> job detail
A
đ˝Other
Site Reliability Engineer III
American Express ¡ Bengaluru, KA, India
// classified as
Other (Adjacent or hard to classify.)
posted
1d ago
location
Bengaluru, KA, India
languages
java
tools
â
> stack
java
> description
Site Reliability Engineer III advances efforts to enhance system resilience, scalability, and performance through feature development, automation, architectural design, chaos engineering, and disaster recovery planning, while promoting best practices for continuous improvement and reliability.
- Provides overall management of the SRE and Production activities for Marketing and Personalization Applications and trusted advisor to business partners, and development teams for support;Â
- Work with development organization to create strategic & tactical architecture plans to ensure maximum SRE Governance models
- Provides consulting and technical expertise to business partners, architects and developers as well as identifies and drives opportunities for improved performance and availabilityÂ
- Reviews design and develop code bases ensure solutions are viable, scalable, and will meet performance standards and support requirements.Â
- Monitors the quality of vendor service delivery resources by reporting on any trends, issues and achievements and escalating where appropriate
- Responsible for ensuring that the engineered environment meets the specifications in terms of support requirements, application design and infrastructure i.e. accountable for the performance and efficiency of Security applications
- Pragmatic, results-driven approach to problem solving. Demonstrated ability to present multiple technical solution options, negotiate amongst options, and make decisions with varying levels of information completeness
- Advises development and support teams as may be necessary to ensure delivery of optimal technical designs and implementation on holistic monitoring solutions.
- Participates as may be necessary in resolving complex technical issues across the portfolio
- Bachelorâs or master's degree in computer science, Information Systems, or other related field (or has equivalent work experience)
- Minimum 5 yearsâ experience with a hands-on service management knowledge in a broad range of distributed technologies infrastructure and BigData systems and platforms
- Requires advanced to expert level knowledge and understanding of high availability architecture/support processes and performance/availability metrics and monitoring
- Extensive experience using a systems analysis and design methodology and an excellent understanding of enterprise system monitoring tools.
- Strong communication skills both verbal and writtenÂ
- Effective consultative skills across a multi-functional environmentÂ
- Possesses knowledge about industry best-practices and trends in infrastructure and security SRE
- Demonstrated knowledge of Personalization and Marketing Domain Â
- Experience with distributed, web, mid-range, and mobile enterprise solutions
- Experience in Finance ManagementÂ
- Â Hands on knowledge in one or more of Java/ Py/Go/ React
- Public Cloud Certification is an add-on