← back to jobs
> job detail
M
👽Other

Site Reliability Engineer II

Mastercard ¡ Pune, Mahārāshtra, India
// classified as
Other (Adjacent or hard to classify.)
posted
1d ago
location
Pune, Mahārāshtra, India
languages
python, shell, sql
tools
—
> stack
pythonshellsql
> description

Our Purpose

Mastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we’re helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships and networks combine to deliver a unique set of products and services that help people, businesses and governments realize their greatest potential.

Title and Summary

Site Reliability Engineer II

As a Site Reliability Engineer (SRE) II, you will play a critical role in ensuring the reliability, availability, scalability, and performance of mission‑critical payment platforms within Mastercard’s Treasury Services and the broader TECH organization. You are expected to operate with strong technical ownership, engineering judgment, and accountability, contributing not only to operational excellence but also to system resilience, automation, and reliability engineering practices.
Business Operations is driving Mastercard’s SRE transformation, embedding principles of reliability engineering, observability, automation, and reduced operational toil across the application lifecycle. In this role, you will champion SRE best practices, including SLIs/SLOs, error budgets, incident management, and reliability-driven development, while mentoring junior engineers and collaborating closely with product, platform, and engineering teams.

Key Responsibilities

Reliability & Service Ownership
Own the end‑to‑end reliability and operational health of BizOps‑owned services across production and non‑production environments, ensuring adherence to defined SLIs, SLOs, and error budgets.


Incident Management & Resilience Engineering
Lead major incident response, including advanced triage, impact assessment, and driving blameless root cause analysis (RCA).
Ensure systemic fixes via corrective and preventive actions to eliminate recurring issues and improve overall resilience.


Observability & Proactive Monitoring
Design and enhance observability frameworks using tools such as Splunk, Dynatrace, and custom telemetry.
Continuously improve monitoring, alerting, and dashboards to enable early detection and reduce Mean Time To Detect (MTTD) and Mean Time To Resolve (MTTR).


Automation & Toil Reduction
Design and implement automation solutions across operational workflows, CI/CD pipelines, and infrastructure management.
Focus on reducing manual effort (toil) through scripting, self-healing mechanisms, and intelligent alerting systems.


Release Engineering & Production Readiness
Actively participate in release planning and execution, ensuring all deployments meet production readiness criteria, including monitoring coverage, rollback strategies, and operational documentation.


Capacity Planning & Performance Optimization
Collaborate with engineering teams to assess and improve system performance, scalability, and capacity planning, ensuring systems can handle peak payment workloads.


Continuous Improvement & Reliability Engineering
Identify recurring incidents, architectural limitations, and operational inefficiencies using data-driven insights, and drive systemic reliability improvements.


Operational Excellence & Documentation
Develop and maintain runbooks, playbooks, and SOPs, ensuring they are reliable, actionable, and aligned with evolving system architecture.


Collaboration & Engineering Partnership
Act as a key reliability partner to Engineering, Product, Infrastructure, and Platform teams, embedding SRE practices early in the software development lifecycle (SDLC).


Mentorship & Knowledge Sharing
Mentor junior engineers, promoting SRE principles, coding practices, and operational excellence, fostering a strong engineering culture.


Governance, Risk & Compliance
Ensure adherence to Mastercard operational standards, audit requirements, and regulatory compliance, representing SRE/BizOps in change governance, audits, and risk reviews.



Skills & Qualifications
Required

Bachelor’s degree in Computer Science or a related technical field, or equivalent practical experience.
Strong hands‑on experience in Linux/Unix environments, SQL, and programming/scripting (Python, Shell, Groovy).
Solid understanding of distributed systems, APIs, networking, and system design fundamentals.
Proven ability to debug and resolve complex production issues across application, infrastructure, and integrations.
Experience managing high-availability systems in fast-paced operational environments.
Strong analytical, problem-solving, and decision-making skills.
Excellent communication skills with the ability to clearly articulate technical issues and reliability risks.
Ability to independently own services and drive outcomes.

Preferred

Experience supporting systems in financial services or regulated environments.
Hands-on experience with observability platforms (Splunk, Dynatrace, etc.).
Experience with CI/CD pipelines, infrastructure-as-code (IaC), and release automation.
Exposure to cloud platforms, containers, and platform services.
Experience implementing SRE concepts (SLIs, SLOs, error budgets, toil reduction).
Prior experience in on-call rotations and 24x7 production support models.

Corporate Security Responsibility


All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must:

  • Abide by Mastercard’s security policies and practices;

  • Ensure the confidentiality and integrity of the information being accessed;

  • Report any suspected information security violation or breach, and

  • Complete all periodic mandatory security trainings in accordance with Mastercard’s guidelines.