Director , Site Reliability Engineering
hace 1 semana
Our Purpose
Mastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships and networks combine to deliver a unique set of products and services that help people, businesses and governments realize their greatest potential.
Title and Summary
Director , Site Reliability EngineeringWho is Mastercard?Mastercard is a global technology company in the payments industry. Our mission is to connect and power an inclusive, digital economy that benefits everyone, everywhere by making transactions safe, simple, smart, and accessible. Using secure data and networks, partnerships and passion, our innovations and solutions help individuals, financial institutions, governments, and businesses realize their greatest potential.
Our decency quotient, or DQ, drives our culture and everything we do inside and outside of our company. With connections across more than 210 countries and territories, we are building a sustainable world that unlocks priceless possibilities for all.
Overview
Are you a visionary leader who thrives on driving transformation in complex infrastructure environments? Do you excel at building high-performing teams, fostering innovation, and aligning technology with business outcomes? The Distributed Platform Operations team is seeking a Director of Site Reliability Engineering (SRE) to lead strategic initiatives that ensure the reliability, scalability, and performance of our VMware and Oracle Linux platforms.
This role is ideal for a seasoned leader who combines deep technical expertise with a passion for operational excellence, automation, and cross-functional collaboration.
Skills
Strategic Leadership & Vision
• Define and execute the strategic roadmap for Site Reliability Engineering across distributed platforms.
• Lead modernization efforts including hardware lifecycle management, virtualization upgrades, and infrastructure optimization.
• Champion a culture of automation, resilience, and continuous improvement.
• Build, mentor, and scale a high-impact SRE organization with a focus on technical excellence and career development.
• Establish clear objectives, performance metrics, and development plans for team members.
• Promote knowledge sharing and operational maturity through documentation and onboarding programs.
• Oversee the health and performance of VMware clusters, ESXi hosts, and Oracle Linux environments.
• Ensure robust disaster recovery and high availability strategies are in place and tested.
• Drive incident management and root cause analysis for critical infrastructure issues.
• Lead the adoption of Infrastructure-as-Code and automation frameworks using tools like Chef, Ansible, PowerCLI, Python, and Jenkins.
• Reduce operational toil through scalable automation and self-healing systems.
• Align engineering practices with DevOps principles and agile methodologies.
• Architect observability solutions using Prometheus, Grafana, Splunk, and Dynatrace.
• Define and enforce Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets.
• Optimize alerting and telemetry to support proactive incident response.
• Ensure infrastructure compliance with security baselines, OS configurations, and regulatory standards.
• Collaborate with InfoSec and audit teams to maintain a secure and compliant environment.
• Partner with application, network, and storage teams to align infrastructure capabilities with business needs.
• Communicate technical strategies, upgrade plans, and operational impacts to executive stakeholders.
• Influence enterprise architecture and platform engineering decisions.
Experience:
• 10+ years of experience in infrastructure, SRE, or platform engineering roles, with 5+ years in leadership.
• Proven success in leading large-scale infrastructure modernization and automation initiatives.
• Deep expertise in VMware, Linux systems, and SRE practices.
• Strong executive communication, strategic thinking, and stakeholder management skills.
Corporate Security Responsibility
All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must:
Abide by Mastercard's security policies and practices;
Ensure the confidentiality and integrity of the information being accessed;
Report any suspected information security violation or breach, and
Complete all periodic mandatory security trainings in accordance with Mastercard's guidelines.
-
Site Reliability Engineer
hace 1 semana
Ciudad Guayana, Estado Bolívar, Argentina Mastercard A tiempo completoOur PurposeMastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships...
-
Site Reliability Engineer
hace 1 semana
Ciudad Guayana, Estado Bolívar, Argentina Mastercard A tiempo completoOur PurposeMastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships...
-
Lead Product Manager
hace 2 días
Ciudad Guayana, Estado Bolívar, Argentina Mastercard A tiempo completoJob Title:Lead Product Manager - Technical Overview:OverviewArcus, a Mastercard company, powers the next generation of real-time payments and financial connectivity, with a strong focus on driving innovation across Mexico's financial ecosystem. Our platform enables secure, compliant, and scalable money movement for fintechs, financial institutions, and...
-
Site Reliability Engineer
hace 3 semanas
Ciudad Autónoma De Buenos Aires, Argentina Chevron A tiempo completoOverview Improves and protects the software and systems behind all of organization’s IT services, including management of scalability, availability, latency, performance, security, and capacity, and delivering of software faster, better, and cheaper. The Chevron Business Support Center (BASSC), located in Buenos Aires (Puerto Madero), Argentina, is...
-
Site Reliability Engineer: Scale, Automation
hace 3 semanas
Ciudad Autónoma De Buenos Aires, Argentina Chevron A tiempo completoA global energy company in Buenos Aires is looking for a Site Reliability Engineer to improve IT services and systems. Responsibilities include proactive incident prevention, leading troubleshooting efforts, and optimizing system performance. The ideal candidate has hands-on IT experience, full stack knowledge, and strong communication skills. Don't miss the...
-
Site Reliability Operator
hace 4 semanas
Ciudad Autónoma De Buenos Aires, Argentina Netrix Global A tiempo completoAbout The Opportunity Netrix Global is looking for a Site Reliability Operator to join our Managed Services – Site Operations team. How You Will Make An Impact Develop our clients Work closely with a cross-functional team of developers, designers, cloud engineers, solution architects, and DBA's engineers to ensure the infrastructure 24x7 Participate on an...
-
Senior SRE: Cloud-Native Infra, CI/CD, Flexible Hours
hace 24 horas
Ciudad de Mendoza, Argentina AgileEngine A tiempo completoA leading software development company in Mendoza, Argentina, is seeking a Site Reliability Engineer to design and implement scalable AWS infrastructure, mentor teams in DevSecOps practices, and enhance system reliability. The ideal candidate has 8–10 years of experience in infrastructure roles, strong skills in Infrastructure-as-Code, and excellent...
-
Plant Manager, Argentina
hace 4 semanas
Ciudad Autónoma De Buenos Aires, Argentina Ball A tiempo completoOverview Further your career at Ball, a world leader in manufacturing sustainable aluminum packaging. Achieve extraordinary things when you join our team, and make a difference in your professional development, the community, and around the globe! Ball is thrilled to receive Newsweek's 2023 Top 100 Global Most Loved Workplace award! As a sustainable product...
-
Plant Manager, Argentina
hace 4 semanas
Ciudad Autónoma De Buenos Aires, Argentina Ball Corporation A tiempo completoFurther your career at Ball, a world leader in manufacturing sustainable aluminum packaging. Achieve extraordinary things when you join our team, and make a difference in your professional development, the community, and around the globe! Ball is thrilled to receive Newsweek's 2023 Top 100 Global Most Loved Workplace award! As a sustainable product leader,...
-
Reliability Technician
hace 4 semanas
Distrito Ciudad de Godoy Cruz, Argentina Smiths Group A tiempo completoSerá responsable de la implementación y seguimiento de los contratos de Servicio con foco en la mejora de la confiabilidad de equipos rotantes mediante la aplicación de herramientas específicas como ACR FMEA RCM etc. para asegurar los objetivos de JC. Además es responsable de aplicar técnicas de mantenimiento predictivo como vibraciones y termografía....
-
DevOps Engineer
hace 4 semanas
Ciudad de Mendoza, Argentina AgileEngine A tiempo completo2 weeks ago Be among the first 25 applicants AgileEngine is an Inc. 5000 company that creates award‑winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people‑first culture has earned us multiple Best Place to Work awards. WHY JOIN US...
-
Senior Storage and Data Management Architect
hace 1 semana
Ciudad Autónoma De Buenos Aires, Argentina Financecolombia A tiempo completoOverview Senior Storage and Data Management Architect - Executive Director Remote (Heredia Province, Heredia, Costa Rica) International candidates ok? No About JPMorganChase JPMorganChase, one of the oldest financial institutions, offers innovative financial solutions to millions of consumers, small businesses and many of the world’s most prominent...
-
Design Engineer, Quick Response Center
hace 4 semanas
Ciudad de Mendoza, Argentina Flowserve Corporation A tiempo completoOverview Flowserve is a world-leading manufacturer and aftermarket service provider of comprehensive flow-control systems. We seek people who are committed to excellence, innovation and ownership. As an individual contributor or a leader of people, your enterprise mindset will help Flowserve maintain its position as a global standard. We offer professional...