Site Reliability Engineer
hace 5 días
Job Title:
Site Reliability Engineer (Automation & virtualization)Overview:
Site Reliability EngineerAbout the Role
We're looking for a passionate and skilled Site Reliability Engineer (SRE) to join our Platform Engineering team. This role is pivotal in automating and managing VMware ESXi hypervisors across Dell and Cisco UCS platforms, ensuring high reliability, scalability, and performance of our infrastructure.
You'll work at the intersection of infrastructure and software, driving automation, observability, and operational excellence across our virtualization stack.
---
Key Responsibilities
Hypervisor & Infrastructure Management
- Deploy, configure, and patch ESXi hosts using tools like VMware Update Manager, iDRAC, and UCS Central.
- Validate host readiness and enforce consistency across environments.
Automation & Infrastructure as Code
- Build and maintain automation pipelines using PowerCLI, Python, Terraform, and Ansible.
- Develop Infrastructure-as-Code (IaC) templates for scalable provisioning.
NSX & Network Integration
- Administer NSX-T/V for logical switching, routing, and micro-segmentation.
- Troubleshoot endpoint tagging and network performance issues between NSX and ESXi.
Monitoring & Observability
- Implement observability stacks using Prometheus, Grafana, Splunk, and Dynatrace.
- Define and track SLOs, SLIs, and error budgets.
Security & Compliance
Planning & Optimization
- Lead modernization efforts including UCS blade decommissioning and Dell R760 upgrades.
- Optimize cluster and VM sizing for performance and cost efficiency.
Collaboration & Stakeholder Engagement
- Partner with application, storage, and network teams to align infrastructure with workload needs.
- Communicate upgrade plans and maintenance schedules across teams.
Documentation & Knowledge Sharing
- Maintain build guides, validation checklists, and operational runbooks.
- Contribute to internal wikis and onboarding materials.
Required Skills
- 5+ years in SRE, DevOps, or Platform Engineering roles.
- Strong scripting in PowerCLI, Python, or Go.
- Experience with VMware ESXi, vCenter, NSX, and UCS Manager.
- Proficiency in Terraform, Ansible, and CI/CD pipeline tools.
- Familiarity with observability platforms and incident response workflows.
Preferred Qualifications
- Experience with REST API integration for ESXi and vCenter.
- Knowledge of GitOps, AIOps, and chaos engineering practices.
- Certifications: VMware VCP, CKA/CKAD, or equivalent.To find US Salary Ranges, visit People Place. Under the Compensation tab, select "Salary Structures." Within the text of "Salary Structures," click on the link "salary structures 2025," through which you will be able to access the salary ranges for each Mastercard job family. For more information regarding US benefits, visit People Place and review the Benefits tab and the Time Off & Leave tab.
-
Site Reliability Engineer
hace 1 semana
Ciudad Guayana, Estado Bolívar, Argentina Mastercard A tiempo completoOur PurposeMastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships...
-
Site Reliability Engineer
hace 1 semana
Ciudad Guayana, Estado Bolívar, Argentina Mastercard A tiempo completoOur PurposeMastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships...
-
Site Reliability Engineer
hace 1 semana
Ciudad Guayana, Estado Bolívar, Argentina Mastercard A tiempo completoJob Title:Site Reliability Engineer (Data Protection) Overview:Our PurposeWe work to connect and power an inclusive, digital economy that benefits everyone, everywhere by making transactions safe, simple, smart and accessible. Using secure data and networks, partnerships and passion, our innovations and solutions help individuals, financial institutions,...
-
Lead Site Reliability Engineer
hace 1 semana
Ciudad Guayana, Estado Bolívar, Argentina Mastercard A tiempo completoOur PurposeMastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships...
-
Director , Site Reliability Engineering
hace 1 semana
Ciudad Guayana, Estado Bolívar, Argentina Mastercard A tiempo completoOur PurposeMastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships...
-
Platform Engineer II
hace 5 días
Ciudad Guayana, Estado Bolívar, Argentina Mastercard A tiempo completoJob Title:Platform Engineer II Overview:OverviewArcus Engineering team is looking for a Site Reliability Engineer to drive our customer experience strategy forward by consistently innovating and problem-solving. The ideal candidate is passionate about the customer experience journey, highly motivated, intellectually curious, analytical, and possesses an...
-
Site Reliability Engineer: Scale, Automation
hace 2 semanas
Ciudad Autónoma De Buenos Aires, Argentina Chevron A tiempo completoA global energy company in Buenos Aires is looking for a Site Reliability Engineer to improve IT services and systems. Responsibilities include proactive incident prevention, leading troubleshooting efforts, and optimizing system performance. The ideal candidate has hands-on IT experience, full stack knowledge, and strong communication skills. Don't miss the...
-
Senior SRE: Cloud-Native Infra, CI/CD, Flexible Hours
hace 4 semanas
Ciudad de Mendoza, Argentina AgileEngine A tiempo completoA leading software development company in Mendoza, Argentina, is seeking a Site Reliability Engineer to design and implement scalable AWS infrastructure, mentor teams in DevSecOps practices, and enhance system reliability. The ideal candidate has 8–10 years of experience in infrastructure roles, strong skills in Infrastructure-as-Code, and excellent...
-
Remote IT Operations Engineer — Incident
hace 2 semanas
Ciudad de Mendoza, Argentina Wakapi A tiempo completoA leading technology company in Mendoza, Argentina, seeks an IT Operations Technical Support Engineer to provide critical technical support. This role involves ensuring system reliability and resolving incidents while collaborating with internal teams. Candidates must have a Bachelor's degree in IT or a related field and strong skills in Linux commands, SQL,...
-
QA Automation Engineer – Data Analytics
hace 16 horas
Ciudad de Mendoza, Argentina AgileEngine A tiempo completoA dynamic software development company in Mendoza is looking for a Middle/Senior Quality Assurance Automation Engineer. This role focuses on ensuring the quality and reliability of analytics products in healthcare. Candidates should have 2–4 years of QA experience with strong SQL skills and API testing proficiency. The position offers a flexible work...
-
IT Operations Engineer: Automation
hace 2 semanas
Ciudad de Mendoza, Argentina Wakapi Software A tiempo completoA leading software company in Mendoza is seeking an IT Operations Technical Support Engineer to provide technical-level support to vendors and internal stakeholders. Responsibilities include troubleshooting software issues, collaborating with development teams, and ensuring system reliability. Ideal candidates should have a Bachelor's degree in IT or...
-
Senior Mobile Engineer
hace 1 semana
Ciudad de Mendoza, Argentina AgileEngine A tiempo completoA leading software development company in Mendoza is seeking a Senior/Lead Mobile Software Engineer to own and evolve mobile applications. This role involves direct impact on user experience and the full mobile application lifecycle with a focus on reliability and performance. The ideal candidate has over 5 years of mobile development experience, is...
-
Senior Software Engineer — Real-Time Trading
hace 2 semanas
Ciudad de Mendoza, Argentina AgileEngine A tiempo completoA software development company in Mendoza, Argentina seeks a Senior Software Engineer to design algorithms for market surveillance and data analysis. The successful candidate will have 4+ years of experience in Java and the trading industry, ensuring reliability and performance of systems. The role offers a chance to work on exciting projects with top-tier...
-
Software Engineer
hace 4 días
Ciudad de Mendoza, Argentina IQVIA A tiempo completoSoftware Engineer (Senior/Lead) ID41548 ($2,500 signing bonus) AgileEngine whatjobs.com AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple...
-
Software Engineer
hace 16 horas
Ciudad de Mendoza, Argentina Talent Connect A tiempo completoSoftware Engineer (Senior/Lead) ID41548 ($2,500 signing bonus) AgileEngine AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to...
-
Software Engineer
hace 4 semanas
Ciudad de Mendoza, Argentina AgileEngine A tiempo completoSoftware Engineer (Senior) ID44885 Join to apply for the Software Engineer (Senior) ID44885 role at AgileEngine. AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture...