Site Reliability Engineer
hace 6 días
We are looking for a
Site Reliability Engineer
to join a new team at one of our clients, a major American pet care retailer offering supplies, services, and care solutions. This is an opportunity to join a large, well-established organization that combines retail, services, and digital solutions to improve the lives of pets and their owners, in a collaborative environment with the chance to work on impactful, customer-facing products at scale.
Responsibilities
- Ensure high availability, reliability, and performance of retail systems (e-commerce, checkout, inventory), especially during peak sales events.
- Monitor systems using SLIs/SLOs, lead incident response, and perform root cause analysis to reduce downtime and customer impact.
- Design and maintain scalable, fault-tolerant infrastructure using cloud platforms, containers, and Infrastructure as Code.
- Automate deployments, testing, and operational tasks through CI/CD pipelines and self-healing systems.
- Implement robust monitoring, logging, and alerting to proactively detect and resolve issues.
Requirements
- Strong experience with Linux/Unix systems and cloud platforms (GCP).
- Proficiency in at least one programming/scripting language (Python, Bash, Node).
- Hands-on experience with containers and orchestration (Docker, Kubernetes).
- Solid understanding of monitoring, logging, and alerting tools and SRE concepts (SLIs, SLOs, SLAs).
- Experience building or supporting high-traffic, customer-facing systems, preferably in e-commerce or retail environments.
- Knowledge of CI/CD pipelines, Infrastructure as Code (Terraform), and reliability best practices.
Nice to have
- Experience with Observability
- Experience with Ecommerce
We offer
- Flexible working hours (full-time).
- One "Flex Day" off per month – eligible after six months with the company.
- 10 business days of vacation.
- Swiss Medical health coverage.
- 100% remote work.
- Permanent contract with salary review every four months (in ARS).
- Access to Udemy and Platzi for professional training.
- Employee Assistance Program (financial, nutritional, psychological support, etc.).
- Fully covered English classes during working hours.
- Discounts on Club de Beneficios and Samsung products.
- Birthday day off.
About Us
Mobile Computing is joining Grid Dynamics (NASDAQ: GDYN), a leading provider of technology consulting, platform and product engineering, AI, and advanced analytics services. Fusing technical vision with business acumen, we solve the most pressing technical challenges and enable positive business outcomes for enterprise companies undergoing business transformation. A key differentiator for Grid Dynamics is our 8 years of experience and leadership in enterprise AI, supported by profound expertise and ongoing investment in data, analytics, cloud & DevOps, application modernization and customer experience. Founded in 2006, Grid Dynamics is headquartered in Silicon Valley with offices across the Americas, Europe, and India.
-
Site Reliability Engineer
hace 2 semanas
Buenos Aires, Buenos Aires C.F., Argentina Blockscout Limited A tiempo completoBlockscout is a leading provider of indexing and UI services for EVM chains. Our team hosts explorers for many of the largest chains in the industry. Reliability is vital to our company's success. We are looking for a Site Reliability Engineer to strengthen our DevOps and Support teams.Key responsibilitiesMonitor systems: Proactively watch production systems...
-
Site Reliability Engineer
hace 2 semanas
Buenos Aires, Buenos Aires C.F., Argentina Sur A tiempo completoAs the Site Reliability Engineer you will support and scale the infrastructure powering their secure, mission-critical SaaS platform.You must be confident in operating and debugging both modern infrastructure (cloud-native, containerized services) and classic Windows production environments (IIS, SQL Server AlwaysOn, Service Broker), with the ability to...
-
Site Reliability Engineer
hace 2 semanas
Buenos Aires, Buenos Aires C.F., Argentina Sur A tiempo completoAs the Site Reliability Engineer you will support and scale the infrastructure powering their secure, mission-critical SaaS platform. You must be confident in operating and debugging both modern infrastructure (cloud-native, containerized services) and classic Windows production environments (IIS, SQL Server AlwaysOn, Service Broker), with the ability to...
-
Site Reliability Engineer
hace 2 semanas
Buenos Aires, Buenos Aires C.F., Argentina DevRev A tiempo completoDevRevAt DevRev, we're building the future of work with Computer – your AI teammate.Computer is not just another tool. It's built on the belief that the future of work should be about genuine human connection and collaboration – not piling on more apps.Computer is the best kind of teammate: it amplifies your strengths, takes repetition and frustration...
-
Site Reliability Engineer
hace 2 semanas
Buenos Aires, Buenos Aires C.F., Argentina DevRev A tiempo completoDevRevAt DevRev, we're building the future of work with Computer – your AI teammate.Computer is not just another tool. It's built on the belief that the future of work should be about genuine human connection and collaboration – not piling on more apps.Computer is the best kind of teammate: it amplifies your strengths, takes repetition and frustration...
-
Site Reliability Engineering
hace 6 días
Buenos Aires, Buenos Aires C.F., Argentina Modo A tiempo completoSomos MODO, la fintech de los bancos argentinos que está revolucionando la manera de pagar y ahorrar con promociones en la Argentina. Estamos en el centro del ecosistema de pagos, desarrollando experiencias de pago novedosas en QR, NFC y online con todos los medios de pago, y creando el mejor lugar para hacer y disfrutar promociones. Además, creamos el...
-
Senior Site Reliability Engineer, Observability
hace 2 semanas
Buenos Aires, Buenos Aires C.F., Argentina Chainlink Labs A tiempo completoAbout ChainlinkChainlink is the industry-standard oracle platform bringing the capital markets onchain and powering the majority of decentralized finance (DeFi). The Chainlink stack provides the essential data, interoperability, compliance, and privacy standards needed to power advanced blockchain use cases for institutional tokenized assets, lending,...
-
Sr Reliability Engineer
hace 2 semanas
Buenos Aires, Buenos Aires C.F., Argentina Human Consulting A tiempo completoDescripción del puesto Para nuestro cliente, PLUSPETROL, nos encontramos en la búsqueda de un Senior Reliability Engineer. Será responsable del análisis en procesos de: Disponibilidad, Mantenibilidad y Confiabilidad. Coordinara, promoverá y desarrollará a través de su nivel de conocimiento técnico, las mejores prácticas de la industria, las...
-
Senior SRE
hace 5 días
Buenos Aires, Buenos Aires C.F., Argentina JPMorganChase A tiempo completoDescriptionAssume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability. As a Lead Site Reliability Engineer at JPMorgan Chase within the CIB, you hold a leadership role in your team, demonstrate strong knowledge across multiple technical...
-
sre
hace 2 semanas
Buenos Aires, Buenos Aires C.F., Argentina ZirconTech A tiempo completoSite Reliability Engineer (SRE) – Level 2 (L2)Experience: 6–8 years of relevant experienceRole Summary:The L2 SRE is responsible for managing and maintaining the reliability, performance, and scalability ofcloud-based systems. This role handles incident management, monitoring, automation, and contributes tosystem design while mentoring junior team...