Site Reliability Engineer
hace 3 días
We are looking for a
Site Reliability Engineer
to join a new team at one of our clients, a major American pet care retailer offering supplies, services, and care solutions. This is an opportunity to join a large, well-established organization that combines retail, services, and digital solutions to improve the lives of pets and their owners, in a collaborative environment with the chance to work on impactful, customer-facing products at scale.
Responsibilities
- Ensure high availability, reliability, and performance of retail systems (e-commerce, checkout, inventory), especially during peak sales events.
- Monitor systems using SLIs/SLOs, lead incident response, and perform root cause analysis to reduce downtime and customer impact.
- Design and maintain scalable, fault-tolerant infrastructure using cloud platforms, containers, and Infrastructure as Code.
- Automate deployments, testing, and operational tasks through CI/CD pipelines and self-healing systems.
- Implement robust monitoring, logging, and alerting to proactively detect and resolve issues.
Requirements
- Strong experience with Linux/Unix systems and cloud platforms (GCP).
- Proficiency in at least one programming/scripting language (Python, Bash, Node).
- Hands-on experience with containers and orchestration (Docker, Kubernetes).
- Solid understanding of monitoring, logging, and alerting tools and SRE concepts (SLIs, SLOs, SLAs).
- Experience building or supporting high-traffic, customer-facing systems, preferably in e-commerce or retail environments.
- Knowledge of CI/CD pipelines, Infrastructure as Code (Terraform), and reliability best practices.
Nice to have
- Experience with Observability
- Experience with Ecommerce
We offer
- Flexible working hours (full-time).
- One "Flex Day" off per month – eligible after six months with the company.
- 10 business days of vacation.
- Swiss Medical health coverage.
- 100% remote work.
- Permanent contract with salary review every four months (in ARS).
- Access to Udemy and Platzi for professional training.
- Employee Assistance Program (financial, nutritional, psychological support, etc.).
- Fully covered English classes during working hours.
- Discounts on Club de Beneficios and Samsung products.
- Birthday day off.
About Us
Mobile Computing is joining Grid Dynamics (NASDAQ: GDYN), a leading provider of technology consulting, platform and product engineering, AI, and advanced analytics services. Fusing technical vision with business acumen, we solve the most pressing technical challenges and enable positive business outcomes for enterprise companies undergoing business transformation. A key differentiator for Grid Dynamics is our 8 years of experience and leadership in enterprise AI, supported by profound expertise and ongoing investment in data, analytics, cloud & DevOps, application modernization and customer experience. Founded in 2006, Grid Dynamics is headquartered in Silicon Valley with offices across the Americas, Europe, and India.
-
Site Reliability Engineer
hace 1 semana
Buenos Aires, Buenos Aires C.F., Argentina Blockscout Limited A tiempo completoBlockscout is a leading provider of indexing and UI services for EVM chains. Our team hosts explorers for many of the largest chains in the industry. Reliability is vital to our company's success. We are looking for a Site Reliability Engineer to strengthen our DevOps and Support teams.Key responsibilitiesMonitor systems: Proactively watch production systems...
-
Site Reliability Engineer
hace 1 semana
Buenos Aires, Buenos Aires C.F., Argentina Sur A tiempo completoAs the Site Reliability Engineer you will support and scale the infrastructure powering their secure, mission-critical SaaS platform.You must be confident in operating and debugging both modern infrastructure (cloud-native, containerized services) and classic Windows production environments (IIS, SQL Server AlwaysOn, Service Broker), with the ability to...
-
Site Reliability Engineer
hace 1 semana
Buenos Aires, Buenos Aires C.F., Argentina Sur A tiempo completoAs the Site Reliability Engineer you will support and scale the infrastructure powering their secure, mission-critical SaaS platform. You must be confident in operating and debugging both modern infrastructure (cloud-native, containerized services) and classic Windows production environments (IIS, SQL Server AlwaysOn, Service Broker), with the ability to...
-
Site Reliability Engineer
hace 1 semana
Buenos Aires, Buenos Aires C.F., Argentina DevRev A tiempo completoDevRevAt DevRev, we're building the future of work with Computer – your AI teammate.Computer is not just another tool. It's built on the belief that the future of work should be about genuine human connection and collaboration – not piling on more apps.Computer is the best kind of teammate: it amplifies your strengths, takes repetition and frustration...
-
Site Reliability Engineer
hace 1 semana
Buenos Aires, Buenos Aires C.F., Argentina DevRev A tiempo completoDevRevAt DevRev, we're building the future of work with Computer – your AI teammate.Computer is not just another tool. It's built on the belief that the future of work should be about genuine human connection and collaboration – not piling on more apps.Computer is the best kind of teammate: it amplifies your strengths, takes repetition and frustration...
-
Senior Site Reliability Engineer
hace 2 semanas
Buenos Aires, Buenos Aires C.F., Argentina Dev A tiempo completoAre you in Brazil or Argentina? Join us as we actively recruit in these locations, offering a comfortable remote environment. Submit your CV in English, and we'll get back to youWe invite a Senior Site Reliability Engineer to join our dynamic team. In this hands-on role, you'll focus on improving the stability, observability, and efficiency of our services....
-
Site Reliability
hace 1 semana
Buenos Aires, Buenos Aires C.F., Argentina Canonical - Jobs A tiempo completoCanonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and silicon providers,...
-
Site Reliability Engineer
hace 1 semana
Buenos Aires, Buenos Aires C.F., Argentina Canonical - Jobs A tiempo completoCanonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and silicon providers, and...
-
Senior Site Reliability Engineer
hace 1 semana
Buenos Aires, Buenos Aires C.F., Argentina Canonical - Jobs A tiempo completoCanonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation and IoT. Our customers include the world's leading public cloud and silicon providers,...
-
Senior Site Reliability
hace 1 semana
Buenos Aires, Buenos Aires C.F., Argentina Canonical - Jobs A tiempo completoCanonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and silicon providers,...