Cloud Reliability Engineer, Coverage

hace 3 días


Rosario, Argentina Avature A tiempo completo

Avature’s Coverage team is dedicated to maintaining and improving the quality of our monitoring tools and practices as applied during on-call shifts or other related incident-spotting endeavors. The scope of the team ranges from the management and continuous improvement of our servers and service monitoring and alerting to a holistic system reliability view.
As a Cloud Reliability Engineer, you’ll strive to implement tools and processes that improve observability, monitoring, and incident management, minimize emergency response time, and provide a pain-free experience for the teams involved in incident management.
Your challenges and objectives
- Understand Avature’s infrastructure and processes.
- Contribute to defining standards with a DevOps/SRE mindset and advocate for them.
- Identify and address weaknesses in our infrastructure to ensure service availability.
- Develop strategies to mitigate and prevent interruptions in critical services.
- Occasionally perform troubleshooting on ongoing incidents.

Your day-to-day activities
- Participate in the definition and implementation of SRE policies and practices.
- Collaborate with other infrastructure and development teams in the continuous improvement of their services’ monitoring and observability.
- Work with development and engineering teams to implement SRE practices from the early stages of the software development life cycle.
- Engage in incident management, conducting post-mortem analyses and proposing preventive measures to avoid future disruptions.
- Occasionally perform troubleshooting on ongoing incidents.

About you
- Knowledge in observability: logs (ELK stack), metrics (e.g. Prometheus, Grafana), and tracing (e.g. Jaeger, OpenTelemetry).
- Experience creating and maintaining fault-tolerant and distributed systems.
- Solid experience in Linux system administration.
- Analytical and troubleshooting skills.
- Infrastructure-as-code mindset.
- Software development (Python, Golang) and configuration management (Puppet, Ansible) skills.
- Knowledge of incident management and related tools, such as Splunk On-Call.

About us
Avature is a market leading enterprise SaaS Solution provider for global talent acquisition and talent management. We have a strong commitment to high quality engineering and customer service and are recognized innovators in the very large company market. We currently work with over 650 companies worldwide, including 110 of the Fortune 500, all of the Big Four consulting firms, the largest banks and manufacturers in the world, and five governments.
We design, build, implement, and support our product ourselves. With 26 releases a year and a strong commitment to innovation and quality engineering, our private cloud platform has become the product choice for the very large global organization.
At Avature, we value opportunities to learn and grow within a dynamic, creative, and collaborative environment. We encourage autonomy and empower our people to approach challenges innovatively while bringing their unique perspective to the table. We offer a career development program that supports continuous learning and thoughtful leadership, and that meaningfully impacts each individual’s professional trajectory.
What we offer
A fast-paced, energetic, and engaging environment.
Flexible hours.
Work remotely or come by the office as much as you want.
Four salary reviews per year.
Option to earn part of your salary in US dollars.
Three weeks vacations from the first year.
Four weeks paternity leave.
OSDE 310 health coverage (family plan).
Four days a year to attend events related to professional development.
End of year week off (December 26 to 31).
Internet service expenses.
Birthdays off.
An organizational culture that empowers everyone to be themselves is key to thrive in business, but more importantly, it is a pathway for creating a more equitable society. Avature fosters a diverse and inclusive environment and celebrates that each unique person brings something different to our team. We are committed to considering all qualified applicants equally and to promoting equal opportunities within our organization.



  • Rosario, Argentina FullStack A tiempo completo

    Site Reliability Engineer - Remote - Latin America Join FullStack to work as a Site Reliability Engineer for U.S. clients on flexible, project-based development work. The Opportunity Integrate directly into our client’s team and work alongside designers and engineers on a daily basis. Responsibilities Design, implement, and maintain distributed systems for...


  • Rosario, Argentina AgileEngine A tiempo completo

    Join to apply for the Site Reliability Engineer ID45689 role at AgileEngine . 3 days ago, be among the first 25 applicants. AgileEngine is an Inc. 5000 company that creates award‑winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in application development and AI/ML, and our people‑first...


  • Rosario, Santa Fe, Argentina AgileEngine A tiempo completo

    AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to Work awards. WHY JOIN US If you're looking for a place to grow, make an...


  • Rosario, Argentina AgileEngine A tiempo completo

    A leading software company in Rosario is seeking a Site Reliability Engineer to build secure and scalable cloud-native systems. This role involves collaborating with product teams, optimizing CI/CD workflows, and enhancing DevSecOps practices. The ideal candidate has 8-10 years of experience, strong AWS proficiency, and is skilled in Infrastructure as Code....

  • Cloud Engineer Senior

    hace 2 semanas


    Rosario, Argentina Staffed A tiempo completo

    We're on the lookout for a Senior Cloud Engineer with a strong software development background to join our team. This role is designed for someone who thrives in a cross-project environment. Requirements (not All Technologies Are Necessary Simultaneously) Proficiency in Cloud computing platforms (AWS, GCP, Azure). Experience in developing, maintaining, and...


  • Rosario, Argentina AgileEngine A tiempo completo

    A forward-thinking technology firm in Argentina seeks a Lead DevOps Engineer to own and evolve cloud infrastructure ensuring reliability for AI-driven products. The role includes leading modernization efforts in AWS, managing CI/CD pipelines, and promoting automation best practices. Applicants should have over 6 years of relevant experience, strong AWS...


  • Rosario, Argentina Staffed A tiempo completo

    A cloud solutions company in Rosario, Argentina is looking for a Senior Cloud Engineer to join their dynamic team. The ideal candidate will have expertise in cloud computing platforms like AWS, GCP, or Azure, along with strong software development skills in Python, JS, and other technologies. This role offers engaging projects in a cross-project environment...

  • Cloud DevOps Engineer

    hace 3 semanas


    Municipio de Rosario, Argentina Azumo A tiempo completo

    Overview Azumo is looking for a highly motivated Cloud DevOps Engineer to develop and maintain cloud infrastructure for next-generation web, mobile, and IoT applications. The position is FULLY REMOTE based in Latin America. Responsibilities Provisioning of infrastructure components for supporting business services Developing, configuring, and deploying tools...


  • Rosario, Argentina Staffed A tiempo completo

    A technology firm is seeking a Senior DevOps Engineer to join a small, dynamic team in Rosario, Argentina. You will manage Azure infrastructure, build CI/CD pipelines for .NET applications, and automate tasks with PowerShell. The ideal candidate has an advanced level of English and is comfortable working collaboratively in a cross-functional team. This...

  • Sr Go Software Engineer

    hace 1 semana


    Departamento Rosario, Argentina Staffed A tiempo completo

    Join to apply for the Sr Go Software Engineer role at Staffed We're seeking a Backend Engineer to help advance software engineering through intelligent telemetry analysis. You will work on a platform that enables engineering teams to understand how code changes affect system behavior, improving change resilience and automating workflows through AI-powered...