Senior Site Reliability Engineer

hace 4 días


Argentina Svitla Systems, Inc. A tiempo completo

Svitla Systems Inc. is looking for a Senior Site Reliability Engineer for a full‑time position (40 hours per week) in Argentina. Our client is a leading expert network, providing business and government professionals with opportunities to communicate with industry and subject‑matter experts to answer research questions. Their customers consult with these experts over the phone, in person at conferences, teleconferences, custom events, and workshops, or may gather their primary research data through surveys, polls, or web‑based data offerings. Experts are categorized into six main industry sectors: healthcare, financial and business services, consumer goods and services, energy, industrials, and basic materials; tech, media, and telecom; and legal and regulatory. Since 2003, the company has provided its customers with primary research services, helping professionals gain a comprehensive understanding of a topic before making significant investments and/or business decisions. Their multinational client list includes nine top 10 consulting firms, hundreds of hedge funds, and many of the largest private equity firms and Fortune‑ranked companies. We are seeking a skilled Site Reliability Engineer (SRE) with experience managing production‑level SaaS applications hosted on Azure. The perfect candidate will be adept at monitoring, analyzing, and troubleshooting application and infrastructure‑related issues on time. Requirements: Bachelor's degree in Computer Science, a related technical field, or equivalent practical experience. 5-7 years of software development experience in one or more programming languages. 3+ years of experience in designing, analyzing, and troubleshooting distributed systems. Strong expertise with Infrastructure as Code, Kubernetes and CI/CD, Python scripting, and any observability tooling (e.g., Datadog, Prometheus, Grafana, New Relic). Nice to have: Experience with event‑driven architectures, particularly RabbitMQ (RMQ). Familiarity with DevOps practices, CI/CD pipelines, and Infrastructure as Code (IaC) principles. Experience with Azure DevOps or similar platforms for managing builds and releases. Ability to thrive in a fast‑paced, collaborative environment while handling multiple priorities. Knowledge/expertise with Datadog. Responsibilities: Design, implement, and manage highly available and scalable infrastructure on Microsoft Azure, leveraging Terraform and Python for automation. Build and operate Kubernetes (AKS) clusters to support containerized microservices, ensuring high reliability and performance. Develop and maintain Azure DevOps CI/CD pipelines to facilitate secure, consistent, and repeatable deployments. Proactively monitor production systems using Datadog; triage incidents, perform root‑cause analysis, and implement post‑incident improvements. Troubleshoot issues in both production and non‑production environments using logs, metrics, traces, and system‑level debugging to ensure system health. Collaborate with engineering teams to optimize system and application performance, resolving latency or capacity bottlenecks. Operate and support Azure‑native services, including Azure Functions, SQL databases, storage accounts, and event‑driven integrations (e.g., RabbitMQ). Define and maintain SLIs, SLOs, and participate in error‑budget practices to align system reliability with business goals. Enhance system observability by improving monitoring, alerting, and logging strategies, and implement automation to reduce manual intervention and operational toil. Collaborate cross‑functionally with developers, QA, and product stakeholders to ensure application operability, resilience, and seamless deployments. Participate in the on‑call rotation, ensuring service uptime and reliability during production incidents. We offer US and EU projects based on advanced technologies. Competitive compensation based on skills and experience. Regular performance appraisals to support your growth. Flexibility in workspace, either remote, our welcoming office or local coworking. Bonuses for recommendations of new employees. Bonuses for article writing, public talks, other activities. 15 vacation days, 10 national holidays, sick leaves. Personalized learning program tailored to your interests and skill development. Free tech webinars and meetups organized by Svitla. Fun corporate online/offline celebrations and activities. Awesome team, friendly and supportive community About Svitla Svitla Systems is a global digital solutions company headquartered in the U.S. and operating across the Americas, Europe, Asia, and APAC. Since 2003, we have served a wide range of clients — from innovative start‑ups to Fortune 500 companies. Our success is built on partnership. By integrating seamlessly with clients’ teams, we create lasting collaborations that drive real results. We are strong advocates of workplace flexibility, remote culture, individual approach to professional and personal growth. Svitla is proud to be an equal opportunity employer. All qualified applicants will receive consideration for cooperation without regard to age, gender identity, sexual orientation, religion, race, color, national origin, disability, or any other characteristic protected by applicable law. Our global mission is to build a business that contributes to wellbeing of our partners, personnel, and their families, improves our communities, and makes a lasting difference in the world. Together, we are coding a brighter tomorrow — and living it. #J-18808-Ljbffr



  • , , Argentina Laravel A tiempo completo

    At Laravel, we don’t just build tools; we build the foundation that empowers millions of developers to ship their dreams. We are looking for a Senior Site Reliability Engineer to help us scale that mission by ensuring our global infrastructure remains as elegant and reliable as the code we write. If you are energized by the challenge of managing...

  • Site Reliability Engineer

    hace 2 semanas


    , , Argentina Epsilon Solutions Ltd. SA de CV. A tiempo completo

    Direct message the job poster from Epsilon Solutions Ltd. SA de CV. Technical Recruiter at Epsilon Solutions Ltd. Job profile: Sr. Site Reliability Engineer Location: Argentina (REMOTE) Job Type: Full Time Contract Overview We are looking for a highly skilled SRE with an engineering background to support our top clients with strong technical exposure and...


  • , , Argentina Mas Global Consulting Llc A tiempo completo

    Hi, this is Monica Hernandez, Founder and CEO of MAS Global. I started MAS with the idea that we could be more than a business, that’s why we like to say that MAS is More . I was born and raised in Medellin, Colombia and thanks to a scholarship I became a Software Engineer and built a career in the US, where I now live. Starting MAS was my way to give back...

  • Site Reliability Engineer

    hace 4 semanas


    , , Argentina Capchase A tiempo completo

    Join to apply for the Site Reliability Engineer role at Capchase . Capchase provides flexible payment solutions to B2B software, cloud, and AI companies. Our core product, Capchase Pay , offers a buy-now-pay-later payment option for B2B SaaS, hardware, and cloud purchases, helping companies sell more and collect cash faster. Founded in 2020 and headquartered...


  • , , Argentina Next League A tiempo completo

    Senior Engineering Manager, Site Reliability Join to apply for the Senior Engineering Manager, Site Reliability role at Next League. 3 days ago Be among the first 25 applicants. As the Senior Manager of Site Reliability Engineering, you will be responsible for ensuring the reliability, scalability, and efficiency for a wide range of client systems, including...


  • , , Argentina Mas Global Consulting Llc A tiempo completo

    A leading technology firm in Argentina is seeking a Senior Site Reliability Engineer to ensure system reliability, scalability, and security. The ideal candidate will have over 5 years of experience in Site Reliability Engineering or DevOps, strong skills in AWS, Docker, Kubernetes, and automation. This role involves driving automation initiatives,...


  • Argentina Ciklum A tiempo completo

    Ciklum is looking for a Site Reliability Engineer to join our team in Argentina.We are a custom product engineering company that supports both multinational organizations and scaling startups to solve their most complex business challenges. With a global team of over 4,000 highly skilled developers, consultants, analysts and product owners, we engineer...


  • Argentina Description Ciklum A tiempo completo

    DescriptionCiklum is looking for a Site Reliability Engineer to join our team in Argentina.We are a custom product engineering company that supports both multinational organizations and scaling startups to solve their most complex business challenges. With a global team of over 4,000 highly skilled developers, consultants, analysts and product owners, we...


  • Argentina Unosquare A tiempo completo

    Are you looking for a career that makes a positive difference in your life and reimagines learners and educators across the globe? Do you want to work with fun and social people in a positive and engaged virtual office environment? We are hiring a Senior Site Reliability Engineer who will build and support reliable, high capacity, and well-performing systems...

  • Site Reliability Engineer

    hace 4 semanas


    , , Argentina Felix A tiempo completo

    About Us At Félix, we're building the financial ecosystem for Latin immigrants in the U.S., starting with a revolution in remittances. Our core product is an AI-powered chatbot built on WhatsApp, allowing our users to send money home as easily as sending a text message. We leverage cutting-edge technology like AI, blockchain, and stablecoins to make...