Senior SRE

hace 2 semanas


Argentina Werben HR A tiempo completo

Locations*: Buenos Aires, Argentina, is preferable; other locations are in Argentina, Brazil, Colombia, Peru, Chile, Mexico, Bolivia, Spain, Serbia (Belgrade), Czechia (Prague), Ukraine, Portugal. Type of work: Remote, full-time We are seeking a Senior SRE engineer to join a team that works on a complex distributed architecture, spanning physical machines - and virtualizing on-prem host/cloud computing. The role is to help set up centralized DevOps and help existing teams adopt more centralized best practices. The ideal candidate will have the ability to manage complexity and tackle problems across multiple stack layers as a part of a small team championing operational excellence. Our environment is relaxed yet intellectually intense. Our teams are lean and agile, which means rapid prototyping of products with immediate user feedback. We seek people who think in code, aspire to solve undiscovered computer science challenges, and are motivated by being around like-minded people. In fact, of the 600 employees globally, approximately 500 of them code daily. Job (Project) Description The customer develops and deploys systematic financial strategies across a variety of asset classes and global markets. We seek to produce high-quality predictive signals (alphas) through our proprietary research platform to employ financial strategies focused on exploiting market inefficiencies. Our teams work collaboratively to drive the production of alphas and financial strategies – the foundation of a sustainable, global investment platform. Key Responsibilities: Architecture and Automation: Design and deploy As-A-Service solutions using open-source software to automate system management, scaling, and monitoring. System Optimization: Develop tools to streamline deployment, monitoring, and incident management for large-scale, distributed environments. Collaboration Across Teams: Work with development and operations teams to design and implement software solutions that enhance the overall reliability of services. Contribute to the ongoing DevOps and Agile transformation. Monitoring & Incident Response: Set up, configure, and maintain monitoring and alerting systems to ensure real-time visibility into system performance. Participate in on-call rotations to respond to incidents and mitigate downtime. CI/CD & Infrastructure Management: Continuously improve CI/CD pipelines using tools like GitLab, Helm, Terraform, and Ansible, ensuring fast, safe, and reliable deployments. Container Orchestration: Leverage container orchestration platforms like Kubernetes (K8S) to manage distributed systems at scale. Experience with Slurm or similar cluster management is a plus. Cloud and Automation Tools: Use cloud infrastructure (AWS, GCP, etc.) and Infrastructure as Code (IaC) tools to automate the provisioning and scaling of resources. Key Skills and Requirements: Linux Systems: Deep expertise and hands-on experience working with Linux-based systems, with a focus on optimization and troubleshooting. Python Proficiency: Strong skills in Python for scripting, automation, and system management. Containerization & Orchestration: In-depth knowledge of container orchestration technologies such as Kubernetes (K8S). Experience with other cluster management tools like Slurm is a plus. Infrastructure as Code (IaC): Hands-on experience with tools like Helm, Terraform, and Ansible to manage infrastructure in a scalable and automated way. Container Technologies: Strong working knowledge of Docker, Podman, or other containerization systems to enable efficient and consistent deployment. CI/CD Pipelines: Experience working with CI/CD tools, especially GitLab (preferred), GitHub, or Git, to ensure smooth and rapid delivery cycles. Monitoring & Logging: Experience with monitoring and logging solutions such as Prometheus, Grafana, and the ELK stack to provide comprehensive insights into system performance and health. Relational Databases: Understanding of relational databases, their performance tuning, and management in distributed systems. Agile Development: Familiarity with Agile development methodologies, with a focus on continuous improvement and collaboration. Cloud Experience: Exposure to cloud technologies such as AWS or Google Cloud (GCP) is a strong plus. Collaboration & Communication: A team-first attitude with excellent verbal and written communication skills in English, able to work collaboratively with peers across the organization. #J-18808-Ljbffr


  • Senior SRE

    hace 1 semana


    , , Argentina Laravel A tiempo completo

    A leading technology company is seeking a Senior Site Reliability Engineer to enhance its global infrastructure's reliability and elegance. This role involves establishing SRE as a core function, designing multi-region Kubernetes systems, and creating observability tools. Candidates should possess deep proficiency in Linux administration, AWS, and...


  • , , Argentina Next League A tiempo completo

    Senior Engineering Manager, Site Reliability Join to apply for the Senior Engineering Manager, Site Reliability role at Next League. 3 days ago Be among the first 25 applicants. As the Senior Manager of Site Reliability Engineering, you will be responsible for ensuring the reliability, scalability, and efficiency for a wide range of client systems, including...


  • , , Argentina Mas Global Consulting Llc A tiempo completo

    Hi, this is Monica Hernandez, Founder and CEO of MAS Global. I started MAS with the idea that we could be more than a business, that’s why we like to say that MAS is More . I was born and raised in Medellin, Colombia and thanks to a scholarship I became a Software Engineer and built a career in the US, where I now live. Starting MAS was my way to give back...

  • Senior SRE – ID #00140

    hace 3 semanas


    , , Argentina Werben HR A tiempo completo

    Locations*: Buenos Aires, Argentina, is preferable; other locations are in Argentina, Brazil, Colombia, Peru, Chile, Mexico, Bolivia, Spain, Serbia (Belgrade), Czechia (Prague), Ukraine, Portugal. Type of work: Remote, full-time We are seeking a Senior SRE engineer to join a team that works on a complex distributed architecture, spanning physical machines -...


  • Argentina Unosquare A tiempo completo

    A global technology firm in Argentina is seeking a Senior Site Reliability Engineer to build and support high-capacity systems. The ideal candidate has strong experience in Terraform, EKS, and K8S, and will collaborate across functional teams. Responsibilities include monitoring AWS costs and managing automation tools to enhance operational efficiency. This...


  • , , Argentina Ciklum A tiempo completo

    A custom product engineering company is seeking a Senior DevOps Engineer to enhance and oversee their production environments in Argentina. The ideal candidate will ensure optimal performance and security measures in both Windows and Linux systems, while working with cutting-edge technologies. Responsibilities include monitoring production, deploying...


  • , , Argentina Next League A tiempo completo

    A digital growth consultant is seeking a Senior Engineering Manager for Site Reliability to lead a team of engineers and ensure the reliability and efficiency of client systems. The position involves responsibilities that include both leadership duties and hands-on technical tasks, focusing on maintaining high service standards for major clients. Candidates...


  • , , Argentina Svitla Systems, Inc. A tiempo completo

    A global digital solutions company is seeking a Senior Site Reliability Engineer in Argentina. The role requires expertise in managing production-level SaaS applications, particularly on Azure. Responsibilities include designing scalable infrastructure and monitoring system performance. The ideal candidate should have a Bachelor's degree, 5-7 years of...

  • Senior DevOps Engineer

    hace 2 semanas


    , , Argentina Ciklum A tiempo completo

    Ciklum is looking for a Senior DevOps Engineer to join our team in Argentina. We are a custom product engineering company that supports both multinational organizations and scaling startups to solve their most complex business challenges. With a global team of over 4,000 highly skilled developers, consultants, analysts and product owners, we engineer...


  • , , Argentina Mas Global Consulting Llc A tiempo completo

    A leading technology firm in Argentina is seeking a Senior Site Reliability Engineer to ensure system reliability, scalability, and security. The ideal candidate will have over 5 years of experience in Site Reliability Engineering or DevOps, strong skills in AWS, Docker, Kubernetes, and automation. This role involves driving automation initiatives,...