Senior SRE

hace 4 semanas


Argentina Werben HR A tiempo completo

Locations*: Buenos Aires, Argentina, is preferable; other locations are in Argentina, Brazil, Colombia, Peru, Chile, Mexico, Bolivia, Spain, Serbia (Belgrade), Czechia (Prague), Ukraine, Portugal. Type of work: Remote, full-time We are seeking a Senior SRE engineer to join a team that works on a complex distributed architecture, spanning physical machines - and virtualizing on-prem host/cloud computing. The role is to help set up centralized DevOps and help existing teams adopt more centralized best practices. The ideal candidate will have the ability to manage complexity and tackle problems across multiple stack layers as a part of a small team championing operational excellence. Our environment is relaxed yet intellectually intense. Our teams are lean and agile, which means rapid prototyping of products with immediate user feedback. We seek people who think in code, aspire to solve undiscovered computer science challenges, and are motivated by being around like-minded people. In fact, of the 600 employees globally, approximately 500 of them code daily. Job (Project) Description The customer develops and deploys systematic financial strategies across a variety of asset classes and global markets. We seek to produce high-quality predictive signals (alphas) through our proprietary research platform to employ financial strategies focused on exploiting market inefficiencies. Our teams work collaboratively to drive the production of alphas and financial strategies – the foundation of a sustainable, global investment platform. Key Responsibilities: Architecture and Automation: Design and deploy As-A-Service solutions using open-source software to automate system management, scaling, and monitoring. System Optimization: Develop tools to streamline deployment, monitoring, and incident management for large-scale, distributed environments. Collaboration Across Teams: Work with development and operations teams to design and implement software solutions that enhance the overall reliability of services. Contribute to the ongoing DevOps and Agile transformation. Monitoring & Incident Response: Set up, configure, and maintain monitoring and alerting systems to ensure real-time visibility into system performance. Participate in on-call rotations to respond to incidents and mitigate downtime. CI/CD & Infrastructure Management: Continuously improve CI/CD pipelines using tools like GitLab, Helm, Terraform, and Ansible, ensuring fast, safe, and reliable deployments. Container Orchestration: Leverage container orchestration platforms like Kubernetes (K8S) to manage distributed systems at scale. Experience with Slurm or similar cluster management is a plus. Cloud and Automation Tools: Use cloud infrastructure (AWS, GCP, etc.) and Infrastructure as Code (IaC) tools to automate the provisioning and scaling of resources. Key Skills and Requirements: Linux Systems: Deep expertise and hands-on experience working with Linux-based systems, with a focus on optimization and troubleshooting. Python Proficiency: Strong skills in Python for scripting, automation, and system management. Containerization & Orchestration: In-depth knowledge of container orchestration technologies such as Kubernetes (K8S). Experience with other cluster management tools like Slurm is a plus. Infrastructure as Code (IaC): Hands-on experience with tools like Helm, Terraform, and Ansible to manage infrastructure in a scalable and automated way. Container Technologies: Strong working knowledge of Docker, Podman, or other containerization systems to enable efficient and consistent deployment. CI/CD Pipelines: Experience working with CI/CD tools, especially GitLab (preferred), GitHub, or Git, to ensure smooth and rapid delivery cycles. Monitoring & Logging: Experience with monitoring and logging solutions such as Prometheus, Grafana, and the ELK stack to provide comprehensive insights into system performance and health. Relational Databases: Understanding of relational databases, their performance tuning, and management in distributed systems. Agile Development: Familiarity with Agile development methodologies, with a focus on continuous improvement and collaboration. Cloud Experience: Exposure to cloud technologies such as AWS or Google Cloud (GCP) is a strong plus. Collaboration & Communication: A team-first attitude with excellent verbal and written communication skills in English, able to work collaboratively with peers across the organization. #J-18808-Ljbffr



  • , , Argentina Glia A tiempo completo

    A leading technology company based in Argentina is seeking a Senior Site Reliability Engineer to ensure the health and performance of production services. This role involves defining Service Level Objectives, leading incident responses, and improving CI/CD pipelines while collaborating remotely within a dedicated team. Ideal for candidates passionate about...


  • , , Argentina Cloudbeds A tiempo completo

    A leading tech company in Argentina seeks a Sr. Site Reliability Engineer to ensure the reliability and performance of their platform. The role involves architecting AWS cloud solutions, maintaining Kubernetes clusters, and championing SRE practices across teams. Candidates should have over 5 years of experience with AWS and Kubernetes. This remote position...

  • Senior Azure Infra

    hace 1 semana


    , , Argentina UTR Sports A tiempo completo

    A leading sports technology company in Argentina is seeking a Senior Infrastructure Engineer to build and manage cloud-native infrastructure. The ideal candidate will have over 7 years of experience, primarily working with Azure and Terraform, and must excel in a dynamic environment. This role encompasses setting standards for infrastructure and CI/CD...


  • , , Argentina Upbound A tiempo completo

    A cutting-edge technology company based in Argentina is seeking a Senior Production Engineer to enhance the reliability and scalability of their cloud platform. This position involves collaborating with software engineering teams to design, build and maintain robust systems. Ideal candidates should have extensive experience in distributed systems,...


  • , , Argentina UTR Sports A tiempo completo

    Why Join UTR Sports? UTR Sports is a leader in using innovative technology to elevate the sports of tennis and pickleball, providing a dynamic, fast‑paced work environment where you can make a real impact. We offer competitive compensation, opportunities for growth, and the chance to work with a passionate team of sports enthusiasts and technology...

  • Senior Cloud SRE

    hace 3 semanas


    , , Argentina Nearsure A tiempo completo

    A remote technology company in Argentina is seeking a Senior Site Reliability Engineer to collaborate with product teams on cloud infrastructure. Ideal candidates will have over 8 years of software development experience, including 5 years with AWS, Kubernetes, and Docker. You’ll play a crucial role in designing solutions for reliability and efficiency,...


  • , , Argentina xLabs A tiempo completo

    A blockchain infrastructure company is seeking a Senior / Staff Site Reliability Engineer. This remote position offers opportunities to work with cutting-edge blockchain technologies. Ideal candidates will have experience in Infrastructure-as-Code and container orchestration with Kubernetes. Responsibilities include collaborating with teams, problem-solving,...

  • Senior Azure DevOps

    hace 2 semanas


    , Chubut, Argentina Ingenierojob A tiempo completo

    A leading technology firm is seeking a Senior DevOps / Site Reliability Engineer to enhance their Azure cloud infrastructure. The role involves architecting and maintaining cloud environments, ensuring secure deployments, and optimizing reliability. Candidates should have solid experience with Azure and CI/CD automation, and demonstrate strong communication...


  • , , Argentina Glia A tiempo completo

    About Glia Our award-winning technology powers conversations with customers for some of the world’s largest enterprises. We believe that combining the human touch with technology is the best way to create amazing customer experiences. When human abilities such as problem-solving, creative thinking and relationship building are enhanced with technology......

  • Azure DevOps

    hace 2 semanas


    , , Argentina Blink Digital A tiempo completo

    A leading technology company is seeking a Senior DevOps / Site Reliability Engineer based in Buenos Aires to architect and manage Azure cloud environments. This role involves automating processes, ensuring secure deployments, and collaborating with cross-functional teams. Ideal candidates will possess deep expertise in Azure, CI/CD automation, and a strong...