Site Reliability Engineer

hace 1 semana

Argentina Félix A tiempo completo

About Us At Félix, we're building the financial ecosystem for Latin immigrants in the U.S., starting with a revolution in remittances. Our core product is an AI-powered chatbot built on WhatsApp, allowing our users to send money home as easily as sending a text message. We leverage cutting-edge technology like AI, blockchain, and stablecoins to make cross-border payments faster, more affordable, and more accessible than ever before. We are a hyper-growth Series B company, backed by over $100 million in funding from top-tier global investors, including QED, Castle Island, Switch Ventures, HTwenty, Monashees, and General Catalyst Customer Value Fund. This isn't just about the numbers; it's a testament to the trust our investors have in our vision and our team. Additionally, Félix was selected as an “Endeavour Entrepreneur” and was a recipient of the CrossTech Fintech Startups Award. We are a group of extremely talented and dedicated high-performers, united by our shared obsession with a single goal: empowering our customers. We are all owners of Félix, driven by a bias for action and a true experimentation spirit to get shit done with urgency and focus. Joining Félix means you will be part of a team building a legacy, a company that will outlive us all. This is a rare opportunity to apply your skills to a deeply meaningful mission—serving a community that has been underserved for too long. We are a team that is fiercely loyal to each other, where radical transparency and constructive feedback are how we grow and push for excellence. We are bold, we care less about what others are doing, and more about creating sustainable value and a product that truly makes our users' lives better. We are building the future, today. About the Role We’re looking for a Site Reliability Engineer (SRE) to join our Engineering Operations team, reporting directly to Damian Finol, Head of EngOps. This is a new role focused on strengthening the reliability, scalability, and security of the infrastructure that powers our fintech platform. You’ll work closely with Engineering and SecOps to ensure our systems are highly available, observable, and cost-efficient. The role blends software engineering, systems operations, and security practices, with a strong emphasis on automation, proactive monitoring, and continuous improvement. Responsibilities Manage and optimize our infrastructure on Google Cloud Platform (GCP) and Google Kubernetes Engine (GKE). Automate provisioning and configuration using Terraform, Helm, and scripting languages such as Go, Python, and Bash. Build, maintain, and improve monitoring and alerting systems using Prometheus, Grafana, and centralized logging tools (e.g., ELK or Loki). Participate in on-call rotations, incident response, and post-mortem analyses, ensuring rapid recovery and continuous learning from failures. Define and track SLOs/SLIs and error budgets to monitor service health and performance. Implement cloud security best practices to protect sensitive data and maintain the integrity of our systems. Collaborate across Engineering, Security, and Product teams to embed reliability and automation in every phase of development and deployment. Contribute to GKE cost optimization and resource management strategies to enhance efficiency and control operational spend. Requirements 4+ years of experience as an SRE, DevOps, Infrastructure, or Platform Engineer. Strong hands-on experience with GCP and GKE. Proficiency in Kubernetes (architecture, deployments, networking, and troubleshooting). Solid programming or scripting skills in Go, Python, or Bash. Experience with Terraform and Helm for Infrastructure as Code. Strong understanding of monitoring and observability using Prometheus, Grafana, and logging frameworks. Familiarity with incident management, on-call operations, and post-mortem processes. Knowledge of network fundamentals (TCP/IP, DNS, load balancing). Experience with PostgreSQL or distributed databases. Awareness of FinOps and cloud cost management principles. Excellent problem-solving, communication, and collaboration skills, with a proactive mindset. Certified Kubernetes Administrator (CKA). Experience in FinOps, cloud security, or regulated industries. Familiarity with PagerDuty or similar incident management tools. Background implementing SLOs/SLIs and error budgets in production environments. These are the applicable requisites, although equivalent competencies in any of the above will also be considered. What We Offer Competitive salary Initial stock options grant Annual performance bonus Health, dental, and vision plans Remote work environment, although we have offices in Miami and México City and would love to work in hybrid model if you are up to it. Continuous learning opportunities Unlimited PTO Paid parental leave Empowering opportunities for growth in a dynamic entrepreneurial environment Equal Opportunity Employer At Félix, we are committed to providing equal employment opportunities to all qualified employees and applicants without regard to race, religion, nationality, sex, sexual orientation, gender identity, age, or disability. This policy applies to all terms and conditions of employment, including recruitment, hiring, placement, promotion, training, compensation, benefits, and termination. Want to learn more about our privacy practices? Check out our Privacy Policy. #J-18808-Ljbffr

Site Reliability Engineer: Scale, Reliability

hace 3 semanas

, , Argentina Capchase A tiempo completo

Join a forward-thinking company as a Site Reliability Engineer, where you'll play a crucial role in building scalable, high-performing systems. This position offers the opportunity to shape the future of reliability engineering while ensuring the availability, latency, and performance of our systems. You'll collaborate with a diverse team to define the...
Site Reliability Engineer

hace 3 días

, , Argentina Epsilon Solutions Ltd. SA de CV. A tiempo completo

Direct message the job poster from Epsilon Solutions Ltd. SA de CV. Technical Recruiter at Epsilon Solutions Ltd. Job profile: Sr. Site Reliability Engineer Location: Argentina (REMOTE) Job Type: Full Time Contract Overview We are looking for a highly skilled SRE with an engineering background to support our top clients with strong technical exposure and...
Site Reliability Engineer

hace 4 semanas

, , Argentina AgileEngine A tiempo completo

Join to apply for the Site Reliability Engineer role at AgileEngine AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to Work...
Site Reliability Engineer

hace 2 semanas

, , Argentina Capchase A tiempo completo

Join to apply for the Site Reliability Engineer role at Capchase . Capchase provides flexible payment solutions to B2B software, cloud, and AI companies. Our core product, Capchase Pay , offers a buy-now-pay-later payment option for B2B SaaS, hardware, and cloud purchases, helping companies sell more and collect cash faster. Founded in 2020 and headquartered...
Site Reliability Engineer

hace 4 semanas

, , Argentina Semperti A tiempo completo

¡En Semperti nos encontramos en la búsqueda de SRE SSR para sumarse al team! El Site Reliability Engineer (SRE) tendrá la misión de ayudar a nuestros clientes a garantizar la disponibilidad de sus sistemas como así también de liderar la adopción de nuevas herramientas para contribuir a la mejora de sus procesos, trabajando con numerosas tecnologías...
Site Reliability Engineer

hace 2 semanas

Argentina Description Ciklum A tiempo completo

DescriptionCiklum is looking for a Site Reliability Engineer to join our team in Argentina.We are a custom product engineering company that supports both multinational organizations and scaling startups to solve their most complex business challenges. With a global team of over 4,000 highly skilled developers, consultants, analysts and product owners, we...
Senior Site Reliability Engineer

hace 7 días

, , Argentina Mas Global Consulting Llc A tiempo completo

Hi, this is Monica Hernandez, Founder and CEO of MAS Global. I started MAS with the idea that we could be more than a business, that’s why we like to say that MAS is More . I was born and raised in Medellin, Colombia and thanks to a scholarship I became a Software Engineer and built a career in the US, where I now live. Starting MAS was my way to give back...
Senior Devops

hace 3 semanas

, Chubut, Argentina Ingenierojob A tiempo completo

Senior DevOps / Site Reliability Engineer (Azure) (Ref-Lch) ARS 1.200.000 - 1.500.000 Senior DevOps / Site Reliability Engineer (Azure) (Ref-Lch) We are looking for a highly skilled Senior DevOps / Site Reliability Engineer with deep experience in Azure cloud, CI/CD automation, and secure workload identity. This role is ideal for someone who masters modern...
Site Reliability Engineer

hace 2 semanas

, , Argentina Felix A tiempo completo

About Us At Félix, we're building the financial ecosystem for Latin immigrants in the U.S., starting with a revolution in remittances. Our core product is an AI-powered chatbot built on WhatsApp, allowing our users to send money home as easily as sending a text message. We leverage cutting-edge technology like AI, blockchain, and stablecoins to make...
Senior Site Reliability Engineer

hace 6 días

Argentina Svitla Systems A tiempo completo

REMOTEARGENTINANovember 18, 2025Svitla Systems Inc. is looking for a Senior Site Reliability Engineer for a full-time position (40 hours per week) in Argentina. Our client is a leading expert network, providing business and government professionals with opportunities to communicate with industry and subject-matter experts to answer research questions. Their...

América

Europa

Asia / Oceanía

África

Site Reliability Engineer