Senior Site Reliability Engineer

hace 2 semanas

Buenos Aires, Argentina Masabi A tiempo completo

**Introducing Masabi**

// At Masabi, we’re driving the fare payment revolution, powering the journeys of millions all over the world. We build fare collection platforms that allow riders to seamlessly buy and present tickets for public transport either on their mobile phones, from a ticket machine, or even by tapping their bank card to travel.

Our Justride platform is used in over 250 locations globally, including some of the largest cities in the world. With our industry-first mobile ticketing SDK, we’ve partnered with large players in the transport space, including Uber, Moovit and Transit.

Your own journey is important to us too. Choosing a role here means joining a network of innovators from all walks of life; a group of passionate individuals who consistently deliver. Here, you’ll find the tools you need to build the career you want. Whether you’re taking the direct route or trying a new path, we’ll support you no matter what.

**Role Description_**

// We're looking for a Senior Site Reliability Engineer to join Masabi and be at the forefront of ensuring our platform's reliability, performance, and security. In this role, you'll be pivotal to scaling and modernising our platform while ensuring uptime, performance, and security. You'll work across legacy and modern infrastructure, drive key improvements, and collaborate closely with architecture and product teams to enable reliable delivery across the business.

**Location_**

// This role is available in a fully remote model for contractors based in Argentina.

**Responsibilities_**
- Automation and Scalability: Drive automation to reduce operational overhead and human error. Build CI/CD pipelines, develop Infrastructure as Code (IaC) using tools like Terraform and CloudFormation, and design scalable systems to handle high traffic while optimising resource utilisation. Drive the effort to scale up new environments as we expand globally.
- Continuous Improvement: Refine processes, tools, and workflows to enhance system reliability, scalability, and efficiency. Plan capacity to anticipate future needs and support high-performance systems.
- Security and Compliance: Ensure infrastructure meets organisational security standards and supports compliance frameworks like SOC 2 and PCI.
- Monitoring and Reliability: Maintain real-time monitoring systems aligned with SLIs and SLOs, ensuring uptime and performance meet or exceed SLAs. Set up proactive alerting mechanisms to address issues before they escalate.
- Cost Optimisation: Monitor and optimise cloud infrastructure costs through autoscaling, rightsizing, and architectural reviews to balance cost-effectiveness with reliability.
- Disaster Recovery and Redundancy: Implement failover strategies, disaster recovery plans, and redundancy to ensure system resilience under all conditions.
- Incident Management: Respond to production incidents, minimise downtime, and restore availability. Perform root cause analysis, implement preventive measures, and contribute to post-incident reviews to share lessons learned.
- Collaboration and Mentorship: Partner with developers to design reliable, maintainable systems. Coach teams on best practices for reliability, scalability, and observability, fostering a culture of ownership.
- Documentation and Knowledge Sharing: Maintain detailed documentation for infrastructure, incident response, and workflows. Develop playbooks and runbooks to ensure seamless knowledge transfer.

// Our platform is JVM-based and cloud-native, hosted on AWS. We utilise standard tooling, including Gitlab, Terraform, CloudFormation, Puppet, Kibana, Grafana and Confluent Cloud.

**Key Tools and Technologies SREs Work With_**
- Monitoring: Grafana, Prometheus, CloudWatch, Pingdom, Kibana
- CI/CD: GitLab CI, Rundeck
- IaC: Terraform, CloudFormation
- Cloud Platforms: AWS

**About You_**
- Significant experience in SRE or related roles, with a proven track record in building and maintaining reliable systems
- Expertise in AWS Cloud technologies
- Hands-on experience with Terraform and Grafana, along with strong knowledge of security principles and networking components
- Experience in building pipelines and robust CI/CD infrastructure
- A collaborative team player who approaches projects with an open mind and prioritises security
- Passionate about leveraging technology to drive advancements while ensuring reliability and security
- Excellent communication skills, a collaborative mindset, and a willingness to learn and contribute to team success
- Self-sufficient and capable of working independently, while also knowing when to seek support or input

**Desirable_**
- Familiarity with PCI DSS v4 Compliance requirements is a plus
- AWS Cloud certification
- Experience with orchestrating containers

**Careers at Masabi are for people going places - driven by a mission to make transit fair and accessible for all.**

We are a network of innovators from all walks of life, passionate about making a difference.

Site Reliability Engineer

hace 2 semanas

Buenos Aires, Argentina Careers at SunDevs A tiempo completo

**Descripción del puesto**: Como Site Reliability Engineer en SunDevs, colaborarás con otros ingenieros de software senior y Platform Engineers para diseñar y desarrollar sistemas y plataformas en la nube altamente disponibles, escalables, seguras y mantenibles para resolver grandes desafíos. Brindarás asesoramiento y guía a nuestros ingenieros de...
Site Reliability Engineer

hace 2 semanas

Buenos Aires, Argentina Launchpad Technologies A tiempo completo

Launchpad, a people-first technology company, is a leader in North America´s rapidly growing tech sector. Through two solutions, Launchpad supports its clients with digital transformation: - PaasportTM, our iPaaS solution, streamlines software integration and automates workflows. - Nearshore Staff Augmentation, our managed IT staffing service, connects top...
Senior DevOps

hace 3 semanas

Buenos Aires, Argentina Itps A tiempo completo

Senior DevOps / Site Reliability Engineer (Azure) (Ref-Lch) We are looking for a highly skilled Senior DevOps / Site Reliability Engineer with deep experience in Azure cloud, CI/CD automation, and secure workload identity. This role is ideal for someone who masters modern DevOps practices, understands cloud architecture at scale, and can lead the design and...
Senior Site Reliability Engineer

hace 2 semanas

Capital Federal, Buenos Aires, Argentina Business Commercial Management A tiempo completo

BCM Uruguay is Hiring! Senior Site Reliability Engineer - Remote Remote - LATAM **English Level**: B2+ / C1 - Advanced Contractor - USD ⏱ Full-Time Para empresa multinacional de servicios en ingeniería digital, especialista en software de última generación y en desarrollo de productos digitales. Cuando una idea aparece, nacen la motivación y el deseo...
Site Reliability Engineer

hace 2 semanas

Buenos Aires, Argentina Right Balance A tiempo completo

**Overview** We're looking for a Site Reliability Engineer. Headquartered in Los Angeles, California, Right Balance provides top-tier technology talent for innovative companies in the US. We’re in the top 50 companies to watch in LA. **Engagement Details** Our client is a USA-based company producing video solutions with the mission to advance scientific...
Sre - Site Reliability Engineer - Remoto - 1526

hace 2 semanas

Buenos Aires, Argentina Web: A tiempo completo

Descripción del empleo: ¿Qué hace la compañía? **Empresa de ingeniería digital que desde 2009 se dedica a mejorar equipos de productos digitales y facilitar iniciativas de transformación digital.** Con más de 1000 empleados en cinco países (México, Colombia, Bolivia, Argentina, Irlanda del Norte), su enfoque innovador de "flujos de trabajo" asegura...
Site Reliability Engineer

hace 4 días

Buenos Aires, Argentina Wise Athena A tiempo completo

**Join Our Team as an SRE!** Wise Athena looking for a **Site Reliability Engineer (SRE)** to join our dynamic and innovative team! At our company, we’re revolutionizing Revenue Growth Management (RGM) with the power of AI. You will work with a passionate, forward-thinking team. This is a fully remote position. **Key Responsibilities** - **Problem...
Site Reliability Engineer

hace 2 semanas

Buenos Aires, Argentina ConglomerateIT A tiempo completo

**Location: Remote** **Job Title : Site Reliability Engineer** **Job Summary**: **Responsibilities**: - Development: - Mirakl Integration: Implement and customize Mirakl integration features, including data synchronization, order processing, and seamless communication between Magento and Mirakl. - Collaboration: Collaborate closely with cross-functional...
Site Reliability Engineer: Scale, Automation

hace 3 semanas

Ciudad Autónoma De Buenos Aires, Argentina Chevron A tiempo completo

A global energy company in Buenos Aires is looking for a Site Reliability Engineer to improve IT services and systems. Responsibilities include proactive incident prevention, leading troubleshooting efforts, and optimizing system performance. The ideal candidate has hands-on IT experience, full stack knowledge, and strong communication skills. Don't miss the...
Site Reliability Engineer

hace 4 días

Buenos Aires, Argentina Solstice A tiempo completo

Solstice is an innovation and emerging technology firm that helps Fortune 500 companies seize new opportunities through groundbreaking digital solutions. As strategists and consultants, we help organizations evolve their digital strategy to solve mission-critical problems. As designers and developers, we build incredible digital solutions that transcend a...

América

Europa

Asia / Oceanía

África

Senior Site Reliability Engineer