Senior Site Reliability Engineer

hace 19 horas


Argentina Laravel A tiempo completo

At Laravel, we don't just build tools; we build the foundation that empowers millions of developers to ship their dreams. We are looking for a Senior Site Reliability Engineer to help us scale that mission by ensuring our global infrastructure remains as elegant and reliable as the code we write. If you are energized by the challenge of managing multi-region Kubernetes clusters, building robust observability systems, and solving complex operational puzzles with code, you've found your next home. Location / Timezone: Between US West and US East for optimal collaboration with the team. Description Of The Role As a Senior Site Reliability Engineer, you will be a founding member of our dedicated SRE function, reporting directly to Florian Beer. This is a high-impact, autonomous role where you will design and implement the systems that power Laravel Cloud, Nightwatch, Forge and Vapor. You will act as a bridge between development and operations, advocating for a blameless culture and shared responsibility for reliability across the entire organization. Your 12-Month Mission Imagine we are all at Laracon in 12 months' time. You are telling the team about your first year, and the impact is undeniable: First 30 Days: You have stabilized our incident response by creating comprehensive, actionable runbooks for our core alerts. Day 60: You have pioneered "observability as code" by migrating our alert rules and dashboards into version control Day 90: You have established clear, data-driven SLOs for all customer-facing products, giving us a unified language for reliability Year One: You have transformed our visibility by creating beautiful, insightful dashboards used by the entire company and have significantly reduced manual toil through sophisticated automation. What You Will Do Architect Reliability: Establish SRE as a core function at Laravel, building the fundamentals from the ground up. System Design: Design, build, and maintain multi-region Kubernetes infrastructure and global distributed systems. Automation: Solve operational challenges through software, reducing manual intervention (toil) for our product teams Observability: Design and implement monitoring, logging, and alerting systems using tools like Prometheus, Grafana, and Loki Collaboration: Partner with product leads and SecOps to make reliability a shared responsibility Incident Response: Lead incident reviews and postmortems in a strictly blameless environment to foster continuous learning Requirements Requirements - What You Will Bring Infrastructure Mastery: Deep experience with Linux system administration and cloud platforms, specifically AWS Orchestration & IaC: Proficiency with Kubernetes, Docker, and managing infrastructure via Terraform. Programming Skills: The ability to solve problems with software and scripting using PHP, Bash, or Go. Systems Thinking: A "smart and passionate" approach to troubleshooting, with the ability to deconstruct complex systems into triagable components. Reliability Mindset: Experience with SLO/SLI/SLA definition, capacity planning, and performance tuning. Soft Skills: A commitment to documentation, cross-team collaboration, and an automation-first mindset. Requirements - Bonus Skills Framework Familiarity: Previous experience working with the Laravel framework or our existing product suite (Cloud, Forge, Vapor, etc.) is highly preferred. Advanced Observability: Experience with Prometheus and Grafana Mimir for metrics storage and alerting. Cost optimization: Specialized knowledge in managing and optimizing resource usage and cloud costs Benefits Small tight-knit team where every developer counts Fully remote and globally distributed working environment Option to attend Laracon conferences around the world Health care plan (Medical, Dental & Vision) Paid time off (Vacation, Sick & Public holidays) Family leave (Maternity, Paternity) Pension plans (As locally applicable) Performance based bonus plan Company equity #J-18808-Ljbffr



  • Argentina MAS Global Consulting A tiempo completo

    Who We AreAt MAS Global Consulting, we are a premium digital engineering partner delivering technology solutions to some of the world's most innovative companies — from high-growth startups to Fortune 500 enterprises.With a people-first culture and a commitment to excellence, we combine nearshore talent, agile delivery, and technical depth to build...


  • , , Argentina Laravel A tiempo completo

    At Laravel, we don’t just build tools; we build the foundation that empowers millions of developers to ship their dreams. We are looking for a Senior Site Reliability Engineer to help us scale that mission by ensuring our global infrastructure remains as elegant and reliable as the code we write. If you are energized by the challenge of managing...


  • Argentina Jobgether A tiempo completo

    This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior Site Reliability Engineer in Argentina. We are looking for an experienced Senior Site Reliability Engineer to help scale and secure a high-traffic, rapidly growing platform. In this role, you will be responsible for ensuring system reliability,...


  • , , Argentina Capchase A tiempo completo

    Join a forward-thinking company as a Site Reliability Engineer, where you'll play a crucial role in building scalable, high-performing systems. This position offers the opportunity to shape the future of reliability engineering while ensuring the availability, latency, and performance of our systems. You'll collaborate with a diverse team to define the...

  • Site Reliability Engineer

    hace 4 semanas


    , , Argentina Epsilon Solutions Ltd. SA de CV. A tiempo completo

    Direct message the job poster from Epsilon Solutions Ltd. SA de CV. Technical Recruiter at Epsilon Solutions Ltd. Job profile: Sr. Site Reliability Engineer Location: Argentina (REMOTE) Job Type: Full Time Contract Overview We are looking for a highly skilled SRE with an engineering background to support our top clients with strong technical exposure and...


  • , , Argentina Capchase A tiempo completo

    Join to apply for the Site Reliability Engineer role at Capchase . Capchase provides flexible payment solutions to B2B software, cloud, and AI companies. Our core product, Capchase Pay , offers a buy-now-pay-later payment option for B2B SaaS, hardware, and cloud purchases, helping companies sell more and collect cash faster. Founded in 2020 and headquartered...


  • , , Argentina Third-Party Job Posts A tiempo completo

    A leading hospitality technology company is seeking a Sr. Site Reliability Engineer to ensure the reliability and performance of their platform. The successful candidate will design AWS architectures, manage Kubernetes clusters, and automate deployments with Terraform. Must have over 5 years of relevant experience and good English communication skills. This...


  • Argentina Jobgether A tiempo completo

    This position is posted bygether on behalf of a partner company. We are currently looking for a Senior Site Reliability Engineer in Argentina. We are looking for an experienced Senior Site Reliability Engineer to help scale and secure a high-traffic, rapidly growing platform. In this role, you will be responsible for ensuring system reliability, performance,...


  • Argentina Svitla Systems, Inc. A tiempo completo

    Svitla Systems Inc. is looking for a Senior Site Reliability Engineer for a full‑time position (40 hours per week) in Argentina. Our client is a leading expert network, providing business and government professionals with opportunities to communicate with industry and subject‑matter experts to answer research questions. Their customers consult with these...


  • , , Argentina Next League A tiempo completo

    Senior Engineering Manager, Site Reliability Join to apply for the Senior Engineering Manager, Site Reliability role at Next League. 3 days ago Be among the first 25 applicants. As the Senior Manager of Site Reliability Engineering, you will be responsible for ensuring the reliability, scalability, and efficiency for a wide range of client systems, including...