Senior Site Reliability Engineer
hace 19 horas
At Emi Labs we are on a mission to increase Frontline Workers’ access to professional opportunities.
This is a 2.7 billion population that accounts for 80% of the world’s workforce. They are digitally invisible, as there’s little to no data available on who they are, their career history, or their skill set, limiting their access to professional opportunities and growth. We're here to transform this by building the infrastructure to make Frontline Workers visible.
Emi, our main product, is an A.I. recruitment assistant that enables companies to engage in a conversation with each applicant to detect interested and qualified individuals while saving Recruiters a huge amount of time by automating tasks such as screening, validating skills, scheduling interviews, and collecting documents.
We were part of Y-Combinator's Winter 2019 batch and in 2022 we have raised an $11M funding round co-led by Merus Capital and Khosla Ventures.
**About the SRE Team**
Our team is responsible for availability, performance, change management, monitoring, and capacity planning. We work to increase the productivity and autonomy of Emi's development teams, providing services, automation, and a secure and reliable platform in which the right systems can emerge. Right now, Emi's technology stack includes:
Kubernetes (EKS)
Helm
ArgoCD
Docker
Terraform, Terragrunt
OpenTelemetry
Prometheus, Thanos, Loki
AWS CloudWatch, Grafana, Sentry, New Relic
- Aurora PostgreSQL, Redshift, OpenSearch, MongoDB
React, Node.js, TypeScript, Python
**What You'll be doing**
You'll be a key player in the project of centralizing our microservices telemetry (metrics, traces and logs) in a single platform and soon will become the technical owner of our monitoring and observability stack. You will need to move fast, streamlining our tooling to unlock the full potential of our development teams.As a member of our SRE team, you will also identify, propose, and implement improvements in our cloud infrastructure and Kubernetes clusters to ensure the best possible experience for our developers and the highest uptime rate for our services.
**What we are looking for**:
- We are looking for an experienced SRE Engineer with strong focus on observability, monitoring and developer experience.
- 4 years of experience working with microservices and distributed systems: tracing, load balancing, concurrency, event-driven architecture patterns.
- 4 years of experience working as an SRE or DevOps engineer in cloud solution architecture.
- 3 years of experience administrating Kubernetes clusters inEKS/AKS/GKE.
- 2 years of experience administrating any observability platform.
- Fluent in scripting, Linux, Docker and CI pipelines.
- Experience dealing with production incidents, troubleshooting and remediation.
- Familiar with any modern Infra as Code solution, Terraform is a plus.
- You enjoy designing and discussing technical solutions.
- You have a strong DevOps mindset and a drive to give developers' autonomy.
- Humble learner: You are curious, enjoy learning and are avid to learn from your own mistakes.
- Proactive: You strive to solve problems to ease your team and others' lives.
- People-driven: You thrive at teamwork, and are fluent in helping others and asking for help.
- Deliver with focus: You excel at organizing your work and have a deliberate drive for the simplest solutions.
- Advanced English level.
**Bonus if you have**:
- Experience with OpenTelemetry, Grafana, Sentry, and/or New Relic is a huge plus.
- Avid to train developers and new colleagues in the DevOps mindset.
- Strong background in designing backend systems on any high-level language.
- Enterprise integrations experience: webhooks, public APIs.
- Application performance optimization: SQL queries, profiling, memory heap, garbage collection.
- Deployment strategies: rolling update, blue/green, canary.
- FinOps experience.
- Experience designing and implementing Disaster Recovery Plans.
Emi Labs is committed to fostering a fair, inclusive, and equal work environment. We believe diversity is crucial to building the best team and solving Frontline Worker's access to professional opportunities, that is why Emi aims to be a leader in workplace equality and move both our company and the industry forward.
Emi is a very dynamic and new startup where growth opportunities are there for the taking We are just building a team with great impact so now is the best time to jump aboard
Interested in knowing more about Emi Labs?
LI-Remote
-
Lead Site Reliability Engineer
hace 2 semanas
Buenos Aires, Argentina Ecolab A tiempo completoJOB DESCRIPTION Elevate your engineering prowess to unprecedented levels by joining a team of exceptionally gifted professionals and position yourself among the top echelon in site reliability.The Infrastructure Engineering team is responsible for the design and engineering of solutions and technologies working with other engineering teams to support the...
-
Site Reliability Engineer Observability Lead
hace 3 semanas
Buenos Aires, Argentina Unilever A tiempo completoSite Reliability Engineer Observability Lead Responsibilities Create a robust observability framework, including an APM, alarming, dashboarding, event correlation, integrated to an existing observability platform. Perform analytics on previous incidents and usage patterns to better predict issues and take proactive actions. Troubleshoot priority incidents,...
-
Site Reliability Engineer
hace 1 semana
Capital Federal, Buenos Aires, Argentina Rp consultoria A tiempo completoNos encontramos en búsqueda de un/a **Ssr. Site Reliability Engineer **para incorporar a nuestro equipo en Buenos Aires, Argentina. ¿Qué buscamos en un **Ssr. Site Reliability Engineer**? Ser un colaborador activo de la automatización de tareas que necesiten intervención manual en el ciclo de desarrollo de software. Con muchas ganas de aprender,...
-
Senior Site Reliability Engineer
hace 7 días
Buenos Aires, Argentina Neara A tiempo completoNeara is a high-growth, venture-backed Series B, tech company headquartered in Sydney, Australia. We work with 75% of the utilities in Australia and New Zealand and are growing rapidly across the US and Europe. Our mission is to revolutionise the utilities industry by helping them future-proof their infrastructure and navigate the challenges of the clean...
-
Site Reliability Engineer
hace 1 semana
Buenos Aires, Argentina Launchpad Technologies A tiempo completoLaunchpad, a people-first technology company, is a leader in North America´s rapidly growing tech sector. Through two solutions, Launchpad supports its clients with digital transformation: - PaasportTM, our iPaaS solution, streamlines software integration and automates workflows. - Nearshore Staff Augmentation, our managed IT staffing service, connects top...
-
Senior Site Reliability Engineer, Observability
hace 4 días
Buenos Aires, Argentina Chainlink A tiempo completoSenior Site Reliability Engineer, Observability Join to apply for the Senior Site Reliability Engineer, Observability role at Chainlink Labs. Company Overview Chainlink is the industry‑standard oracle platform that brings the capital markets on‑chain and powers the majority of decentralized finance (DeFi). Our stack provides essential data,...
-
Senior Ai Site Reliability Engineer
hace 2 semanas
Buenos Aires, Argentina SQUIRE A tiempo completo**WHO WE ARE** SQUIRE is the leading business management system designed for the needs of barbers, shop owners, and their communities. We believe the pursuit of artistry and autonomy should not be restricted by the complexities of running a business. With SQUIRE, we provide custom-branded tools, resources, and guidance to help barbers of all stages and...
-
Site Reliability Engineer
hace 3 semanas
Buenos Aires, Argentina Exxon Mobil A tiempo completoA global energy company is seeking a Site Reliability Engineer to manage and automate operations in Buenos Aires. Ideal candidates should have a Bachelor's degree and over 2 years of experience in site reliability or infrastructure engineering, specifically within a DevOps framework. The role involves developing infrastructure as code, performance...
-
Senior Site Reliability Engineer
hace 5 días
Capital Federal, Buenos Aires, Argentina Business Commercial Management A tiempo completoBCM Uruguay is Hiring! Senior Site Reliability Engineer - Remote Remote - LATAM **English Level**: B2+ / C1 - Advanced Contractor - USD ⏱ Full-Time Para empresa multinacional de servicios en ingeniería digital, especialista en software de última generación y en desarrollo de productos digitales. Cuando una idea aparece, nacen la motivación y el deseo...
-
Site Reliability Engineer
hace 4 días
Buenos Aires, Argentina VS-Staffing A tiempo completoJob Description - Site Reliability Engineer - Remote Costa Rica **Title**: Site Reliability Engineer **Location**: Remote, LATAM **Job Overview**: **Key responsibilities include**: - Incident Management: Lead the response to security incidents through identification, containment, analysis, and mitigation strategies to minimize impact. - Procedure...