Senior Site Reliability Engineer, Observability
hace 2 días
Senior Site Reliability Engineer, Observability Join to apply for the Senior Site Reliability Engineer, Observability role at Chainlink Labs. Company Overview Chainlink is the industry‑standard oracle platform that brings the capital markets on‑chain and powers the majority of decentralized finance (DeFi). Our stack provides essential data, interoperability, compliance, and privacy standards needed for advanced blockchain use cases. Since inventing decentralized oracle networks, Chainlink has enabled tens of trillions in transaction value and now secures the vast majority of DeFi. Responsibilities Build and orchestrate a modern OTEL‑based observability platform. Support multiple telemetry types, including metrics, logs, and traces. Define and enforce governance for observability and large‑scale problem management. Ensure reliability, security, and performance exceed defined SLAs. Collaborate with engineers across the company to troubleshoot issues, deploy new products, and increase velocity while reducing cognitive load. Lead the design and deployment of monitoring/observability services to detect incidents and trigger alerts. Ingest, aggregate, transform, and utilize data from a multitude of sources in real‑time data pipelines. Oversee availability, performance, and supportability of our observability infrastructure. Create processes for alert response operations and support the team for reliable delivery of oracle data. Recommend metrics collection to enable robust alerting with every new feature release. Champion reliability and security by adopting best practices from the start. Qualifications 7+ years of relevant professional experience, typically in dev‑ops, infrastructure, SRE, or platform roles. Ability to develop software beyond typical infrastructure requirements and configurations. Proficiency in programming languages such as C, C++, Java, Python, Go, Perl, or Ruby. Expert knowledge in designing, developing, and managing large real‑time systems. Experience with monitoring and logging: exporting metrics with Prometheus, building Grafana dashboards, and using a centralized logging solution (ELK, Splunk, or Grafana Stack). Experience with distributed systems and container orchestration: maintaining or building Kubernetes clusters and deploying new services on them. Strong communication skills—capable of giving and receiving constructive feedback, and comfortable in planning meetings and code reviews. Desired Qualifications Passion for blockchain, Web 3.0, and related decentralized technologies. Experience running infrastructure in the blockchain/Web3 space. Ability to scale systems sustainably through automation and iterative improvement. Experience working remotely in a distributed team. Strong desire to grow and challenge yourself—constantly seeking ways to improve and automate services to reduce toil. Tools & Services AWS; Terraform/Terragrunt; Kubernetes, Calico, ArgoCD; Prometheus and Grafana; GitHub Actions; Packer. We expect proficiency in most of these tools and deep expertise in several. Seniority Level Mid‑Senior level Employment Type Full‑time Job Function Engineering and Information Technology Industries Technology, Information and Internet Work Environment All roles with Chainlink Labs are global and remote‑based. Unless otherwise stated, we ask that applicants try to overlap some working hours with Eastern Standard Time (EST). Application Process We carefully review all applications and aim to provide a response to every candidate within two weeks after the job posting closes. The closing date is listed on the job advert; applicants are encouraged to prepare their application thoughtfully in advance. Commitment to Equal Opportunity Chainlink Labs is an equal opportunity employer. All qualified applicants will receive equal consideration for employment in compliance with applicable laws, regulations, or ordinances. If you need assistance or accommodation due to a disability or special need when applying for a role or in our recruitment process, please contact us via our form. Global Data Privacy Notice for Job Candidates and Applicants Information collected and processed as part of your Chainlink Labs Careers profile, and any job applications you submit, is subject to our Privacy Policy. By submitting your application, you agree to our use and processing of your data as required. #J-18808-Ljbffr
-
Site Reliability Engineer Observability Lead
hace 3 semanas
Buenos Aires, Argentina Unilever A tiempo completoSite Reliability Engineer Observability Lead Responsibilities Create a robust observability framework, including an APM, alarming, dashboarding, event correlation, integrated to an existing observability platform. Perform analytics on previous incidents and usage patterns to better predict issues and take proactive actions. Troubleshoot priority incidents,...
-
Lead Site Reliability Engineer
hace 2 semanas
Buenos Aires, Argentina Ecolab A tiempo completoJOB DESCRIPTION Elevate your engineering prowess to unprecedented levels by joining a team of exceptionally gifted professionals and position yourself among the top echelon in site reliability.The Infrastructure Engineering team is responsible for the design and engineering of solutions and technologies working with other engineering teams to support the...
-
Senior Site Reliability Engineer
hace 20 horas
Buenos Aires, Argentina Dev.Pro A tiempo completo🟢 Are you in Brazil or Argentina? Join us as we actively recruit in these locations, offering a comfortable remote environment. Submit your CV in English, and we'll get back to you! We invite a Senior Site Reliability Engineer to join our dynamic team. In this hands-on role, you’ll focus on improving the stability, observability, and efficiency of our...
-
Senior SRE
hace 1 semana
Buenos Aires, Argentina Dev PRO A tiempo completoA top financial technology firm is seeking a Senior Site Reliability Engineer to improve the stability and efficiency of their services. The role involves leading initiatives in monitoring and automation, collaborating with engineering teams, and ensuring observability readiness. Candidates must have 5+ years in site reliability or platform engineering, with...
-
Senior Site Reliability Engineer
hace 2 semanas
Buenos Aires, Buenos Aires C.F., Argentina Dev A tiempo completoAre you in Brazil or Argentina? Join us as we actively recruit in these locations, offering a comfortable remote environment. Submit your CV in English, and we'll get back to youWe invite a Senior Site Reliability Engineer to join our dynamic team. In this hands-on role, you'll focus on improving the stability, observability, and efficiency of our services....
-
Senior Site Reliability Engineer
hace 2 semanas
Buenos Aires, Buenos Aires C.F., Argentina Dev A tiempo completoAre you in Brazil or Argentina? Join us as we actively recruit in these locations, offering a comfortable remote environment. Submit your CV in English, and we'll get back to youWe invite a Senior Site Reliability Engineer to join our dynamic team. In this hands-on role, you'll focus on improving the stability, observability, and efficiency of our services....
-
Senior Ai Site Reliability Engineer
hace 2 semanas
Buenos Aires, Argentina SQUIRE A tiempo completo**WHO WE ARE** SQUIRE is the leading business management system designed for the needs of barbers, shop owners, and their communities. We believe the pursuit of artistry and autonomy should not be restricted by the complexities of running a business. With SQUIRE, we provide custom-branded tools, resources, and guidance to help barbers of all stages and...
-
Senior Site Reliability Engineer, Observability
hace 4 días
Buenos Aires, Buenos Aires C.F., Argentina Chainlink Labs A tiempo completoAbout ChainlinkChainlink is the industry-standard oracle platform bringing the capital markets onchain and powering the majority of decentralized finance (DeFi). The Chainlink stack provides the essential data, interoperability, compliance, and privacy standards needed to power advanced blockchain use cases for institutional tokenized assets, lending,...
-
Senior Site Reliability Engineer
hace 3 semanas
Buenos Aires, Argentina Dev.Pro A tiempo completoWe are a US-based outsource software development company that has been delivering exceptional software experience to our clients since 2011, helping technology companies to become industry leaders. Over the past few years, we’ve been hiring specialists all over the world while our main development centers were in Ukraine. Now, we keep expanding and start...
-
Senior API Engineer, Search
hace 2 semanas
Buenos Aires, Argentina Ailet A tiempo completoLead Api Engineer (Search and Analytics) Blue Orange Digital Site Reliability Engineer Observability Lead Unilever Site Reliability Engineer Observability LeadResponsibilities- Create a robust observability framework, including an APM, alarming, dashboarding, event correlation, integrated to an existing observability platform.- Perform analytics on previous...