Site Reliability Engineer
hace 4 días
DevRev
At DevRev, we're building the future of work with Computer – your AI teammate.
Computer is not just another tool. It's built on the belief that the future of work should be about genuine human connection and collaboration – not piling on more apps.
Computer is the best kind of teammate: it amplifies your strengths, takes repetition and frustration out of your day, and gives you more time and energy to do your best work.
How?
Easy: it's the only platform capable of…
Complete data unification
Most AI products focus on either structured data (like CRM records and support tickets), or unstructured data (like documents and emails). Computer AirSync connects everything, unifying all your data sources (like Google Workspace, Jira, Notion) into one AI-ready source of truth: Computer Memory.
Powerful search, reasoning, and action
Once connected to all your tools and apps, Computer is embedded in your full business context. It can find and summarize, sure. Even more impressive: it offers employees insights, strategic and proactive suggestions, plus powerful agentic actions.
Extensions for your teams and customers
Computer doesn't make you choose between new software and old. Its AI-native platform lets you extend existing tools with sophisticated apps and agents. So your teams – and your customers – can take action, seamlessly. These agents work alongside you: updating workflows, coordinating across teams, and syncing back to your systems.
This isn't just software. Computer brings people back together, breaking down silos and ushering in the future of teamwork, through human-AI collaboration. Stop managing software. Stop wasting time. Start solving bigger problems, building better products, and making your customers happier.
We call this Team Intelligence. It's why DevRev exists.
Trusted by global companies across multiple industries, DevRev is backed by Khosla Ventures and Mayfield, with $150M+ raised. We are 650+ people, across eight global offices.
About the RoleWe are seeking an experienced Site Reliability Engineer / Platform Engineer to join our team and help build and maintain a resilient, scalable infrastructure supporting our applications across multiple cloud providers. In this role, you will design and implement infrastructure solutions, automate operational processes, and work closely with development teams to ensure reliable, efficient systems that scale with our business.
Key Responsibilities- Design, build, and maintain infrastructure across AWS, GCP, and Azure using Infrastructure as Code (IaC) principles
- Implement and optimize CI/CD pipelines using tools like Argo and CircleCI to enable rapid, reliable deployments
- Manage and scale Kubernetes clusters in production environments, ensuring high availability and optimal resource utilization
- Administer and optimize cloud databases including MongoDB, Redis, RDS, and other data stores for performance and reliability
- Develop monitoring, alerting, and observability solutions to identify and resolve issues before they impact users
- Automate routine operational tasks to reduce manual toil and improve system reliability
- Conduct incident response and post-mortem analysis to drive continuous improvement
- Collaborate with development teams to design systems with reliability, scalability, and operational excellence in mind
- Document infrastructure architecture, runbooks, and operational procedures
- Evaluate and implement new tools and technologies to improve platform capabilities
- 3+ years of experience in Site Reliability Engineering, DevOps, or Platform Engineering
- Strong hands-on experience with at least two major cloud providers (AWS, GCP, Azure)
- Proficiency with Kubernetes for container orchestration and management
- Demonstrated expertise with IaC tools (Terraform, CloudFormation, Pulumi, or similar)
- Experience with CI/CD platforms, particularly Argo and/or CircleCI
- Solid understanding of database technologies including MongoDB, Redis, and relational databases (RDS)
- Proficiency in at least one programming or scripting language (Python, Go, Bash, etc.)
- Experience with monitoring and observability tools (e.g., Prometheus, Grafana, ELK, CloudWatch)
- Experience implementing and managing OpenTelemetry (OTEL) for distributed tracing, metrics, and logging
- Strong understanding of networking, security, and infrastructure best practices
- Experience managing multi-cloud or hybrid cloud environments
- Familiarity with service mesh technologies (Istio, Linkerd)
- Knowledge of security hardening and compliance in cloud environments
- Experience with cost optimization in cloud infrastructure
- Contributions to open-source infrastructure or DevOps projects
- Certifications from major cloud providers
- Problem-solver who thrives on automation and reducing operational burden
- Clear communicator who can explain technical concepts to both technical and non-technical stakeholders
- Detail-oriented with strong attention to reliability and security
- Collaborative team player who enjoys mentoring others
- Self-motivated learner who stays current with infrastructure and cloud technology trends
- Competitive salary and comprehensive benefits package
- Opportunity to work with cutting-edge cloud technologies and tools
- Collaborative environment focused on knowledge sharing and professional growth
- Remote or flexible work arrangement
- Continuous learning and development opportunities
Culture
The foundation of DevRev is its culture -- our commitment to those who are hungry, humble, honest, and who act with heart. Our vision is to help build the earth's most customer-centric companies. Our mission is to leverage design, data engineering, and machine intelligence to empower engineers to embrace their customers.
That is DevRev
-
Site Reliability Engineer
hace 2 días
Buenos Aires, Buenos Aires C.F., Argentina Blockscout Limited A tiempo completoBlockscout is a leading provider of indexing and UI services for EVM chains. Our team hosts explorers for many of the largest chains in the industry. Reliability is vital to our company's success. We are looking for a Site Reliability Engineer to strengthen our DevOps and Support teams.Key responsibilitiesMonitor systems: Proactively watch production systems...
-
Site Reliability Engineer
hace 4 días
Buenos Aires, Buenos Aires C.F., Argentina Sur A tiempo completoAs the Site Reliability Engineer you will support and scale the infrastructure powering their secure, mission-critical SaaS platform. You must be confident in operating and debugging both modern infrastructure (cloud-native, containerized services) and classic Windows production environments (IIS, SQL Server AlwaysOn, Service Broker), with the ability to...
-
Site Reliability Engineer
hace 4 días
Buenos Aires, Buenos Aires C.F., Argentina Sur A tiempo completoAs the Site Reliability Engineer you will support and scale the infrastructure powering their secure, mission-critical SaaS platform.You must be confident in operating and debugging both modern infrastructure (cloud-native, containerized services) and classic Windows production environments (IIS, SQL Server AlwaysOn, Service Broker), with the ability to...
-
Site Reliability Engineer
hace 4 días
Buenos Aires, Buenos Aires C.F., Argentina DevRev A tiempo completoDevRevAt DevRev, we're building the future of work with Computer – your AI teammate.Computer is not just another tool. It's built on the belief that the future of work should be about genuine human connection and collaboration – not piling on more apps.Computer is the best kind of teammate: it amplifies your strengths, takes repetition and frustration...
-
Site Reliability Engineer
hace 2 semanas
Buenos Aires, Buenos Aires C.F., Argentina Chevron A tiempo completoImproves and protects the software and systems behind all of organization's IT services, including management of scalability, availability, latency, performance, security, and capacity, and delivering of software faster, better, and cheaper.The Chevron Business Support Center (BASSC), located in Buenos Aires (Puerto Madero), Argentina, is accepting online...
-
Senior Site Reliability Engineer
hace 1 semana
Buenos Aires, Buenos Aires C.F., Argentina Dev A tiempo completoAre you in Brazil or Argentina? Join us as we actively recruit in these locations, offering a comfortable remote environment. Submit your CV in English, and we'll get back to youWe invite a Senior Site Reliability Engineer to join our dynamic team. In this hands-on role, you'll focus on improving the stability, observability, and efficiency of our services....
-
Senior Site Reliability Engineer
hace 1 semana
Buenos Aires, Buenos Aires C.F., Argentina Dev A tiempo completoAre you in Brazil or Argentina? Join us as we actively recruit in these locations, offering a comfortable remote environment. Submit your CV in English, and we'll get back to youWe invite a Senior Site Reliability Engineer to join our dynamic team. In this hands-on role, you'll focus on improving the stability, observability, and efficiency of our services....
-
Site Reliability Engineer
hace 2 semanas
Buenos Aires, Buenos Aires C.F., Argentina Capchase A tiempo completoCapchase is the #1 platform for vendor financing in tech. We help software and hardware vendors offer flexible installment payments as part of the sales process, improving conversion rates and cashflow. We provide an awesome buyer experience.Capchase was founded in 2020 and is headquartered in NYC. We've provided over $2.5B in funding to thousands of...
-
Site Reliability
hace 4 días
Buenos Aires, Buenos Aires C.F., Argentina Canonical - Jobs A tiempo completoCanonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and silicon providers,...
-
Site Reliability Engineer
hace 4 días
Buenos Aires, Buenos Aires C.F., Argentina Canonical - Jobs A tiempo completoCanonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and silicon providers, and...