C++ Engineer Ai Runtime
hace 3 días
**About Us**:
We are a **stealth-mode startup** building next-generation infrastructure for the AI industry. Our team has decades of experience in software, systems, and deep tech. We are working on a new kind of AI runtime that pushes the boundaries of performance and flexibility making advanced models portable, efficient, and customizable for real-world deployment.
If you want to be part of a small, fast-moving team shaping the **future of applied AI systems**, this is your opportunity.
**Role**:
We are looking for a **C++ Engineer** with strong systems and GPU programming background to help extend and optimize an open-source AI inference runtime. You will work on low-level internals of large language model serving, focusing on:
- Dynamic adapter integration (e.g., LoRA/QLoRA)
- Incremental model update mechanisms
- Multi-session inference caching and scheduling
- GPU performance improvements (Tensor Cores, CUDA/ROCm)
This is a **hands-on role**: you will be designing, coding, profiling, and iterating on high-performance inference code that runs directly on CPUs and GPUs.
**Responsibilities**:
- Implement support for **runtime adapter loading (LoRA)**, enabling models to be customized on the fly without retraining or model merges.
- Design and implement mechanisms for **incremental model deltas**, allowing models to be extended and updated efficiently.
- Extend runtime to handle **multi-session execution**, with isolation and caching strategies for concurrent users.
- Optimize core math kernels and memory layouts to improve inference performance on **CPU and GPU backends**.
- Collaborate with backend and infrastructure engineers to integrate your work into APIs and orchestration layers.
- Write benchmarks, unit tests, and profiling tools to ensure correctness and measure performance gains.
- Contribute to system architecture discussions and help define the roadmap for future runtime features.
**Requirements**:
- Strong proficiency in **modern C++ (C++14/17/20)** and systems programming.
- Solid understanding of **low-level performance optimization**: memory management, multithreading, SIMD, cache efficiency.
- Experience with **CUDA** and/or **ROCm/HIP** GPU programming.
- Familiarity with **linear algebra kernels** (matrix multiply, attention) and how they map to hardware acceleration (Tensor Cores, BLAS libraries, etc.).
- Exposure to **machine learning inference frameworks** (e.g., llama.cpp, TensorRT, ONNX Runtime, TVM, PyTorch internals) is a plus.
- Comfortable working in a **Unix/Linux** environment; experience with build systems (CMake, Bazel) and CI pipelines.
- Strong problem-solving and debugging skills; ability to dive deep into both code and performance traces.
- Self-motivated and able to thrive in a **fast-moving startup** environment.
**Nice to Have**:
- Experience implementing **LoRA or adapter-based fine-tuning** in inference runtimes.
- Knowledge of **quantization methods** and deploying quantized models efficiently.
- Background in distributed systems or multi-GPU orchestration.
- Contributions to **open-source ML/AI systems**.
**Why Join**:
- Build core IP at the intersection of **AI and systems engineering**.
- Work with a highly technical founding team on problems that are both intellectually challenging and commercially impactful.
- Opportunity to shape the direction of a new AI platform from the ground up
- Competitive compensation (contract or full-time), equity potential, and flexible remote work.
-
Backend Engineer
hace 3 días
Rosario, Argentina Baasi A tiempo completo**About Us**: We are a **stealth-mode startup** building new infrastructure for the AI industry. Our mission is to make advanced language models deployable, customizable, and secure across diverse environments. Our platform leverages an existing SaaS codebase for authentication, billing, and user management, and we are extending it with AI-specific features...
-
Senior C/C++ Code Reviewer for AI Data Training
hace 2 semanas
Rosario, Argentina G2i Inc. A tiempo completoA leading AI training firm is seeking a skilled Code Reviewer with expertise in C/C++. The successful candidate will review evaluations of AI-generated C/C++ code, ensuring adherence to quality standards. Responsibilities include auditing code accuracy, providing feedback to annotators, and ensuring compliance with evaluation guidelines. The role offers...
-
Software Engineer C++
hace 2 semanas
Rosario, Argentina Aliantec A tiempo completoIntroducción **¿Qué querés saber primero sobre la posición «Software Engineer C++»?** - **¿Qué hace la compañía?** ***: - **¿Qué necesitás para ser parte del equipo?**: - **¿Qué vas a hacer?** ***: - **¿Cuál es el desafío de la posición?**: - **¿Con quién trabajarás?** ***: - **¿Cuándo y dónde trabajarás?**: - **¿Qué...
-
Remote AI Prompt Engineer for Scalable AI Systems
hace 4 horas
Rosario, Argentina FullStack A tiempo completoA leading IT talent network is hiring an AI Prompt Engineer to work with U.S. clients on flexible, remote projects. You will design and optimize workflows, analyze AI outputs, and integrate AI-driven logic into production systems. The ideal candidate has over 4 years of experience as a Software or AI Engineer, strong skills in prompt engineering, and...
-
Remote Frontend Engineer
hace 3 semanas
Rosario, Argentina Scale Up Recruiting Partners A tiempo completoA technology recruitment firm is seeking a Frontend Engineer (Angular + AI Integration) to develop responsive and AI-enhanced user interfaces. This fully remote position allows you to collaborate with an international team, working on cutting-edge AI products. The ideal candidate will have over 5 years of experience in Angular and TypeScript, with proven...
-
Senior C# Backend Engineer
hace 4 semanas
Rosario, Argentina AgileEngine A tiempo completoA leading software development firm in Rosario, Argentina is seeking a Senior/Lead C# Backend Engineer to drive the modernization of core systems. You will lead the design and development of concurrent applications, optimizing data operations and deploying scalable solutions on AWS. The ideal candidate has over 5 years of experience in C#, performance...
-
Remote Senior C# Code Review Engineer for LLM Data Training
hace 2 semanas
Rosario, Argentina G2i Inc. A tiempo completoA technology services company is seeking a Code Reviewer with deep expertise in C#. You will review evaluations of AI-generated C# code, ensuring quality standards are maintained. Ideal candidates should have 5–7+ years of C# development or code review experience, strong knowledge of the .NET ecosystem, and fluent English skills. The role is remote,...
-
Software Engineer C++ Remoto
hace 6 días
Rosario, Argentina Aliantec A tiempo completo**¿Qué hace la compañía?** **Empresa líder de América Latina, dedicada al desarrollo y venta de tecnología de precisión para la maquinaria agrícola.** Se encuentra ubicada en Santa Fe y cuenta con más de 130 colaboradores/as. Buscan ser los proveedores más eficaces y confiables de productos agrícolas a nível internacional. **¿Qué necesitás...
-
Software Engineer C++
hace 1 semana
Rosario, Argentina Aliantec A tiempo completo**¿Qué hace la compañía?** **Empresa líder de América Latina, dedicada al desarrollo y venta de tecnología de precisión para la maquinaria agrícola.** Se encuentra ubicada en Santa Fe y cuenta con más de 130 colaboradores/as. Buscan ser los proveedores más eficaces y confiables de productos agrícolas a nível internacional. **¿Qué necesitás...
-
Frontend Engineer
hace 3 semanas
Rosario, Argentina Scale Up Recruiting Partners A tiempo completoLocation: Latin America (Remote) Employment type: Contractor (Long-term) Language: Advanced English Compensation: USD salary – based on experience About The Client Our client is a fast-growing U.S.-based technology company building AI-powered products that simplify complex workflows and enhance user interaction through intelligent automation. They are...