MCP & Tools Python Developer - Agent Evaluation Infrastructure

hace 1 semana


Buenos Aires, Buenos Aires C.F., Argentina Mindrift A tiempo completo

This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.

At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. 

What we do

The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe.

Who we're looking for

Calling all security researchers, engineers, and penetration testers with a strong foundation in problem-solving, offensive security, and AI-related risk assessment.

If you thrive on digging into complex systems, uncovering hidden vulnerabilities, and thinking creatively under constraints, join us 

We're looking for someone who can bring a hands-on approach to technical challenges, whether breaking into systems to expose weaknesses or building secure tools and processes. We value contributors with a passion for continuous learning, experimentation, and adaptability. 

About the project

We're on the hunt for hands-on Python engineers for a new project focused on developing Model Context Protocol (MCP) servers and internal tools for running and evaluating agent behavior. You'll implement base methods for agent action verification, integrate with internal and client infrastructures, and help fill tooling gaps across the team.

What you'll be doing:

  • Developing and maintaining MCP-compatible evaluation servers
  • Implementing logic to check agent actions against scenario definitions
  • Creating or extending tools that writers and QAs use to test agents
  • Working closely with infrastructure engineers to ensure compatibility
  • Occasionally helping with test writing or debug sessions when needed

Although we're only looking for experts for this current project, contributors with consistent high-quality submissions may receive an invitation for ongoing collaboration across future projects. 

How to get started:

Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.

Requirements

The ideal contributor will have:

  • 4+ years of Python development experience, ideally in backend or tools
  • Solid experience building APIs, testing frameworks, or protocol-based interfaces
  • Understanding of Docker, Linux CLI, and HTTP-based communication
  • Ability to integrate new tools into existing infrastructures
  • Familiarity with how LLM agents are prompted, executed, and evaluated
  • Clear documentation and communication skills - you'll work with QA and writers

We also value applicants who have:

  • Experience with Model Context Protocol (MCP) or similar structured agent-server interfaces
  • Knowledge of FastAPI or similar async web frameworks
  • Experience working with LLM logs, scoring functions, or sandbox environments
  • Ability to support dev environments (devcontainers, CI configs, linters)
  • JS experience

Benefits

  • Get paid for your expertise, with rates that can go up to $17/hour depending on your skills, experience, and project needs.
  • Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.
  • Participate in an advanced AI project and gain valuable experience to enhance your portfolio.
  • Influence how future AI models understand and communicate in your field of expertise.


  • Buenos Aires, Buenos Aires C.F., Argentina Mindrift A tiempo completo

    This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of...


  • Buenos Aires, Buenos Aires C.F., Argentina COTO A tiempo completo

    #somosCOTO, una de las principales empresas argentinas con más de 50 años de trayectoria en producción, comercialización, exportación y Real Estate.Conformamos un equipo de más de personas y seguimos creciendo Estamos buscando profesionales que quieran potenciar su carrera, sueñen en grande y sean apasionados por lo que hacen. ¿Te sumás? Tenemos una...

  • Senior Python Developer

    hace 1 semana


    Buenos Aires, Buenos Aires C.F., Argentina endava A tiempo completo

    Full-timeCompany DescriptionTechnology is our how. And people are our why. For over two decades, we have been harnessing technology to drive meaningful change. By combining world-class engineering, industry expertise and a people-centric mindset, we consult and partner with leading brands from various industries to create dynamic platforms and intelligent...


  • Buenos Aires, Buenos Aires C.F., Argentina ZirconTech A tiempo completo

    Job Summary:We are seeking a skilled Golang Developer with expertise in Python to join our dynamic team. The ideal candidate will be responsible for designing, developing, and maintaining high-performance, scalable software solutions using Golang and Python. You will work on a variety of projects, collaborating with cross-functional teams to deliver robust...


  • Buenos Aires, Buenos Aires C.F., Argentina Workana A tiempo completo

    Workana is the largest remote work platform for talents in Latin America. Our new segment, Workana Premium, focuses on matching the most exceptional professionals with leading and innovative companies around the globe. Enjoy competitive compensation, dedicated support, and the flexibility of remote work within a dynamic environment that fosters collaboration...


  • Buenos Aires, Buenos Aires C.F., Argentina web A tiempo completo

    del empleo:Objectives of this role:Develop, test and maintain high-quality software using Python programming language.Participate in the entire software development lifecycle, building, testing and delivering high-quality solutions.Collaborate with cross-functional teams to identify and solve complex problems.Write clean and reusable code that can be easily...

  • Senior LLM Engineer

    hace 2 días


    Buenos Aires, Buenos Aires C.F., Argentina micro1 A tiempo completo

    Job DescriptionJob Title:Senior LLM EngineerJob Type:Full-timeLocation:Buenos Aires (Hybrid or Remote)Experience:4-6+ Years of Relevant ExperienceJob Summary:Join our team as a Senior LLM Engineer and play a pivotal role in designing, building, and optimizing next-generation Generative AI applications. You will leverage your expertise in Large Language...

  • Full-Stack Developer

    hace 5 días


    Buenos Aires, Buenos Aires C.F., Argentina Sur A tiempo completo

    As the Full-stack Developer you will help lead the charge of the mission to turn big ideas into blazing realities. In this role, you won't just write code, you'll shape dreams, building custom apps that delight clients and advancing the AI tools that power magic. We're searching for a curious learner, a bold challenger, and a collaborative...

  • Full-Stack Developer

    hace 5 días


    Buenos Aires, Buenos Aires C.F., Argentina Sur Global A tiempo completo

    As the Full-stack Developer you will help lead the charge of the mission to turn big ideas into blazing realities. In this role, you won't just write code, you'll shape dreams, building custom apps that delight clients and advancing the AI tools that power magic. We're searching for a curious learner, a bold challenger, and a collaborative...

  • Senior Python Developer

    hace 2 semanas


    Buenos Aires, Buenos Aires C.F., Argentina Endava A tiempo completo

    Company Description Technology is our how. And people are our why. For over two decades, we have been harnessing technology to drive meaningful change.By combining world-class engineering, industry expertise and a people-centric mindset, we consult and partner with leading brands from various industries to create dynamic platforms and intelligent digital...