Empleos actuales relacionados con AI Agent Evaluation Analyst - Argentina - Mindrift

  • Ai Agent Engineer

    hace 2 horas


    Argentina Velocity Agent A tiempo completo

    We are looking for AI Agent Engineer / AI Agent Developer to design, build autonomous and deploy autonomous AI Agents that support our investigative and administrative workflows. The role involves turning business process into reliable AI-driven automation. -      Translate business processes (investigations, case updates, invoicing) into...


  • , , Argentina Mindrift A tiempo completo

    This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI....


  • , , Argentina Mindrift A tiempo completo

    Be among the first 25 applicants. Get AI-powered advice on this job and more exclusive features. This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. At Mindrift, innovation meets opportunity. We...


  • , , Argentina Mindrift A tiempo completo

    This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English. At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we do...


  • , , Argentina Mindrift A tiempo completo

    2 weeks ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. At Mindrift, innovation meets...


  • , , Argentina Mindrift A tiempo completo

    1 day ago – Be among the first 25 applicants. This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. At Mindrift, innovation meets opportunity. We believe in using the power of collective...


  • , , Argentina Siena AI A tiempo completo

    Join to apply for the Engineering Manager role at Siena AI . Meet Siena Siena is the first intelligence layer for customer experience. We're creating an operating system of AI agents that learn, remember, and act across every customer touchpoint—from support conversations to shopping experiences to voice and social media interactions. Siena doesn’t just...

  • Senior AI Coach

    hace 1 semana


    , , Argentina Applaudo A tiempo completo

    You are a highly skilled AI professional who thrives at the intersection of engineering, solution design, and team enablement. You excel at building real‑world LLM‑powered systems while also mentoring, coaching, and uplifting cross‑functional teams. You are passionate about helping organizations adopt Generative AI responsibly and effectively, and you...

  • AI Strategist

    hace 17 horas


    , , Argentina Siena AI A tiempo completo

    AI Strategist Join to apply for the AI Strategist role at Siena AI. About Siena Siena is the first intelligence layer for customer experience. We’re creating an operating system of AI agents that learn, remember, and act across every customer touchpoint—from support conversations to shopping experiences to voice and social media interactions. Siena...

  • AI Engineer

    hace 1 semana


    , , Argentina CRAFTLabs A tiempo completo

    We are software artisans passionate about what we do: help companies build awesome solutions. With an agile process that is built on top of the best engineering practices. We believe transparent, honest and fluent communication, both remotely and on-site is a key factor to the success of any project. What are we looking for? Our ideal candidate is a seasoned...

AI Agent Evaluation Analyst

hace 4 semanas


Argentina Mindrift A tiempo completo

Job Overview At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. What We Do The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting‑edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real‑world expertise from across the globe. Who We're Looking For We’re seeking curious, intellectually proactive contributors—people who double‑check assumptions and play devil’s advocate. If you’re comfortable with ambiguity and complexity, and enjoy remote, flexible, project‑based work, you’re a good fit. About the Project We need QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. The role combines quality assurance, research, and logical problem‑solving. What You'll Be Doing Reviewing evaluation tasks and scenarios for logic, completeness, and realism. Identifying inconsistencies, missing assumptions, or unclear decision points. Helping define clear expected behaviors (gold standards) for AI agents. Annotating cause‑effect relationships, reasoning paths, and plausible alternatives. Thinking through complex systems and policies as a human would to ensure agents are tested properly. Working closely with QA, writers, or developers to suggest refinements or edge‑case coverage. Requirements Excellent analytical thinking: reason about complex systems, scenarios, and logical implications. Strong attention to detail: spot contradictions, ambiguities, and vague requirements. Familiarity with structured data formats: read JSON/YAML (writing not required). Ability to assess scenarios holistically: identify missing, unrealistic, or potentially breaking elements. Good communication and clear writing in English to document findings. Optional: Experience with policy evaluation, logic puzzles, case studies, or structured scenario design. Optional: Background in consulting, academia, olympiads (e.g., logic/math/informatics), or research. Optional: Exposure to LLMs, prompt engineering, or AI‑generated content. Optional: Familiarity with QA or test‑case thinking (edge cases, failure modes, "what could go wrong"). Optional: Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.). Benefits Get paid for your expertise, with rates up to $17/hour depending on skills, experience, and project needs. Participate in a flexible, remote, freelance project that fits around your primary professional or academic commitments. Gain valuable experience in advanced AI projects to enhance your portfolio. Influence how future AI models understand and communicate in your field of expertise. Seniority Level Internship Employment Type Part‑time Job Function Other Industries: IT Services and IT Consulting #J-18808-Ljbffr