Empleos actuales relacionados con AI Agent Evaluation Analyst - Argentina - Mindrift

Ai Agent Engineer

hace 2 horas

Argentina Velocity Agent A tiempo completo

We are looking for AI Agent Engineer / AI Agent Developer to design, build autonomous and deploy autonomous AI Agents that support our investigative and administrative workflows. The role involves turning business process into reliable AI-driven automation. - Translate business processes (investigations, case updates, invoicing) into...
AI Agent Evaluation Analyst

hace 2 semanas

, , Argentina Mindrift A tiempo completo

This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI....
Freelance Agent Evaluation Analyst

hace 2 semanas

, , Argentina Mindrift A tiempo completo

Be among the first 25 applicants. Get AI-powered advice on this job and more exclusive features. This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. At Mindrift, innovation meets opportunity. We...
Evaluation Scenario Writer

hace 2 semanas

, , Argentina Mindrift A tiempo completo

This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English. At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. What we do...
MCP & Tools Python Developer - Agent Evaluation Infrastructure

hace 17 horas

, , Argentina Mindrift A tiempo completo

2 weeks ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. At Mindrift, innovation meets...
Freelance Economic Analyst

hace 4 semanas

, , Argentina Mindrift A tiempo completo

1 day ago – Be among the first 25 applicants. This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. At Mindrift, innovation meets opportunity. We believe in using the power of collective...
Engineering Manager, Agent

hace 17 horas

, , Argentina Siena AI A tiempo completo

Join to apply for the Engineering Manager role at Siena AI . Meet Siena Siena is the first intelligence layer for customer experience. We're creating an operating system of AI agents that learn, remember, and act across every customer touchpoint—from support conversations to shopping experiences to voice and social media interactions. Siena doesn’t just...
Senior AI Coach

hace 1 semana

, , Argentina Applaudo A tiempo completo

You are a highly skilled AI professional who thrives at the intersection of engineering, solution design, and team enablement. You excel at building real‑world LLM‑powered systems while also mentoring, coaching, and uplifting cross‑functional teams. You are passionate about helping organizations adopt Generative AI responsibly and effectively, and you...
AI Strategist

hace 17 horas

, , Argentina Siena AI A tiempo completo

AI Strategist Join to apply for the AI Strategist role at Siena AI. About Siena Siena is the first intelligence layer for customer experience. We’re creating an operating system of AI agents that learn, remember, and act across every customer touchpoint—from support conversations to shopping experiences to voice and social media interactions. Siena...
AI Engineer

hace 1 semana

, , Argentina CRAFTLabs A tiempo completo

We are software artisans passionate about what we do: help companies build awesome solutions. With an agile process that is built on top of the best engineering practices. We believe transparent, honest and fluent communication, both remotely and on-site is a key factor to the success of any project. What are we looking for? Our ideal candidate is a seasoned...

AI Agent Evaluation Analyst

hace 4 semanas

Argentina Mindrift A tiempo completo

Job Overview At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI. This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. What We Do The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting‑edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real‑world expertise from across the globe. Who We're Looking For We’re seeking curious, intellectually proactive contributors—people who double‑check assumptions and play devil’s advocate. If you’re comfortable with ambiguity and complexity, and enjoy remote, flexible, project‑based work, you’re a good fit. About the Project We need QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. The role combines quality assurance, research, and logical problem‑solving. What You'll Be Doing Reviewing evaluation tasks and scenarios for logic, completeness, and realism. Identifying inconsistencies, missing assumptions, or unclear decision points. Helping define clear expected behaviors (gold standards) for AI agents. Annotating cause‑effect relationships, reasoning paths, and plausible alternatives. Thinking through complex systems and policies as a human would to ensure agents are tested properly. Working closely with QA, writers, or developers to suggest refinements or edge‑case coverage. Requirements Excellent analytical thinking: reason about complex systems, scenarios, and logical implications. Strong attention to detail: spot contradictions, ambiguities, and vague requirements. Familiarity with structured data formats: read JSON/YAML (writing not required). Ability to assess scenarios holistically: identify missing, unrealistic, or potentially breaking elements. Good communication and clear writing in English to document findings. Optional: Experience with policy evaluation, logic puzzles, case studies, or structured scenario design. Optional: Background in consulting, academia, olympiads (e.g., logic/math/informatics), or research. Optional: Exposure to LLMs, prompt engineering, or AI‑generated content. Optional: Familiarity with QA or test‑case thinking (edge cases, failure modes, "what could go wrong"). Optional: Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.). Benefits Get paid for your expertise, with rates up to $17/hour depending on skills, experience, and project needs. Participate in a flexible, remote, freelance project that fits around your primary professional or academic commitments. Gain valuable experience in advanced AI projects to enhance your portfolio. Influence how future AI models understand and communicate in your field of expertise. Seniority Level Internship Employment Type Part‑time Job Function Other Industries: IT Services and IT Consulting #J-18808-Ljbffr

América

Europa

Asia / Oceanía

África

Empleos actuales relacionados con AI Agent Evaluation Analyst - Argentina - Mindrift

AI Agent Evaluation Analyst