AI/ML Engineer

hace 1 semana


Alta Gracia, Argentina Zyte A tiempo completo

About Us At Zyte, we eat data for breakfast and you can eat your breakfast anywhere and work for Zyte. Founded in 2010, we are a globally distributed team of over 250 Zytans working from over 28 countries who are on a mission to enable our customers to extract the data they need to continue to innovate and grow their businesses. We believe that all businesses deserve a smooth pathway to data For more than a decade, Zyte has led the way in building powerful, easy-to-use tools to collect, format, and deliver web data, quickly, dependably, and at scale. And today, the data we extract helps thousands of organizations make smarter business decisions, secure competitive advantage, and drive sustainable growth. Today, over 3,000 companies and 1 million developers rely on our tools and services to get the data they need from the web. Data QA is an important function within Zyte. The Data QA team works to ensure that the quality and usability of the data scraped by our web scrapers meets and exceeds the expectations of our enterprise clients. Are you passionate about data and data quality and integrity? Do you enjoy using Python and AI to analyze and manipulate data, detect data quality issues, and visualize your findings? Are you highly customer-focused with excellent attention to detail? Owing to growing business and the need for ever more sophisticated Data QA, we are looking for a talented Data Scientist to join our team. As a Zyte Engineer, you work on AI-based data wrangling, data manipulation, and data visualisation techniques and apply them in the verification and validation of data quality as it pertainsto data extracted from the web. Roles & Responsibilities Design and implement AI-driven quality checks : build models to detect anomalies, identify schema drift, and classify data errors in real time. Automate and scale QA : replace manual and rule-based validation with ML-powered solutions that continuously improve. Leverage GenAI for validation : use embedding models, LLMs, and prompt-driven pipelines to perform semantic checks on scraped data. Develop monitoring & alerting pipelines : quantify data quality via KPIs, dashboards, and automated reports for stakeholders. Experiment & innovate : research and prototype new AI techniques for QA, using embeddings, synthetic data, and reinforcement learning to stress-test scrapers. Collaborate cross-functionally : work with developers, product managers, and account teams to integrate AI-based QA into production workflows. Communicate insights : present findings with clear visualizations, metrics, and evidence-based recommendations to technical and non-technical audiences. Requirements Proficiency in Python & PyData stack (NumPy, pandas, scikit-learn, PyTorch/TensorFlow preferred). 3+ years in a data science, applied ML, or data engineering role (ideally with exposure to QA or data validation at scale). Hands-on experience with GenAI tools: LLM APIs (OpenAI, Anthropic, Google), prompt engineering, cost/token optimization. Strong ML fundamentals: anomaly detection, classification, clustering, embeddings, evaluation metrics. Experience with big data frameworks (Spark, BigQuery, or similar). Ability to work with very large datasets (millions+ of records). Version control skills (GitHub/Bitbucket). Excellent communication in English, both technical and non-technical. Desired Skills Prior experience in data quality automation or web data QA. Familiarity with LangChain, MCP, Marvin, or similar orchestration frameworks. Experience building QA dashboards or visualization layers. Background in statistics or applied mathematics. Previous remote/distributed work experience. Benefits As a new Zytan, you will: Become part of a self-motivated, progressive, multi-cultural team. Have the freedom and flexibility to work from where you do your best work. Attend conferences and meet with team members from across the globe. Work with cutting-edge open source technologies and tools. #J-18808-Ljbffr



  • Alta Gracia, Argentina Zyte A tiempo completo

    A data technology company is seeking a talented Data Scientist to join their team in Argentina. The role involves designing AI-driven quality checks, automating QA processes, and leveraging GenAI for data validation. Ideal candidates will have a strong proficiency in Python and at least 3 years in data science or a related field. This position offers the...


  • Alta Gracia, Argentina Darwoft A tiempo completo

    Project: Life Sciences & Healthcare Data Intelligence Time Zone: ART Get to Know Us At Darwoft, we partner with cutting‑edge companies around the world to build digital products that create real impact. One of our clients is a leading Life Sciences and Healthcare data intelligence company that is transforming decision‑making by empowering global...


  • Alta Gracia, Argentina Darwoft A tiempo completo

    - Project: Life Sciences & Healthcare Data Intelligence - Time Zone: ART Get to Know Us At Darwoft, we partner with cutting‑edge companies around the world to build digital products that create real impact. One of our clients is a leading Life Sciences and Healthcare data intelligence company that is transforming decision‑making by empowering global...

  • Remote DataOps Engineer

    hace 1 semana


    Alta Gracia, Argentina Darwoft A tiempo completo

    A leading technology firm in Argentina seeks a Data Operations Engineer to ensure the stability and reliability of data platforms. You will monitor and optimize data pipelines, develop automation tools using Python, and collaborate across teams to deliver robust data solutions. Ideal candidates have 4+ years in data operations and strong skills in cloud...

  • Remote DataOps Engineer

    hace 2 semanas


    Alta Gracia, Argentina Darwoft A tiempo completo

    A leading technology firm in Argentina seeks a Data Operations Engineer to ensure the stability and reliability of data platforms. You will monitor and optimize data pipelines, develop automation tools using Python, and collaborate across teams to deliver robust data solutions. Ideal candidates have 4+ years in data operations and strong skills in cloud...