Zyte

Machine Learning Engineer - Web Data Quality

Job Location

Rio de Janeiro, Brazil

Job Description

2 weeks ago Be among the first 25 applicants At Zyte , we make the world's web data accessible to everyone. Our technology powers data extraction at scale, helping businesses and researchers unlock the full potential of the web. We're a remote-first, multicultural team of engineers, data scientists, and innovators who believe in curiosity, collaboration, and continuous learning. If you're passionate about building reliable AI systems and improving the quality of web data, we'd love to hear from you. About The Role As a Machine Learning Engineer (Web Data Quality) , you will design and implement intelligent systems that automatically detect, measure, and improve the quality of large-scale web datasets. You will work at the intersection of data science, AI, and distributed systems, collaborating closely with product, engineering, and data teams to make data accuracy measurable, scalable, and actionable. What You’ll Do Develop and deploy ML models for anomaly detection, schema drift, and content validation Build and improve data quality pipelines leveraging modern data and MLOps tools Design and optimize embeddings and GenAI models to enhance data consistency Collaborate with engineers to integrate AI systems into production workflowsConduct experiments, evaluate performance, and iterate for continuous improvement Stay up to date on AI/ML and GenAI research to guide innovation within Zyte Required Qualifications 3 years of experience in Machine Learning / Data Science / AI Engineering Strong Python skills and experience with ML frameworks (PyTorch, TensorFlow, scikit-learn) Experience with data validation, anomaly detection, or data quality systems Familiarity with data pipelines (Airflow, Spark, or similar) Understanding of model evaluation, metrics, and deployment best practices Excellent problem‑solving, communication, and collaboration skills Preferred Qualifications Experience with LangChain, LlamaIndex, or GenAI model orchestration Familiarity with data labeling tools and active learning approaches Contributions to open‑source or public ML projects Experience working in a remote, cross‑functional team environment Benefits 35 days of paid time off Health & wellness support Inclusive and supportive team environment Attend conferences and meet with team members from across the globe Work with cutting‑edge open source technologies and tools Seniority Level Mid‑Senior level Employment Type Full‑time Job Function Information Technology Industries IT Services and IT Consulting Referrals increase your chances of interviewing at Zyte by 2x. J-18808-Ljbffr

Location: Rio de Janeiro, Rio de Janeiro, BR

Posted Date: 11/24/2025
View More Zyte Jobs

Contact Information

Contact Human Resources
Zyte

Posted

November 24, 2025
UID: 5463586814

AboutJobs.com does not guarantee the validity or accuracy of the job information posted in this database. It is the job seeker's responsibility to independently review all posting companies, contracts and job offers.