Eximietas Design
Senior Software Engineer
Job Location
bangalore, India
Job Description
About Eximietas: Eximietas Design is a leading technology consulting and solutions development firm specializing in Chip design, Firmware & Embedded Software development, Cloud Computing, cybersecurity, and AI/ML domains. Our success is anchored in the unparalleled expertise of our engineering leadership team, who have collectively taped out over 100 chips and released countless software solutions for renowned tech giants like Google, Cisco, Microsoft, Oracle, Uber, Broadcom and Sun. With a commitment to innovation and excellence, we deliver cutting-edge solutions that empower businesses to thrive in the ever-evolving digital landscape. We are an ISO 9001 and ISO 27001 certified company with development centres in the US and India. Website link: https://www.eximietas.design Role: ML Data Engineer Location: Bangalore ( 5 days - Work from Office) Experience: 4 Years - 8 Years We are seeking a highly skilled and motivated candidate with expertise in programming, problem-solving, and machine learning (ML) and artificial intelligence (AI). The ideal candidate will possess strong programming skills, with a particular focus on Python, and experience utilising key data manipulation libraries, such as Pandas and NumPy. Familiarity with the Hugging Face Transformers and Datasets libraries is highly desirable. Key Requirements: 1. Proficiency in Python and its data manipulation libraries (e.g., Datasets, Pandas, NumPy). 2. Demonstrable experience designing and building scalable data pipelines for collecting, cleaning, transforming, and versioning large-scale datasets (text, code, structured) specifically for LLM finetuning. 3. Hands-on experience in preparing and formatting diverse datasets into specific structures required for LLM finetuning (e.g., prompt-completion, instruction-following, chat formats). 4. Experience in curating, cleaning, and structuring datasets for LLM evaluation, ensuring data quality and relevance for various benchmarks. 5. Familiarity with common challenges in LLM data preparation, such as bias detection/mitigation, data distribution analysis, and data augmentation techniques. 6. Solid understanding of data engineering best practices, including data quality, versioning, and efficient data processing. In addition to technical expertise, the ideal candidate will have experience with Git and GitHub for version control and a proven ability to collaborate effectively in a team environment, particularly when working on shared codebases and remote projects. Strong data management and manipulation skills are crucial, as is experience working on remote servers to develop and deploy machine learning models.
Location: bangalore, IN
Posted Date: 6/17/2025
Location: bangalore, IN
Posted Date: 6/17/2025
Contact Information
Contact | Human Resources Eximietas Design |
---|