Coders Brain Technology Pvt. Ltd.

Pyspark Developer - Big Data Apps

Job Location

chennai, India

Job Description

Position Title : Senior PySpark Developer Experience Range : 6 to 10 Years Location : Chennai Job Type : Full-Time Job Overview : We are looking for an experienced Senior PySpark Developer to join our data engineering team. This role involves designing and building high-performance data processing systems using Apache Spark (PySpark) and other big data technologies. The ideal candidate will have a deep understanding of data engineering principles, big data ecosystem, and cloud-based data solutions. You will play a critical role in developing scalable data pipelines, optimizing performance, and ensuring data integrity across platforms. Key Responsibilities : - Big Data Application Development : Design, develop, and maintain distributed data processing solutions using Apache Spark and Python (PySpark). - Data Pipeline Engineering : Build and manage end-to-end data pipelines for ingestion, transformation, and delivery of structured and unstructured data. - Performance Optimization : Tune and optimize Spark applications for speed, efficiency, and fault tolerance, especially with large-scale datasets. - Collaboration : Work closely with data engineers, analysts, data scientists, and business stakeholders to understand requirements and translate them into scalable solutions. - Data Quality & Governance : Implement best practices for data quality, integrity, and compliance in all stages of the data pipeline. - Automation & Orchestration : Develop automated workflows using tools like Apache Airflow, and integrate with CI/CD pipelines. - Cloud Integration : Design and deploy data solutions in cloud environments such as AWS, Azure, or GCP. - Troubleshooting & Support : Identify and resolve bottlenecks, failures, and latency issues in distributed systems. - Technical Leadership : Mentor junior developers and contribute to code reviews, architecture discussions, and best practice implementation. - Continuous Learning : Stay up to date with the latest trends in big data, cloud computing, and data engineering technologies. Required Skills & Qualifications : - 6 years of hands-on experience in data engineering with a focus on Apache Spark and Python (PySpark). - Proven ability to build and maintain scalable and high-performance big data applications. - Strong knowledge of big data frameworks like Hadoop, Hive, and Kafka. - Experience with cloud platforms such as AWS (Glue, EMR, S3), Azure (Databricks, Data Lake), or GCP. - Expertise in SQL, data warehousing, and data modeling concepts. - Proficiency with workflow orchestration tools such as Apache Airflow or similar. - Familiarity with CI/CD pipelines and tools like Jenkins, Git, etc. - Hands-on experience with containerization tools like Docker and orchestration using Kubernetes. - Understanding of distributed computing, parallel processing, and fault-tolerant system design. - Excellent problem-solving skills and the ability to work independently in a fast-paced environment. - Strong communication skills to effectively collaborate across teams. Preferred Qualifications : - Experience leading or mentoring data engineering teams. - Familiarity with Spark Structured Streaming and real-time data processing. - Knowledge of machine learning workflows and integration with data pipelines. - Certifications in Big Data (Cloudera, Hortonworks) or Cloud Platforms (AWS, Azure, GCP). - Experience in implementing data governance and compliance standards. (ref:hirist.tech)

Location: chennai, IN

Posted Date: 5/1/2025
View More Coders Brain Technology Pvt. Ltd. Jobs

Contact Information

Contact Human Resources
Coders Brain Technology Pvt. Ltd.

Posted

May 1, 2025
UID: 5156986814

AboutJobs.com does not guarantee the validity or accuracy of the job information posted in this database. It is the job seeker's responsibility to independently review all posting companies, contracts and job offers.