Job Summary :
We are seeking a motivated Data Engineer to join our Data Engineering team. The ideal candidate will have exposure in Python, and SQL, and will be responsible for designing, developing, and maintaining robust data pipelines for structured, semi-structured, and unstructured data. This role is ideal for someone passionate about building scalable data solutions and enabling advanced analytics across the organization.
Key Responsibilities :
- Design, develop, and optimize data pipelines using Python, Spark, and SQL.
- Ingest, process, and analyze structured (., relational databases), semi-structured (., JSON, XML), and unstructured data (., text, logs, images) from diverse sources.
- Implement data quality checks, validation, and transformation logic to ensure data integrity and reliability.
- Collaborate with data scientists, analysts, and business stakeholders to understand data requirements and deliver solutions.
- Develop and maintain data models, data dictionaries, and technical documentation.
- Monitor, troubleshoot, and optimize data workflows for performance and scalability.
- Ensure compliance with data governance, security, and privacy policies.
- Support data migration, integration, and modernization initiatives, including cloud-based solutions (AWS, Azure, GCP).
- Automate repetitive data engineering tasks and contribute to continuous improvement of data infrastructure.
Good to Have :
Knowledge of any of the cloud platforms (AWS, Azure, GCP).Understanding of data security, encryption, and compliance best practices.Qualifications :
Bachelor’s or Master’s degree in Computer Science, Information Systems, Data Engineering, or a related field.Good programming skills in Python and SQLGood problem-solving, analytical, and communication skills.Ability to work collaboratively in a fast-paced environment.