Senior Data Engineer
Charlotte- Hybrid (4 days onsite 1 day in person)
6 months extending
2 Openings - 40-65 / hr based on experience level
This position is for a Senior AWS Data Engineer working on a Master Data Management (MDM) project. The goal of the project is to create a single, trusted view of business data by cleaning up duplicate and inconsistent information from multiple sources. Youll be building scalable data pipelines on AWS, improving data quality, and working on advanced features like entity resolution and machine learning-assisted matching. Its a hands-on role where youll own production-grade pipelines and work with large datasets. If you enjoy solving complex data challenges and making systems more efficient, this is a great fit.
Main Responsibilities :
Build and maintain data pipelines on AWS
Develop ETL jobs using AWS Glue (PySpark) and Amazon EMR
Orchestrate workflows using Apache Airflow
Support full and incremental data processing
Implement data matching, deduplication, and entity resolution
Monitor, troubleshoot, and support production pipelines
Partner with analytics and business teams
Must Haves
6+ years in AWS data engineering experience
Python and PySpark development
Hands-on with AWS Glue and Amazon EMR
Experience using Apache Airflow
Strong SQL skills
Experience working with large datasets
Familiarity with ML concepts for data quality or matching
Nice to have
Experience with entity resolution, fuzzy matching, or deduplication
Experience with AWS Entity Resolution
Experience in Business MDM programs
Data Engineer • Charlotte, NC, United States