Talent.com
Data EngineerData Platform Engineer
Data EngineerData Platform EngineerVDart Inc • Plano, Texas, USA
Data EngineerData Platform Engineer

Data EngineerData Platform Engineer

VDart Inc • Plano, Texas, USA
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Job Title : Data Engineer / Data Platform Engineer

Location : Dallas TX

Duration : 3 Months CTH

Role Summary

As a Data Platform Engineer you will be responsible for the design development and maintenance of our high-scale cloud-based data platform treating data as a strategic product. You will lead the implementation of robust optimized data pipelines using PySpark and the Databricks Unified Analytics Platform-leveraging its full ecosystem for Data Engineering Data Science and ML workflows. You will also establish best-in-class DevOps practices using CI / CD and GitHub Actions to ensure automated deployment and reliability. This role demands expertise in large-scale data processing and a commitment to modern scalable data engineering and AWS cloud infrastructure practices.

Key Responsibilities

Platform Development : Design build and maintain scalable efficient and reliable ETL / ELT data pipelines to support data ingestion transformation and integration across diverse sources.

Big Data Implementation : Serve as the subject matter expert for the Databricks environment developing high-performance data transformation logic primarily using PySpark and Python. This includes utilizing Delta Live Tables (DLT) for declarative pipeline construction and ensuring governance through Unity Catalog.

Cloud Infrastructure Management : Configure maintain and secure the underlying AWS cloud infrastructure required to run the Databricks platform including virtual private clouds (VPCs) network endpoints storage (S3) and cross-account access mechanisms.

DevOps & Automation (CI / CD) : Own and enforce Continuous Integration / Continuous Deployment (CI / CD) practices for the data platform. Specifically design and implement automated deployment workflows using GitHub Actions and modern infrastructure-as-code concepts to deploy Databricks assets (Notebooks Jobs DLT Pipelines and Repos).

Data Quality & Testing : Design and implement automated unit integration and performance testing frameworks to ensure data quality reliability and compliance with architectural standards.

Performance Optimization : Optimize data workflows and cluster configurations for performance cost efficiency and scalability across massive datasets.

Technical Leadership : Provide technical guidance on data principles patterns and best practices (e.g. Medallion Architecture ACID compliance) to promote team capabilities and maturity. This includes leveraging Databricks SQL for high-performance analytics.

Documentation & Review : Draft and review architectural diagrams design documents and interface specifications to ensure clear communication of data solutions and technical requirements.

Required Qualifications

Experience : 5 years of professional experience in Data Engineering focusing on building scalable data platforms and production pipelines.

Big Data Expertise : Minimum 3 years of hands-on experience developing deploying and optimizing solutions within the Databricks ecosystem. Deep expertise required in :

Delta Lake (ACID transactions time travel optimization).

Unity Catalog (data governance access control metadata management).

Delta Live Tables (DLT) (declarative pipeline development).

Databricks Workspaces Repos and Jobs.

Databricks SQL for analytics and warehouse operations.

AWS Infrastructure & Security : Proven hands-on experience (3 years) with core AWS services and infrastructure components including :

Networking : Configuring and securing VPCs VPC Endpoints Subnets and Route Tables for private connectivity.

Security & Access : Defining and managing IAM Roles and Policies for secure cross-account access and least privilege access to data.

Storage : Deep knowledge of Amazon S3 for data lake implementation and governance.

Programming : Expert proficiency (4 years) in Python for data manipulation scripting and pipeline development.

Spark & SQL : Deep understanding of distributed computing and extensive experience (3 years) with PySparkand advanced SQL for complex data transformation and querying.

DevOps & CI / CD : Proven experience (2 years) designing and implementing CI / CD pipelines including proficiency with GitHub Actions or similar tools (e.g. GitLab CI Jenkins) for automated testing and deployment.

Data Concepts : Full understanding of ETL / ELT Data Warehousing and Data Lake concepts.

Methodology : Strong grasp of Agile principles (Scrum).

Version Control : Proficiency with Git for version control.

Preferred Qualifications

AWS Data Ecosystem Experience : Familiarity and experience with AWS cloud-native data services such as AWS Glue Amazon Athena Amazon Redshift Amazon RDS and Amazon DynamoDB.

Knowledge of real-time or near-real-time streaming technologies (e.g. Kafka Spark Structured Streaming).

Experience in developing feature engineering pipelines for machine learning (ML) consumption.

Background in performance tuning and capacity planning for large Spark clusters.

Keywords : PySpark ETL Databricks Terraform CI / CD SQL AWS EC2

Key Skills

Apache Hive,S3,Hadoop,Redshift,Spark,AWS,Apache Pig,NoSQL,Big Data,Data Warehouse,Kafka,Scala

Employment Type : Full Time

Experience : years

Vacancy : 1

[job_alerts.create_a_job]

Data EngineerData Platform Engineer • Plano, Texas, USA

[internal_linking.similar_jobs]
Data Engineer

Data Engineer

Protagona • Dallas, TX, US
[job_card.full_time]
[filters_job_card.quick_apply]
As a Data Engineer, you will be part of a talented team of engineers responsible for the deployment and configuration of cloud resources to meet individual client business needs in AWS.Client engag...[show_more]
[last_updated.last_updated_30]
Data Integration Engineer

Data Integration Engineer

Blue Acorn iCi • Dallas, TX, US
[job_card.full_time]
[filters_job_card.quick_apply]
Bachelor’s or master’s degree in computer science, Information Systems, or related field 12+ years of experience in Software engineering, with at least 3-5 years focused on Creating Mic...[show_more]
[last_updated.last_updated_variable_days]
Analytics Engineer (Observability Tooling)

Analytics Engineer (Observability Tooling)

Sepal • Dallas, TX, United States
[job_card.full_time]
Sepal AI builds the world’s hardest tests for AI grounded in real-world software systems.We’re hiring a Data Engineer with 3+ years of experience and a strong systems mindset to help us build evalu...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Principal Data Modeler and Database Engineer (Onsite)

Principal Data Modeler and Database Engineer (Onsite)

Raytheon • Richardson, TX, US
[job_card.full_time]
TX234 : Richardson 1717 CityLine 1717 East CityLine Drive Building C17, Richardson, TX, 75082 USA.Person, or Immigration Status Requirements : . At Raytheon, the foundation of everything we do is roote...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Lead Data Engineer - AWS

Lead Data Engineer - AWS

Tiger Analytics Inc. • Dallas, TX, US
[filters.remote]
[job_card.full_time]
[filters_job_card.quick_apply]
Tiger Analytics is a fast-growing advanced analytics consulting firm.Our consultants bring deep expertise in Data Science, Machine Learning and AI. We are the trusted analytics partner for multiple ...[show_more]
[last_updated.last_updated_30]
Senior Python Data Engineer

Senior Python Data Engineer

Purple Drive • Dallas, TX, Texas, USA
[job_card.full_time]
Job Title Senior Python Data Engineer &...[show_more]
[last_updated.last_updated_variable_days]
Data Engineer-2

Data Engineer-2

eTeam Inc • Plano, Texas, United States
[job_card.full_time]
[filters_job_card.quick_apply]
Extensive experience in designing, configuring, deploying, managing and automating AWS Core Services like S3, IAM, EC2, Route53, SNS, SQS, ELB, CloudWatch, Lambda and VPC.Experience in automating c...[show_more]
[last_updated.last_updated_30]
Senior Data Engineer

Senior Data Engineer

GalaxEsystems • Plano, TX, US
[job_card.full_time]
[filters_job_card.quick_apply]
Senior Data Engineer Location : .Remote Visa : USC / GC / Ead We are seeking a skilled Data Engineer to design, build, and maintain scalable data pipelines and analytics solutions.The ideal candidate will...[show_more]
[last_updated.last_updated_30]
Azure Data Engineer

Azure Data Engineer

Connvertex Technologies Inc. • Dallas, TX, United States
[job_card.full_time]
[filters_job_card.quick_apply]
Job Title : Azure Data Engineer Location : Dallas, TX (Local only with DL) Work Type : Hybrid (3 days a week onsite) Job Type : Contract (1 year+) Rate : ...[show_more]
[last_updated.last_updated_1_day]
Data Center Engineer

Data Center Engineer

OTSI • Richardson, TX, us
[job_card.full_time]
[filters_job_card.quick_apply]
Object Technology Solutions, Inc (OTSI).We are seeking a highly motivated and skilled Data Center Optical Engineer to join our team. The ideal candidate will have experience in racking, cabling, and...[show_more]
[last_updated.last_updated_variable_days]
Data Engineer

Data Engineer

Ascentt • Plano, Texas, United States, 75024
[job_card.full_time]
We are looking for a highly skilled and motivated.In this role, you will design and implement scalable data pipelines, build robust data infrastructure, and work closely with cross-functional teams...[show_more]
[last_updated.last_updated_variable_days]
Azure Data Engineer

Azure Data Engineer

ATTAINX INC • Dallas, Texas, United States, 75243
[job_card.full_time]
US Citizen w / Active Secret Clearance.We are seeking an experienced Azure Data Engineer with proven ability to evaluate, optimize, and modernize enterprise data warehouse environments.The role comb...[show_more]
[last_updated.last_updated_variable_days]
Senior Solutions Engineer : AI & Data Cloud Pre-Sales

Senior Solutions Engineer : AI & Data Cloud Pre-Sales

Snowflake • Dallas, TX, US
[job_card.full_time]
A leading cloud data platform provider is seeking a Senior Solution Engineer to solve complex customer problems and support the sales team. The ideal candidate will have 7-8 years of industry experi...[show_more]
[last_updated.last_updated_1_day] • [promoted]
Data Engineer Lead : 24-02271 (No C2C)

Data Engineer Lead : 24-02271 (No C2C)

Akraya Inc • Dallas, Texas, United States
[job_card.full_time]
[filters_job_card.quick_apply]
AWS, SQL, Python, Redshift, S3.Dallas, TX OR Seattle, WA (Hybrid.Plan, design, implement, and manage a deployment of self-service data platform using native AWS services. Design, build, and maintain...[show_more]
[last_updated.last_updated_30]
Azure Data Engineer

Azure Data Engineer

Akaasa Technologies • Dallas, TX, United States
[job_card.full_time]
[filters_job_card.quick_apply]
Required Candidate Location : Dallas, TX Hybrid / 3 days a week We need a senior (10+ years) Azure Data engineer with recent experience in Banking, Capital Market...[show_more]
[last_updated.last_updated_1_day]
Senior Data Platform Lead – Hybrid (Snowflake / Azure)

Senior Data Platform Lead – Hybrid (Snowflake / Azure)

Accordion • Town of Texas, WI, United States
[job_card.full_time]
A consulting firm is seeking a leader in data platform delivery with over 10 years of experience.This hybrid role requires command of Snowflake and Azure solutions, and the ability to manage comple...[show_more]
[last_updated.last_updated_30] • [promoted]
Data Architect

Data Architect

eTeam Inc • Plano, Texas, United States
[job_card.temporary]
[filters_job_card.quick_apply]
Job Type : Contract; 3 months contract.Data Vault, Data modelling, AWS, Terraform.Implement business and IT data requirements through new data strategies and designs across all data platforms (relat...[show_more]
[last_updated.last_updated_30]
Junior Data Engineer

Junior Data Engineer

Mod Op • Dallas, TX, US
[job_card.full_time]
[filters_job_card.quick_apply]
Data Engineer Job Description : .Mod Op is a full-service advertising agency with offices across several US locations, Panama City, Panama, and Canada. With continued growth and a dynamic leade...[show_more]
[last_updated.last_updated_30]