Talent.com
Big Data Engineer
Big Data EngineerSGA • Rockville, MD
Big Data Engineer

Big Data Engineer

SGA • Rockville, MD
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Software Guidance & Assistance, Inc., (SGA), is searching for a Big Data Engineer for a Contract assignment with one of our premier Regulatory clients in Rockville, MD or Tysons, VA.

This role is hybrid ( days / week Onsite in either Rockville or Tysons office)

Responsibilities :

  • Design, develop, and maintain large-scale data processing pipelines using Big Data technologies (, Hadoop, Spark, Python, Scala).
  • Implement data ingestion, storage, transformation, and analysis of solutions that are scalable, efficient, and reliable.
  • Stay current with industry trends and emerging Big Data technologies to continuously improve the data architecture
  • Collaborate with cross-functional teams to understand business requirements and translate them into technical solutions.
  • Optimize and enhance existing data pipelines for performance, scalability, and reliability.
  • Develop automated testing frameworks and implement continuous testing for data quality assurance.
  • Conduct unit, integration, and system testing to ensure the robustness and accuracy of data pipelines.
  • Work with data scientists and analysts to support data-driven decision-making across the organization.
  • Ability to write and maintain automated unit, integration, and end-to-end tests
  • Monitor and troubleshoot data pipelines in production environments to identify and resolve issues.

Requirements :

  • Bachelor's degree in Computer Science, Information Systems or related discipline with at least five () years of related experience, or equivalent training and / or work experience; Master's degree and past Financial Services industry experience preferred.
  • Demonstrated technical expertise in Object Oriented and database technologies / concepts which resulted in deployment of enterprise quality solutions.
  • Past experience with developing enterprise quality solutions in an iterative or Agile environment.
  • Extensive knowledge of industry leading software engineering approaches including Test Automation, Build Automation and Configuration Management frameworks.
  • Strong written and verbal technical communication skills.
  • Demonstrated ability to develop effective working relationships that improved the quality of work products.
  • Should be well organized, thorough, and able to handle competing priorities.
  • Ability to maintain focus and develop proficiency in new skills rapidly.
  • Ability to work in a fast paced environment.
  • Experience with object oriented programming languages such as Java, Scala or Python.
  • Essential Technical Skills :

    Big Data technologies

  • Experience with Big data technologies such as Hadoop, Spark, Hive & Trino
  • Evaluate understanding of common issues like :
  • ◦ Data skew and strategies to mitigate it.

    ◦ Working with massive data volumes in PetaBytes.

    ◦ Troublehsooting job failures due to resource limitations, bad data, scalability challenged.

  • Look for real-world debugging and mitigation stories.
  • AI Skills

  • Prompt Engineering : Proficiency in crafting effective prompts for AI coding assistants and analysis tools
  • AI Workflow Design : Experience redesigning development processes to leverage AI capabilities
  • Data Analysis : Ability to interpret AI-generated insights and translate them into actionable team improvements
  • Change Management : Experience leading teams through AI adoption and workflow transformation
  • SQL Skills (Window Functions, Joins, Complex Queries)

  • Assess comfort with SQL window functions, multi-table joins, aggregations.
  • Provide examples or ask them to write / optimize SQL queries on the spot.
  • Probe how they handle edge cases like NULLs, duplicates, ordering, etc.
  • Apache Spark (Development, Internals & Tuning)

  • Test their understanding of Spark's core architecture — executors, tasks, stages, DAG.
  • Focus on Spark performance tuning techniques : partitioning, caching, broadcast joins, etc.
  • Ask scenario-based questions on troubleshooting slow running / stuck jobs or resource issues in Spark.
  • Explore their experience optimizing Spark jobs for large-scale datasets.
  • Cloud Technologies

  • Check exposure to AWS services like S, EMR, Glue, Lambda, Athena, etc.
  • Ask how they've used S with Spark (, dealing with file formats, consistency issues).
  • EKS, Serverless knowledge, etc.
  • Programming - Python or Scala

  • Assess ability to write clean, modular, and performant code.
  • Look for experience in functional programming concepts (, immutability, higher-order functions).
  • Ask about real-world use cases where they wrote scalable data processing code.
  • Evaluate understanding of collections, concurrency, and memory management.
  • Preferred Skills :

  • Experience with managing production data pipelines / ETL systems
  • Experience with CI / CD
  • Experience writing test cases
  • AWS certifications
  • [job_alerts.create_a_job]

    Big Data Engineer • Rockville, MD

    [internal_linking.similar_jobs]
    Senior Data Engineer

    Senior Data Engineer

    CareerBliss • Bethesda, MD, US
    [job_card.full_time]
    Work on data driven strategies to apply AI and ML to novel drug discovery, at the forefront of biotech innovation.This Jobot Job is hosted by : Charles Simmons. Are you a fit? Easy Apply now by click...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    Senior Databricks Engineer

    Senior Databricks Engineer

    Strategic Technology Partners, LLC • Washington, DC, United States
    [job_card.full_time]
    Career Opportunities with Strategic Technology Partners.An ACTIVE IRS / TREASURY Clearance is required by the government contract. Strategic Technology Partners LLC (www.Senior Databricks Engineer who...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Data Engineer

    Data Engineer

    Analytica • Washington, DC, US
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Analytica is seeking a Data Engineer to support a key, long-term federal government client program and software product.The ideal candidate will be comfortable working in an agile, multi-faceted te...[show_more]
    [last_updated.last_updated_30]
    Data Engineer

    Data Engineer

    PROVATOHR INC • Crystal City, VA, US
    [job_card.full_time]
    [filters_job_card.quick_apply]
    This is a Hybrid position – 1-2 days per week on-site in Crystal City, VA The Role As a mid-level Data Engineer, you'll be at the forefront of delivering software solutions that directly impa...[show_more]
    [last_updated.last_updated_30]
    Azure Data Engineer

    Azure Data Engineer

    TechDigital Group • Washington, DC, United States
    [job_card.full_time]
    In-dept knowledge and hands-on experience in Microsoft Azure data stack i.Azure Data Factory, Azure Synapse, Azure SQL Database, etc. Experience of big data approach, architectural concepts, data so...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Data & Cloud Software Engineer — TS / SCI (DC)

    Senior Data & Cloud Software Engineer — TS / SCI (DC)

    Amentum • Washington, DC, United States
    [job_card.full_time]
    A leading defense contractor in Washington, DC is seeking a Senior Software Engineer to lead large-scale data analysis projects and develop robust software solutions. Candidates must possess an acti...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Big Data Architect IV

    Big Data Architect IV

    GAMA-1 Technologies • Greenbelt, MD, US
    [job_card.full_time]
    [filters_job_card.quick_apply]
    GAMA-1 is seeking an experienced and highly skilled Big Data Architect IV to join our growing cloud engineering team in a telework environment. The ideal candidate will possess deep expertise in var...[show_more]
    [last_updated.last_updated_variable_days]
    Sr. Biometrics Data Engineer

    Sr. Biometrics Data Engineer

    Syneos Health / inVentiv Health Commercial LLC • Washington, District of Columbia, United States
    [job_card.full_time]
    Syneos Health is a leading fully integrated biopharmaceutical solutions organization built to accelerate customer success. We translate unique clinical, medical affairs and commercial insights into ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Magnificent Data Engineer (2+ years - Senior Level)

    Magnificent Data Engineer (2+ years - Senior Level)

    Black Cape • Pentagon, VA, US
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Magnificent Data Engineer (2+ years - Senior Level) Locations : Arlington, VA, Reston, VA and areas throughout the DMV area Clearance : MUST have an Active Secret Clearance and be willing to get a TS...[show_more]
    [last_updated.last_updated_variable_days]
    Senior Analytics Engineer : Healthcare Data Pipelines

    Senior Analytics Engineer : Healthcare Data Pipelines

    MedStar Health • Columbia, MD, United States
    [job_card.full_time]
    A leading healthcare organization in Columbia, MD is seeking a Sr.Analytics Engineer to enhance quality and safety metrics through data-driven decisions. The role involves leading data engineering p...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Data Platform Architect — Cloud & Big Data (DoD)

    Senior Data Platform Architect — Cloud & Big Data (DoD)

    Ivertix • Alexandria, VA, United States
    [job_card.full_time]
    A technology solutions company is seeking an experienced Solution Architect to design scalable data solutions and support user requirements. Ideal candidates should have over 6 years of experience i...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Data Engineer

    Data Engineer

    Data Society • DC, US
    [job_card.full_time]
    [filters_job_card.quick_apply]
    At Data Society, we provide bespoke, leading-edge data and AI solutions for Fortune 1,000 companies and federal, state, and local governmental organizations. We partner with our clients to educate, ...[show_more]
    [last_updated.last_updated_30]
    Snowflake Data Engineer

    Snowflake Data Engineer

    Stellent IT LLC • Columbia, MD, United States
    [job_card.temporary]
    [filters_job_card.quick_apply]
    Snowflake Data Engineer Columbia, Maryland (Hybrid role) Long term Contract <...[show_more]
    [last_updated.last_updated_variable_days]
    Senior Data Engineer - NAVY COMFRC

    Senior Data Engineer - NAVY COMFRC

    The Rehancement Group • College Park, Maryland, United States
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Senior Data Engineer who serves as a data engineering SME responsible for designing, automating, and maintaining data environments that support analytics, modeling, and decision support.Onsite at N...[show_more]
    [last_updated.last_updated_30]
    Senior Staff Data & API Platform Engineer

    Senior Staff Data & API Platform Engineer

    Pandora • Washington, DC, United States
    [job_card.full_time]
    A leading audio entertainment company in Washington, D.Senior Staff Software Engineer to design and build high-performance reporting APIs and data pipelines. The ideal candidate should have over 8 y...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Data Engineer

    Data Engineer

    Halvik • Arlington, VA, USA
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Halvik is a highly successful WOB business with more than 50 prime contracts and 500+ professionals delivering Digital Services, Advanced Analytics, Artificial Intelligence / Machine Learning, Cyber ...[show_more]
    [last_updated.last_updated_30]
    Principal Consultant, Data Engineer (Level 4)

    Principal Consultant, Data Engineer (Level 4)

    Lovelytics • Arlington, VA, US
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Lovelytics is a Databricks-focused data and AI consulting firm specializing in artificial intelligence, data, and analytics solutions. Since partnering with Databricks in 2019, Lovelytics has experi...[show_more]
    [last_updated.last_updated_30]
    Cloud Engineer (Hybrid)

    Cloud Engineer (Hybrid)

    SiloSmashers • Arlington, VA, USA
    [job_card.full_time]
    [filters_job_card.quick_apply]
    DHS suitability clearance, as required by federal contract.The Cloud Engineer will provide engineering expertise in building, securing, and optimizing cloud infrastructure for DHS CISA.This role fo...[show_more]
    [last_updated.last_updated_30]