Talent.com
AI Data Engineer
AI Data EngineerC the Signs • NJ, US
AI Data Engineer

AI Data Engineer

C the Signs • NJ, US
[job_card.1_day_ago]
[job_preview.job_type]
  • [job_card.full_time]
  • [filters.remote]
  • [filters_job_card.quick_apply]
[job_card.job_description]

Position Summary

The Data Engineer will play a crucial role in developing and fine-tuning data specifically for our LLMs and machine learning models. This individual will be responsible for the entire data lifecycle, including gathering, cleaning, structuring, and optimizing large, diverse healthcare datasets. The ideal candidate will have a strong background in data engineering principles, experience with big data technologies, and a keen understanding of the unique challenges and requirements of healthcare data.

You will design, build, and maintain scalable data pipelines that source, preprocess, and deliver high-quality, high-volume datasets to our machine learning engineers. This role requires a deep understanding of data engineering best practices coupled with specific knowledge of the data requirements for LLM training and refinement

Key Responsibilities

  • Collaborate with data scientists and machine learning engineers to understand data requirements for LLM and machine learning model fine-tuning.
  • Design, build, and maintain scalable data pipelines to ingest, process, and store massive and diverse healthcare datasets.
  • Implement robust data validation and monitoring to ensure the integrity, accuracy, and consistency of all training datasets.
  • Implement robust data cleaning, validation, and transformation processes to ensure data quality and integrity.
  • Develop and optimize data structures and schemas for efficient access and utilization by LLMs and machine learning models.
  • Work with the team to identify and acquire new data sources, ensuring compliance with relevant healthcare regulations (e.g., HIPAA).
  • Monitor data pipeline performance, troubleshoot issues, and implement optimizations to improve efficiency and reliability.
  • Document data engineering processes, data models, and data dictionaries.
  • Stay up-to-date with the latest advancements in data engineering, big data technologies, and machine learning.

Requirements

Required

  • Bachelor's degree in Computer Science, Engineering, or a related field.
  • Proven experience as a Data Engineer, with a focus on big data technologies.
  • Strong proficiency in programming languages such as Python, Scala, or Java.
  • Extensive experience with data warehousing, ETL processes, and data modeling.
  • Experience with major cloud providers (e.g., AWS, GCP, Azure) and their data storage and processing services.
  • Hands-on experience with big data frameworks like Apache Spark for distributed processing.
  • Excellent problem-solving skills and the ability to work independently and as part of a team.
  • Strong communication and interpersonal skills.
  • Preferred

  • Master's degree in a related field.
  • Experience with healthcare data and a good understanding of healthcare data standards (e.g., FHIR, HL7).
  • Familiarity with machine learning concepts and LLM fine-tuning processes.
  • Experience with data orchestration tools (e.g., Apache Airflow).
  • Work Authorization :

  • Must be a US Citizen, Green Card holder, or currently in the US have valid H1B visa
  • Benefits

    Why Join Us?

    Joining  C the Signs  is not just about building AI; it’s about shaping the future of healthcare. If you are a technical leader with an unshakable belief in the power of AI to save lives and the ability to make it happen at scale, this is your opportunity to create a tangible, global impact.

    Benefits :

  • Competitive salary and benefits package.
  • Flexible working arrangements (remote or hybrid options available).
  • The opportunity to work on life-changing AI technology that directly impacts patient outcomes.
  • Join a team that combines cutting-edge innovation with a mission to save lives and improve health equity.
  • Continuous learning opportunities with access to the latest tools and advancements in AI and healthcare.
  • [job_alerts.create_a_job]

    Ai Data Engineer • NJ, US

    [internal_linking.similar_jobs]
    Principal AI & Data Strategy Consultant — Telco

    Principal AI & Data Strategy Consultant — Telco

    Amdocs • NJ, United States
    [job_card.full_time]
    A technology solutions provider is seeking a Principal Consultant in their Data & AI Division.This role focuses on providing strategic advisory services for AI and data strategies in the telecommun...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Cloud & AI Solutions Leader - Principal Consultant

    Cloud & AI Solutions Leader - Principal Consultant

    Amdocs Studios • NJ, United States
    [job_card.full_time]
    A leading technology solutions company is seeking a Consulting Principal Lead in New Jersey to guide cloud and AI solutioning for clients. Candidates will architect complex technology solutions, ser...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Consulting Principal Lead - Cloud and AI

    Consulting Principal Lead - Cloud and AI

    Amdocs Studios • NJ, United States
    [job_card.full_time]
    Consulting Principal Lead - Cloud and AI.At Amdocs, we foster a culture of innovation, collaboration, and inclusivity.We believe in empowering our employees to drive change and make a meaningful im...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    GENAI / MACHINE LEARNING AI / DATA Scientist

    GENAI / MACHINE LEARNING AI / DATA Scientist

    Hudson Manpower • New Jersey, NJ, US
    [job_card.full_time]
    I hope this message finds you well.I’m reaching out to share an exciting opportunity that I believe aligns with your experience and expertise in the field for my client. If you are interested, I’d l...[show_more]
    [last_updated.last_updated_30]
    Data Architecture Lead for AI-Driven Analytics

    Data Architecture Lead for AI-Driven Analytics

    Tiger Analytics • NJ, United States
    [job_card.full_time]
    A leading analytics consulting firm is seeking a Data Architecture Lead to design and govern the data ecosystem, focusing on integrating scalable data solutions and empowering AI / ML initiatives.The...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Cloud AI

    Cloud AI

    Openkyber • NJ, United States
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Our Client is looking for a Data Scientist with Gen AI & Python skills.Preferred Locals to New Jersey.[show_more]
    [last_updated.last_updated_30]
    Principal Consultant – Data & AI Division

    Principal Consultant – Data & AI Division

    Amdocs • NJ, United States
    [job_card.full_time]
    Principal Consultant – Data & AI Division.At Amdocs, we foster a culture of innovation, collaboration, and inclusivity.We believe in empowering our employees to drive change and make a meaningful i...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Generative AI (GenAI) Architect

    Senior Generative AI (GenAI) Architect

    Tekshapers Inc • Somerset, NJ, United States
    [job_card.full_time] +2
    [filters_job_card.quick_apply]
    MsoNoSpacing"> Senior Generative AI (GenAI) Architect We are seeking...[show_more]
    [last_updated.last_updated_variable_days]
    DS / AI Tech Partner

    DS / AI Tech Partner

    Tiger Analytics Inc. • NJ, US
    [filters.remote]
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Tiger Analytics is pioneering what AI and analytics can do to solve some of the toughest problems faced by organizations globally. We develop bespoke solutions powered by data and technology for sev...[show_more]
    [last_updated.last_updated_variable_days]
    Sr. Database Engineer / Sr. Data Engineer

    Sr. Database Engineer / Sr. Data Engineer

    Comprehensive Resources INC • New Jersey, USA
    [job_card.full_time]
    Data Engineer Location : Fully Remote (Once or twice a month on-site ) Washington, DC Duration : LongTerm • Job Description : The Senior Dat...[show_more]
    [last_updated.last_updated_30]
    W2 Remote Role : : GCP Cloud Data Engineer

    W2 Remote Role : : GCP Cloud Data Engineer

    Kanak Elite Services Inc • NJ, United States
    [filters.remote]
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Role : : Google Cloud Data Engineer Location : : REMOTE (USA) Duration : : Contract(W2 Only) GCP Cloud Data Engineer 7+ Data Components : ...[show_more]
    [last_updated.last_updated_variable_days]
    Senior / Lead Data Engineer-Python,Spark & SQL (Only W2)

    Senior / Lead Data Engineer-Python,Spark & SQL (Only W2)

    Saransh Inc • NJ, United States
    [job_card.full_time]
    [filters_job_card.quick_apply]
    MessageBody"> Description : We are in search of a Senior Data Engineer who is adept in SQL, Python, and Spark, and ideally with an experience in AI-related tasks and AP...[show_more]
    [last_updated.last_updated_variable_days]
    Data Coordinator

    Data Coordinator

    START Center for Cancer Research • East Brunswick, NJ, US
    [job_card.full_time]
    The START Center for Cancer Research (“START”) is the world’s largest early phase site network, fully dedicated to oncology clinical research. Throughout our history, START has pro...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    AI Data Engineer

    AI Data Engineer

    C The Signs • New Jersey, United States, New Jersey, United States
    [filters.remote]
    [job_card.full_time]
    The Data Engineer will play a crucial role in developing and fine-tuning data specifically for our LLMs and machine learning models. This individual will be responsible for the entire data lifecycle...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    Azure Data Engineer

    Azure Data Engineer

    VDart Inc • NJ, United States
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Position : Data Engineer Location : Princeton, NJ 08540 Duration : Contract Required Skills & Qualificat...[show_more]
    [last_updated.last_updated_variable_days]
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    C The Signs • New Jersey, United States, New Jersey, United States
    [filters.remote]
    [job_card.full_time]
    The Machine Learning Engineer will be responsible for the end-to-end development and deployment of Large language and machine learning models, with a primary focus on data preprocessing, model trai...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    Head of Cloud Transformation

    Head of Cloud Transformation

    Openkyber • NJ, United States
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Role Name - Senior Microsoft Fabric Data Engineer / Architect EXPERIENCE_RANGE_IN_REQUIRED_SKILLS_ Azure, Microsoft Fabric, Pyspark, SQL, Data Modelling <...[show_more]
    [last_updated.last_updated_variable_days]
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    C the Signs • NJ, US
    [filters.remote]
    [job_card.full_time]
    [filters_job_card.quick_apply]
    The Machine Learning Engineer will be responsible for the end-to-end development and deployment of Large language and machine learning models, with a primary focus on data preprocessing, model trai...[show_more]
    [last_updated.last_updated_1_day]