Location : Hybrid / Atlanta, GA
Experience Level : Entry-level (Masters preferred)
About OrcaWorks AI
At OrcaWorks AI, were building next-generation AI systems that empower businesses to make data-driven decisions with intelligence and speed. Were seeking passionate Data Engineers who love solving real-world data challenges and want to be part of a growing team building cutting-edge AI infrastructure.
Key Responsibilities
- Design, develop, and maintain data pipelines using tools like Airbyte and Prefect to feed AI and machine learning models.
- Integrate data from multiple structured and unstructured sources into unified and queryable layers using ElasticSearch or Vespa.
- Implement data validation, transformation, and storage solutions using modern ETL frameworks.
- Collaborate with AI, LLM, and data science teams to ensure reliable and optimized data flow for model training.
- Support database management, SQLModel, and data governance practices across services.
Required Skills & Qualifications
Masters degree (or Bachelors with equivalent experience) in Computer Science, Information Systems, or Data Engineering.Proficiency in Python and SQL; experience with PySpark or equivalent ETL frameworks.Hands-on experience with Airbyte, Prefect, and DBT.Familiarity with search and indexing systems like Vespa or ElasticSearch.Knowledge of cloud data platforms (AWS, GCP, or Azure) and API integration.Strong understanding of data security and applied AI workflows.