Talent.com
AWS Data Architect (Hayward)
AWS Data Architect (Hayward)Fractal • Hayward, CA, US
AWS Data Architect (Hayward)

AWS Data Architect (Hayward)

Fractal • Hayward, CA, US
[job_card.1_day_ago]
[job_preview.job_type]
  • [job_card.full_time]
  • [job_card.part_time]
[job_card.job_description]

Fractal is a strategic AI partner to Fortune 500 companies with a vision to power every human decision in the enterprise. Fractal is building a world where individual choices, freedom, and diversity are the greatest assets; an ecosystem where human imagination is at the heart of every decision. Where no possibility is written off, only challenged to get better. We believe that a true Fractalite is the one who empowers imagination with intelligence. Fractal has been featured as a Great Place to Work by The Economic Times in partnership with the Great Place to Work Institute and recognized as a Cool Vendor and a Vendor to Watch by Gartner .

Please visit Fractal | Intelligence for Imagination for more information about Fractal.

Fractal is looking for a proactive and driven AWS Lead Data Architect / Engineer to join our cloud and data tech team. In this role, you will work on designing the system architecture and solution, ensuring the platform is scalable while performant, and creating automated data pipelines.

Responsibilities :

  • Design & Architecture of Scalable Data Platforms
  • Design, develop, and maintain large-scale data processing architectures on the Databricks Lakehouse Platform to support business needs
  • Architect multi-layer data models including Bronze (raw), Silver (cleansed), and Gold (curated) layers for various domains (e.g., Retail Execution, Digital Commerce, Logistics, Category Management).
  • Leverage Delta Lake, Unity Catalog, and advanced features of Databricks for governed data sharing, versioning, and reproducibility.
  • Client & Business Stakeholder Engagement
  • Partner with business stakeholders to translate functional requirements into scalable technical solutions.
  • Conduct architecture workshops and solutioning sessions with enterprise IT and business teams to define data-driven use cases
  • Data Pipeline Development & Collaboration
  • Collaborate with data engineers and data scientists to develop end-to-end pipelines using Python, PySpark, SQL
  • Enable data ingestion from diverse sources such as ERP (SAP), POS data, Syndicated Data, CRM, e-commerce platforms, and third-party datasets.
  • Performance, Scalability, and Reliability
  • Optimize Spark jobs for performance tuning, cost efficiency, and scalability by configuring appropriate cluster sizing, caching, and query optimization techniques.
  • Implement monitoring and alerting using Databricks Observability, Ganglia, Cloud-native tools
  • Security, Compliance & Governance
  • Design secure architectures using Unity Catalog, role-based access control (RBAC), encryption, token-based access, and data lineage tools to meet compliance policies.
  • Establish data governance practices including Data Fitness Index, Quality Scores, SLA Monitoring, and Metadata Cataloging.
  • Adoption of AI Copilots & Agentic Development
  • Utilize GitHub Copilot, Databricks Assistant, and other AI code agents for
  • Writing PySpark, SQL, and Python code snippets for data engineering and ML tasks.
  • Generating documentation and test cases to accelerate pipeline development.
  • Interactive debugging and iterative code optimization within notebooks.
  • Advocate for agentic AI workflows that use specialized agents for
  • Data profiling and schema inference.
  • Automated testing and validation.
  • Innovation and Continuous Learning
  • Stay abreast of emerging trends in Lakehouse architectures, Generative AI, and cloud-native tooling.
  • Evaluate and pilot new features from Databricks releases and partner integrations for modern data stack improvements.

Requirements :

  • Bachelors or masters degree in computer science, Information Technology, or a related field.
  • 8-12 years of hands-on experience in data engineering, with at least 5+ years on Python and Apache Spark.
  • Expertise in building high-throughput, low-latency ETL / ELT pipelines on AWS / Azure / GCP using Python, PySpark, SQL.
  • Excellent hands on experience with workload automation tools such as Airflow, Prefect etc.
  • Familiarity with building dynamic ingestion frameworks from structured / unstructured data sources including APIs, flat files, RDBMS, and cloud storage
  • Experience designing Lakehouse architectures with bronze, silver, gold layering.
  • Strong understanding of data modelling concepts, star / snowflake schemas, dimensional modelling, and modern cloud-based data warehousing.
  • Experience with designing Data marts using Cloud data warehouses and integrating with BI tools (Power BI, Tableau, etc.).
  • Experience CI / CD pipelines using tools such as AWS Code commit, Azure DevOps, GitHub Actions.
  • Knowledge of infrastructure-as-code (Terraform, ARM templates) for provisioning platform resources
  • In-depth experience with AWS Cloud services such as Glue, S3, Redshift etc.
  • Strong understanding of data privacy, access controls, and governance best practices.
  • Experience working with RBAC, tokenization, and data classification frameworks
  • Excellent communication skills for stakeholder interaction, solution presentations, and team coordination.
  • Proven experience leading or mentoring global, cross-functional teams across multiple time zones and engagements.
  • Ability to work independently in agile or hybrid delivery models, while guiding junior engineers and ensuring solution quality
  • Must be able to work in PST time zone.
  • Pay :

    The wage range for this role takes into account the wide range of factors that are considered in making compensation decisions including but not limited to skill sets; experience and training; licensure and certifications; and other business and organizational needs. The disclosed range estimate has not been adjusted for the applicable geographic differential associated with the location at which the position may be filled. At Fractal, it is not typical for an individual to be hired at or near the top of the range for their role and compensation decisions are dependent on the facts and circumstances of each case. A reasonable estimate of the current range is : $150k - $180k. In addition, you may be eligible for a discretionary bonus for the current performance period.

    Benefits :

    As a full-time employee of the company or as an hourly employee working more than 30 hours per week, you will be eligible to participate in the health, dental, vision, life insurance, and disability plans in accordance with the plan documents, which may be amended from time to time. You will be eligible for benefits on the first day of employment with the Company. In addition, you are eligible to participate in the Company 401(k) Plan after 30 days of employment, in accordance with the applicable plan terms. The Company provides for 11 paid holidays and 12 weeks of Parental Leave. We also follow a free time PTO policy, allowing you the flexibility to take the time needed for either sick time or vacation.

    Fractal provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.

    [job_alerts.create_a_job]

    Aws Architect • Hayward, CA, US

    [internal_linking.related_jobs]
    Data Architect – Databricks

    Data Architect – Databricks

    Veracity Software Inc • Santa Clara, CA, United States
    [job_card.full_time]
    We're seeking a visionary Data Architect with deep expertise in Databricks to lead the design, implementation, and optimization of our enterprise data architecture. You'll be instrumental in shaping...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Data Engineer (Hayward)

    Data Engineer (Hayward)

    Midjourney • Hayward, CA, US
    [job_card.part_time]
    Midjourney is a research lab exploring new mediums to expand the imaginative powers of the human species.We are a small, self-funded team focused on design, human infrastructure, and AI.We have no ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Data Architect

    Data Architect

    Next Level Business Services, Inc. • Sunnyvale, CA, United States
    [job_card.full_time]
    My Name is Ajay Singh and I'm a Resource Manager at Next Level Business Services, Inc.Please find the Job Description below and respond with an expected salary range?. Also, attach a copy of your up...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Data analytics Architect

    Data analytics Architect

    MOTOCOL • San Jose, CA, United States
    [job_card.full_time]
    Our client is seeking a senior software architect with at least 8 years of experience in designing scalable, distributed systems to join the Data Analytics Team. This role is expected to provide arc...[show_more]
    [last_updated.last_updated_30] • [promoted]
    AWS Architect

    AWS Architect

    TechDigital Group • San Jose, CA, United States
    [job_card.full_time]
    Knowledge of Google Kubernetes Engine (and.Experience of Prompt development for Large Language Model (LLM) interactions.Understand and enhance the existing build processes.NET core, C#, developing ...[show_more]
    [last_updated.last_updated_30] • [promoted]
    AWS Architect

    AWS Architect

    RAPID EAGLE INC • Sunnyvale, CA, US
    [job_card.full_time]
    MVP, translating requirements into a pragmatic,.Cloud Engineers, DevOps Engineer).Architecture ownership : Map requirements to AWS managed services. maintain Architecture.Content delivery : Design Cl...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Enterprise Solution Architect – Master Data & Governance (Remote / Hybrid – Bay Area Preferred)

    Enterprise Solution Architect – Master Data & Governance (Remote / Hybrid – Bay Area Preferred)

    Talent Connection • Pleasanton, CA, United States
    [filters.remote]
    [job_card.full_time]
    Own end-to-end architecture for.APIs, event streaming, and ETL integrations with CRM, ERP, and Data Warehouse systems.Partner cross-functionally with engineering, product, and business teams to emb...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Data Architect, AI

    Data Architect, AI

    Cisco Systems, Inc. • San Jose, CA, United States
    [job_card.full_time]
    Applications are accepted until further notice.You will build and maintain the data infrastructure that powers our LLM guardrail and red teaming product's detection and validation engine.You will o...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Data Architect

    Data Architect

    Compunnel Inc. • San Jose, CA, United States
    [job_card.full_time]
    This resource will work hands‑on to deliver scalable, secure, and high‑performing systems for analytics, reporting, and machine learning on a petabyte scale. Cloud experience : Azure preferred, AWS a...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Data Architect, AI

    Data Architect, AI

    Cisco Systems • San Jose, CA, United States
    [job_card.full_time]
    Applications are accepted until further notice.You will build and maintain the data infrastructure that powers our LLM guardrail and red teaming product's detection and validation engine.You will o...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Principal Enterprise Data Architect

    Principal Enterprise Data Architect

    Smarsh • Pleasanton, CA, US
    [job_card.full_time]
    Smarsh empowers its customers to manage risk and unleash intelligence in their digital communications.Our growing community of over 6500 organizations in regulated industries counts on Smarsh every...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    (US)Databricks Solution Architect

    (US)Databricks Solution Architect

    Codvo Private Limited • Santa Clara, CA, United States
    [job_card.full_time]
    At Codvo, we are committed to building scalable, future-ready data platforms that power business impact.We believe in a culture of innovation, collaboration, and growth, where engineers can experim...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Manager, REMS Data Programmer

    Senior Manager, REMS Data Programmer

    Jazz Pharmaceuticals • Fremont, California, USA
    [job_card.full_time]
    If you are a current Jazz employee please apply via the Internal Career site.Jazz Pharmaceuticals is a global biopharma company whose purpose is to innovate to transform the lives of patients and ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Databricks Data Architect

    Databricks Data Architect

    DynPro Inc. • San Jose, CA, United States
    [job_card.full_time]
    This range is provided by DynPro Inc.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Direct message the job poster from DynPro Inc.Tech Recruiti...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Lead Data Engineer (Hayward)

    Lead Data Engineer (Hayward)

    Mentor Talent Acquisition • Hayward, CA, United States
    [job_card.full_time]
    Were looking for a Lead Data Engineer to spearhead the design, implementation, and iteration of a world-class, modern data infrastructure that powers analytics, data science, and ML / AI systems.You ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    GCP Data Architect ONSITE INDEPENDENT visa only

    GCP Data Architect ONSITE INDEPENDENT visa only

    TestingXperts Inc. DBA Damcosoft • Santa Clara, CA, United States
    [job_card.full_time] +2
    [filters_job_card.quick_apply]
    Role : GCP Data Architect Location : Santa Clara CA (100% Onsite) [show_more]
    [last_updated.last_updated_variable_days]
    Principal Data Center Solutions Architect

    Principal Data Center Solutions Architect

    Supermicro • San Jose, CA, United States
    [job_card.full_time]
    Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Remote Sales & Trading Associate - AI Trainer ($50-$60 / hour)

    Remote Sales & Trading Associate - AI Trainer ($50-$60 / hour)

    Data Annotation • Pittsburg, California
    [filters.remote]
    [job_card.full_time] +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]