Talent.com
Data Architect II
Data Architect IIGSK • San Francisco, California, USA
Data Architect II

Data Architect II

GSK • San Francisco, California, USA
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

The Onyx Research Data Tech organization is GSKs Research data ecosystem which has the capability to bring together analyze and power the exploration of data at scale. We partner with scientists across GSK to define and understand their challenges and develop tailored solutions that meet their needs. The goal is to ensure scientists have the right data and insights when they need it to give them a better starting point for and accelerate medical discovery. Ultimately this helps us get ahead of disease in more predictive and powerful ways.

Onyx is a full-stack shop consisting of product and portfolio leadership data engineering infrastructure and DevOps data / metadata / knowledge platforms and AI / ML and analysis platforms all geared toward :

Building a next-generation metadata- and automation-driven data experience for GSKs scientists engineers and decision-makers increasing productivity and reducing time spent on data mechanics

Providing best-in-class AI / ML and data analysis environments to accelerate our predictive capabilities and attract top-tier talent

Aggressively engineering our data at scale as one unified asset to unlock the value of our unique collection of data and predictions in real-time

The Onyx Data Architecture team sits within the Data Engineering team which is responsible for the design delivery support and maintenance of industrialized automated end to end data services and pipelines. They apply standardized data models and mapping to ensure data is accessible for end users in end-to-end user tools through use of APIs. They define and embed best practices and ensure compliance with Quality Management practices and alignment to automated data governance. They also acquire and process internal and external structure and unstructured data in line with Product requirements.

As a Data Architect II youll apply your expertise in big data and AI / GenAI workflows to support GSKs complex regulated R&D environment.

Youll contribute to designing Data Mesh / Data Fabric architectures while enabling modern AI and machine learning capabilities across our

platform.

You will be responsible for

Partner with the Scientific Knowledge Engineering team to develop physical data models to build fit-for-purpose data products

Design data architecture aligned with enterprise-wide standards to promote interoperability

Collaborate with the platform teams and data engineers to maintain architecture principles standards and guidelines

Design data foundations that support GenAI workflows including RAG (Retrieval-Augmented Generation) vector databases and

embedding pipelines

Work across business areas and stakeholders to ensure consistent implementation of architecture standards

Lead reviews and maintain architecture documentation and best practices for Onyx and our stakeholders

Adopt security-first design with robust authentication and resilient connectivity

Provide best practices and leadership subject matter and GSK expertise to architecture and engineering teams composed of GSK

FTEs strategic partners and software vendors.

Why you

Basic Qualifications :

We are looking for professionals with these required skills to achieve our goals :

Bachelors degree in computer science engineering Data Science or similar discipline

5 years of experience in data architecture data engineering or related fields in pharma healthcare or life sciences R&D.

3 years experience of defining architecture standards patterns on Big Data platforms

3 years experience with data warehouse data lake and enterprise big data platforms

3 years experience with enterprise cloud data architecture (preferably Azure or GCP) and delivering solutions at scale

3 years of hands-on relational dimensional and / or analytic experience (using RDBMS dimensional NoSQL data platform technologies and ETL and data ingestion protocols)

Preferred Qualifications :

If you have the following characteristics it would be a plus :

  • Masters or PhD in computer science engineering Data Science or similar discipline
  • Deep knowledge and use of at least one common programming language : e.g. Python Scala Java
  • Experience with AI / ML data workflows : feature stores vector databases embedding pipelines model serving architectures
  • Familiarity with GenAI / LLM data patterns : RAG architectures prompt engineering data requirements fine-tuning data preparation
  • Experience with GCP data / analytics stack : Spark Dataflow Dataproc GCS Bigquery
  • Experience with enterprise data tools : Ataccama Collibra Acryl
  • Experience with Agile frameworks : SAFe Jira Confluence Azure DevOps
  • Experience applying CI / CD principles to data solution

Experience with Spark and RAG-based architectures for data science and ML use cases

Strong communication skillsability to explain technical concepts to non-technical stakeholders

Pharmaceutical healthcare or life sciences background

#GSKOnyx

#LI-GSK

If you are based in Cambridge MA; Waltham MA; Rockville MD; or San Francisco CA the annual base salary for new hires in this position ranges $109725 to $182875. The US salary ranges take into account a number of factors including work location within the US market the candidates skills experience education level and the market rate for the addition this position offers an annual bonus and eligibility to participate in our share based long term incentive program which is dependent on the level of the role. Available benefits include health care and other insurance benefits (for employee and family) retirement benefits paid holidays vacation and paid caregiver / parental and medical leave. If salary ranges are not displayed in the job posting for a specific country the relevant compensation will be discussed during the recruitment process.

Please visit GSK US Benefits Summary to learn more about the comprehensive benefits program GSK offers US employees.

Why GSK

Uniting science technology and talent to get ahead of disease together.

GSK is a global biopharma company with a purpose to unite science technology and talent to get ahead of disease together. We aim to positively impact the health of 2.5 billion people by the end of the decade as a successful growing company where people can thrive. We get ahead of disease by preventing and treating it with innovation in specialty medicines and vaccines. We focus on four therapeutic areas : respiratory immunology and inflammation; oncology; HIV; and infectious diseases to impact health at scale.

People and patients around the world count on the medicines and vaccines we make so were committed to creating an environment where our people can thrive and focus on what matters most. Our culture of being ambitious for patients accountable for impact and doing the right thing is the foundation for how together we deliver for patients shareholders and our people.

Should you require any adjustments to our process to assist you in demonstrating your strengths and capabilities contact us at where you can also request a call.

Please note should your inquiry not relate to adjustments we will not be able to support you through these channels. However we have created a Recruitment FAQ guide. Click the link where you will find answers to multiple questions we receive

GSK is an Equal Opportunity Employer. This ensures that all qualified applicants will receive equal consideration for employment without regard to race color religion sex (including pregnancy gender identity and sexual orientation) parental status national origin age disability genetic information (including family medical history) military service or any basis prohibited under federal state or local law.

Important notice to Employment businesses / Agencies

GSK does not accept referrals from employment businesses and / or employment agencies in respect of the vacancies posted on this site. All employment businesses / agencies are required to contact GSKs commercial and general procurement / human resources department to obtain prior written authorization before referring any candidates to GSK. The obtaining of prior written authorization is a condition precedent to any agreement (verbal or written) between the employment business / agency and the absence of such written authorization being obtained any actions undertaken by the employment business / agency shall be deemed to have been performed without the consent or contractual agreement of GSK. GSK shall therefore not be liable for any fees arising from such actions or any fees arising from any referrals by employment businesses / agencies in respect of the vacancies posted on this site.

Please note that if you are a US Licensed Healthcare Professional or Healthcare Professional as defined by the laws of the state issuing your license GSK may be required to capture and report expenses GSK incurs on your behalf in the event you are afforded an interview for employment. This capture of applicable transfers of value is necessary to ensure GSKs compliance to all federal and state US Transparency requirements. For more information please visit the Centers for Medicare and Medicaid Services (CMS) website at Experience :

Staff IC

Key Skills

Fund Management,Drafting,End User Support,Infrastructure,Airlines,Catia

Employment Type : Full-Time

Experience : years

Vacancy : 1

Monthly Salary Salary : 109725 - 182875

[job_alerts.create_a_job]

Data Architect • San Francisco, California, USA

[internal_linking.similar_jobs]
Data Architect, MarTech

Data Architect, MarTech

LendingClub Bank • San Francisco, CA, United States
[job_card.full_time]
Current Employees of LendingClub : Please apply via your internal Workday Account.LendingClub Corporation (NYSE : LC) is the parent company of LendingClub Bank, National Association, Member FDIC.We a...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Data Platform Architect — Fintech (Equity)

Senior Data Platform Architect — Fintech (Equity)

Rippling • San Francisco, CA, United States
[job_card.full_time]
A leading HR and finance platform is seeking a Senior Staff Software Engineer to define and scale its data architecture for a financial data platform. This role demands deep technical expertise in d...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Data Architect

Data Architect

VirtualVocations • Oakland, California, United States
[job_card.full_time]
A company is looking for a Data Architect North America.Key Responsibilities Design and implement end-to-end data architectures, including data lakes and warehouses Define data integration and t...[show_more]
[last_updated.last_updated_30] • [promoted]
Technology Architect, Principal (User)

Technology Architect, Principal (User)

Pacific Gas and Electric Company • Oakland, CA, United States
[job_card.full_time]
Job Category : Information Technology.Business Unit : Information Technology.Job Location : Oakland; Alameda; Alta; American Canyon. Angels Camp; Antioch; Auberry; Auburn; Avenal; Avila Beach; Bakersf...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Data Architect

Senior Data Architect

Autodesk • San Francisco, CA, United States
[job_card.full_time]
You will be the principal authority for data modeling, integration, and architecture patterns that enable.Sales, Marketing, Customer Success, and Finance. This role is more than technical design-it ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Data Architect

Data Architect

Diverse Lynx • San Francisco, CA, United States
[job_card.full_time]
Define and implement enterprise-wide data architecture standards and best practices.Develop conceptual, logical, and physical data models for complex systems. Drive modernization initiatives includi...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Sr. Data Architect - Unified Activation Services

Sr. Data Architect - Unified Activation Services

Salesforce • San Francisco, CA, United States
[job_card.full_time]
To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts. Enterprise Technology & Infrastructure.Salesforce ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Platform Architect / Data Architect

Platform Architect / Data Architect

Data Freelance Hub • San Francisco, CA, United States
[job_card.temporary]
Platform Architect / Data Architect – 6‑12 month contract, on‑site in San Francisco, CA.This role focuses on data architecture, data governance, and cloud‑based data platform modernization.Data arc...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Enterprise Data Architect (EDA)

Enterprise Data Architect (EDA)

Verkada • San Mateo, CA, United States
[job_card.full_time]
Verkada is transforming how organizations protect their people and places with an integrated, AI-powered platform.A leader in cloud physical security, Verkada helps organizations strengthen safety ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Data Architect

Data Architect

Purple Drive Technologies LLC • San Francisco, California, United States
[job_card.full_time]
Role : Data Architect | Location : San Francisco, CA | Duration : 6 months.Define and implement enterprise-wide data architecture standards and best practices. Develop conceptual, logical, and physical...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
AWS Architect

AWS Architect

Right Skale, Inc. • San Francisco, CA, United States
[job_card.full_time]
We are seeking a highly skilled AWS Architect with expertise in cloud architecture, cost optimization, CI / CD frameworks, and data engineering, especially in integrating AWS Glue with Databricks Uni...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Data Architect

Senior Data Architect

Autodesk, Inc. • San Francisco, CA, United States
[job_card.full_time]
We are seeking a • •Senior Data Architect • • to lead • •data architecture and design • • across • •real-time streaming and enterprise batch systems • •. You will be the principal authority for data modeling...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Data Architect

Senior Data Architect

GitHub • San Francisco, CA, United States
[job_card.full_time]
GitHub is the world's leading platform for agentic software development - powered by Copilot to build, scale, and deliver secure software. Over 180 million developers, including more than 90% of the...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Data Engineer II

Data Engineer II

Axon • San Francisco, California, United States
[job_card.full_time]
Join Axon and be a Force for Good.At Axon, we’re on a mission to Protect Life.We’re explorers, pursuing society’s most critical safety and justice issues with our ecosystem of devices and cloud sof...[show_more]
[last_updated.last_updated_30] • [promoted]
Data Platform Architect

Data Platform Architect

GoFundMe • San Francisco, CA, United States
[job_card.full_time]
Be among the first 25 applicants.Get AI-powered advice on this job and more exclusive features.Want to help us help others? We’re hiring!. GoFundMe is the world’s most powerful community for good, d...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior AWS Data Architect - Databricks Lakehouse

Senior AWS Data Architect - Databricks Lakehouse

Fractal • San Francisco, CA, United States
[job_card.full_time]
A leading strategic AI company in San Francisco is seeking an AWS Lead Data Architect / Engineer to design scalable data platforms and automate data pipelines. The ideal candidate will have at least 8...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Associate Principal Data and Analytics Platforms Architect

Associate Principal Data and Analytics Platforms Architect

Exelixis • Alameda, CA, United States
[job_card.full_time]
Associate Principal Data and Analytics Platforms Architect.This position will help define and design our cloud platform, which includes cloud automation tools and standards, CI / CD pipelines, DevOps...[show_more]
[last_updated.last_updated_30] • [promoted]
Palantir Data Architect

Palantir Data Architect

Artech • San Francisco, CA, United States
[job_card.full_time]
Join our innovative team and play a crucial role in architecting scalable data platforms using Palantir Foundry.You will be at the forefront of designing solutions that enhance grid operations, out...[show_more]
[last_updated.last_updated_variable_days] • [promoted]