Talent.com
Senior/Staff Big Data Storage and Computing Engineer, Recommendation Data Ecosystem
Senior/Staff Big Data Storage and Computing Engineer, Recommendation Data EcosystemTikTok • Seattle
Senior / Staff Big Data Storage and Computing Engineer, Recommendation Data Ecosystem

Senior / Staff Big Data Storage and Computing Engineer, Recommendation Data Ecosystem

TikTok • Seattle
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Our team plays a crucial role in the data ecosystem of the TikTok Recommendation System, focusing on creating offline and real-time data storage solutions for large-scale recommendation, search, and advertising businesses, serving over 1 billion users. The core goals of the team are to ensure high system reliability, uninterrupted service, and smooth data processing. We are committed to building a storage and computing infrastructure that can adapt to various data sources and meet diverse storage requirements, ultimately providing efficient, cost-effective, and user-friendly data storage and management tools for the business.

Responsibilities

1. Architecture Design and Implementation : Design and implement offline and real-time data architectures for large-scale recommendation, search, and advertising systems based on Paimon and Flink. Ensure efficient data processing and storage to meet the strict requirements of the business for data timeliness and accuracy. 2. System Construction and Optimization : Design and implement flexible, scalable, stable, and high-performance storage systems and computing models. Use Paimon as the storage foundation and combine it with the powerful computing capabilities of Flink. Continuously optimize system performance to cope with the challenges brought by business growth. 3. Troubleshooting and Stability Assurance : Be responsible for troubleshooting production systems. For problems that occur in the Paimon-Flink architecture during operation, design and implement necessary mechanisms and tools, such as data consistency assurance and exception recovery, to ensure the overall stability of the production system. 4. Distributed System Construction : Build industry-leading distributed systems, including offline and online storage based on Paimon and batch and stream processing frameworks based on Flink, providing solid and reliable infrastructure support for massive data and large-scale business systems.

Minimum Qualifications :

  • A bachelor's degree or above in computer science, software engineering, or related fields, with more than 2 years of experience in building scalable systems.
  • Technical Skills :

1. Paimon - Flink Technology Stack : Have a thorough understanding of Paimon and Flink, and be able to understand and use them at the source-code level. Experience in customizing or extending these two systems is preferred.

2. Data Lake Technology : Have an in-depth understanding of at least one data lake technology (such as Paimon), with practical implementation and customization experience, which should be highlighted in the resume.

3. Storage Knowledge : Be familiar with the principles of HDFS, and knowledge of columnar storage formats such as Parquet and ORC is preferred.

4. Programming Languages : Be proficient in programming languages such as Java, C++, and Scala, with strong coding and problem-solving abilities.

  • Project Experience : Have experience in data warehouse modeling and be able to design efficient data models that meet complex business scenarios.
  • Experience in using other big-data systems / frameworks (such as Hive, HBase, Kudu, is preferred.
  • Comprehensive Qualities : Have the courage to take on complex problems and be willing to explore problems without clear solutions.
  • Be passionate about learning new technologies and be able to quickly master and apply them to practical work.
  • Experience in handling large-scale data (PB - level and above) is preferred.
  • [job_alerts.create_a_job]

    Data • Seattle

    [internal_linking.similar_jobs]
    Sr. Data Engineer

    Sr. Data Engineer

    Symetra • Bellevue, WA, United States
    [job_card.full_time]
    Symetra has an exciting opportunity to join our team as a.Symetra is looking for a Data Engineer to join our Data Management team. In this role, you'll Architect and develop modern cloud-based solut...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Data Engineer

    Senior Data Engineer

    Curative AI • Bellevue, Washington, United States
    [job_card.full_time]
    We are currently delivering exceptional value to our customers in Revenue Cycle Management (RCM) and Clinical Operations, empowering them with industry-transforming AI technology, intelligent autom...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Data Platform Engineer

    Senior Data Platform Engineer

    Rippling • Seattle, WA, United States
    [job_card.full_time]
    A growing technology company in Seattle is seeking an experienced data engineer to drive impactful projects and build scalable data infrastructure. The ideal candidate will have over 6 years of expe...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Lead Big Data Engineer - PySpark

    Lead Big Data Engineer - PySpark

    Logic20 / 20 • Seattle, Washington, United States
    [filters.remote]
    [job_card.full_time]
    We’re a nine-time “Best Company to Work For,” where intelligent, talented people come together to do outstanding work—and have a lot of fun while they’re at it. We offer a solution-focused environme...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Data Engineer

    Senior Data Engineer

    VirtualVocations • Renton, Washington, United States
    [job_card.full_time]
    A company is looking for a Senior Data Engineer to join their Technology function and solve complex data problems for their Data Platform. Key Responsibilities Own the implementation and quality o...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Software Engineer - Applied Data - Moloco Ads

    Senior Software Engineer - Applied Data - Moloco Ads

    Moloco • Seattle, WA, United States
    [job_card.full_time]
    Senior Software Engineer - Applied Data - Moloco Ads.Seattle, Washington, United States.Moloco builds some of the most powerful AI advertising solutions in the world. Our name—short for "machine lea...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Director, AI-Driven Data Storage Innovation

    Director, AI-Driven Data Storage Innovation

    Oracle • Seattle, Washington, United States
    [job_card.full_time]
    A leading technology company is seeking a Director for Software Development to lead Data Storage Innovation efforts.This role involves building a high-impact engineering team, evaluating emerging t...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    Senior Staff Engineer - Data Lakehouse Platform

    Senior Staff Engineer - Data Lakehouse Platform

    GEICO • Seattle, WA, United States
    [job_card.full_time]
    At GEICO, we offer a rewarding career where your ambitions are met with endless possibilities.Every day we honor our iconic brand by offering quality coverage to millions of customers and being the...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Software Engineer, Data Engineering

    Senior Software Engineer, Data Engineering

    Read AI • Seattle, WA, United States
    [job_card.full_time]
    Senior Software Engineer, Data Engineering – Read AI.Join to apply for the Senior Software Engineer, Data Engineering role at Read AI. Read AI is a leading productivity AI company focused on helping...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Data Systems Engineer – Hybrid Seattle

    Senior Data Systems Engineer – Hybrid Seattle

    Axon Enterprise • Seattle, WA, United States
    [job_card.full_time]
    A leading tech company in Seattle is hiring a Backend Engineer to design and implement scalable services critical to data architecture. The role requires 8+ years of experience and proficiency in mo...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    LEAD DATA ENGINEER

    LEAD DATA ENGINEER

    Purple Drive • Seattle, WA, Washington, USA
    [job_card.full_time]
    Responsibilities : • Design, build, and deploy data extraction, transformation, and loading processes and pipelines from various sources including databases, APIs, and ...[show_more]
    [last_updated.last_updated_variable_days]
    Senior Data Engineer

    Senior Data Engineer

    Xealth • Seattle, Washington, United States
    [job_card.full_time]
    At Xealth, we're revolutionizing healthcare by leveraging data and automation to empower care providers (building on EHRs such as Epic and Cerner) to seamlessly prescribe, deliver, and monitor digi...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Software Engineer - Data Infra - Moloco Ads

    Senior Software Engineer - Data Infra - Moloco Ads

    Y-Axis • Seattle, WA, United States
    [job_card.full_time]
    Senior Software Engineer - Data Infra - Moloco Ads at Moloco.Seattle, Washington, United States - Full Time.Experience : 5 year(s) or above. Skills : Distributed Systems, Data Ingestion, Feature Engin...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Software Engineer, Data Platform

    Senior Software Engineer, Data Platform

    iUNU, Inc. • Seattle, WA, United States
    [job_card.full_time]
    At IUNU ("you knew"), our mission is to deliver confidence at scale to the commercial greenhouse industry.We built LUNA, a computer vision platform that autonomously tracks plant development to tur...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Software Engineer, Full-Stack - Enterprise Gen AI

    Staff Software Engineer, Full-Stack - Enterprise Gen AI

    Scale AI, Inc. • Seattle, WA, United States
    [job_card.full_time]
    Staff Software Engineer, Full-Stack - Enterprise Gen AI.Scale GP (Scale Generative AI Platform) is an enterprise-grade AI platform providing APIs for knowledge retrieval, inference, evaluation, and...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Data Platform Engineer - Applied Data

    Senior Data Platform Engineer - Applied Data

    Moloco • Seattle, WA, United States
    [job_card.full_time]
    A leading AI advertising solutions provider in Seattle is looking for a Senior Software Engineer to design and optimize data systems across their platform. Responsibilities include building large-sc...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Software Engineer - Data Acquisition

    Senior Software Engineer - Data Acquisition

    WEX, Inc. • Seattle, WA, United States
    [job_card.full_time]
    WEX's Data-as-a-Service (DaaS) platform – responsible for ingesting, validating, and orchestrating raw data from dozens of internal systems and third‑party providers. You will work across multiple d...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Data Engineer

    Senior Data Engineer

    Opala • Seattle, WA, US
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Opala delivers robust, secure solutions to tackle the most complex data challenges faced by healthcare’s payers and providers. As a startup originating from a major healthcare plan in the Nort...[show_more]
    [last_updated.last_updated_30]