Talent.com
Sr Big Data Engineer - Oozie and Pig (GCP)
Sr Big Data Engineer - Oozie and Pig (GCP)Rackspace • Remote, Remote, United States
Sr Big Data Engineer - Oozie and Pig (GCP)

Sr Big Data Engineer - Oozie and Pig (GCP)

Rackspace • Remote, Remote, United States
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
  • [filters.remote]
[job_card.job_description]

About the Role

We are seeking a Senior Big Data Engineer with deep expertise in distributed systems, batch data processing, and large-scale data pipelines. The ideal candidate has strong hands-on experience with Oozie, Pig, the Apache Hadoop ecosystem, and programming proficiency in Java (preferred) or Python. This role requires a deep understanding of data structures and algorithms, along with a proven track record of writing production-grade code and building robust data workflows.

This is a fully remote position and requires an independent, self-driven engineer who thrives in complex technical environments and communicates effectively across teams.

Work Location : US-Remote, Canada-Remote

Key Responsibilities :

  • Design and develop scalable batch processing systems using technologies like Hadoop, Oozie, Pig, Hive, MapReduce, and HBase, with hands-on coding in Java or Python (Java is a must).
  • Must be able to lead Jira Epics
  • Write clean, efficient, and production-ready code with a strong focus on data structures and algorithmic problem-solving applied to real-world data engineering tasks.
  • Develop, manage, and optimize complex data workflows within the Apache Hadoop ecosystem, with a strong focus on Oozie orchestration and job scheduling.
  • Leverage Google Cloud Platform (GCP) tools such as Dataproc, GCS, and Composer to build scalable and cloud-native big data solutions.
  • Implement DevOps and automation best practices, including CI / CD pipelines, infrastructure as code (IaC), and performance tuning across distributed systems.
  • Collaborate with cross-functional teams to ensure data pipeline reliability, code quality, and operational excellence in a remote-first environment.

Qualifications :

  • Bachelors's degree in Computer Science, software engineering or related field of study.
  • Experience with managed cloud services and understanding of cloud-based batch processing systems are critical.
  • Must be able to lead Jira Epics is MUST
  • Proficiency in Oozie, Airflow, Map Reduce, Java are MUST haves.
  • Strong programming skills with Java (specifically Spark), Python, Pig, and SQL.
  • Expertise in public cloud services, particularly in GCP.
  • Proficiency in the Apache Hadoop ecosystem with Oozie, Pig, Hive, Map Reduce.
  • Familiarity with BigTable and Redis.
  • Experienced in Infrastructure and Applied DevOps principles in daily work. Utilize tools for continuous integration and continuous deployment (CI / CD), and Infrastructure as Code (IaC) like Terraform to automate and improve development and release processes.
  • Proven experience in engineering batch processing systems at scale.
  • Must Have : (Important)

  • 5+ years of experience in customer-facing software / technology or consulting.
  • 5+ years of experience with “on-premises to cloud” migrations or IT transformations.
  • 5+ years of experience building, and operating solutions built on GCP
  • Proficiency in Oozie andPig
  • Must be able to lead Jira Epics
  • Proficiency in Java or Python
  • The following information is required by pay transparency legislation in the following states : CA, CO, HI, NY, and WA. This information applies only to individuals working in these states.

  • The anticipated starting pay range for Colorado is : $116,100 - $170,280.
  • The anticipated starting pay range for the states of Hawaii and New York (not including NYC) is : $123,600 - $181,280.
  • The anticipated starting pay range for California, New York City and Washington is : $135,300 - $198,440.
  • Unless already included in the posted pay range and based on eligibility, the role may include variable compensation in the form of bonus, commissions, or other discretionary payments. These discretionary payments are based on company and / or individual performance and may change at any time. Actual compensation is influenced by a wide array of factors including but not limited to skill set, level of experience, licenses and certifications, and specific work location. Information on benefits   offered is here.

    About Rackspace Technology

    We are the multicloud solutions experts. We combine our expertise with the world’s leading technologies — across applications, data and security — to deliver end-to-end solutions. We have a proven record of advising customers based on their business challenges, designing solutions that scale, building and managing those solutions, and optimizing returns into the future. Named a best place to work, year after year according to Fortune, Forbes and Glassdoor, we attract and develop world-class talent. Join us on our mission to embrace technology, empower customers and deliver the future.

    More on Rackspace Technology

    Though we’re all different, Rackers thrive through our connection to a central goal : to be a valued member of a winning team on an inspiring mission. We bring our whole selves to work every day. And we embrace the notion that unique perspectives fuel innovation and enable us to best serve our customers and communities around the globe. We welcome you to apply today and want you to know that we are committed to offering equal employment opportunity without regard to age, color, disability, gender reassignment or identity or expression, genetic information, marital or civil partner status, pregnancy or maternity status, military or veteran status, nationality, ethnic or national origin, race, religion or belief, sexual orientation, or any legally protected characteristic. If you have a disability or special need that requires accommodation, please let us know.

    #LI-VM1

    We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

    [job_alerts.create_a_job]

    Sr Data Engineer • Remote, Remote, United States

    [internal_linking.related_jobs]
    Senior Big Data Engineer (9892)

    Senior Big Data Engineer (9892)

    Extreme Networks • United States, United States, United States
    [job_card.full_time]
    Over 50,000 customers globally trust our end-to-end, cloud-driven networking solutions.They rely on our top-rated services and support to accelerate their digital transformation efforts and deliver...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff / Principal Data Engineer (NARA)

    Staff / Principal Data Engineer (NARA)

    Skylight • United States, United States, United States
    [job_card.full_time]
    Skylight is a digital consultancy using design and technology to help government agencies deliver better public services. We’re at the forefront of a civic movement to reinvent how all levels of gov...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Principal Data Engineer

    Principal Data Engineer

    Blackcloak • United States, United States, United States
    [filters.remote]
    [job_card.full_time]
    BlackCloak’s mission is to protect corporate executives and high-profile individuals in their personal lives, mitigating risks to their families, companies, reputation, and finances.We defend our c...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Data Engineer

    Data Engineer

    Mission Lane • Remote, Remote, United States
    [filters.remote]
    [job_card.full_time]
    Mission Lane is combining the power of data, technology, and exceptional service to pave a clear way forward for millions of people on the path to financial success. By attracting top talent and lev...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Data Engineer

    Staff Data Engineer

    Referral Board • United States, United States, United States
    [job_card.full_time]
    Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale — unleashing the potential of businesses and people.The Elastic Search AI...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Data Engineer (Remote)

    Senior Data Engineer (Remote)

    Lifecheq • Remote, Remote, United States
    [filters.remote]
    [job_card.full_time]
    LifeCheq is a fintech company changing how South Africans manage their personal.Our platform combines smart tech, deep financial expertise, and a unique. We're growing rapidly, backed by major inves...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Data Engineer (Snowflake)

    Senior Data Engineer (Snowflake)

    Exadel • Remote, Remote, United States
    [job_card.full_time]
    We’re an AI-first global tech company with 25+ years of engineering leadership, 2,000+ team members, and 500+ active projects powering Fortune 500 clients, including HBO, Microsoft, Google, and Sta...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Big Data Engineer

    Big Data Engineer

    Inherent Technologies • United States
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Position : Data Engineer Location : Secaucus, NJ / Remote Duration : 1 Years Job Detail...[show_more]
    [last_updated.last_updated_1_day]
    Sr Data Engineer (Asset-Wealth Management)

    Sr Data Engineer (Asset-Wealth Management)

    Ccube • Remote, Remote, United States
    [filters.remote]
    [job_card.full_time]
    Job Title : Sr Data Engineer – Asset / Wealth Management.Scottsdale, AZ (Onsite / Hybrid 3 days a week).The ideal candidate will have a solid understanding of how data flows across asset management syst...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    Principal Data Engineer

    Principal Data Engineer

    Future Tech Enterprise • Remote, Remote, United States
    [filters.remote]
    [job_card.full_time]
    ERP systems through Google Cloud (BigQuery, Dataflow) to Looker.As the sole data engineer, you will ensure the reliability, scalability, and performance of our analytics platform while collaboratin...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Principal Data Engineer

    Principal Data Engineer

    Trella Health • Remote, Remote, United States
    [filters.remote]
    [job_card.full_time]
    At Trella Health, we are passionate and committed to our mission : empowering meaningful change in healthcare.Since our founding in 2015, we continue to grow our team, enhance our solution and servi...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Data Engineer

    Senior Data Engineer

    Performline • Remote, Remote, United States
    [filters.remote]
    [job_card.full_time]
    PerformLine is a category-leading SaaS company that empowers leaders with end-to-end marketing compliance technology, from automated review of documents to discovery and live monitoring across cons...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    Staff Data Engineer

    Staff Data Engineer

    Elastic • United States, United States, United States
    [job_card.full_time]
    We are seeking a Staff Data Engineer to join our Data Engineering & Architecture team.In this role, you will contribute to our mission of building a world-class Data Platform.Your work will directl...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Lead Data Engineer

    Lead Data Engineer

    Fusemachines • Remote, Remote, United States
    [filters.remote]
    [job_card.full_time]
    Fusemachines is a leading AI strategy, talent, and education services provider.Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI.With a presenc...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Data Scientist

    Data Scientist

    Planet Technologies • US
    [job_card.full_time]
    Connects to APIs, system-to-system connections, and manual data transfers for ETL.Applies AI / ML to extract insights from unstructured documentation. Develops and carries out full-service data calls,...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Sr. Data Engineer

    Sr. Data Engineer

    Ninjatrader • Remote, Remote, United States
    [job_card.full_time]
    Please be advised that the most accurate and up-to-date information about our open roles—including job descriptions, compensation, and benefits—can only be guaranteed on our official job board.For ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Data Engineer

    Senior Data Engineer

    Linkup • Remote, Remote, United States
    [job_card.full_time]
    Your core responsibilities will be to : .Build and operate large-scale, high-throughput.Design and maintain distributed.SQL, NoSQL, and object stores) for performance and reliability.Collaborate with...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Data Engineer

    Data Engineer

    Ibis Public Sector • Remote, Remote, United States
    [filters.remote]
    [job_card.full_time]
    Ibis Public Sector is seeking a Data Engineer to provide Python development and database management support for enhancing and extending capabilities within open-source and inherited data environmen...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]