Talent.com
Big Data Engineer
Big Data EngineerSapphire Software Solutions • Remote, Remote, United States
Big Data Engineer

Big Data Engineer

Sapphire Software Solutions • Remote, Remote, United States
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
  • [filters.remote]
[job_card.job_description]

Hello !!!

Hope you're doing well. I’ll get right to it. I am looking to hire an SR. Big Data Developer who has exp working with,
Spark, Scala, Databricks, SQL, Azure (nice to have). This is a project with our partner In IN. We already have someone working on this team and are looking to add one more to the existing team. If this role isn’t up your alley, I have a plethora of other roles with our Clients/vendors that I can review with you.

If you're in the market, please feel free to reach out to me and I'll go over the details with you. Lastly, I’d like to also mention, depending on your immigration status, our agency also sponsors individuals that require H1B & Green Cards. We cannot work through any layers and will be hiring the candidate on our payroll directly. Looking forward to hearing back from you.

Job Title: Sr. Big Data Engineer

Location: San Francisco, CA (open to remote)

Duration: 6 months (will extend we have multiple consultants on this team that have been there 2+ years)

Interview: 2 rounds (1st round 1-hour video technical interview, 2nd round 30 min formality personality call)

I have 8 openings in San Francisco, CA. These roles are open to remote candidates but they have to work PST hours. We have direct access to hiring managers with quick turnarounds on interviews. If candidates can crack the first interview they will get the job.

Top Skills' Details

Spark

Scala

SQL

Databricks

Azure (nice to have)

Job Description

Looking for a strong Big Data Engineer with Spark, Scala, SQL, and Azure

Architecture and Platform Organizations are looking for an experienced Big Data Engineer to build analytics and ML platforms to collect, store, process, and analyze huge sets of data spread across the organization. The platform will provide frameworks for quickly rolling out new data analysis for data-driven products and micro-services.

The platform will also enable machine/deep learning infrastructure that operationalizes data science models for broad consumption. You'll partner with end-to-end Product Managers and Data Scientists to understand customer requirements and design prototypes and bring ideas into production. You'll be developing real products. You need to be an expert in design, coding, and scripting. You'll be writing high-quality code that is consistent with our standards, creating new standards as necessary, and demonstrate correctness with pragmatic automated tests. You'll review the work of other engineers to improve quality and engineering practices and participate in continuing education programs to grow your skills. You'll be serving as a member of an Agile Engineering team and participate in the team's workflow.

Ideally 5-8 years of experience as a Software Engineer, experienced in building distributed, scalable, and reliable data pipelines that ingest and process data at scale and in batch and real-time. Strong knowledge of programming languages/tools including Java, Scala, Spark, SQL, Hive, ElasticSearch. Most tools within the Hadoop Ecosystem are necessary, but we're mainly looking for Spark and Scala (Java if not Scala). Experience with streaming technologies such as Spark Streaming, Flink, or Apache Bean. Experience with Kafka is a plus. Working experience with various NoSQL databases such as Cassandra, HBase, MongoDB, and/or Couchbase. Would be a plus if you have prior Machine Learning or Deep Learning knowledge (this will be learned in the job).

You will be working with the Marketing and Supply Chain side working on a Personalization initiative and getting data feed work to and from 3rd party vendors doing the analytics, marketing, and operations for email campaigns and catalog campaigns. Eventually will get into Machine Learning in areas of Product Recommendations on the site.

The team is working in Spark in Scala to ingest transaction and clickstream data to come up with associations and product recommendations. You'll be working on batch processing and real-time streaming projects. In batch processing, you'll be creating Spark Jobs & Azure Cloud using Azure tools to do some of the scheduling and workflow management for the batch jobs. Currently migrating from Teradata to Microsoft Azure. Overall, you'll be building a new Data Platform using Spark and building out a data pipeline from transactional systems and process them in Spark and the framework is written in Scala. (or Java)

1) Basic Transformations like filter, map & Actions like count, Group by, etc using Dataframe API
2) Iterating over Scala collections
3) Spark Parallelism – Data Ingestion from External RDBMS, Local Transformations
4) Datawarehouse – Dimensions, Facts when to do full load vs Incremental, etc’s
5) Basic software engineering principles.

Regards,

Rakesh Kumar

APPLY
[job_alerts.create_a_job]

Big Data Engineer • Remote, Remote, United States

[internal_linking.similar_jobs]
Data Engineer (Remote)

Data Engineer (Remote)

Jobgether • Cherryvale, KS, United States
[filters.remote]
[job_card.full_time]
This position is posted by Jobgether on behalf of a partner company.We are currently looking for a Data Engineer in the United States.This role offers the opportunity to design, implement, and depl...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Scientific Data Engineer

Senior Scientific Data Engineer

TetraScience • US
[filters.remote]
[job_card.full_time]
[filters_job_card.quick_apply]
TetraScience is a Scientific Data and AI company with a mission to radically improve and extend human life.TetraScience combines the world's only open, purpose-built, and collaborative scientific d...[show_more]
[last_updated.last_updated_30]
Data Engineer

Data Engineer

Horizon Air Freight • USA
[job_card.full_time]
[filters_job_card.quick_apply]
We are seeking a skilled and forward‑thinking.In this role, you will design and maintain robust data pipelines, manage and enhance our Snowflake data warehouse, and ensure the accuracy, reliability...[show_more]
[last_updated.last_updated_variable_days]
Staff Data Engineer P-133

Staff Data Engineer P-133

Smash CR • US
[job_card.full_time]
[filters_job_card.quick_apply]
We believe in long-lasting relationships with our talent.We invest time getting to know them and understanding what they seek as their professional next step.We aim to find the perfect match.As age...[show_more]
[last_updated.last_updated_30]
Data Engineer (HealthTech)

Data Engineer (HealthTech)

AssistRx • US
[filters.remote]
[job_card.full_time]
[filters_job_card.quick_apply]
AssistRx transforms the patient journey through technology, helping patients access life-saving therapies faster and more efficiently.Our platform connects the healthcare ecosystem—patients, provid...[show_more]
[last_updated.last_updated_30]
Big Id

Big Id

Saxon Global • United States
[filters.remote]
[job_card.full_time]
Experience with "Big ID" is HIGHLY desired.AWS Experience is most important.Big Id Deletion and Retention Module.Experience building and orchestrating Glue jobs for ETL and data lifecycle managemen...[show_more]
[last_updated.last_updated_30] • [promoted]
Data Integration Engineer

Data Integration Engineer

Two95 International Inc. • US
[filters.remote]
[job_card.full_time]
[filters_job_card.quick_apply]
Develop, test, document and maintain scalable data pipelines.Build out new data integrations including APIs to support continuing increases in data volume and complexity.Establish and follow data g...[show_more]
[last_updated.last_updated_30]
Sr. Data Engineer (Informatica/PowerBI) Position - Minneapolis, MN!

Sr. Data Engineer (Informatica/PowerBI) Position - Minneapolis, MN!

American IT Systems • United States
[job_card.full_time]
[filters_job_card.quick_apply]
Data Engineer (Informatica/PowerBI) Position - Minneapolis, MN ! Location: Minneapolis, MN (REMOTE) Duration: 6-12 Months ...[show_more]
[last_updated.last_updated_variable_days]
Data Engineer

Data Engineer

V4C.ai • US
[filters.remote]
[job_card.full_time]
[filters_job_card.quick_apply]
Data Engineer to join our remote team in the United States.In this role, you will support the design, development, and maintenance of data solutions using Databricks, helping clients and internal t...[show_more]
[last_updated.last_updated_variable_days]
Principal Data Engineer

Principal Data Engineer

CleanChoice Energy • US
[job_card.full_time]
[filters_job_card.quick_apply]
Job Description We are seeking a Principal Data Engineer to lead our Data Engineering team and architect the scalable data infrastructure powering our clean energy mission.In this senior technical ...[show_more]
[last_updated.last_updated_30]
Senior Data Engineer - 100% Remote

Senior Data Engineer - 100% Remote

Genesis10 • United States
[filters.remote]
[job_card.temporary]
Genesis10 is currently seeking a Senior Data Engineer for a 6+ month contract to hire opportunity located in Minneapolis, MN.This is a remote position with our client in the financial services indu...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Data & Analytics Engineer - MS Fabric

Senior Data & Analytics Engineer - MS Fabric

TheStaffed • United States
[filters.remote]
[job_card.full_time]
Our client is seeking a hands-on Senior Data & Analytics Engineer to design and deliver enterprise-grade Microsoft Fabric Lakehouse and Power BI solutions supporting marketing and customer analytic...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Data Engineer

Senior Data Engineer

Comfrt • US
[job_card.full_time]
[filters_job_card.quick_apply]
Comfrt is one of the fastest-growing DTC apparel brands in the world, with a social presence that even industry giants envy.Founded just three years ago, we’ve quickly built a reputation for stylis...[show_more]
[last_updated.last_updated_variable_days]
Data Engineer 2

Data Engineer 2

Network Designs Inc. • USA
[job_card.full_time]
[filters_job_card.quick_apply]
NDi) is a leading Federal contractor that specializes in designing, developing, and delivering information technology and network solutions for government customers.Founded in 1985, NDi's firmly de...[show_more]
[last_updated.last_updated_variable_days]
Senior Software Engineer - Data Acquisition

Senior Software Engineer - Data Acquisition

IntelliPro Group Inc. • (Multiple States), US
[job_card.full_time]
[filters_job_card.quick_apply]
Senior Software Engineer - Data Acquisition Position Type: Full time Location: Remote—Must reside within 30 miles of Portland, ME; Boston, MA; Chicago, IL; or San Francisco Bay Area, CA, Seattle, W...[show_more]
[last_updated.last_updated_30]
Principal Machine Learning Engineer- HelloData

Principal Machine Learning Engineer- HelloData

Grace Hill • US
[job_card.full_time]
[filters_job_card.quick_apply]
Principal Machine Learning Engineer- HelloData Grace Hill is looking for a Principal Machine Learning Engineer to support our HelloData product, an automated multifamily market analysis platform th...[show_more]
[last_updated.last_updated_30]
REMOTE Data Engineer in MN

REMOTE Data Engineer in MN

Insight Global • United States
[filters.remote]
[job_card.full_time]
Data Engineers to join a Personal Campaign Operations team.This is a known, visible and well-funded team bringing in millions of dollars for the company.This team supports the entire enterprise (ca...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Data Engineer – ETL & Pipeline Specialist

Data Engineer – ETL & Pipeline Specialist

Aretec Inc • USA
[job_card.full_time]
[filters_job_card.quick_apply]
Specializing in advanced analytics, machine learning, data analysis, cybersecurity, and business optimization, we empower federal agencies to achieve their most critical missions.As a premier partn...[show_more]
[last_updated.last_updated_30]