Talent.com
Big Data Engineer
Big Data EngineerSapphire Software Solutions • Remote, Remote, United States
No longer accepting applications
Big Data Engineer

Big Data Engineer

Sapphire Software Solutions • Remote, Remote, United States
30+ days ago
Job type
  • Full-time
  • Remote
Job description

Hello !!!

Hope you're doing well. I’ll get right to it. I am looking to hire an SR. Big Data Developer who has exp working with,
Spark, Scala, Databricks, SQL, Azure (nice to have). This is a project with our partner In IN. We already have someone working on this team and are looking to add one more to the existing team. If this role isn’t up your alley, I have a plethora of other roles with our Clients/vendors that I can review with you.

If you're in the market, please feel free to reach out to me and I'll go over the details with you. Lastly, I’d like to also mention, depending on your immigration status, our agency also sponsors individuals that require H1B & Green Cards. We cannot work through any layers and will be hiring the candidate on our payroll directly. Looking forward to hearing back from you.

Job Title: Sr. Big Data Engineer

Location: San Francisco, CA (open to remote)

Duration: 6 months (will extend we have multiple consultants on this team that have been there 2+ years)

Interview: 2 rounds (1st round 1-hour video technical interview, 2nd round 30 min formality personality call)

I have 8 openings in San Francisco, CA. These roles are open to remote candidates but they have to work PST hours. We have direct access to hiring managers with quick turnarounds on interviews. If candidates can crack the first interview they will get the job.

Top Skills' Details

Spark

Scala

SQL

Databricks

Azure (nice to have)

Job Description

Looking for a strong Big Data Engineer with Spark, Scala, SQL, and Azure

Architecture and Platform Organizations are looking for an experienced Big Data Engineer to build analytics and ML platforms to collect, store, process, and analyze huge sets of data spread across the organization. The platform will provide frameworks for quickly rolling out new data analysis for data-driven products and micro-services.

The platform will also enable machine/deep learning infrastructure that operationalizes data science models for broad consumption. You'll partner with end-to-end Product Managers and Data Scientists to understand customer requirements and design prototypes and bring ideas into production. You'll be developing real products. You need to be an expert in design, coding, and scripting. You'll be writing high-quality code that is consistent with our standards, creating new standards as necessary, and demonstrate correctness with pragmatic automated tests. You'll review the work of other engineers to improve quality and engineering practices and participate in continuing education programs to grow your skills. You'll be serving as a member of an Agile Engineering team and participate in the team's workflow.

Ideally 5-8 years of experience as a Software Engineer, experienced in building distributed, scalable, and reliable data pipelines that ingest and process data at scale and in batch and real-time. Strong knowledge of programming languages/tools including Java, Scala, Spark, SQL, Hive, ElasticSearch. Most tools within the Hadoop Ecosystem are necessary, but we're mainly looking for Spark and Scala (Java if not Scala). Experience with streaming technologies such as Spark Streaming, Flink, or Apache Bean. Experience with Kafka is a plus. Working experience with various NoSQL databases such as Cassandra, HBase, MongoDB, and/or Couchbase. Would be a plus if you have prior Machine Learning or Deep Learning knowledge (this will be learned in the job).

You will be working with the Marketing and Supply Chain side working on a Personalization initiative and getting data feed work to and from 3rd party vendors doing the analytics, marketing, and operations for email campaigns and catalog campaigns. Eventually will get into Machine Learning in areas of Product Recommendations on the site.

The team is working in Spark in Scala to ingest transaction and clickstream data to come up with associations and product recommendations. You'll be working on batch processing and real-time streaming projects. In batch processing, you'll be creating Spark Jobs & Azure Cloud using Azure tools to do some of the scheduling and workflow management for the batch jobs. Currently migrating from Teradata to Microsoft Azure. Overall, you'll be building a new Data Platform using Spark and building out a data pipeline from transactional systems and process them in Spark and the framework is written in Scala. (or Java)

1) Basic Transformations like filter, map & Actions like count, Group by, etc using Dataframe API
2) Iterating over Scala collections
3) Spark Parallelism – Data Ingestion from External RDBMS, Local Transformations
4) Datawarehouse – Dimensions, Facts when to do full load vs Incremental, etc’s
5) Basic software engineering principles.

Regards,

Rakesh Kumar

APPLY
Create a job alert for this search

Big Data Engineer • Remote, Remote, United States

Similar jobs

Data Engineer (Remote)

JobgetherCherryvale, KS, United States
Remote
Full-time

This position is posted by Jobgether on behalf of a partner company.We are currently looking for a Data Engineer in the United States.This role offers the opportunity to design, implement, and depl...Show more

 • Promoted

Senior Data Engineer

MurmurationUS
Remote
Full-time
Quick Apply

At Murmuration, we believe that America’s promise is shaped and reshaped by the best ideas and ideals of its communities, and the dreams of the people who believe in a better life for themselves, t...Show more

GCP Data Engineer

Inherent TechnologiesUnited States
Full-time
Quick Apply

Job Details: As a Data Engineer, you will be a key member of an agile team dedicated to designing, building, and scaling innovative business data products on a modern cloud data platform.You will c...Show more

Staff Data Engineer P-133

Smash CRUS
Full-time
Quick Apply

We believe in long-lasting relationships with our talent.We invest time getting to know them and understanding what they seek as their professional next step.We aim to find the perfect match.As age...Show more

Senior Data Engineer

Zenox Global LLCUnited States
Full-time
Quick Apply

Job Title: Senior Data Engineer Location: Cleveland OH, or Pittsburgh, PA Hybrid (3 days onsite) Type: FULL TIME Job Duties :&...Show more

Senior Data Engineer

RightsHelperUS
Remote
Full-time
Quick Apply

We are looking for a hands-on Senior Data Engineer with experience building scalable ETL and data platforms in modern cloud environments.In this role, you will help design and implement the next ge...Show more

Senior Staff Data Engineer

NitrogenUS
Remote
Full-time
Quick Apply

Nitrogen has been revolutionizing how financial advisors and wealth management firms engage with their clients since the launch of Riskalyze in 2011.Today, Advisors tell us it's a constant challeng...Show more

Data Engineer

VDart IncUnited States
Full-time
Quick Apply

MsoNoSpacing">Title: Data Engineer Location: Remote Duration: 6 Months Wor...Show more

AWS Cloud Engineer, remote | 1056070

Revel ITRemote, USA
Remote
Full-time

Treat our consultants and clients the way we would like others to treat us!Interested in joining our team? Check out the opportunity below and apply today!.Seeking a remote contractor for the role ...Show more

 • Promoted

Data Engineer

Hire VelocityUnited States
Full-time
Quick Apply

At Grant Street Group, we build high quality software and provide uncompromising support for tax collection, electronic payments, and auctions, helping government agencies work better and deliver r...Show more

Principal Data Engineer

CleanChoice EnergyUS
Full-time
Quick Apply

Job Description We are seeking a Principal Data Engineer to lead our Data Engineering team and architect the scalable data infrastructure powering our clean energy mission.In this senior technical ...Show more

Data Engineer

Programmers.ioUnited States
Full-time
Quick Apply

MessageBody">Role Summary We are seeking an experienced Data Engineer to design, build, and optimize scalable, high performance data pipelines using Databricks, Apache Airflow, Snowflake, Python, a...Show more

AWS Data Engineer

QodeUS
Remote
Full-time
Quick Apply

As an AWS Data Engineer, your role will be to design, develop, and maintain scalable data pipelines on AWS.You will work closely with technical analysts, client stakeholders, data scientists, and o...Show more

Lead Data Engineer

C the SignsUS
Remote
Full-time
Quick Apply

We are seeking a Lead Data Engineer to architect, build, and scale our next-generation healthcare data platform.In this role, you will lead the effort to design robust pipelines, modernize data arc...Show more

Sr. Data Engineer

Two95 International Inc.US
Remote
Full-time
Quick Apply

Bachelor’s degree in Computer Science, Computer Information Systems, Engineering, Statistics or closely related field (willing to accept foreign education equivalent) (required).Experience in AWS s...Show more

Data Engineer

Apptad IncUnited States
Full-time
Quick Apply

Role - Data Engineer Location - Minneapolis, MN(Remote) Job Details: Databricks, Azure Data Factory, Databricks workflows, PySpark, Python, Databricks SQL Config driven data pipelines Databricks Ge...Show more

Data Engineer - Senior

BuzzclanUnited States
Full-time
Quick Apply

Description: Project Overview The Government of Alberta (GoA) has embarked on transforming the work of government to deliver simpler, more efficient, and better services for the citizens of Albe...Show more

Data Engineer - Senior

V R Della Infotech IncUnited States
Full-time
Quick Apply

Description: Project Overview The Government of Alberta (GoA) has embarked on transforming the work of government to deliver simpler, more efficient, and better services for the citizens of Albe...Show more