Talent.com
Big Data Engineer
Big Data EngineerSapphire Software Solutions • Remote, Remote, United States
Big Data Engineer

Big Data Engineer

Sapphire Software Solutions • Remote, Remote, United States
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
  • [filters.remote]
[job_card.job_description]

Hello !!!

Hope you're doing well. I’ll get right to it. I am looking to hire an SR. Big Data Developer who has exp working with,
Spark, Scala, Databricks, SQL, Azure (nice to have). This is a project with our partner In IN. We already have someone working on this team and are looking to add one more to the existing team. If this role isn’t up your alley, I have a plethora of other roles with our Clients/vendors that I can review with you.

If you're in the market, please feel free to reach out to me and I'll go over the details with you. Lastly, I’d like to also mention, depending on your immigration status, our agency also sponsors individuals that require H1B & Green Cards. We cannot work through any layers and will be hiring the candidate on our payroll directly. Looking forward to hearing back from you.

Job Title: Sr. Big Data Engineer

Location: San Francisco, CA (open to remote)

Duration: 6 months (will extend we have multiple consultants on this team that have been there 2+ years)

Interview: 2 rounds (1st round 1-hour video technical interview, 2nd round 30 min formality personality call)

I have 8 openings in San Francisco, CA. These roles are open to remote candidates but they have to work PST hours. We have direct access to hiring managers with quick turnarounds on interviews. If candidates can crack the first interview they will get the job.

Top Skills' Details

Spark

Scala

SQL

Databricks

Azure (nice to have)

Job Description

Looking for a strong Big Data Engineer with Spark, Scala, SQL, and Azure

Architecture and Platform Organizations are looking for an experienced Big Data Engineer to build analytics and ML platforms to collect, store, process, and analyze huge sets of data spread across the organization. The platform will provide frameworks for quickly rolling out new data analysis for data-driven products and micro-services.

The platform will also enable machine/deep learning infrastructure that operationalizes data science models for broad consumption. You'll partner with end-to-end Product Managers and Data Scientists to understand customer requirements and design prototypes and bring ideas into production. You'll be developing real products. You need to be an expert in design, coding, and scripting. You'll be writing high-quality code that is consistent with our standards, creating new standards as necessary, and demonstrate correctness with pragmatic automated tests. You'll review the work of other engineers to improve quality and engineering practices and participate in continuing education programs to grow your skills. You'll be serving as a member of an Agile Engineering team and participate in the team's workflow.

Ideally 5-8 years of experience as a Software Engineer, experienced in building distributed, scalable, and reliable data pipelines that ingest and process data at scale and in batch and real-time. Strong knowledge of programming languages/tools including Java, Scala, Spark, SQL, Hive, ElasticSearch. Most tools within the Hadoop Ecosystem are necessary, but we're mainly looking for Spark and Scala (Java if not Scala). Experience with streaming technologies such as Spark Streaming, Flink, or Apache Bean. Experience with Kafka is a plus. Working experience with various NoSQL databases such as Cassandra, HBase, MongoDB, and/or Couchbase. Would be a plus if you have prior Machine Learning or Deep Learning knowledge (this will be learned in the job).

You will be working with the Marketing and Supply Chain side working on a Personalization initiative and getting data feed work to and from 3rd party vendors doing the analytics, marketing, and operations for email campaigns and catalog campaigns. Eventually will get into Machine Learning in areas of Product Recommendations on the site.

The team is working in Spark in Scala to ingest transaction and clickstream data to come up with associations and product recommendations. You'll be working on batch processing and real-time streaming projects. In batch processing, you'll be creating Spark Jobs & Azure Cloud using Azure tools to do some of the scheduling and workflow management for the batch jobs. Currently migrating from Teradata to Microsoft Azure. Overall, you'll be building a new Data Platform using Spark and building out a data pipeline from transactional systems and process them in Spark and the framework is written in Scala. (or Java)

1) Basic Transformations like filter, map & Actions like count, Group by, etc using Dataframe API
2) Iterating over Scala collections
3) Spark Parallelism – Data Ingestion from External RDBMS, Local Transformations
4) Datawarehouse – Dimensions, Facts when to do full load vs Incremental, etc’s
5) Basic software engineering principles.

Regards,

Rakesh Kumar

APPLY
[job_alerts.create_a_job]

Big Data Engineer • Remote, Remote, United States

[internal_linking.similar_jobs]
AI Data Manager

AI Data Manager

Luma AI • United States
[filters.remote]
[job_card.full_time]
Luma's mission is to build multimodal AI to expand human imagination and capabilities.We believe that multimodality is critical for intelligence.To go beyond language models and build more aware, c...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Data Engineer (Remote)

Data Engineer (Remote)

Jobgether • Cherryvale, KS, United States
[filters.remote]
[job_card.full_time]
This position is posted by Jobgether on behalf of a partner company.We are currently looking for a Data Engineer in the United States.This role offers the opportunity to design, implement, and depl...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Information Technology Professional

Information Technology Professional

U.S. Navy • Independence, KS, US
[job_card.full_time]
Information Technology Professional (IT/CTN/IS).Information Systems Technicians, Cryptologic Technician Networks, and Intelligence Specialists keep the Fleet connected, informed, and secure by oper...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Data Engineer (HealthTech)

Data Engineer (HealthTech)

AssistRx • US
[filters.remote]
[job_card.full_time]
[filters_job_card.quick_apply]
AssistRx transforms the patient journey through technology, helping patients access life-saving therapies faster and more efficiently.Our platform connects the healthcare ecosystem—patients, provid...[show_more]
[last_updated.last_updated_30]
Remote - NLP developer / Engineer

Remote - NLP developer / Engineer

Sage IT Inc • Nowata, OK, United States
[filters.remote]
[job_card.full_time]
JD :Must need Regex exp We are looking for NLP developer, someone who can manipulate text, understands linguistics (degree in linguistics or computational linguistics, or CS degree with minor in li...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Big Id

Big Id

Saxon Global • United States
[filters.remote]
[job_card.full_time]
Experience with "Big ID" is HIGHLY desired.AWS Experience is most important.Big Id Deletion and Retention Module.Experience building and orchestrating Glue jobs for ETL and data lifecycle managemen...[show_more]
[last_updated.last_updated_30] • [promoted]
Sr. Data Engineer (Informatica/PowerBI) Position - Minneapolis, MN!

Sr. Data Engineer (Informatica/PowerBI) Position - Minneapolis, MN!

American IT Systems • United States
[job_card.full_time]
[filters_job_card.quick_apply]
Data Engineer (Informatica/PowerBI) Position - Minneapolis, MN ! Location: Minneapolis, MN (REMOTE) Duration: 6-12 Months ...[show_more]
[last_updated.last_updated_variable_days]
Data Mesh Director

Data Mesh Director

Zimmer Biomet • United States
[job_card.full_time]
Zimmer Biomet is a global medical technology leader.Our team members are part of a company with a heritage of leadership, a focus on shaping the future, and a mission dedicated to alleviating pain ...[show_more]
[last_updated.last_updated_variable_days]
Senior Data Operations (DataOps) Engineer P-134

Senior Data Operations (DataOps) Engineer P-134

Smash CR • US
[job_card.full_time]
[filters_job_card.quick_apply]
We believe in long-lasting relationships with our talent.We invest time getting to know them and understanding what they seek as their professional next step.We aim to find the perfect match.As age...[show_more]
[last_updated.last_updated_30]
Lead AI Engineer - Platform

Lead AI Engineer - Platform

Tribe AI • US
[job_card.full_time]
[filters_job_card.quick_apply]
At Tribe, we’re on a mission to help enterprises realize the value of AI for their business.Today, every large enterprise wants to use AI to transform its business, but they often lack the capabili...[show_more]
[last_updated.last_updated_30]
Azure Data QA Engineer (Remote) with Security Clearance

Azure Data QA Engineer (Remote) with Security Clearance

Advantech GS Enterprises Inc • United States
[filters.remote]
[job_card.full_time]
Job Title: Azure Data QA Engineer Program: DISA (Advantech GS Enterprises) Location: Remote (Supporting DISA environments) Clearance: Active Secret or ability to obtain Position Overview Advantech ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Travel CT Tech - $2,536 per week in Neodesha, KS

Travel CT Tech - $2,536 per week in Neodesha, KS

AlliedTravelCareers • Neodesha, KS, US
[job_card.full_time]
AlliedTravelCareers is working with Blu Medstaff LLC to find a qualified CT Tech in Neodesha, Kansas, 66757!.At Blu MedStaff, we truly value our nurses and are dedicated to supporting you every ste...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Data Engineer - 100% Remote

Senior Data Engineer - 100% Remote

Genesis10 • United States
[filters.remote]
[job_card.temporary]
Genesis10 is currently seeking a Senior Data Engineer for a 6+ month contract to hire opportunity located in Minneapolis, MN.This is a remote position with our client in the financial services indu...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Data & Analytics Engineer - MS Fabric

Senior Data & Analytics Engineer - MS Fabric

TheStaffed • United States
[filters.remote]
[job_card.full_time]
Our client is seeking a hands-on Senior Data & Analytics Engineer to design and deliver enterprise-grade Microsoft Fabric Lakehouse and Power BI solutions supporting marketing and customer analytic...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Software Engineer - Data Acquisition

Senior Software Engineer - Data Acquisition

IntelliPro Group Inc. • (Multiple States), US
[job_card.full_time]
[filters_job_card.quick_apply]
Senior Software Engineer - Data Acquisition Position Type: Full time Location: Remote—Must reside within 30 miles of Portland, ME; Boston, MA; Chicago, IL; or San Francisco Bay Area, CA, Seattle, W...[show_more]
[last_updated.last_updated_30]
Remote Senior SQL Engineer - AI Trainer

Remote Senior SQL Engineer - AI Trainer

SuperAnnotate • Independence, Kansas, US
[filters.remote]
[job_card.full_time]
As a Senior SQL Engineer, you will work remotely on an hourly paid basis to review AI-generated SQL queries, database designs, and data-processing logic, as well as generate high-quality reference ...[show_more]
[last_updated.last_updated_30]
REMOTE Data Engineer in MN

REMOTE Data Engineer in MN

Insight Global • United States
[filters.remote]
[job_card.full_time]
Data Engineers to join a Personal Campaign Operations team.This is a known, visible and well-funded team bringing in millions of dollars for the company.This team supports the entire enterprise (ca...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
529601641-Software Engineer 3

529601641-Software Engineer 3

Lorven technologies • United States
[job_card.full_time]
[filters_job_card.quick_apply]
Job description: Level Description 8 or more years of experience, relies on experience and judgment to plan and accomplish goals, independently performs a variety of complicated tasks, may lead and...[show_more]
[last_updated.last_updated_variable_days]