Job Description
Job Description
About Us
Kalamata Capital Group is a forward-thinking financial technology company committed to leveraging data-driven intelligence to support small business growth. We are seeking a highly skilled Data Scientist to develop predictive models, perform robust exploratory data analysis, and build scalable data pipelines that power key business decisions across the organization.
Summary
The ideal candidate is an experienced data scientist with deep technical expertise in machine learning, data engineering workflows, and statistical modeling. This role will work closely with engineering, product, and analytics teams to design, validate, and deploy ML solutions that improve decision-making efficiency. Strong proficiency in Pandas, PySpark, and MongoDB is essential, along with the ability to write clean, reproducible, production-ready code. The successful candidate will be equally comfortable communicating complex analytical insights to non-technical stakeholders.
Key Responsibilities
Exploratory Analysis & Data Profiling : Conduct EDA on large, complex datasets using Pandas and PySpark; assess data quality and structure.
Model Development : Build, tune, and evaluate supervised and unsupervised machine learning models (e.g., tree-based methods, regressions, boosting algorithms).
Pipeline Engineering : Design and implement reliable, maintainable machine learning pipelines and preprocessing workflows for production environments.
Data Management : Query and integrate MongoDB datasets; design efficient schemas and aggregation pipelines that support analytical and operational workloads.
Visualization : Create intuitive visualizations using seaborn, plotly, and matplotlib to support model diagnostics and business storytelling.
Reproducible Code : Write clean, modular, well-documented Python code (PEP8 compliant); maintain version control using Git.
Model Explainability : Apply model interpretation tools such as SHAP and LIME to evaluate feature impact and improve transparency.
Cross-Functional Collaboration : Partner with engineering, analytics, and product teams to translate business needs into actionable model-driven solutions.
Documentation : Produce clear technical memos, reports, and model documentation for internal stakeholders.
Required Skills & Qualifications
Technical Expertise :
Core Skills :
Preferred (Bonus) Skills
Flexible work from home options available.
Scientist Machine Learning • San Francisco, CA, US