Data Scientist

Fusemachines is a leading AI company focused on democratizing artificial intelligence and providing tailored solutions for enterprises worldwide.

Fusemachines

Employee count: 201-500

India only

About Fusemachines
Fusemachines is a 10+ year old AI company, dedicated to delivering state-of-the-art AI products and solutions to a diverse range of industries. Founded by Sameer Maskey, Ph.D., an Adjunct Associate Professor at Columbia University, our company is on a steadfast mission to democratize AI and harness the power of global AI talent from underserved communities. With a robust presence in four countries and a dedicated team of over 400 full-time employees, we are committed to fostering AI transformation journeys for businesses worldwide. At Fusemachines, we not only bridge the gap between AI advancement and its global impact but also strive to deliver the most advanced technology solutions to the world.

About the Role:

Location: Remote | Contractual Full-time

We are seeking a Data Scientist with hands-on Python experience and proven abilities to support software activities in an Agile software development lifecycle. We are seeking a well-rounded developer to lead a cloud-based big data application using a variety of technologies. The ideal candidate will possess strong technical, analytical, and interpersonal skills. In addition, the candidate will lead developers on the team to achieve architecture and design objectives as agreed with stakeholders.

Role Description

Work with developers on the team to meet product deliverables.
Work independently and collaboratively on a multi-disciplined project team in an Agile development environment.
Contribute detailed design and architectural discussions as well as customer requirements sessions to support the implementation of code and procedures for our big data product.
Design and develop clear and maintainable code with automated open-source test functions
Ability to identify areas of code/design optimization and implementation.
Learn and integrate with a variety of systems, APIs, and platforms.
Interact with a multi-disciplined team to clarify, analyze, and assess requirements.
Be actively involved in design, development, and testing activities in big data applications.

Key Responsibilities

Data Engineering & Processing:

Develop scalable data pipelines using PySpark for processing large datasets.
Work extensively in Databricks for collaborative data science workflows and model deployment.
Handle messy, unstructured, and semi-structured data, performing thorough Exploratory Data Analysis (EDA).
Apply appropriate statistical measures and hypothesis testing to derive insights and validate assumptions.

Data Analysis & Modeling:

Write complex SQL queries for data extraction, transformation, and analysis.
Build and validate predictive models using techniques such as: Gradient Boosting Machines (GBMs) (e.g., XGBoost, LightGBM), Generalized Linear Models (GLMs) (e.g., logistic regression, Poisson regression)
Apply unsupervised learning techniques like clustering (K-Means, DBSCAN), PCA, and anomaly detection.

Automation & Optimization:

Automate data workflows and model training pipelines using scheduling tools (e.g., Airflow, Databricks Jobs).
Optimize model performance and data processing efficiency.

Cloud & Deployment:

Basic experience with Azure or other cloud platforms (AWS, GCP) for data storage, compute, and model deployment.
Familiarity with cloud-native tools like Azure Data Lake, Azure ML, or equivalent.

Required Skills:

Programming Languages: Python (with PySpark), SQL
Tools & Platforms: Databricks, Azure (or other cloud platforms), Git
Libraries & Frameworks: scikit-learn, pandas, numpy, matplotlib/seaborn, XGBoost/LightGBM
Statistical Knowledge: Hypothesis testing, confidence intervals, correlation analysis
Machine Learning: Supervised and Unsupervised learning, model evaluation metrics
Data Handling: EDA, feature engineering, dealing with missing/outlier data
Automation: Experience with job scheduling and pipeline automation.

Required Experience:

Minimum 5+ years in Data Science or related fields
Hands on experience with Databricks.
Experience with data cleansing, transformation, and validation.
Proven technical leadership on prior development projects.
Hands-on experience with versioning tools such as GitHub, Azure Devops, Bitbucket, etc.
Hands-on experience building pipelines in GitHub (or Azure Devops, etc.)
Hands-on experience using Relational Databases, such as Oracle, SQL Server, MySQL, Postgres or similar.
Experience using Markdown to document code in repositories or automated documentation tools like PyDoc.
Strong written and verbal communication skills.

Preferred Qualifications:

Experience with data visualization tools such as Power BI or Tableau.
Experience with MLOps, DEVOPS CI/CD tools and automation processes (e.g., Azure DevOPS, GitHub, BitBucket).
Containers and their environments (Docker, Podman, Docker-Compose, Kubernetes, Minikube, Kind, etc.)
Experience working in cross-functional teams and communicating insights to stakeholders.

Education
Master of Science/B. Tech degree from an accredited university

Fusemachines is an Equal Opportunities Employer, committed to diversity and inclusion. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or any other characteristic protected by applicable federal, state, or local laws.

Apply now

Please let Fusemachines know you found this job on Himalayas. This helps us grow!

Apply now

About the job

Apply before

Oct 30, 2025

Posted on

Aug 31, 2025

Job type

Contractor

Experience level

Senior

Location requirements

India

Hiring timezones

India +/- 0 hours

About Fusemachines

Learn more about Fusemachines and their company culture.

View company profile

Fusemachines is an innovative leader in the field of Artificial Intelligence, providing cutting-edge AI products and solutions to a wide range of industries. Established in 2013, Fusemachines has dedicated over a decade to transforming enterprises by leveraging its proprietary AI technologies and tailored solutions. Our mission is to 'Democratize AI' by making advanced technology accessible to everyone, particularly underserved communities. We believe that AI has the potential to revolutionize industries, enhance efficiency, and create new opportunities for growth and innovation.

Our flagship offering, the Fusemachines AI Studio™, is designed to empower enterprises to develop, launch, and manage AI applications using our GenAI Engines™ and Predictive AI Engines™. These tools enable organizations to streamline their data processes, improve decision-making, and create personalized customer experiences. Through our comprehensive AI transformation journey, we assist businesses in enhancing their operational capabilities while nurturing AI talent from diverse backgrounds. Under the leadership of Dr. Sameer Maskey, our founder and CEO, we are not only striving to provide AI solutions but also contributing to AI education and job opportunities across the globe.

Apply now

Please let Fusemachines know you found this job on Himalayas. This helps us grow!

Apply now

About the job

Apply before

Oct 30, 2025

Posted on

Aug 31, 2025

Job type

Contractor

Experience level

Senior

Location requirements

India

Hiring timezones

India +/- 0 hours

Claim this profile

Fusemachines

Company size

201-500 employees

Founded in

2013

Chief executive officer

Dr. Sameer Maskey

Markets

Artificial Intelligence AI Solutions AI Education Machine Learning Predictive Analytics Enterprise Software Data Science Technology Consulting Software Development Digital Transformation

Employees live in

United States

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

India only

Senior Data Scientist

HighLevel

Employee count: 1001-5000

Full Time

Revenue Operations

37 remote jobs at Fusemachines

Explore the variety of open remote roles at Fusemachines, offering flexible work options across multiple disciplines and skill levels.

View all jobs at Fusemachines

Mexico only

Solution Data Architect

Fusemachines

Employee count: 201-500

Full Time

Top remote companies

Remote companies like Fusemachines

Find your next opportunity by exploring profiles of companies that are similar to Fusemachines. Compare culture, benefits, and job openings on Himalayas.

View all companies

InfuseAI

Benefits Tech stack

InfuseAI builds MLOps tools to streamline ML and data workflow.

Machine Learning AI

Obviously AI

Tech stack

Obviously AI is a no-code AI platform that enables businesses to build and deploy predictive machine learning models quickly and without coding expertise. Their mission is to make AI accessible to every company, empowering non-technical users to make data-driven decisions.

Artificial Intelligence Machine Learning

Top remote companies

Remote companies like Fusemachines

Find your next opportunity by exploring profiles of companies that are similar to Fusemachines. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Data Scientist

Role Description

Key Responsibilities

Required Skills:

Apply now

About the job

Apply before

Posted on

Job type

Experience level

Location requirements

Hiring timezones

Job categories

Skills

About Fusemachines

Apply now

About the job

Apply before

Posted on

Job type

Experience level

Location requirements

Hiring timezones

Job categories

Skills

Fusemachines

Company size

Founded in

Chief executive officer

Markets

Employees live in

Similar remote jobs

Senior Data Scientist

37 remote jobs at Fusemachines

Solution Data Architect

Remote companies like Fusemachines

Remote companies like Fusemachines

Find your dream job

Find your dream job

Find your dream job

Senior Data Scientist

Solution Data Architect

Data Scientist

Data Scientist

Senior Data Scientist

Data Scientist

Data Scientist

Solution Data Architect

Solution Data Architect

Solution Data Architect

Solution Data Architect

Senior Data Scientist with LLM experience

Remote companies like Fusemachines