Himalayas logo
SZ
Open to opportunities

Soumya Zacharia

@soumyazacharia

AI and Data Engineer building scalable pipelines and production AI solutions.

Qatar
Message

What I'm looking for

I seek a role building production AI/data systems where I can create scalable pipelines, deploy models, collaborate cross-functionally, and deliver measurable product or research impact in a fast-paced, engineering-driven team.

I am an AI and Data Engineer with a Master’s in Data Analytics, experienced in end-to-end AI application development, backend engineering, data analysis, visualization, and statistical modeling. I have delivered measurable outcomes across supply chain, genomics, and clinical research by integrating AI into production systems.

At Beebolt I architected data pipelines, built LLM-based chatbots, and implemented document parsing and automation that reduced manual input by 80% and cut accounting reconciliation time by 70%. Previously, I developed parallel bioinformatics pipelines and predictive models at Dana-Farber and Brigham & Women's Hospital to support genomic research and published results in leading medical journals.

I combine strong software engineering practices, cloud deployment experience, and domain expertise in bioinformatics to solve complex problems and drive product and research impact. I’m collaborative, quality-focused, and passionate about building scalable, maintainable systems that translate data into actionable insights.

Experience

Work history, roles, and key accomplishments

BE

Software and AI Engineer

Beebolt

Jul 2022 - Sep 2024 (2 years 2 months)

Integrated AI capabilities into a supply-chain platform to automate document parsing, invoice reconciliation, and order creation, reducing manual input by 80% and cutting accounting reconciliation time by 70%. Led full-stack feature development including APIs, PostgreSQL backends, CI/CD and chatbots to improve user engagement and support efficiency.

Dana-Farber Cancer Institute logoDI

Bioinformatics Data Analyst

Oct 2020 - Jun 2022 (1 year 8 months)

Developed parallel processing pipelines and predictive models for high-throughput DNA sequencing to analyze cancer vs. healthy samples, supporting research and contributing to published studies. Implemented statistical analyses and visualization workflows to detect mutational signatures and microsatellite instability.

BH

Data Science Intern

Brigham & Women's Hospital

Jan 2019 - Jul 2019 (6 months)

Built reproducible genome analysis pipelines and created Tableau dashboards to analyze clinical study data, enabling association tests and variant-gene mapping that accelerated decision making and delivered a 93% accuracy regression result for phenotype associations.

Education

Degrees, certifications, and relevant coursework

Northeastern University logoNU

Northeastern University

Master of Science, Data Analytics Engineering

2018 - 2020

Grade: 3.92/4

Completed a Master of Science in Data Analytics Engineering with coursework in statistics, databases, machine learning, and visualization, achieving a 3.92 GPA.

University of Kerala logoUK

University of Kerala

Bachelor of Engineering, Electronics & Communication

2012 - 2016

Grade: 8.34/10

Activities and societies: IEEE Women In Engineering student branch leadership and organizing technical workshops and outreach programs.

Earned a Bachelor of Engineering in Electronics & Communication with coursework in C++, signal processing, and computer architecture, graduating with an 8.34/10 GPA.

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan
Soumya Zacharia - Software and AI Engineer - Beebolt | Himalayas