Soumya Zacharia
@soumyazacharia
AI and Data Engineer building scalable pipelines and production AI solutions.
What I'm looking for
I am an AI and Data Engineer with a Master’s in Data Analytics, experienced in end-to-end AI application development, backend engineering, data analysis, visualization, and statistical modeling. I have delivered measurable outcomes across supply chain, genomics, and clinical research by integrating AI into production systems.
At Beebolt I architected data pipelines, built LLM-based chatbots, and implemented document parsing and automation that reduced manual input by 80% and cut accounting reconciliation time by 70%. Previously, I developed parallel bioinformatics pipelines and predictive models at Dana-Farber and Brigham & Women's Hospital to support genomic research and published results in leading medical journals.
I combine strong software engineering practices, cloud deployment experience, and domain expertise in bioinformatics to solve complex problems and drive product and research impact. I’m collaborative, quality-focused, and passionate about building scalable, maintainable systems that translate data into actionable insights.
Experience
Work history, roles, and key accomplishments
Software and AI Engineer
Beebolt
Jul 2022 - Sep 2024 (2 years 2 months)
Integrated AI capabilities into a supply-chain platform to automate document parsing, invoice reconciliation, and order creation, reducing manual input by 80% and cutting accounting reconciliation time by 70%. Led full-stack feature development including APIs, PostgreSQL backends, CI/CD and chatbots to improve user engagement and support efficiency.
Developed parallel processing pipelines and predictive models for high-throughput DNA sequencing to analyze cancer vs. healthy samples, supporting research and contributing to published studies. Implemented statistical analyses and visualization workflows to detect mutational signatures and microsatellite instability.
Data Science Intern
Brigham & Women's Hospital
Jan 2019 - Jul 2019 (6 months)
Built reproducible genome analysis pipelines and created Tableau dashboards to analyze clinical study data, enabling association tests and variant-gene mapping that accelerated decision making and delivered a 93% accuracy regression result for phenotype associations.
Optimized SQL queries and led testing for portal and CRM applications, improving task completion time by 40% and backlog delivery by 25% through operational dashboards and Agile test strategies.
Education
Degrees, certifications, and relevant coursework
Northeastern University
Master of Science, Data Analytics Engineering
2018 - 2020
Grade: 3.92/4
Completed a Master of Science in Data Analytics Engineering with coursework in statistics, databases, machine learning, and visualization, achieving a 3.92 GPA.
University of Kerala
Bachelor of Engineering, Electronics & Communication
2012 - 2016
Grade: 8.34/10
Activities and societies: IEEE Women In Engineering student branch leadership and organizing technical workshops and outreach programs.
Earned a Bachelor of Engineering in Electronics & Communication with coursework in C++, signal processing, and computer architecture, graduating with an 8.34/10 GPA.
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Soumya?
You can contact Soumya and 90k+ other talented remote workers on Himalayas.
Message SoumyaFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
