Vikram Tharakan
@vikramtharakan
Machine Learning Engineer specializing in large-scale graph + NLP systems for production analytics.
What I'm looking for
I’m a Machine Learning Engineer who builds production-ready intelligence systems that turn messy, heterogeneous data into actionable insights—often using graph models, NLP, and real-time entity resolution.
At Booz Allen Hamilton, I architected and led delivery of a large-scale data fusion platform that ingests vendor datasets, normalizes content via Apache Tika, and transforms it into a unified graph data model with Databricks, Apache Spark, NiFi, AWS, Elasticsearch, and JanusGraph. Within a ~10-person team, I drove end-to-end delivery under tight deadlines—spanning data engineering, integration, and deployment—so the resulting service is actively used by agents in the field.
I’ve also focused on high-impact data intelligence pipelines: I contributed to an entity resolution workflow combining Fellegi-Sunter probabilistic linkage with rule-based candidate generation, and I engineered a scalable entity deletion service from the ground up with cascading dependency resolution across multi-source linked graphs.
Earlier, at MITRE, I developed ETL microservices for document ingestion into Elasticsearch using NiFi, Tika, OCR, caching, and neural machine translation. I fine-tuned extractive QA transformer models (NER/EQA) with an active learning loop to reach strong F1 performance, and I co-developed a deep metric learning recommender system over geospatial data—then deployed relationship graphs in Neo4j to surface cross-source linkages.
Experience
Work history, roles, and key accomplishments
Architected and led delivery of a large-scale data fusion platform ingesting heterogeneous vendor datasets into a unified graph model, enabling analyst-facing graph search at enterprise scale. Built entity resolution and cascading entity deletion services and applied NLP/NER and LLM-based approaches to accelerate time-sensitive client analysis.
Deep Learning Engineer
MITRE
Aug 2020 - Jul 2022 (1 year 11 months)
Developed and maintained ETL microservices to ingest documents using NiFi, Apache Tika, OCR (Tesseract), and neural machine translation (Opus MT) into Elasticsearch. Fine-tuned extractive QA (EQA) transformer models for NER with active learning to improve F1 to 93%, and built deep metric learning recommenders and cross-vendor relationship graphs.
Accelerator Operator Engineer
Stanford Linear Accelerator Center
Mar 2018 - Jul 2020 (2 years 4 months)
Operated LCLS and SPEAR accelerator beamlines, delivering reliable photon beam for experimental users across multiple facilities. Automated operational diagnostics and monitoring to improve shift efficiency, and redesigned the onboarding curriculum to cut operator ramp-up from 10 months to 3 months, saving an estimated ~$56,000 per trainee in person-hours.
Education
Degrees, certifications, and relevant coursework
Udacity
Data Analyst Nanodegree, Data Analytics
2019 -
Completed the Udacity Data Analyst Nanodegree in 2019.
U.S. Particle Accelerator School
Accelerator Physics Fundamentals, Accelerator Physics
2019 -
Completed Accelerator Physics Fundamentals at the U.S. Particle Accelerator School in January 2019.
Udacity
Data Scientist Nanodegree, Data Science
2019 -
Completed the Udacity Data Scientist Nanodegree in 2019.
University of California, Santa Barbara
Bachelor of Science, Physics
2013 - 2017
Grade: GPA 3.5
Earned a B.S. in Physics from the University of California, Santa Barbara (GPA 3.5).
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Vikram?
You can contact Vikram and 90k+ other talented remote workers on Himalayas.
Message VikramFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
