Dhruv Sondhi
@dhruvsondhi
I am a machine learning researcher specializing in NLP for astrophysics and scientific software.
What I'm looking for
I am pursuing a Master of Computer Science with interdisciplinary research in astrophysics and machine learning, focusing on automated identification and classification of astrophysical named entities using large language models.
I integrated the Universal Astronomy Thesaurus (UAT) to enrich an ADS dataset, developed data-augmentation techniques including duplication and substitution of context-important entities, and fine-tuned models such as AstroBERT to improve vocabulary coverage and downstream performance.
During a Mitacs Global Research Internship I implemented NLP on NASA ADS literature to identify trends across astrophysics subdomains and assessed machine learning usage; I developed ML models and researched text-processing techniques including RNNs and deep-learning approaches to retrieve relationships from large corpora.
As a Google Summer of Code contributor to the TARDIS scientific Python project I developed logging and packet-tracking frameworks, used Numba for JIT performance, and contributed features to the main project branch; I have presented work at NLP for Space (ESAC), ADASS 2024, CASCA Toronto, and OGMC-UWaterloo.
Experience
Work history, roles, and key accomplishments
Masters Researcher
Western University
Aug 2023 - Present (2 years)
Leveraged large language models to identify astrophysical named entities and classify research papers; integrated the Universal Astronomy Thesaurus to enrich the ADS dataset and developed data-augmentation methods to improve NER and disambiguation performance.
Research Intern
Western University
May 2022 - Aug 2022 (3 months)
Implemented NLP and ML on the NASA ADS research literature (Kaggle dataset ~2M papers) to identify trends across astrophysics subdomains and evaluate ML technique usage under supervision of Dr. Pauline Barmby.
Developer & Researcher
TARDIS
May 2021 - Aug 2021 (3 months)
Selected to GSoC from ~1,600 applicants and developed logging and packet-tracking frameworks for the TARDIS scientific Python project, implementing JIT optimizations with Numba and features merged into master.
Education
Degrees, certifications, and relevant coursework
Western University
Master of Computer Science, Computer Science
2023 -
Grade: 3.9/4.0
Conducting master's research applying large language models to identify astrophysical named entities and classify research papers, integrating the Universal Astronomy Thesaurus to augment the ADS dataset and improve NER performance.
Delhi Technological University
Bachelor of Technology, Computer Science and Environmental Engineering
2019 - 2023
Grade: 8.9/10.0
Completed a Bachelor of Technology in Computer Science and Environmental Engineering with coursework in machine learning, artificial intelligence, and data science.
Availability
Location
Authorized to work in
Job categories
Interested in hiring Dhruv?
You can contact Dhruv and 90k+ other talented remote workers on Himalayas.
Message DhruvFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
