Aashish Rana
@aashishrana
Data Scientist delivering production LLM systems, RAG, and enterprise NLP.
What I'm looking for
I’m a Data Scientist with 3+ years of hands-on experience building production AI systems for enterprise clients. I architect LLM solutions like Retrieval-Augmented Generation (RAG), AI agents, and LLM-powered chatbots that move from prototype to deployment.
At Tatras Data, I designed production-grade RAG pipelines using Pinecone and AWS Bedrock, combining semantic + keyword hybrid search with reranking to improve response accuracy. I also built scalable LLM report generation pipelines with AWS Bedrock and OpenAI APIs.
I focus on practical, end-to-end delivery across the AI and data stack—building SQL agents and GraphRAG-based assistants, and developing ETL pipelines with PySpark, AWS Glue, and Apache Airflow into Snowflake for downstream analytics. I’ve also implemented summarization pipelines using LLMs and FAISS.
I’ve strengthened core ML/NLP performance through fine-tuning transformer models like BERT and RoBERTa for multi-label and multi-class classification, and I’ve led teams by mentoring interns to design and deploy an AI-powered research paper chatbot. Earlier, as a Junior Data Scientist and Intern, I built machine learning pipelines and worked with Sentinel-2 imagery for object detection and land-use classification.
Experience
Work history, roles, and key accomplishments
Data Scientist
Tatras Data
May 2023 - Present (3 years)
Built RAG-based LLM systems (Pinecone, AWS Bedrock) with hybrid search and reranking. Developed LLM report pipelines (Bedrock, OpenAI). Created SQL agents and chatbots on OpenSearch (millions of rows). Built ETL pipelines (PySpark, Glue, Airflow → Snowflake). Implemented summarization (FAISS) and fine-tuned BERT/RoBERTa. Led AI research chatbot team.
Jr. Data Scientist
Tatras Data
Sep 2022 - May 2023 (8 months)
Built end-to-end ML pipelines (preprocessing, feature engineering, training, evaluation) using Python, scikit-learn, and PyTorch. Fine-tuned BERT/RoBERTa for multi-class text classification on domain datasets. Mentored interns on satellite imagery classification projects.
Data Science Intern
Sabudh Foundation
Jan 2022 - Jul 2022 (6 months)
Built an object detection and land-use classification system using Sentinel-2 satellite imagery by extracting and processing multi-temporal data from Sentinel Hub with cloud filtering, noise reduction, and normalization. Trained and evaluated Random Forest and XGBoost classification models across datasets.
Education
Degrees, certifications, and relevant coursework
Uttranchal University
BCA, Computer Applications
2026 -
Availability
Location
Authorized to work in
Portfolio
github.com/RanaAashishSocial media
Job categories
Interested in hiring Aashish?
You can contact Aashish and 90k+ other talented remote workers on Himalayas.
Message AashishFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
