Swapnil Kadekar
@swapnilkadekar
AI/ML Engineer building production generative AI systems across LLMs, vision, and embedded devices.
What I'm looking for
I’m an AI/ML Engineer specializing in “Lab-to-Producon,” focusing on getting generative AI from prototypes into real, reliable systems. I’ve worked on end-to-end AI/ML solutions across LLMs, vision, and embedded platforms.
At OStream (an NVIDIA Incepon company), I reported/developed directly with the founder and CEO while tackling diverse AI technologies. I built a private Retrieval Augmented Generaon (RAG) system (vector database + DeepSeek R1) for a $7B rail/transportaon project, addressing high-speed correspondence from pilot to producon.
I’ve fine-tuned and deployed models for multimodal and NLP use cases, including VLMs and transformer-based pipelines (BERT, BART, sentence transformers) for tasks like sentiment analysis, classification, clustering, and entity recognition. I also led performance benchmarking of embedded large language and vision models on NVIDIA Orin and Rockchip NPU devices for real-time inference optimization.
I bring strong engineering rigor, including building production-grade embedded vision transformer camera systems using TensorFlow, creating reliable data pipelines with Databricks, and developing frameworks like NVIDIA Lab’s generative AI API gateway (Gapi). I’m especially driven to ship practical, high-quality AI that works under real constraints.
Experience
Work history, roles, and key accomplishments
AI / ML Engineer
OStream
Jan 2021 - Present (5 years 5 months)
Developed a private Retrieval Augmented Generation (RAG) system (vector database + DeepSeek R1) for a $7B rail/transportation project and fine-tuned vision-language models for multimodal applications. Led embedded LLM/vision model benchmarking on NVIDIA Orin and Rockchip NPUs and deployed production-grade embedded vision transformer camera solutions.
Research and Development Intern
University of Tor Vergata
Jun 2018 - Aug 2018 (2 months)
Built interactive real-time sensor data dashboards using JavaScript, Elasticsearch, and Kibana to uncover actionable patterns. Developed long-range communication protocols using LoRa for reliable aggregation of distributed sensor data into a centralized server.
Education
Degrees, certifications, and relevant coursework
Pace University (Seidenberg School)
Master of Science (MS), Computer Science
Earned an M.S. in Computer Science from Pace University.
University of Rome Tor Vergata
Internship, Research & Development
2018 -
Completed a research and development internship working on interactive real-time dashboards and long-range LoRa-based distributed sensor communication and analytics.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Swapnil?
You can contact Swapnil and 90k+ other talented remote workers on Himalayas.
Message SwapnilFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
