Skip to main content
SK
Open to opportunities

Swapnil Kadekar

@swapnilkadekar

AI/ML Engineer building production generative AI systems across LLMs, vision, and embedded devices.

United States
Message

What I'm looking for

I’m looking for a role where I can build and scale “Lab-to-Producon” generative AI—LLMs, vision, and embedded inference—working closely with product and leadership to ship reliable systems.

I’m an AI/ML Engineer specializing in “Lab-to-Producon,” focusing on getting generative AI from prototypes into real, reliable systems. I’ve worked on end-to-end AI/ML solutions across LLMs, vision, and embedded platforms.

At OStream (an NVIDIA Incepon company), I reported/developed directly with the founder and CEO while tackling diverse AI technologies. I built a private Retrieval Augmented Generaon (RAG) system (vector database + DeepSeek R1) for a $7B rail/transportaon project, addressing high-speed correspondence from pilot to producon.

I’ve fine-tuned and deployed models for multimodal and NLP use cases, including VLMs and transformer-based pipelines (BERT, BART, sentence transformers) for tasks like sentiment analysis, classification, clustering, and entity recognition. I also led performance benchmarking of embedded large language and vision models on NVIDIA Orin and Rockchip NPU devices for real-time inference optimization.

I bring strong engineering rigor, including building production-grade embedded vision transformer camera systems using TensorFlow, creating reliable data pipelines with Databricks, and developing frameworks like NVIDIA Lab’s generative AI API gateway (Gapi). I’m especially driven to ship practical, high-quality AI that works under real constraints.

Experience

Work history, roles, and key accomplishments

OS
Current

AI / ML Engineer

OStream

Jan 2021 - Present (5 years 5 months)

Developed a private Retrieval Augmented Generation (RAG) system (vector database + DeepSeek R1) for a $7B rail/transportation project and fine-tuned vision-language models for multimodal applications. Led embedded LLM/vision model benchmarking on NVIDIA Orin and Rockchip NPUs and deployed production-grade embedded vision transformer camera solutions.

UV

Research and Development Intern

University of Tor Vergata

Jun 2018 - Aug 2018 (2 months)

Built interactive real-time sensor data dashboards using JavaScript, Elasticsearch, and Kibana to uncover actionable patterns. Developed long-range communication protocols using LoRa for reliable aggregation of distributed sensor data into a centralized server.

Education

Degrees, certifications, and relevant coursework

Pace University (Seidenberg School) logoPS

Pace University (Seidenberg School)

Master of Science (MS), Computer Science

Earned an M.S. in Computer Science from Pace University.

University of Rome Tor Vergata logoUV

University of Rome Tor Vergata

Internship, Research & Development

2018 -

Completed a research and development internship working on interactive real-time dashboards and long-range LoRa-based distributed sensor communication and analytics.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan