Lokesh Jain
@lokeshjain1
Senior data engineer and machine learning practitioner specializing in scalable cloud-native data platforms.
What I'm looking for
I am a senior data engineer and machine learning practitioner with experience building scalable ETL pipelines, real-time streaming systems, and production ML services on GCP and Hadoop ecosystems. I deliver performance improvements, reduce operational incidents, and enable data-driven decisions through optimized data models and dashboarding.
My background spans end-to-end ML solutions from dataset construction and deep learning model development to deployment and monitoring; projects include YOLOv8 image models, ESRGAN-enhanced object detection, and instruction-tuning experiments with Llama-2. I have repeatedly cut pipeline runtimes, improved model accuracy, and reduced costs across roles.
I mentor and lead cross-functional teams, evangelize code quality and reliability practices, and implement repeatable CI/CD and validation frameworks to accelerate delivery while maintaining system stability and data quality.
Experience
Work history, roles, and key accomplishments
Built scalable ETL workflows on GCP using Scala and Python processing terabytes in BigQuery and Airflow, reducing manual intervention by 50% and cutting pipeline runtime by 83% via Spark and Kafka optimizations.
Data Scientist
BNMC Inc.
Mar 2024 - Nov 2024 (8 months)
Built modular preprocessing and training pipelines on GCP and engineered a YOLOv8 model on 100,000+ images achieving 95% accuracy, improving data quality and real-time detection with Kafka integrations.
Research Assistant
University at Buffalo
Jun 2023 - Dec 2023 (6 months)
Constructed training datasets and implemented a TensorFlow model to classify mythological figures in Bengal scrolls, achieving 84% accuracy through preprocessing with OpenCV and scikit-learn.
Machine Learning Engineer
Visionindia Software Exports Limited
Jan 2019 - Aug 2022 (3 years 7 months)
Built and deployed forecasting models with Python and Spark reducing holding costs by 18%, engineered ETL pipelines with PostgreSQL and Airflow improving model accuracy and reducing execution time by 20%.
Education
Degrees, certifications, and relevant coursework
University at Buffalo, The State University of New York
Master of Science, Data Science
2022 - 2024
Activities and societies: Projects: AI teaching assistant (GPT-3.5, BERT), ESRGAN+Attention with YOLOv5 for drone imagery, instruction-tuning exploration with OpenAI and HuggingFace.
Completed a Master of Science in Data Science with projects in NLP, computer vision, and deep learning, including development of AI-driven applications and super-resolution GANs for object detection.
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Lokesh?
You can contact Lokesh and 90k+ other talented remote workers on Himalayas.
Message LokeshFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
