Himalayas logo
Bhanu ThotaBT
Open to opportunities

Bhanu Thota

@bhanuthota

Award-winning AI/ML engineer building high-performance GenAI systems and scalable production ML platforms.

United States
Message

What I'm looking for

I seek senior roles building scalable GenAI/ML platforms with strong MLOps, low-latency inference, cross-functional leadership, and impact at enterprise scale.

I am an award-winning AI/ML engineer with 6+ years architecting enterprise-scale GenAI solutions, production ML systems, and low-latency inference services. I specialize in LLMs (GPT-4, LLaMA, Mistral), RAG architectures, model optimization, and distributed systems, and have delivered measurable outcomes including $2M+ cost savings, 40% performance improvements, and 99.9% system reliability.

I design and deploy cloud-native, multi-tenant platforms (AWS/Azure/GCP) with strong MLOps practices, automated CI/CD, and observability, and I mentor engineering teams to move initiatives from POC to production at Fortune 500 scale. My work spans real-time multimodal pipelines, high-throughput APIs, and productionized fine-tuning workflows that prioritize low latency, cost-efficiency, and compliance.

Experience

Work history, roles, and key accomplishments

CE
Current

AI/ML Engineer

CentryStone

Feb 2025 - Present (1 year)

Architected enterprise GenAI platform on AWS SageMaker/Bedrock and GCP Vertex AI serving 100K+ concurrent users with 99.99% uptime and sub-200ms P99 latency; led $5M+ client engagements and implemented low-latency C++/Python inference services that achieved 3x throughput and $50K annual GPU cost reduction.

University of Massachusetts Dartmouth logoUD

Teaching & Research Assistant

University of Massachusetts Dartmouth

Jan 2024 - Dec 2024 (11 months)

Pioneered real-time 3D human pose estimation and multimodal action-recognition pipelines, achieving 30+ FPS and 25% accuracy improvement; optimized transformer inference (C++/CUDA) reducing latency 20% and deployed NLP evaluation APIs processing 5K+ applications with 85% accuracy.

Netcracker Technology logoNT

Senior Software Engineer

Netcracker Technology

Apr 2022 - Dec 2022 (8 months)

Architected distributed Spark/Scala ETL pipelines and Snowflake data warehouse, improving throughput 60% and reducing query latency 35%; led cloud-native migration to AWS/Snowflake/Kubernetes and maintained 99.95% reliability for AI/ML data processing.

Netcracker Technology logoNT

Software Engineer

Netcracker Technology

Aug 2019 - Mar 2022 (2 years 7 months)

Engineered high-throughput RESTful APIs and refactored monoliths into microservices, supporting 10M+ daily transactions and achieving 40% scalability improvement; built ML churn prediction (78% accuracy) and reduced deployments from 4 hours to 30 minutes via CI/CD automation.

Education

Degrees, certifications, and relevant coursework

University of Massachusetts Dartmouth logoUD

University of Massachusetts Dartmouth

Master of Science, Data Science

Grade: 3.97/4.0

Activities and societies: Teaching & Research Assistant; taught Algorithms & Complexity; led multimodal AI and real-time 3D pose estimation research; Graduate Teaching Assistant Award (2024).

Completed Master of Science in Data Science with a 3.97/4.0 GPA, focusing on multimodal AI systems and real-time inference optimizations.

VU

Vasavi College of Engineering (Osmania University)

Bachelor of Engineering, Mechanical Engineering

Grade: 3.53/4.0

Earned a Bachelor of Engineering in Mechanical Engineering with a 3.53/4.0 GPA, completing coursework and projects in engineering fundamentals.

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan
Bhanu Thota - AI/ML Engineer - CentryStone | Himalayas