Skip to main content
HimalayasHimalayas logo
Bhanu ThotaBT
Open to opportunities

Bhanu Thota

@bhanuthota

Award-winning AI/ML engineer building high-performance GenAI systems and scalable production ML platforms.

United States
Message

What I'm looking for

I seek senior roles building scalable GenAI/ML platforms with strong MLOps, low-latency inference, cross-functional leadership, and impact at enterprise scale.

I am an award-winning AI/ML engineer with 6+ years architecting enterprise-scale GenAI solutions, production ML systems, and low-latency inference services. I specialize in LLMs (GPT-4, LLaMA, Mistral), RAG architectures, model optimization, and distributed systems, and have delivered measurable outcomes including $2M+ cost savings, 40% performance improvements, and 99.9% system reliability.

I design and deploy cloud-native, multi-tenant platforms (AWS/Azure/GCP) with strong MLOps practices, automated CI/CD, and observability, and I mentor engineering teams to move initiatives from POC to production at Fortune 500 scale. My work spans real-time multimodal pipelines, high-throughput APIs, and productionized fine-tuning workflows that prioritize low latency, cost-efficiency, and compliance.

Experience

Work history, roles, and key accomplishments

CE
Current

AI/ML Engineer

CentryStone

Feb 2025 - Present (1 year 4 months)

Architected enterprise GenAI platform on AWS SageMaker/Bedrock and GCP Vertex AI serving 100K+ concurrent users with 99.99% uptime and sub-200ms P99 latency; led $5M+ client engagements and implemented low-latency C++/Python inference services that achieved 3x throughput and $50K annual GPU cost reduction.

University of Massachusetts Dartmouth logoUD

Teaching & Research Assistant

University of Massachusetts Dartmouth

Jan 2024 - Dec 2024 (11 months)

Pioneered real-time 3D human pose estimation and multimodal action-recognition pipelines, achieving 30+ FPS and 25% accuracy improvement; optimized transformer inference (C++/CUDA) reducing latency 20% and deployed NLP evaluation APIs processing 5K+ applications with 85% accuracy.

Netcracker Technology logoNT

Senior Software Engineer

Netcracker Technology

Apr 2022 - Dec 2022 (8 months)

Architected distributed Spark/Scala ETL pipelines and Snowflake data warehouse, improving throughput 60% and reducing query latency 35%; led cloud-native migration to AWS/Snowflake/Kubernetes and maintained 99.95% reliability for AI/ML data processing.

Netcracker Technology logoNT

Software Engineer

Netcracker Technology

Aug 2019 - Mar 2022 (2 years 7 months)

Engineered high-throughput RESTful APIs and refactored monoliths into microservices, supporting 10M+ daily transactions and achieving 40% scalability improvement; built ML churn prediction (78% accuracy) and reduced deployments from 4 hours to 30 minutes via CI/CD automation.

Education

Degrees, certifications, and relevant coursework

University of Massachusetts Dartmouth logoUD

University of Massachusetts Dartmouth

Master of Science, Data Science

Grade: 3.97/4.0

Activities and societies: Teaching & Research Assistant; taught Algorithms & Complexity; led multimodal AI and real-time 3D pose estimation research; Graduate Teaching Assistant Award (2024).

Completed Master of Science in Data Science with a 3.97/4.0 GPA, focusing on multimodal AI systems and real-time inference optimizations.

VU

Vasavi College of Engineering (Osmania University)

Bachelor of Engineering, Mechanical Engineering

Grade: 3.53/4.0

Earned a Bachelor of Engineering in Mechanical Engineering with a 3.53/4.0 GPA, completing coursework and projects in engineering fundamentals.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan