Kumar Chodavarapu
@kumarchodavarapu
I’m an MLOps engineer and ML engineer building GPU-optimized, end-to-end production ML platforms.
What I'm looking for
I’m an MLOps engineer with 12+ years of progressive experience, starting in core IT systems engineering and evolving through DevOps, ML engineering, and now MLOps. I intentionally carry the Linux, automation, and infrastructure foundation into every ML system I build.
Since 2019, I’ve applied that depth to real AI/ML workloads—building telecom network ML models at Ericsson using real client datasets, then leading full MLOps platform engineering at Qubittron. I’m an expert in end-to-end ML lifecycle management, GPU-optimised infrastructure, LLM/GenAI serving, and production model monitoring.
At Qubittron, I designed and led a central ML platform used by 10+ data science teams, taking projects from experiment tracking to live monitoring. I built GPU-optimised Kubernetes clusters on AWS EKS, cutting GPU idle time by 40%, and automated Continuous Training with Kubeflow and MLflow—reducing model-to-production time from 3 weeks to under 6 hours.
I also engineered high-throughput real-time serving with KServe and Seldon Core (10M+ predictions/day at p99 latency under 80ms) and delivered an LLM inference layer with vLLM and HuggingFace TGI. I implemented RAG and vector search pipelines (FAISS/Pinecone, LangChain), feature stores (Feast), and drift detection with Evidently AI and Kafka—triggering retraining and reducing degradation incidents by 75%, while embedding security and governance practices like RBAC, Vault-based secrets management, and SOC2-aligned audit logging.
Experience
Work history, roles, and key accomplishments
MLOps Engineer
Qubittron Consulting Inc
Apr 2022 - Present (4 years)
Designed and led a central MLOps platform for 10+ data science teams, reducing model-to-production time from 3 weeks to under 6 hours. Built GPU-optimized AWS EKS clusters and real-time LLM serving (10M+ predictions/day) while cutting GPU idle time by 40% and reducing model degradation incidents by 75%.
ML & DevOps Engineer
Ericsson
Apr 2019 - Feb 2022 (2 years 10 months)
Managed telecom cloud platforms and built ML models on real client network datasets across 50+ production clusters. Trained anomaly and fault classification models achieving 88%+ precision and built an LSTM-based predictive maintenance model forecasting failures up to 48 hours in advance.
DevOps Engineer
Videri North
Jul 2017 - Jan 2019 (1 year 6 months)
Automated infrastructure provisioning with Terraform and CloudFormation, reducing deployment time by 40% and configuration errors by 25%. Built CI/CD pipelines with Jenkins and Git, deployed containerized workloads on AWS ECS with autoscaling, and improved peak traffic handling by 50%.
Collaborative Services Analyst
Invesco
Jun 2010 - Jan 2017 (6 years 7 months)
Provided core Linux and application/server administration, including RedHat Linux management and troubleshooting of BEA WebLogic on Solaris, RHEL, and Windows. Led migrations (AIX to RedHat/Solaris), administered SAN/NAS storage, and wrote Python/Shell automation to monitor logs, disk space, and services.
Systems Engineer
NESS Technologies
Jan 2005 - Mar 2010 (5 years 2 months)
Built AWS infrastructure automation using CloudFormation (VPC/subnets/NAT) and implemented security via IAM and Security Groups. Delivered continuous delivery with Bamboo/Bitbucket/Maven, integrated Chef for provisioning, and deployed monitoring with Splunk and AppDynamics for microservices.
Education
Degrees, certifications, and relevant coursework
Pondicherry University
MBA (MFT), MFT
Completed an MBA (MFT) at Pondicherry University in 2004.
Acharya Nagarjuna University
Master of Science (M.Sc.), Organic Chemistry
Completed an M.Sc. in Organic Chemistry at Acharya Nagarjuna University in 2002.
Acharya Nagarjuna University
Bachelor of Science, B.Sc
Completed a B.Sc. at Acharya Nagarjuna University in 1999.
Tech stack
Software and tools used professionally
Splunk
Apache Spark
GitHub
GitLab
Bitbucket
Kubernetes
Akamai
Jenkins
GitHub Actions
GitLab CI
dbt
MySQL
PostgreSQL
MongoDB
Cassandra
Gmail
Redis
Terraform
AWS CloudFormation
Pulumi
Jira
Python
XML
TensorFlow
PyTorch
MLflow
scikit-learn
Kubeflow
Kafka
FastAPI
Grafana
Prometheus
OpenTelemetry
OpenStack
etcd
Linux
Windows
Datadog
AppDynamics
Elasticsearch
Ansible
Serverless
Docker
NGINX
Airflow
containerd
s3-lambda
XGBoost
Hugging Face
LightGBM
Seldon
LangChain
LlamaIndex
Weaviate
Weights & Biases
Evidently AI
BentoML
Pinecone
WhyLabs
Feast
Ray
KServe
Delta Lake
vLLM
Great Expectations
Tekton
ArgoCD
ONNX Runtime
Bash
Faiss
Dynamic
Metaflow
Kustomize
Objective
CoreDNS
Jan
X++
Availability
Location
Authorized to work in
Salary expectations
Job categories
Skills
Interested in hiring Kumar?
You can contact Kumar and 90k+ other talented remote workers on Himalayas.
Message KumarFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
