Himalayas logo
Julian ThomasJT
Open to opportunities

Julian Thomas

@julianthomas1

Senior MLOps & AI/ML engineer building scalable, production-ready AI systems.

United States
Message

What I'm looking for

I seek senior roles where I can lead MLOps/ML engineering to productionize LLMs, improve model reliability, and scale AI systems within collaborative, growth-focused teams.

I am a Senior MLOps and AI/ML engineer with 9+ years building scalable, AI-driven web applications and production machine learning systems. I specialize in accelerating model deployment and operationalizing LLMs using modern cloud and MLOps tooling.

I've delivered measurable impact by reducing deployment times, improving inference latency, and increasing data processing capacity across enterprise environments. My work spans end-to-end pipelines, from feature engineering and distributed data processing to model serving and observability.

Technically, I build containerized microservices and APIs with Python, Node.js, Flask, and FastAPI, and I deploy and manage cloud infrastructure using AWS, Azure, Kubernetes, Docker, Terraform, and CI/CD automation. I pair these capabilities with TensorFlow, PyTorch, Hugging Face, Spark, and Kafka to support real-time and large-scale ML workloads.

I take pride in designing resilient, monitored systems that maintain high availability and scale—delivering production LLM services, integrated frontends with React/Next.js, and robust MLOps practices that help teams ship AI features faster and reliably.

Experience

Work history, roles, and key accomplishments

NT
Current

AI & React Developer

Ntiva

Dec 2024 - Present (1 year)

Engineered and deployed GPT-4 and Azure OpenAI services to accelerate LLM application deployment by 40%, designed REST/GraphQL APIs and containerized microservices to improve scalability and frontend integration.

CO

Machine Learning Engineer

Codoxo

Feb 2019 - Feb 2023 (4 years)

Implemented Kafka/RabbitMQ streaming for near-real-time fraud detection at scale, deployed secure low-latency Python inference APIs, and built Spark-based pipelines to process millions of healthcare claims daily.

Education

Degrees, certifications, and relevant coursework

Siena College logoSC

Siena College

Bachelor of Science, Computer Science

Bachelor of Science in Computer Science with coursework in data structures, databases, operating systems, networks, software engineering, and machine learning; graduated May 2014.

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan
Julian Thomas - AI & React Developer - Ntiva | Himalayas