Benjamin Sprayberry
@benjaminsprayberry
Senior AI Engineer building low-latency, scalable AWS inference and LLM automation.
What I'm looking for
I’m a Senior AI Engineer focused on building cloud-native AI platforms that scale in production. I’ve architected large-scale inference systems on AWS, delivering measurable performance gains like reducing latency by 40% across high-volume workloads.
I design distributed microservices and streaming pipelines using Python and Kubernetes to support real-time, high-throughput applications. At Amazon, I’ve implemented fault-tolerant architectures and observability with Prometheus and Grafana, reducing downtime by 22% and improving throughput by 18% while keeping service reliability strong under peak traffic.
I also integrate LLMs and ML models into enterprise APIs—improving workflow automation accuracy by 30% and boosting internal document processing efficiency by 35% using OpenAI APIs, vector databases, and RAG-style pipelines.
Previously, I delivered AI/ML products in real-world environments: building computer vision models with PyTorch and OpenCV (27% detection accuracy lift), deploying RESTful FastAPI services for over 2M monthly requests, and strengthening data processing with Spark and Airflow (40% throughput). I bring this full-stack perspective—backend and front-end where needed—along with MLOps and DevOps discipline (Terraform, GitHub Actions, Kubernetes) to ship faster, deploy more consistently, and mentor engineers through architecture reviews.
Experience
Work history, roles, and key accomplishments
Architected large-scale AI inference platforms on AWS, processing over 15M daily transactions with consistent low-latency performance. Reduced infrastructure costs by 28% and cut inference latency by 42% by optimizing model serving, microservices, and resource allocation.
Developed computer vision and NLP models for real-time surveillance and incident reporting, improving object detection accuracy by 27% and classification accuracy by 32%. Built and operationalized scalable AWS ML pipelines and low-latency APIs serving over 2M monthly requests.
AI Full Stack Engineer
Collins Aerospace
Jan 2010 - Jun 2019 (9 years 5 months)
Built data ingestion and predictive analytics systems for aerospace telemetry, processing over 5TB of data daily and reducing equipment failure rates by 26%. Improved performance and delivery by optimizing databases and ETL workflows (reduced query latency by 35% and data availability delays by 40%).
Education
Degrees, certifications, and relevant coursework
Iowa State University
Master of Engineering, Systems Engineering
2012 - 2014
Master of Engineering in Systems Engineering at Iowa State University from 2012 to 2014.
University of Mississippi
Bachelor of Science, Electrical Engineering
2007 - 2009
Bachelor of Science in Electrical Engineering at the University of Mississippi from 2007 to 2009.
Tech stack
Software and tools used professionally
Apache Spark
Apache Flink
GitHub
GitLab
Kubernetes
Docker Swarm
Jenkins
CircleCI
GitHub Actions
GitLab CI
React Native
NumPy
Pandas
PySpark
MySQL
PostgreSQL
MongoDB
Hadoop
Node.js
Django
Laravel
Spring Boot
.NET
Tailwind CSS
Material-UI
Databricks
OpenCV
Redis
Terraform
Azure DevOps
Jira
JFrog Artifactory
Vue.js
JavaScript
Java
Kotlin
TensorFlow
PyTorch
MLflow
scikit-learn
Kafka
RabbitMQ
FastAPI
Grafana
Prometheus
Nagios
Datadog
jFrog
GraphQL
gRPC
Spring Security
Ansible
AWS Lambda
OAuth2
sso
Airflow
Time Analytics
SQL
IBM Cloud
XGBoost
SciPy
LightGBM
LangChain
AutoGen
CrewAI
Delta Lake
Scale AI
Faiss
Stack AI
Remote
Jan
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Benjamin?
You can contact Benjamin and 90k+ other talented remote workers on Himalayas.
Message BenjaminFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
