Alejandro palacios
@alejandropalacios
Senior AI engineer specializing in high-performance LLM and multimodal inference optimization.
What I'm looking for
I am a senior AI engineer with about 10 years building production ML systems and high-performance inference at scale, including roles at PayPal, OpenAI, and NVIDIA. I specialize in PyTorch, TensorRT-LLM, quantization, speculative decoding, and GPU acceleration, and have consistently delivered 20–45% performance and cost improvements.
At NVIDIA I built TensorRT-LLM AutoDeploy features and optimized long-context and vision-language workloads; at OpenAI I scaled GPT pipelines and production inference for ChatGPT; at PayPal I developed real-time fraud ML systems. I collaborate across hardware, cloud, and product teams to drive measurable throughput, latency, and safety wins.
Experience
Work history, roles, and key accomplishments
Built TensorRT-LLM AutoDeploy features and optimizations that reduced manual build time from days to hours and delivered 3–5× higher throughput on long-context decode-heavy workloads; enabled 30–40% lower latency for vision+text tasks and validated stacks on next-gen GPUs yielding 2–4× production throughput gains.
Senior Software Engineer
OpenAI
Apr 2019 - Feb 2024 (4 years 10 months)
Scaled end-to-end GPT pipelines and high-throughput inference engines that enabled ChatGPT launch and reduced serving costs ~40% via intelligent batching and runtime optimizations; contributed multimodal prototypes improving image+text reasoning ~22% and built safety/alignment layers reducing harmful outputs ~30%.
Developed backend services and ML features for real-time payment fraud detection, improving detection accuracy ~25% using behavioral signals, device fingerprinting, and transaction-graph models while contributing to scalable Java/Spring Boot microservices on AWS.
Education
Degrees, certifications, and relevant coursework
Florida International University - College of Engineering and Computing
Bachelor of Science, Computer Science
2014 - 2017
Completed a Bachelor of Science in Computer Science with coursework in software engineering, systems, and AI-related topics.
Tech stack
Software and tools used professionally
Postman
OpenAPI
D3.js
Chart.js
GitHub
ESLint
Prettier
Kubernetes
Jenkins
CircleCI
React Native
DB
MySQL
PostgreSQL
MongoDB
SQLite
MariaDB
Memcached
Gmail
LaunchDarkly
InVision
Node.js
Django
Laravel
Spring Boot
Next.js
.NET
Tailwind CSS
Nuxt.js
Material-UI
Figma
Zeplin
Okta
Redis
Jira
Ant Design
Mocha
SuperTest
Vue.js
jQuery
Svelte
React Query
Webpack
JavaScript
HTML5
Java
ES6
PHP
Kotlin
ASP.NET
TensorFlow
PyTorch
Google Maps
Mapbox
Kafka
RabbitMQ
FastAPI
Grafana
Prometheus
PayPal
Datadog
Apollo
Trello
ClickUp
GraphQL
gRPC
WordPress
Serverless
Zustand
monday.com
pytest
React Testing Library
SendGrid
Auth0
OAuth2
sso
Twilio
WebRTC
NGINX
Balsamiq
CUDA
Hugging Face
LangChain
Nx
Playwright
Pinecone
Caddy
Apigee
ArgoCD
Vite
Vitest
Framer Motion
CoreWeave
Agentic
React Hook Form
Joomla
ChatGPT
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Alejandro?
You can contact Alejandro and 90k+ other talented remote workers on Himalayas.
Message AlejandroFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
