HimalayasHimalayas logo
AP
Open to opportunities

Alejandro palacios

@alejandropalacios

Senior AI engineer specializing in high-performance LLM and multimodal inference optimization.

United States
Message

What I'm looking for

I seek roles focused on scalable, efficient LLM/VLM inference and multimodal systems where I can optimize performance, collaborate with hardware/cloud teams, and deliver measurable cost and latency improvements.

I am a senior AI engineer with about 10 years building production ML systems and high-performance inference at scale, including roles at PayPal, OpenAI, and NVIDIA. I specialize in PyTorch, TensorRT-LLM, quantization, speculative decoding, and GPU acceleration, and have consistently delivered 20–45% performance and cost improvements.

At NVIDIA I built TensorRT-LLM AutoDeploy features and optimized long-context and vision-language workloads; at OpenAI I scaled GPT pipelines and production inference for ChatGPT; at PayPal I developed real-time fraud ML systems. I collaborate across hardware, cloud, and product teams to drive measurable throughput, latency, and safety wins.

Experience

Work history, roles, and key accomplishments

NVIDIA logoNV
Current

Senior Software Engineer

Feb 2024 - Present (2 years 1 month)

Built TensorRT-LLM AutoDeploy features and optimizations that reduced manual build time from days to hours and delivered 3–5× higher throughput on long-context decode-heavy workloads; enabled 30–40% lower latency for vision+text tasks and validated stacks on next-gen GPUs yielding 2–4× production throughput gains.

OP

Senior Software Engineer

OpenAI

Apr 2019 - Feb 2024 (4 years 10 months)

Scaled end-to-end GPT pipelines and high-throughput inference engines that enabled ChatGPT launch and reduced serving costs ~40% via intelligent batching and runtime optimizations; contributed multimodal prototypes improving image+text reasoning ~22% and built safety/alignment layers reducing harmful outputs ~30%.

Education

Degrees, certifications, and relevant coursework

Florida International University - College of Engineering and Computing logoFC

Florida International University - College of Engineering and Computing

Bachelor of Science, Computer Science

2014 - 2017

Completed a Bachelor of Science in Computer Science with coursework in software engineering, systems, and AI-related topics.

Interested in hiring Alejandro?

You can contact Alejandro and 90k+ other talented remote workers on Himalayas.

Message Alejandro

People also viewed

View all talent

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan