Nicholas Keenan
@nicholaskeenan
Senior AI/ML engineer building production LLM, RAG, and low-latency voice systems.
What I'm looking for
I’m a senior AI/ML engineer with 9+ years building production machine learning and LLM-powered systems across finance, insurance, and enterprise SaaS. I deliver end-to-end AI solutions—data pipelines, model training and fine-tuning, RAG architectures, and real-time inference—while keeping performance, reliability, and compliance at the core.
I’ve deployed large-scale conversational AI with scalable, observable infrastructure using Python, FastAPI, Django, Docker, Kubernetes, and cloud platforms like GCP and AWS. From MCP-based LLM-agent integrations and voice AI pipelines to evaluation and latency/accuracy trade-off monitoring, I translate machine learning capabilities into real-world products through strong architecture, automation, and engineering discipline.
Experience
Work history, roles, and key accomplishments
Senior Software Engineer / Technical Lead
Gail
Oct 2023 - Present (2 years 6 months)
Fine-tuned Llama 3.3 with LoRA on Vertex AI and built end-to-end RAG pipelines using LlamaIndex, Pinecone embeddings, and retrieval evaluation for financial Q&A and document intelligence. Deployed low-latency inference services with FastAPI and Kubernetes and implemented agentic integrations via an MCP layer plus voice AI pipelines.
Senior Software Engineer / Technical Lead
Lula Technologies, Inc.
Oct 2021 - Oct 2023 (2 years)
Built API-driven insurance workflows and a driver-risk decisioning ML pipeline using FastAPI, scikit-learn, and XGBoost with real-time inference on GKE and Pub/Sub. Shipped AI-assisted claims triage and claims/policy services, improved multilingual classification accuracy by 10%, and added observability with OpenTelemetry to keep p99 latency under 200ms and errors below 0.01%.
Senior Software Engineer
Taxfyle
May 2019 - Oct 2021 (2 years 5 months)
Developed C#/.NET backend services and APIs for firm onboarding and operational workflows using PostgreSQL-backed transactional systems. Built ML-powered document automation with PyTorch and spaCy to extract tax data and cut manual effort by accelerating return processing by 40%.
Software Engineer Internship
Taxfyle
May 2017 - May 2019 (2 years)
Engineered iOS/Android mobile features using C#/Xamarin and React Native and supported QA with Jest and test workflows across 30+ app features. Built Python internal tools and automation scripts to clean, reconcile, and support scalable mobile app operations.
Education
Degrees, certifications, and relevant coursework
University of Pennsylvania
Bachelor's Degree, Computer Science
2015 - 2019
Earned a bachelor's degree in computer science at the University of Pennsylvania from 2015 to 2019.
Tech stack
Software and tools used professionally
AWS Amplify
Google Cloud Platform
GitHub
Kubernetes
Jenkins
GitHub Actions
React Native
Xamarin
Jupyter
NumPy
Pandas
PostgreSQL
MongoDB
Node.js
Django
.NET Core
Next.js
.NET
Terraform
React
JavaScript
Python
F#
TensorFlow
PyTorch
MLflow
scikit-learn
Kafka
RabbitMQ
FastAPI
OpenTelemetry
iOS
GraphQL
Google Cloud Pub/Sub
gRPC
Elasticsearch
AWS Lambda
Deepgram
Vercel
TypeScript
Docker
Twilio
Zapier
SQL
XGBoost
SciPy
Hugging Face
Supabase
LangChain
LlamaIndex
ChromaDB
Pydantic
Pinecone
ElevenLabs
ONNX Runtime
Agentic
Faiss
LangGraph
PEFT
Dynamic
Remote
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Nicholas?
You can contact Nicholas and 90k+ other talented remote workers on Himalayas.
Message NicholasFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
