A remote role on a team building real LLM products, agents, RAG, document/voice AI, or AI infra, where engineers own systems end-to-end, not ticket-by-ticket. Hand me aproblem and the outcome, and I'll own the hard half: reliability, cost, guardrails. I'm fully remote with complete EU overlap (plus solid US hours), so timezone is a non-issue. Long-term: making AI dependable in production.
Mahdi Bani
@mahdibani
AI engineer (backend roots) building production LLM systems — agents, RAG, document AI. I own the reliability, cost & guardrails that keep AI honest.
What I'm looking for
I'm an AI engineer with a backend foundation. I build and operate production LLM systems end-to-end — document AI, LLM agents, RAG, and automation, and I focus on the part most people skip: making AI dependable in production. Reliability, cost, guardrails, testing, observability, the unglamorous work that decides whether an AI feature survives contact with real users.
Most of my experience is ownership at the deep end. I solo-built and run a multi-tenant AI platform as the entire engineering function for a non-technical company: LLM agents on WhatsApp and inbound voice (Claude tool-use, Deepgram, ElevenLabs) with deterministic guardrails, two-way Microsoft 365 sync, real-time WebSockets, and a Dockerized FastAPI + Celery backend — 20 API services and 18 data models across the stack. In parallel I built and own a production document-AI pipeline (54 document types, GPT-4o vision + DSPy) where I cut LLM inference cost 5.7× while holding 88% F1, backed by 438+ automated tests in CI and a cost-tracking dashboard that made real AI spend observable.
I'm also the person who keeps AI running when the ground shifts: I've contained a DSPy major-version break, a litellm supply-chain compromise (in under an hour), and breaking voice/Meta API migrations before any of it reached users. Earlier I built a crypto-exchange end-to-end (real-time pricing, KYC, fiat-to-crypto), fine-tuned LLMs for client work, and contributed real open source — including 2 merged PRs to the Bittensor core SDK, one fixing a 2.4 GB memory leak.
Stack: Python, FastAPI, SQLAlchemy, PostgreSQL, Redis, Celery · React, Next.js, TypeScript · DSPy, LLM agents & tool-use, RAG & embeddings, fine-tuning · Claude, GPT-4o, Llama/Gemma (local) · Docker, CI/CD, Azure, multi-tenant RBAC · Deepgram, ElevenLabs, Pipecat (voice).
Experience
Work history, roles, and key accomplishments
Full-Stack & AI Engineer
Multi-Tenant SaaS Platform
Jan 2026 - Present (5 months)
Sole architect and operator of a production multi-tenant platform replacing 7+ manual tools. Built 20 API services and 18 data models with row-level multi-tenancy, React/Next.js UI, and WhatsApp/voice LLM agents with deterministic guardrails for 24/7 lead qualification and booking.
AI & Backend Engineer
Production Document-AI Platform
Jan 2025 - Present (1 year 5 months)
Built and owned a production document-AI pipeline supporting 54 document types with GPT-4o vision and DSPy on multi-tenant Azure with RBAC. Re-architected the pipeline to cut LLM inference cost 5.7x per document while holding ~2% accuracy loss (88% F1), with 438+ CI tests and real-time updates via PostgreSQL LISTEN/NOTIFY.
Freelance Full-Stack Engineer
Freelance
Built a crypto-exchange platform end-to-end, including real-time pricing, KYC onboarding, and fiat-to-crypto transaction flows with uptime and correctness as non-negotiable requirements.
Education
Degrees, certifications, and relevant coursework
Not specified
Software Engineering Program, Software Engineering
2022 - 2024
Completed a Software Engineering Program from 2022 to 2024.
Not specified
Baccalaureate, Computer Science
2021 -
Earned a Baccalaureate in Computer Science in 2021.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Website
mahdi-bani.vercel.appSalary expectations
Interested in hiring Mahdi?
You can contact Mahdi and 90k+ other talented remote workers on Himalayas.
Message MahdiFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
