Siddhant Karki
@siddhantkarki
Production-focused AI Engineer building end-to-end systems for RAG, evals, and deployment.
What I'm looking for
I’m an AI Engineer who builds production systems end to end—from data pipelines and model training to APIs, evals, and deployment. I work below the abstraction layer when needed, with hands-on depth in tokenizer internals, model adaptation, distributed processing, and retrieval augmented generation (RAG). I turn ambiguous AI problems into shipped systems, from voice pipelines and analytics to knowledge infrastructure and language model research.
In my current role at CallD.AI, I own production systems across live-call infrastructure and AI post-call analysis, from ingestion through storage, processing, APIs, and dashboards. I redesigned speaker re-labeling with turn-index maps to cut LLM token usage by 75% and improve processing speed 3x, then reduced cost further via an OpenAI-to-Bedrock migration and Nova Micro prompt caching. I also enforced schema-validated structured outputs with Pydantic to eliminate malformed responses, improved distributed reliability using PostgreSQL row locking with concurrent workers and jittered retries, and built an analytics platform with materialized-view-backed reporting and performance-focused query/index work. My project work extends this mindset—hybrid retrieval with metadata-first MCP toolchains, fully local ONNX inference, automated wiki synthesis with verification status, tokenizer cost remediation, and fine-tuned ASR with cross-domain evaluation.
Experience
Work history, roles, and key accomplishments
AI Engineer
CallD.AI
Mar 2025 - Present (1 year 3 months)
Owned two generations of AI post-call analysis pipelines for Twilio, LiveKit, and Asterisk, processing calls end-to-end via S3 → Deepgram STT → LLM analysis → PostgreSQL. Cut LLM token usage 75% and improved processing speed 3x by redesigning speaker relabeling and migrating OpenAI prompts to Bedrock with Nova prompt caching; built dashboard APIs and hardened production infrastructure.
Education
Degrees, certifications, and relevant coursework
Victoria University
Bachelor of Information Technology, Information Technology
2023 - 2025
Grade: 6.46/7.0
Completed a Bachelor of Information Technology at Victoria University (2023–2025) with concentrations in Software Development and Networking Technologies, achieving a GPA of 6.46/7.0.
Availability
Location
Authorized to work in
Website
siddhantskarki.comPortfolio
github.com/sidskarkii/LoreJob categories
Skills
Interested in hiring Siddhant?
You can contact Siddhant and 90k+ other talented remote workers on Himalayas.
Message SiddhantFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
