Skip to main content
Siddhant KarkiSK
Open to opportunities

Siddhant Karki

@siddhantkarki

Production-focused AI Engineer building end-to-end systems for RAG, evals, and deployment.

Australia
Message

What I'm looking for

I’m looking to build production AI systems end to end—APIs, evals, and deployment included—where I can work hands-on below abstractions with real constraints, ship measurable improvements, and own reliability, cost, and quality.

I’m an AI Engineer who builds production systems end to end—from data pipelines and model training to APIs, evals, and deployment. I work below the abstraction layer when needed, with hands-on depth in tokenizer internals, model adaptation, distributed processing, and retrieval augmented generation (RAG). I turn ambiguous AI problems into shipped systems, from voice pipelines and analytics to knowledge infrastructure and language model research.

In my current role at CallD.AI, I own production systems across live-call infrastructure and AI post-call analysis, from ingestion through storage, processing, APIs, and dashboards. I redesigned speaker re-labeling with turn-index maps to cut LLM token usage by 75% and improve processing speed 3x, then reduced cost further via an OpenAI-to-Bedrock migration and Nova Micro prompt caching. I also enforced schema-validated structured outputs with Pydantic to eliminate malformed responses, improved distributed reliability using PostgreSQL row locking with concurrent workers and jittered retries, and built an analytics platform with materialized-view-backed reporting and performance-focused query/index work. My project work extends this mindset—hybrid retrieval with metadata-first MCP toolchains, fully local ONNX inference, automated wiki synthesis with verification status, tokenizer cost remediation, and fine-tuned ASR with cross-domain evaluation.

Experience

Work history, roles, and key accomplishments

CallD.AI logoCA
Current

AI Engineer

CallD.AI

Mar 2025 - Present (1 year 3 months)

Owned two generations of AI post-call analysis pipelines for Twilio, LiveKit, and Asterisk, processing calls end-to-end via S3 → Deepgram STT → LLM analysis → PostgreSQL. Cut LLM token usage 75% and improved processing speed 3x by redesigning speaker relabeling and migrating OpenAI prompts to Bedrock with Nova prompt caching; built dashboard APIs and hardened production infrastructure.

Education

Degrees, certifications, and relevant coursework

Victoria University logoVU

Victoria University

Bachelor of Information Technology, Information Technology

2023 - 2025

Grade: 6.46/7.0

Completed a Bachelor of Information Technology at Victoria University (2023–2025) with concentrations in Software Development and Networking Technologies, achieving a GPA of 6.46/7.0.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan