Ashutosh Khanal
@ashutoshkhanal
Senior data engineer building secure cloud lakehouse platforms for enterprise analytics.
What I'm looking for
I’m an accomplished Senior Data Engineer with 7+ years architecting scalable, secure, high-performance data platforms across healthcare, finance, insurance, and retail. I specialize in modern cloud native lakehouse architectures that enable enterprise-wide transformation, advanced analytics, and regulatory reporting.
In my recent work on an enterprise healthcare modernization team, I led a payer-centric lakehouse on Azure Data Lake Gen2, Databricks, and Snowflake to unify claims, eligibility, and clinical quality data. I re-architected ingestion into a standardized Bronze/Silver/Gold Delta Lake framework, using Databricks Autoloader, incremental CDC, and dbt—stabilizing daily processing of 120M+ claim records and reducing data freshness delays by 38%.
I’m driven by governance and reliability: I enforce data lineage, security controls, and compliance standards aligned with HIPAA and GDP R requirements through RBAC, Managed Identities, and Azure Key Vault. I also focus on performance and cost—optimizing PySpark transformations (41% faster, 26% lower compute costs) and improving Snowflake spend (32% reduction), while mentoring teams and building observability and CI/CD automation with Terraform and Azure DevOps.
Experience
Work history, roles, and key accomplishments
Architected a scalable payer lakehouse on Azure Data Lake Gen2, Databricks, and Snowflake, reworking ingestion into a Bronze/Silver/Gold Delta Lake framework that stabilized daily processing of 120M+ claim records and reduced data freshness delays by 38%. Reduced recurring claims reporting pipeline failures by 45% by redesigning Azure Data Factory orchestration with dependency-aware triggers and d
Designed event-driven ingestion with Kinesis and Lambda, replacing batch-heavy workflows and reducing data latency for fraud detection feeds by 48%. Modernized legacy SSIS workflows into AWS Glue (PySpark) with incremental processing, cutting nightly batch processing time by 43%.
Developed and maintained large-scale retail ETL workflows using Informatica PowerCenter and SSIS, enabling centralized transformations for high-volume POS sales, inventory, and promotions. Improved ETL load performance by 35% through Teradata indexing/partitioning and reduced pipeline failures by 28% after re-architecting legacy SSIS batch workflows.
Built production-grade Python backend services with Django/Flask and performance-aware API design, reducing API latency by 38% through relational database and query optimizations. Implemented asynchronous processing (Celery/RQ with RabbitMQ/Kafka) and improved reliability by reducing transaction failures by 25% using OAuth2/JWT with idempotency and retry logic.
Education
Degrees, certifications, and relevant coursework
University of North Texas
Master’s in Information Systems and Technologies, Information Systems and Technologies
Earned a Master’s in Information Systems and Technologies from the University of North Texas in Texas.
Tech stack
Software and tools used professionally
OpenAPI
Amazon Redshift
Splunk
Azure Synapse
AWS Glue
Amazon Redshift Spectrum
Amazon Quicksight
Amazon S3
Google Cloud Storage
AWS Step Functions
GitHub
AWS CodePipeline
Jenkins
GitHub Actions
PySpark
dbt
DB
MySQL
PostgreSQL
MongoDB
Gmail
Django
Databricks
Redis
Terraform
Azure DevOps
JSON
XML
Kafka
RabbitMQ
FastAPI
Grafana
Prometheus
Azure Monitor
Linux
GraphQL
Azure Functions
Amazon RDS
Azure SQL Database
pytest
OAuth2
Airflow
Root Cause
SQL
Azure Blob Storage
Delta Lake
Great Expectations
Black
Bash
Dynamic
Unity Catalog
Task
Factory
Jan
Unify
Movement
Microsoft Purview
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Ashutosh?
You can contact Ashutosh and 90k+ other talented remote workers on Himalayas.
Message AshutoshFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
