Skip to main content
AS
Open to opportunities

Anuska Shrestha

@anuskashrestha1

Data Engineer/Data Analyst with 5+ years building lakehouse platforms and real-time insights for financial and manufacturing teams.

United States
Message

What I'm looking for

I want to build secure, governed lakehouse and analytics platforms—mixing batch and real-time pipelines, automated validation, and CI/CD—while partnering with stakeholders in regulated domains to deliver trustworthy insights that scale.

I’m a Data Engineer / Data Analyst with 5+ years of experience building scalable data platforms and delivering actionable insights across financial and manufacturing domains. I focus on translating complex business requirements into high-performance, governed data solutions that teams can trust.

In my most recent role, I designed and implemented a domain-oriented lakehouse architecture consolidating trading, liquidity, and risk datasets into governed analytical layers using Databricks and Snowflake. I built event-driven ingestion pipelines with Kafka and Spark Structured Streaming, then developed advanced analytical datasets with SQL and PySpark to support P&L analysis, capital adequacy modeling, and stress testing.

I partner directly with risk, treasury, and analytics stakeholders to turn regulatory and trading requirements into scalable data transformations and curated reporting tables. I also run deep data profiling and root-cause analysis to identify upstream gaps, and I strengthen reliability with automated data validation using dbt tests and custom Python checks.

I bring strong cloud and engineering discipline to analytics delivery—CI/CD and Infrastructure-as-Code with Terraform and GitHub Actions, performance tuning across Databricks clusters, and security-by-design with IAM/RBAC, encryption, and data masking. I also leverage Generative AI tools like GitHub Copilot, AWS Bedrock, and LangChain to accelerate transformation development, documentation, and data exploration.

Experience

Work history, roles, and key accomplishments

Citi logoCI
Current

Data Engineer / Data Analyst

Oct 2023 - Present (2 years 8 months)

Designed and implemented a domain-oriented lakehouse on Databricks and Snowflake to consolidate trading, liquidity, and risk datasets into governed analytical layers. Built event-driven Kafka/Spark streaming ingestion, advanced SQL/PySpark datasets, automated dbt-based validation, and BI dashboards for risk and treasury reporting.

First Citizens Bank logoFB

Data Engineer

Feb 2021 - Sep 2023 (2 years 7 months)

Engineered secure, enterprise data integration pipelines consolidating core banking, mortgage, and treasury data into a centralized cloud analytics platform. Developed PySpark/SQL transformations, Kafka ingestion for near real-time transactions, Spark jobs on EMR for portfolio reporting, and implemented RBAC, encryption, and data masking with AWS IAM and Snowflake.

General Motors logoGM

Data Engineer

Nov 2020 - Jan 2021 (2 months)

Built ETL pipelines ingesting manufacturing and supply chain datasets into centralized reporting systems using Python and SQL. Developed batch and near real-time processing with Spark/Hadoop on AWS, designed dimensional models for operational dashboards, and implemented incremental loading (CDC) plus validation/reconciliation and monitoring via CloudWatch/EMR logs.

Education

Degrees, certifications, and relevant coursework

Southeast Missouri State University logoSU

Southeast Missouri State University

Bachelor's in Computer Science, Computer Science

Completed a Bachelor's in Computer Science at Southeast Missouri State University.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan