Huzz Sindhu
@huzzsindhu
Staff Data Engineer specializing in multi-cloud lakehouses, real-time streaming, and AI-ready data platforms that cut costs and speed insights.
What I'm looking for
I’m a Staff Data Engineer with 11 years designing and scaling enterprise data platforms across fintech, healthcare, SaaS, supply chain, and retail. I own end-to-end platform strategy—architecture and data governance through to team mentorship and stakeholder alignment—while staying comfortable driving technical decisions solo or leading cross-functional engineering in fast-moving environments.
I build lakehouse and streaming systems that deliver measurable impact: multi-cloud medallion architectures for real-time financial analytics, CDC + Kafka + Spark Structured Streaming pipelines with sub-second latency, and dbt semantic/metric layers that unlock self-service analytics at scale. I also enforce compliance and reliability with Unity Catalog RBAC, PII masking, lineage/observability, automated quality checks (e.g., Great Expectations), and SLA monitoring—cutting cloud costs by 30%, reducing mean time to detect by 60%, and accelerating ML training cycle time by 40% through AI-ready feature engineering and RAG-ready data layers.
Experience
Work history, roles, and key accomplishments
Staff Data Engineer
Lateetud
Sep 2023 - Present (2 years 7 months)
Architected a multi-cloud lakehouse on AWS and Azure for real-time financial analytics, delivering CDC + Kafka + Spark Structured Streaming with sub-second latency. Reduced cloud costs 30%, improved pipeline MTTR by 60%, and enforced SOC 2 and GDPR compliance using Unity Catalog RBAC, PII masking, and automated data quality checks.
Senior Data Engineer
Analytics8
Jul 2021 - Aug 2023 (2 years 1 month)
Designed a HIPAA-compliant lakehouse on Azure using ADLS Gen2, Databricks, and Snowflake, building batch/incremental/CDC pipelines into a medallion architecture. Implemented SCD Type 2 modeling and automated data quality frameworks, and delivered dbt semantic/metric layers for Power BI while reducing ML training cycle time by 40%.
Data Engineer
Creole Studios
Jun 2018 - Jun 2021 (3 years)
Built cloud-native data pipelines on AWS using S3, Redshift, and Airflow to ingest application events, clickstream, and SaaS data at scale. Developed near real-time Kafka + Spark Structured Streaming for live product analytics, implemented dbt analytics engineering, and supported churn/LTV feature pipelines while reducing SLA breaches through more reliable ETL workflows.
Associate Data Engineer
BigChalk
Aug 2015 - May 2018 (2 years 9 months)
Built ETL pipelines ingesting ERP, POS, and logistics data into a centralized warehouse for supply chain and retail analytics. Designed dimensional star-schema models, contributed to cloud migration, and implemented automated validation/reconciliation to reduce manual correction effort and improve operational reporting.
Education
Degrees, certifications, and relevant coursework
Huzz hasn't added their education
Don't worry, there are 90k+ talented remote workers on Himalayas
Tech stack
Software and tools used professionally
Amazon Redshift
Airbyte
Fivetran
Azure Synapse
AWS Glue
Apache Flink
Talend
Superset
Metabase
GitHub
GitLab
Kubernetes
Jenkins
GitHub Actions
GitLab CI
Salesforce
Jupyter
NumPy
Pandas
PySpark
Debezium
dbt
MySQL
PostgreSQL
MongoDB
Cassandra
HBase
Gmail
Databricks
Neo4j
Redis
Terraform
Jira
MLflow
Kafka
Apache NiFi
Elasticsearch
Ansible
Airflow
SQL
Dagster
LangChain
Weaviate
Pinecone
Hex
Monte Carlo
Soda
Delta Lake
OpenAI API
Great Expectations
Apache Hudi
dbt Cloud
Bash
Microsoft Fabric
Column
Unity Catalog
Factory
Safe
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Huzz?
You can contact Huzz and 90k+ other talented remote workers on Himalayas.
Message HuzzFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
