Open to opportunities

Tariq User

@tariqfq

Message

Lead Data Engineer building scalable, secure lakehouse and real-time data platforms.

United States

Message

What I'm looking for

I seek a senior/platform role building secure, observable, multi-cloud data platforms with strong CI/CD, mentorship opportunities, and clear SLAs.

I am a Lead Data Engineer with 10+ years designing and managing scalable data platforms across healthtech, retail, fintech, and SaaS, focused on transforming complex business needs into high-performing, well-documented solutions.

I build streaming and batch systems using Databricks, Delta, Iceberg, Kafka, Flink, and Spark, and standardize ELT with dbt and Airflow to accelerate analytics delivery while optimizing compute and storage.

I prioritize security, compliance (HIPAA, GDPR, SOC 2), observability, and CI/CD automation with Terraform and GitHub Actions, mentor engineers, and deliver measurable business outcomes through reduced MTTR, cost savings, and faster onboarding.

Experience

Work history, roles, and key accomplishments

Current

Lead Data Engineer

Current

Truveta

May 2022 - Present (4 years 2 months)

Architected and maintained HIPAA-compliant Databricks lakehouse processing 2.3M patient records daily with 99.9% uptime, reduced cloud costs by $1.2M annually, and enabled real-time clinical decision support across 150+ hospital networks.

Databricks Delta Lake Spark Structured Streaming HIPAA Data Governance Kafka Great Expectations MLFlow

Current

Lead Data Engineer

Current

Truveta, Inc

May 2022 - Present (4 years 2 months)

Led design and operation of large-scale, HIPAA-compliant lakehouse platforms on Databricks and Delta Lake across AWS/Azure/GCP; built batch and streaming CDC pipelines that enabled near-real-time analytics and supported ML feature workflows.

Databricks Apache Spark Delta Lake Spark Structured Streaming Airflow DBT AWS GCP HIPAA Kafka

Current

Lead Data Engineer

Current

Vodworks

Feb 2022 - Present (4 years 5 months)

Built a multi-cloud lakehouse and real-time pipelines for clinical and claims data, reducing time-to-insight by 40% and achieving 99%+ pipeline uptime while ensuring HIPAA compliance.

Databricks Snowflake Kafka DBT Airflow Delta Lake Terraform OpenTelemetry Apache Flink

Senior Data Engineer

Starschema

Jan 2017 - Apr 2022 (5 years 3 months)

Contributed to scaling cloud analytics platforms using Spark, Snowflake, Redshift, and BigQuery; optimized ETL pipelines and warehouses to improve query performance, observability, and cost efficiency for BI consumers.

Apache Spark Snowflake BigQuery Redshift Python SQL Airflow DBT Data Modeling Observability

Senior Data Engineer

Chewy

Jul 2017 - Dec 2021 (4 years 5 months)

Scaled ingestion and streaming pipelines across Databricks, Glue, and EMR to reduce failure rates by over 50% and cut costs ~18%, and built RAG-ready vector datasets for semantic search.

Databricks AWS Glue EMR Kafka Pinecone Faiss Airflow Terraform Data Lineage

Data Engineer

Anblicks

Jan 2014 - Jun 2017 (3 years 5 months)

Consolidated APIs and databases into governed data lakes/warehouses, improving dashboard performance by over 50% and implementing GDPR/CCPA controls and reusable Spark ETL frameworks.

Apache Spark Snowflake Redshift Synapse GDPR ETL SQL Optimization Monitoring Data Governance

Data Engineer

United Techno

Jan 2014 - Dec 2016 (2 years 11 months)

Built ingestion workflows and a centralized Data Vault 2.0 warehouse using Hadoop ecosystem and relational databases; automated infrastructure with Terraform and delivered Power BI reporting to stakeholders.

Hadoop HDFS Hive PostgreSQL MySQL Vault Terraform ETL Power BI SQL

Data Engineer

UnitedTechno

Jan 2014 - Dec 2016 (2 years 11 months)

Built a centralized Data Vault 2.0 warehouse and Hadoop-based pipelines processing multi-terabyte workloads, automated IaC with Terraform reducing environment setup from 3 days to 2 hours, and implemented event-driven pipelines capturing 2M+ events daily.

Hadoop Hive Terraform PostgreSQL MySQL ETL Monitoring Power BI