Skip to main content
HimalayasHimalayas logo
SK
Looking for a job

Saqib Khan

@saqibk2

Lead Data Engineer | Cloud Data Platform Architect | Real-Time & AI Data Systems

United States
Message

What I'm looking for

I’m looking to lead a product-minded cloud data platform team—owning lakehouse/streaming architecture, strengthening reliability and governance, and mentoring engineers to deliver measurable business outcomes.

I’m a Lead Data Engineer with 9+ years of experience building data platforms that scale—and engineering teams that last. I own the full arc from whiteboarding multi-cloud architecture to shipping production pipelines processing 10TB+ daily, while setting org-wide engineering standards and growing engineers into senior contributors.

I specialize in modern lakehouse design (Delta Lake, Iceberg), real-time streaming (Kafka, Spark), and dbt-driven transformation across AWS and GCP. I treat data platforms as products—reliable, observable, and trusted by the teams that depend on them—especially in regulated environments (HIPAA, SOC2, fintech) where data quality directly impacts business and compliance outcomes.

In my roles, I’ve led multi-cloud lakehouse and hybrid processing architectures, using Medallion design, dbt modeling/testing, and advanced SQL to improve data accessibility and delivery efficiency. I’ve also driven DataOps transformation with CI/CD, automated testing, and observability, reducing failures and costs, while mentoring teams and partnering with stakeholders to deliver measurable outcomes.

Experience

Work history, roles, and key accomplishments

VE
Current

Lead Data Engineer

Verato

May 2024 - Present (2 years 1 month)

Led and scaled a team of 6+ data engineers, defining architecture standards and improving delivery efficiency by 30% through Python/SQL pipelines and dbt-driven ELT. Owned a multi-cloud lakehouse vision on AWS/GCP, improving data accessibility by 40% and advancing AI data products that increased data accuracy by 28%.

ST

Senior Data Engineer

Stord

Jan 2021 - Apr 2024 (3 years 3 months)

Architected and scaled a cloud-native GCP data platform supporting $10B+ annual transactions, reducing data latency by 25% with hybrid event-driven and batch processing (Kafka/Airflow). Drove dbt adoption and DataOps/CI-CD, improving release efficiency by 30% and optimizing BigQuery to reduce costs by 15% while improving query performance.

VA

Cloud Data Engineer

Vanta

Apr 2018 - Nov 2020 (2 years 7 months)

Designed and scaled ETL/ELT pipelines for security and compliance data from 400+ integrations, supporting 12,000+ customers and improving scalability by 30%. Built audit-ready “golden datasets,” improving audit readiness and data accuracy by 25%, and reduced manual audit effort by 35% with real-time monitoring and alerting.

IM

Big Data Engineer

Imply

Aug 2016 - Mar 2018 (1 year 7 months)

Contributed to real-time streaming architecture using Kafka and Druid to enable high-throughput analytics for distributed systems. Built and optimized streaming/batch pipelines in Python and SQL, increasing throughput by 25% and improving query performance and concurrency by 30%, while reducing downtime and data inconsistencies by 15%.

Education

Degrees, certifications, and relevant coursework

New Jersey Institute of Technology logoNT

New Jersey Institute of Technology

Bachelor in Information System, Information Systems

Earned a bachelor’s degree in information systems from New Jersey Institute of Technology.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan