I’m looking to lead a product-minded cloud data platform team—owning lakehouse/streaming architecture, strengthening reliability and governance, and mentoring engineers to deliver measurable business outcomes.
Saqib Khan
@saqibk2
Lead Data Engineer | Cloud Data Platform Architect | Real-Time & AI Data Systems
What I'm looking for
I’m a Lead Data Engineer with 9+ years of experience building data platforms that scale—and engineering teams that last. I own the full arc from whiteboarding multi-cloud architecture to shipping production pipelines processing 10TB+ daily, while setting org-wide engineering standards and growing engineers into senior contributors.
I specialize in modern lakehouse design (Delta Lake, Iceberg), real-time streaming (Kafka, Spark), and dbt-driven transformation across AWS and GCP. I treat data platforms as products—reliable, observable, and trusted by the teams that depend on them—especially in regulated environments (HIPAA, SOC2, fintech) where data quality directly impacts business and compliance outcomes.
In my roles, I’ve led multi-cloud lakehouse and hybrid processing architectures, using Medallion design, dbt modeling/testing, and advanced SQL to improve data accessibility and delivery efficiency. I’ve also driven DataOps transformation with CI/CD, automated testing, and observability, reducing failures and costs, while mentoring teams and partnering with stakeholders to deliver measurable outcomes.
Experience
Work history, roles, and key accomplishments
Lead Data Engineer
Verato
May 2024 - Present (2 years 1 month)
Led and scaled a team of 6+ data engineers, defining architecture standards and improving delivery efficiency by 30% through Python/SQL pipelines and dbt-driven ELT. Owned a multi-cloud lakehouse vision on AWS/GCP, improving data accessibility by 40% and advancing AI data products that increased data accuracy by 28%.
Senior Data Engineer
Stord
Jan 2021 - Apr 2024 (3 years 3 months)
Architected and scaled a cloud-native GCP data platform supporting $10B+ annual transactions, reducing data latency by 25% with hybrid event-driven and batch processing (Kafka/Airflow). Drove dbt adoption and DataOps/CI-CD, improving release efficiency by 30% and optimizing BigQuery to reduce costs by 15% while improving query performance.
Cloud Data Engineer
Vanta
Apr 2018 - Nov 2020 (2 years 7 months)
Designed and scaled ETL/ELT pipelines for security and compliance data from 400+ integrations, supporting 12,000+ customers and improving scalability by 30%. Built audit-ready “golden datasets,” improving audit readiness and data accuracy by 25%, and reduced manual audit effort by 35% with real-time monitoring and alerting.
Big Data Engineer
Imply
Aug 2016 - Mar 2018 (1 year 7 months)
Contributed to real-time streaming architecture using Kafka and Druid to enable high-throughput analytics for distributed systems. Built and optimized streaming/batch pipelines in Python and SQL, increasing throughput by 25% and improving query performance and concurrency by 30%, while reducing downtime and data inconsistencies by 15%.
Education
Degrees, certifications, and relevant coursework
New Jersey Institute of Technology
Bachelor in Information System, Information Systems
Earned a bachelor’s degree in information systems from New Jersey Institute of Technology.
Availability
Location
Authorized to work in
Salary expectations
Social media
Skills
Interested in hiring Saqib?
You can contact Saqib and 90k+ other talented remote workers on Himalayas.
Message SaqibFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
