Shawn Iqbal
@shawniqbal2
Senior Data Solution Architect specializing in real-time, scalable data platforms and ETL excellence.
What I'm looking for
I am a results-driven Senior Data Solution Architect with over 11 years of experience designing and optimizing scalable data solutions across cloud platforms. I specialize in latency-conscious data engineering, real-time analytics, and high-throughput systems.
I have architected cloud-native data platforms and led large migrations to Snowflake, Databricks Lakehouse, and Delta Lake, achieving substantial performance and cost improvements. I build and optimize ETL/ELT workflows using Talend, Apache NiFi, Apache Airflow, and Informatica.
I focus on data governance, compliance (HIPAA, GDPR), and data quality while integrating ML models with Databricks, Scikit-learn, TensorFlow, and PyTorch for predictive analytics. My work has improved forecasting accuracy, reduced inventory stockouts, and accelerated model iteration.
I mentor cross-functional teams, implement DevOps best practices (Docker, Kubernetes, CI/CD, Terraform), and deliver production-grade streaming architectures with Kafka, Flink, and AWS Kinesis to support real-time business needs.
Experience
Work history, roles, and key accomplishments
Architected and deployed real-time streaming infrastructures and cloud-native data platforms across AWS, Azure, and GCP, reducing infrastructure costs by 50% and improving operational responsiveness by 35%. Led migration to Snowflake/Databricks, improving query performance by 60% and cutting maintenance overhead by 70%.
Developed and optimized large-scale Hadoop/Spark data pipelines and automated ETL workflows, reducing processing latency by 35% and enabling real-time analytics via Snowflake integrations. Implemented data governance and CI/CD for data workflows.
Data Engineer
DataArt
Jun 2017 - Jul 2019 (2 years 1 month)
Designed scalable ETL pipelines with Apache Beam and Dataflow and built a GCP data lake, improving data delivery times by 30% and reducing data discrepancies by 40% through automated validation frameworks.
ETL & Data Warehouse Engineer
Steer Health
Feb 2014 - May 2017 (3 years 3 months)
Designed and maintained ETL pipelines and dimensional data warehouse solutions using Talend, SSIS, and NiFi, enabling real-time healthcare analytics and ensuring HIPAA-compliant data governance.
Education
Degrees, certifications, and relevant coursework
University of California
Bachelor of Science, Computer Science
Bachelor of Science in Computer Science obtained from the University of California.
Tech stack
Software and tools used professionally
Amazon Redshift
Azure Synapse
Apache Spark
AWS Glue
Apache Flink
Talend
D3.js
Google Cloud Platform
GitHub
GitLab
Kubernetes
Jenkins
CircleCI
GitLab CI
Jupyter
dbt
MySQL
PostgreSQL
MongoDB
SQLite
Cassandra
Hadoop
HBase
Gmail
.NET
Databricks
Redis
Terraform
Jira
Java
TensorFlow
PyTorch
MLflow
scikit-learn
Kafka
Apache NiFi
Trello
Google Cloud Dataflow
Ansible
Kafka Streams
Apache Storm
Airflow
Apache Beam
Time Analytics
Google BigQuery
SQL
Feast
Delta Lake
Great Expectations
Bash
Transform
Availability
Location
Authorized to work in
Portfolio
shawn-iqbal-tech.github.io/portfolioJob categories
Skills
Interested in hiring Shawn?
You can contact Shawn and 90k+ other talented remote workers on Himalayas.
Message ShawnFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
