Open to opportunities

A Shaw

@ashaw1

Message

I’m a senior data engineer and data architect specializing in cloud-native streaming, lakehouses, and analytics.

United States

Message

What I'm looking for

I’m looking for a role where I can build scalable cloud data platforms, own real-time streaming and lakehouse architecture, and partner with AI/ML and analytics teams—while strengthening data governance, reliability, and performance.

I’m a Data Engineer and Data Architect with 9+ years of experience delivering end-to-end big data solutions across AWS, Azure, and GCP. I build cloud-native data platforms that turn operational complexity into reliable, scalable analytics.

I specialize in real-time streaming pipelines with Kafka, Flink, and Spark, and I orchestrate ETL/ELT workflows using Databricks, Airflow, and Python. My work centers on SQL, data modeling, and modern data warehousing like Snowflake, Redshift, and BigQuery.

I’m trusted for designing high-performance architectures that unify structured and unstructured data from RDBMS and NoSQL systems, so teams can power BI, machine learning, and executive analytics. I also lead data governance and schema standardization to support compliance goals such as HIPAA and GDPR.

As a team lead, I mentor junior engineers and partner cross-functionally to align data architecture with AI/ML and business priorities. I love building lakehouse patterns, optimizing cost and performance, and delivering trustworthy pipelines that reduce latency and improve decision-making.

Experience

Work history, roles, and key accomplishments

Current

Lead Data Engineer (Arch)

Current

StealthAI (StitchVision)

Aug 2023 - Present (2 years 11 months)

Architected a real-time inventory and supply chain data platform using Kafka, Flink, and Databricks to enable sub-second product availability updates. Led multi-cloud (AWS and GCP) lakehouse and streaming ETL/ELT designs, including governance aligned to HIPAA and GDPR.

Data Engineering Team Lead

Horizon Technologies (Addo AI)

Jan 2022 - Jul 2023 (1 year 6 months)

Conceptualized and built a user health data platform using Kafka streams and cloud data lakes on Google Cloud. Led enterprise data architecture and migration to AWS Redshift, Snowflake, and BigQuery, supporting machine learning and business intelligence.

Kafka Google Cloud Platform Data Lake ETL Python Apache Spark SQL PostgreSQL Apache NiFi Cloud Migration AWS RedShift Snowflake BigQuery Clustering

Data Engineer (Remote Contract)

Amazon Web Services (AWS)

May 2020 - Dec 2021 (1 year 7 months)

Built scalable ETL pipelines with AWS Glue, PySpark, and Lambda to ingest and transform customer behavior and transaction data. Designed Redshift data marts and a centralized S3 data lake with Glue Data Catalog and schema versioning, supporting analytics via QuickSight and Athena.

S3 AWS Glue PySpark AWS Lambda Redshift Amazon Quicksight Athena Glue Data Catalog Terraform Python Data Quality Anomaly Detection Redshift Spectrum Kinesis SQL APIs

Data Engineer

Mercurial Minds

Jan 2015 - Apr 2020 (5 years 3 months)

Developed a document management system with real-time indexing and retrieval using Kafka and Flink, including OCR for searchable content. Built end-to-end data architectures integrating relational and NoSQL systems, and implemented collaboration analytics and data backup using AWS and Azure storage services.