Shaun K
@shaunk
Senior Data Engineer specializing in cloud-native big data, ETL, and real-time analytics.
What I'm looking for
I am a Senior Data Engineer with 9+ years delivering cloud-native, scalable data platforms across healthcare, finance, and manufacturing. I design and implement end-to-end ETL/ELT pipelines, real-time ingestion, and data warehouses using tools such as Airflow, dbt, Talend, Informatica, Snowflake, and Databricks.
I built a serverless pipeline processing 500M+ daily events with 99.99% uptime, reduced data latency by 45%, and improved reporting speed and accuracy through Data Vault and Kimball modeling. I drive data governance and compliance (HIPAA, GDPR, SOC2), automate data quality, and deploy ML models with MLflow, TensorFlow, and scikit-learn.
I lead teams to deliver production-ready analytics, automate CI/CD and IaC with GitLab, Terraform, Docker, and Kubernetes, and create BI visualizations with Tableau and Power BI to enable data-driven decisions and measurable operational gains.
Experience
Work history, roles, and key accomplishments
Lead Data Engineer
Lark Health
Jan 2022 - Present (3 years 10 months)
Built cloud-native data pipelines for EHR and claims integration, reducing data latency by 45% and improving reporting speed; automated data quality across 25 datasets and deployed ML readmission models with CI/CD.
Senior Data Engineer
EE-Medix
Aug 2018 - Dec 2021 (3 years 4 months)
Designed enterprise financial data warehouse and migrated legacy systems to cloud platforms, cutting reporting time by 50% and infra costs by 35% while ensuring SOC2/GDPR compliance.
ETL & Data Warehouse Engineer
Azumo
Jun 2015 - Jul 2018 (3 years 1 month)
Built scalable big data platforms processing 15+ TB of sensor data and enabled real-time analytics, improving insight velocity by 40% and detecting 90% of anomalies via deep learning pipelines.
Education
Degrees, certifications, and relevant coursework
New Jersey Institute of Technology
Bachelor of Arts, Information Systems
Completed a Bachelor of Arts in Information Systems focused on information systems theory and practical IT applications.
Tech stack
Software and tools used professionally
Amazon Redshift
Azure Synapse
AWS Glue
Apache Flink
Talend
D3.js
GitLab
Kubernetes
Jenkins
CircleCI
GitLab CI
Jupyter
dbt
MySQL
PostgreSQL
MongoDB
SQLite
Cassandra
Hadoop
HBase
Gmail
.NET
Databricks
Redis
Terraform
Jira
Java
TensorFlow
PyTorch
MLflow
scikit-learn
Kafka
Apache NiFi
Trello
Ansible
Serverless
Kafka Streams
Apache Storm
Airflow
Time Analytics
Redis Cloud
Google BigQuery
SQL
Delta Lake
Lark
Bash
Transform
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Shaun?
You can contact Shaun and 90k+ other talented remote workers on Himalayas.
Message ShaunFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
