Kevin Chen
@kevinchen6
Senior Data Engineer building scalable streaming and AI/ML data platforms.
What I'm looking for
I’m a Senior Data Engineer with 10 years of experience building scalable data platforms and distributed processing systems across streaming media, healthcare, and SaaS environments. I focus on real-time analytics, recommendation systems, AI/ML data infrastructure, and platform reliability that helps teams make business-critical decisions.
At Spotify, I developed a Streaming Personalization & Recommendation Analytics Platform that supports large-scale experimentation and machine learning workloads. I rebuilt ingestion and orchestration workflows with Airflow and incremental processing to reduce end-to-end data latency from hours to under 15 minutes, and I automated infrastructure provisioning and deployment with Terraform, Kubernetes, and GitHub Actions—cutting deployment lead time by 60%.
I also drive measurable improvements in trust and stability: I introduced data quality checks and lineage validation across critical pipelines, lowering production data incidents by 30%+ and improving downstream reporting stability. I optimized Snowflake workloads through clustering, partitioning, and tuning to reduce annual compute spend by ~20%, and I strengthened operational reliability for business-critical pipelines to reduce SLA breaches by 35%.
Previously at Included Health, I built a HIPAA-compliant Unified Healthcare Data & Patient Analytics Platform by developing Airflow and Spark pipelines ingesting patient eligibility, claims, and clinical records from Kafka streams and systems like Salesforce and microservices. I consolidated fragmented pipelines into a centralized AWS data platform, created reusable ingestion/transformation frameworks that reduced onboarding time and increased self-service adoption by 30%, and maintained 99%+ DAG success rates through automated validation, retry handling, and monitoring.
Experience
Work history, roles, and key accomplishments
Developed a streaming personalization and recommendation analytics platform processing billions of playback, session, and engagement events daily. Reduced end-to-end data latency from hours to under 15 minutes, cut deployment lead time by 60%, lowered production data incidents by 30%+, reduced annual Snowflake compute spend by ~20%, and decreased SLA breaches by 35%.
Built a Unified Healthcare Data and Patient Analytics platform supporting HIPAA-compliant clinical analytics and machine learning workflows. Developed Kafka-driven Airflow/Spark pipelines, consolidated AWS reporting pipelines, increased analyst self-service adoption by 30%+, maintained 99%+ DAG success rates, and strengthened governance, auditability, and access controls.
Created a Revenue and GTM Analytics Reporting platform for finance, sales operations, and executive reporting across SaaS stakeholders. Automated recurring KPI/revenue reporting to eliminate 10+ hours of weekly manual work, tuned Snowflake workloads to reduce report runtime by 25%+ and compute consumption, and decreased reporting discrepancies by ~30%.
Education
Degrees, certifications, and relevant coursework
Brandeis University
Bachelor's degree in Computer Science, Computer Science
2011 - 2015
Earned a bachelor's degree in Computer Science at Brandeis University from 2011 to 2015.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Kevin?
You can contact Kevin and 90k+ other talented remote workers on Himalayas.
Message KevinFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
