Roy Asher
@royasher
Principal data engineer delivering scalable lakehouse and real-time platforms.
What I'm looking for
I’m a Principal Data Engineer with 10+ years of experience designing and delivering scalable, high-performance data platforms across E-Commerce, Healthcare, FinTech, and AI-driven environments. I build end-to-end cloud-native data ecosystems—real-time streaming and modern lakehouse solutions—leveraging AWS, Snowflake, and Databricks to enable advanced analytics and machine learning at scale.
I’m also deeply focused on reliability, scalability, and cost efficiency, translating business requirements into production-ready data products. From establishing enterprise-grade data governance and observability to optimizing large-scale pipelines and leading teams of 8 engineers, I consistently deliver measurable impact—like reducing infrastructure costs by 30%+ and improving accuracy and readiness across regulated environments.
Experience
Work history, roles, and key accomplishments
Architected and delivered an AWS-based lakehouse and streaming platform supporting high-volume e-commerce analytics. Built event-driven ingestion with Kafka and Kinesis, optimized Spark/ETL workflows to cut infrastructure costs by 30%+, and led/mentored a team of 8 engineers.
Designed and implemented scalable ETL/ELT pipelines for enterprise analytics and AI workloads on Databricks. Built a centralized feature store, automated ML lifecycle processes with MLflow and CI/CD, and improved pipeline reliability and BI reporting accuracy.
Data Engineer
BlackBird Health
Apr 2018 - May 2021 (3 years 1 month)
Built HIPAA-compliant ETL pipelines integrating EHR/EMR and claims data to enable longitudinal patient analytics and secure clinical reporting. Improved data accuracy by 40%+ through automated validation and data quality frameworks while implementing encryption, access control, and audit-ready governance.
Developed high-performance batch and streaming ETL pipelines for large-scale financial transaction systems, including real-time fraud detection and risk analytics. Migrated legacy workflows to scalable Spark-based distributed architectures and improved reconciliation accuracy and processing speed through optimized data pipelines.
Education
Degrees, certifications, and relevant coursework
Roy hasn't added their education
Don't worry, there are 90k+ talented remote workers on Himalayas
Tech stack
Software and tools used professionally
Amazon Redshift
Matillion
Azure Synapse
Apache Spark
AWS Glue
Talend
GitLab
Kubernetes
Jenkins
GitLab CI
PySpark
dbt
MySQL
PostgreSQL
MongoDB
Cassandra
Hadoop
Gmail
Databricks
Redis
Terraform
Java
MLflow
Kafka
Grafana
Prometheus
Datadog
GraphQL
Amazon Kinesis
Kafka Streams
Toolkit
Time Analytics
Google BigQuery
SQL
Apache Iceberg
Pinecone
Delta Lake
Bash
Faiss
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Roy?
You can contact Roy and 90k+ other talented remote workers on Himalayas.
Message RoyFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
