Loading...
Loading...
Himalayas
About usHimalayas PlusCommunityTech stackEmployee benefitsTerms and conditionsPrivacy policyContact usFor job seekers
Create your profileBrowse remote jobsDiscover remote companiesJob description keyword finderRemote work adviceCareer guidesJob application trackerAI resume builderResume examples and templatesAI cover letter generatorCover letter examplesAI headshot generatorAI interview prepInterview questions and answersAI interview answer generatorAI career coachFree resume builderResume summary generatorResume bullet points generatorResume skills section generator© 2025 Himalayas. All rights reserved. Built with Untitled UI. Logos provided by Logo.dev. Voice powered by Elevenlabs Grants
Join the remote work revolution
Join over 100,000 job seekers who get tailored alerts and access to top recruiters.
@ryanwillson1
Senior Data Engineer and architect delivering scalable lakehouse platforms and real-time pipelines.
I am a Senior Data Engineer and Big Data Architect with 10+ years building real-time pipelines and lakehouse platforms across healthcare, finance, and retail. I specialize in reducing latency and cloud costs while ensuring compliance with HIPAA, GDPR, and SOX.
I've designed Kafka and Spark streaming at scale, built multi-zone lakehouses on AWS S3, and standardized onboarding to cut time-to-data by ~50%. I've led Fortune-level deployments, authored Python and SQL frameworks for cleansing and identity graphs, and tuned EMR/Glue/Redshift to lower ETL runtimes and cloud spend.
I deliver measurable business outcomes—cutting inventory discrepancies, shrinking batch windows, and improving match rates—while mentoring teams and implementing infrastructure-as-code and observability best practices.
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Work history, roles, and key accomplishments
SecureHavenCo
Feb 2025 - Aug 2025 (6 months)
Built a unified retail data model and automated 15 near-real-time pipelines (under 15-minute SLAs), cutting inventory discrepancies by 35% and reducing cloud costs by 20%. Delivered self-service dashboards that reduced prep time from two hours to ten minutes and shrank batch windows by 40%.
Secure HavenCo
Feb 2025 - Aug 2025 (6 months)
Designed and maintained scalable ETL/ELT pipelines integrating inventory, orders, customers, and shipping; migrated legacy RDBMS and flat-file workflows into AWS analytics stacks to accelerate financial and billing reports and enable real-time sales insights.
Designed Kafka and Spark streaming processing 500M+ events/day, reducing latency ~45% and built a multi-zone lakehouse on S3 that halved time-to-data and improved match rates by 20%. Tuned EMR/Glue/Redshift to cut ETL runtimes and cloud spend by ~30% while ensuring compliance for Fortune-level customers.
Led enterprise RDBMS-to-PostgreSQL migrations and architected petabyte-scale real-time and batch data pipelines using Spark and AWS (Glue/EMR), reducing downstream data errors by 40% and cutting infrastructure costs and latency.
Implemented end-to-end tagging achieving >98% event capture and reduced defects 40%; built A/B testing pipelines that shortened analysis from days to hours and increased conversion by 7%. Modeled unified profiles in AEP to boost activation reach 20% while operationalizing GDPR consent and deletion workflows.
Blue Medora
Feb 2017 - Apr 2019 (2 years 2 months)
Led two agile squads to deliver 10 cloud monitoring integrations in Kotlin/Java, doubling coverage and cutting time-to-market per integration by 50%. Delivered a serverless public API ingesting millions of metrics/day and drove CI/CD improvements that reduced incidents 30% and increased deployment cadence to weekly.
Delivered governance roadmaps and implemented lineage for 90% of critical datasets to meet HIPAA/GDPR/SOX; built InfoSphere/DataStage pipelines that shrank nightly batches 25% and led MDM consolidation of 20 sources to improve report accuracy and reduce duplicates.
Degrees, certifications, and relevant coursework
Master of Science, Computer Science
Completed a Master of Science in Computer Science focusing on advanced topics in software and data engineering.
Bachelor of Science, Electrical Engineering & Computer Science
Earned a Bachelor of Science in Electrical Engineering & Computer Science with foundational training in software and systems engineering.
Software and tools used professionally
You can contact ryan and 90k+ other talented remote workers on Himalayas.
Message ryanAli Butt
Lead Data Engineer, Kyruus Health
Raza Ash
Senior Data Architect, DataBridge Solutions
Nayla R
Senior Data Engineer, Sepsis Scout
Ketan Boro
Senior Data Engineer, Pfizer
Alay Bangash
Lead Data Engineer, Anveta
Saify K User
Lead Data Engineer, Etleap
Rana Jalil
Senior Data Engineer, NextGen Analytics
Ruohan Liu
Senior Data Engineer, Uber
Tirtha Mandal
Sr Data Engineer & Architect, Point C
Saqib Farooq
Senior Data Engineer, Confiz