David Pavao
@davidpavao
Staff software engineer specializing in data platforms and analytics.
What I'm looking for
I am a staff-level software engineer with 15+ years designing and operating data platforms, streaming pipelines, and analytics infrastructure across marketplace, e-commerce, and healthcare domains. I have led end-to-end platform builds that standardized Kafka topics and schemas, implemented dbt dimensional models, and delivered near-real-time data freshness for critical business domains.
My work has driven measurable improvements: reduced ad-hoc reporting by ~40%, cut monthly data platform costs by ~25%, accelerated reporting from days to minutes, and improved detection of operational issues through streaming dashboards. I prioritize reliability, data contracts, performance optimization, and close collaboration with product and analytics teams to enable self-serve insights and dependable metrics.
Experience
Work history, roles, and key accomplishments
Led design and build of a unified event-driven data platform (Kafka → Flink/Spark → Snowflake/S3) to deliver sub-5-minute data freshness, standardized schemas/contracts, and reduced downstream failures while cutting platform costs ~25%.
Designed and implemented the consolidated event collection pipeline and ETL jobs (Airflow + Python + Spark) to normalize marketplace activity into Snowflake/Redshift, reducing report generation time from days to minutes and improving metric consistency.
Owned order, inventory, and pricing pipelines and built ETL (Airflow + Python + Spark) and dimensional models to move data into Redshift/S3, improving data freshness to hourly/near-real-time and reducing OLTP load ~30%.
Senior Software Engineer
Intermedix
May 2008 - Feb 2016 (7 years 9 months)
Designed and maintained ETL processes and a SQL Server data warehouse for healthcare/emergency management, reducing batch processing time ~60% via indexing, partitioning, and parallelization and enabling operational reporting.
Education
Degrees, certifications, and relevant coursework
University of Houston
Bachelor of Science, Computer Science
2004 - 2009
Completed a Bachelor’s Degree in Computer Science, covering foundational coursework in programming, algorithms, and systems.
Tech stack
Software and tools used professionally
Postman
OpenAPI
AWS CLI
Google Cloud Platform
GitHub
GitLab
ESLint
SonarQube
Kubernetes
GitLab CI
dbt
MySQL
PostgreSQL
MongoDB
Memcached
Cassandra
Gmail
Node.js
Django
Laravel
Spring Boot
Yarn
Tailwind CSS
Redis
Terraform
AWS CloudFormation
IntelliJ IDEA
Gradle
Mocha
Webpack
JavaScript
Java
Neptune
Kafka
FastAPI
Grafana
Prometheus
Linux
macOS
Windows
Datadog
GraphQL
Amazon Kinesis
gRPC
Elasticsearch
Solr
Ansible
AWS Lambda
Serverless
pytest
JUnit
React Testing Library
NGINX
Airflow
SQL
npm
Core Data
Vite
Phoenix LiveView
Remote
Jan
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring David ?
You can contact David and 90k+ other talented remote workers on Himalayas.
Message DavidFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
