Dev Patel
@devpatel6
Senior Data Engineer building scalable Azure and Big Data platforms for trusted analytics.
What I'm looking for
I’m a Senior Data Engineer with 8+ years of experience building large-scale enterprise data platforms across healthcare, financial, retail, telecom, and government datasets. I design scalable data structures—data lakes, data warehouses, ingestion pipelines, transformation layers, and curated datasets—to power enterprise reporting and decision systems.
I’ve built ingestion frameworks that collect data from relational databases, flat files, APIs, and streaming platforms, then transform it with robust ETL/ELT patterns. I perform deep data profiling and quality checks (null analysis, duplicates, schema inconsistencies, referential integrity) to ensure reliable pipelines before publishing analytical layers.
I architect and implement end-to-end processing: structured lake zones (landing/standardized/curated), dimensional modeling (star schema, dimensional modeling, SCD Type 2), and orchestration frameworks to schedule ingestion, transformation, validation, and warehouse refresh cycles. I also deliver monitoring and logging so teams can troubleshoot quickly and keep platforms running.
My work spans modern cloud and engineering practices, including Azure and GCP services, Spark (PySpark/Scala/Spark SQL), and CI/CD automation for repeatable deployments. I’m driven by data governance, metadata/lineage management, and security—so analytics teams get trusted, well-governed datasets they can confidently act on.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer
BMO
Jun 2024 - Present (2 years)
Designed a GCP-based layered data lake and BigQuery dimensional warehouse for workers’ compensation claims, building ingestion pipelines, PySpark transformations, data quality validations, and metadata documentation. Implemented Cloud Composer orchestration, monitoring/logging, security controls, and Looker reporting datasets for enterprise analytics stakeholders.
Data Engineer
Toyota Motors
Nov 2021 - Jun 2024 (2 years 7 months)
Built Google Cloud data lake ingestion and real-time streaming pipelines for retail transactions, using Pub/Sub and Apache Kafka to power near real-time analytics. Developed BigQuery star-schema models and dbt-based ELT transformations, added data validation (Deequ), and orchestrated workflows with Airflow/Cloud Composer for curated reporting datasets.
Data Warehouse Developer
KPMG
Sep 2018 - Nov 2021 (3 years 2 months)
Developed Oracle-based dimensional data warehouse solutions for retail analytics, implementing star schemas and SCD Type-2 logic with Informatica PowerCenter and SQL/PL-SQL. Built and supported Hadoop/Hive telecom CDR processing pipelines using Oozie, Sqoop, Kerberos and Hadoop Ranger security controls, and enabled BI consumption through Tableau dashboards.
Education
Degrees, certifications, and relevant coursework
Dev hasn't added their education
Don't worry, there are 90k+ talented remote workers on Himalayas
Tech stack
Software and tools used professionally
Splunk
Azure Synapse
Apache Spark
Apache Hive
Google Cloud Platform
Google Cloud Storage
Azure Storage
GitHub
GitLab
Kubernetes
Jenkins
CircleCI
GitHub Actions
PySpark
dbt
Sqoop
MySQL
PostgreSQL
Hadoop
HBase
Sybase
Gmail
Yarn
Databricks
Terraform
AWS CloudFormation
Azure DevOps
JSON
Apache Flume
Azure Machine Learning
Kafka
Apache NiFi
Prometheus
Ambari
Google Cloud Dataflow
Google Cloud Pub/Sub
Avro
AWS Lambda
Serverless
Azure Functions
Airflow
Apache Oozie
Apache Ranger
Time Analytics
Amazon Web Services (AWS)
SQL
Google Kubernetes Engine
Delta Lake
Great Expectations
Azure Logic Apps
Collibra
Bash
Deequ
Unity Catalog
Factory
Movement
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Dev?
You can contact Dev and 90k+ other talented remote workers on Himalayas.
Message DevFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
