Rehan User
@rehan
Experienced Lead Data Engineer specializing in cloud data solutions.
What I'm looking for
As a Lead Data Engineer with over 8 years of experience, I specialize in designing and optimizing scalable data architectures. My expertise lies in ETL workflows, cloud data solutions, and big data technologies. I leverage Python, SQL, Spark, and major cloud platforms like AWS, Azure, and GCP to build robust, high-performance data pipelines. I excel at transforming complex business requirements into efficient technical solutions that drive actionable insights and reduce costs.
Throughout my career, I have consistently improved data processing efficiency and reliability, focusing on real-time analytics, data governance, and automation. As a leader, I foster collaborative, high-performing teams and mentor engineers to ensure project success and innovation. I am passionate about leveraging data to enable smarter decision-making and am always exploring new technologies to enhance data architecture and performance.
Experience
Work history, roles, and key accomplishments
Lead Data Engineer
Avanade
Feb 2022 - Present (3 years 6 months)
Architected a cloud-native data lakehouse on AWS (S3 + Databricks Delta Lake), centralizing over 100 TB of structured and unstructured data for analytics teams. Designed and implemented over 150 Airflow DAGs and PySpark jobs, reducing batch processing times and data latency.
Senior Data Engineer
Themesoft Inc.
Jul 2019 - Present (6 years 1 month)
Built and optimized a Snowflake data warehouse with dbt-driven transformations, significantly improving query performance. Developed real-time ingestion pipelines using Kafka and Spark Streaming, enabling near real-time reporting.
Junior Data Engineer
Trellix
Sep 2017 - Present (7 years 11 months)
Developed Python-based ETL scripts and Informatica workflows to integrate sales and marketing data into a centralized Redshift warehouse. Automated over 60 daily jobs with Apache Airflow, ensuring 99.5% on-time execution reliability.
Data Engineer
ApTask
Mar 2016 - Present (9 years 5 months)
Developed and maintained ETL workflows using Informatica to integrate data from Oracle and MySQL into a centralized data warehouse. Optimized SQL queries and indexes, resulting in a 20% improvement in report generation times.
Education
Degrees, certifications, and relevant coursework
Unknown
Bachelor's Degree, Computer Science
2012 - 2016
Completed a Bachelor's Degree in Computer Science, gaining foundational knowledge in the field. Focused on core computer science principles and practices.
Tech stack
Software and tools used professionally
Amazon Redshift
Apache Spark
Microsoft Azure
Google Cloud Platform
GitHub
GitLab
Kubernetes
Jenkins
GitHub Actions
Jupyter
NumPy
Pandas
PySpark
Dask
dbt
MySQL
PostgreSQL
MongoDB
Cassandra
Hadoop
Gmail
Databricks
Neo4j
Terraform
Jira
Java
TensorFlow
PyTorch
MLflow
scikit-learn
Keras
NLTK
Kafka
FastAPI
Prometheus
Airflow
Google BigQuery
Optimizely
SQL
XGBoost
SciPy
LightGBM
Seldon
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Rehan?
You can contact Rehan and 90k+ other talented remote workers on Himalayas.
Message RehanFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
