I seek Data Engineering roles where I’d be engaged in building scalable, secure data platforms, whether on-prem or on cloud platforms, with strong CI/CD and IaC practices. I am a result-driven individual, so I am looking to make an impact with my domain experience across various sectors
Yasir Yusuf
@yasiryusuf
Data Engineer specialising in scalable cloud data pipelines and real-time analytics.
What I'm looking for
I am a senior big data engineer with over eight years of experience designing, developing, and optimising large-scale data solutions across AWS, Azure, GCP, and on-premises Hadoop ecosystems.
I have led migrations to cloud platforms, architected serverless and streaming pipelines using AWS Glue, Lambda, Kinesis, Azure Data Factory, Event Hubs and Google Pub/Sub, and improved query and ETL performance using Redshift, Synapse, BigQuery and dbt.
Throughout my career, I have been engaged in cost-efficient, secure data architectures, infrastructure-as-code (Terraform, CloudFormation), CI/CD automation, and cross-functional collaboration to deliver production-ready data platforms that enable analytics, ML workflows, and real-time use cases.
Experience
Work history, roles, and key accomplishments
Data Engineer
British Petroleum
Nov 2024 - Present (1 year 2 months)
Manage and automate data pipelines across Azure and AWS, improving ETL performance and building scalable data warehouse and real-time streaming solutions that enhanced query performance by 40%.
AWS Data Engineer
British Airways
Feb 2024 - Oct 2024 (8 months)
Designed AWS data lakes and ETL pipelines using S3, Glue and EMR, optimized querying with Athena and Redshift, and automated infrastructure and deployments to improve data integration and operational efficiency.
Data Engineer
Reckitt Benckiser
Aug 2022 - Feb 2024 (1 year 6 months)
Built and automated ETL pipelines and data lakes on Azure, implemented CI/CD for dbt projects and Terraform modules, and improved real-time ingestion and query performance across Synapse and ADLS.
Senior Data Engineer
Citigroup
Jan 2021 - Jul 2022 (1 year 6 months)
Optimized ETL workflows and integrated multi-source data into AWS Redshift, reducing processing times by 30% and implementing serverless pipelines for near-real-time transaction processing and governance with dbt/Snowflake.
Big Data Engineer
NFU Mutual
Oct 2019 - Dec 2020 (1 year 2 months)
Developed and managed Dataproc/Spark and serverless workflows on GCP, built BigQuery models with dbt and Pub/Sub streaming solutions to enable scalable analytics and automated CI/CD pipelines.
Azure Data Engineer
Prudential
Dec 2017 - Sep 2019 (1 year 9 months)
Engineered big data solutions with Azure Data Factory, Synapse and HDInsight, built serverless ETL and real-time streaming with Event Hubs, and delivered analytics dashboards to drive data-driven decisions.
Data Engineer
Morrisons
Mar 2015 - Nov 2017 (2 years 8 months)
Developed Spark Structured Streaming analytics and migrated legacy systems to Cloudera, improving processing speeds by 40% and automating ingestion and orchestration with Sqoop and Oozie.
Education
Degrees, certifications, and relevant coursework
University of Exeter
Master of Science, Applied Science and Statistics
Completed an MSc in Applied Science and Statistics focused on advanced statistical methods and applied data analysis for scientific contexts.
Tech stack
Software and tools used professionally
Amazon Redshift
Azure HDInsight
Azure Synapse
Apache Spark
AWS Glue
AWS IAM
Google Compute Engine
Amazon CloudWatch
Amazon S3
Google Cloud Storage
Azure Storage
GitHub
Kubernetes
Azure Kubernetes Service
Jenkins
GitHub Actions
Azure Pipelines
Pandas
PySpark
dbt
DB
Sqoop
MySQL
PostgreSQL
MongoDB
Microsoft SQL Server
Hadoop
HBase
Databricks
Terraform
AWS CloudFormation
Azure DevOps
Jira
JSON
Kafka
Grafana
Azure Monitor
Zookeeper
Datadog
Google App Engine
Google Cloud Dataflow
Amazon Kinesis
Google Cloud Pub/Sub
Amazon Macie
Avro
AWS Lambda
Serverless
Google Cloud Functions
Azure Functions
Azure SQL Database
Google Cloud SQL
pytest
Zero Server
Airflow
Time Analytics
Root Cause
Google BigQuery
Amazon EMR
SQL
Google Kubernetes Engine
Azure Cosmos DB
Azure Blob Storage
Google Cloud Dataproc
ScalaTest
Azure Logic Apps
Cosmos
Bash
Transform
Google Cloud Deployment Manager
Enhance
Phase
Factory
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Yasir?
You can contact Yasir and 90k+ other talented remote workers on Himalayas.
Message YasirFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
