Fatkhullakh Turakhonov
@fatkhullakhturakhono
Data Engineer building end-to-end lakehouses, pipelines, and BI with PySpark and Airflow.
What I'm looking for
I’m a Data Engineer with hands-on experience building end-to-end data pipelines, lakehouses, and analytics platforms across e-commerce, mobility, and retail. I focus on practical, business-ready delivery—modeling, orchestration, and reliable reporting that teams can trust.
In my recent role, I built an end-to-end lakehouse for a Central Asian e-commerce client by ingesting multi-source data into a Medallion architecture on S3-compatible object storage with Delta Lake. I designed and ran a 2-worker PySpark cluster for deduplication, currency normalization, and timezone fixes across 5M+ records, then strengthened the Gold layer with dbt (15 models, 50+ tests) and CI/CD via GitHub Actions.
I deliver resilience as well as performance: I used Delta Lake time travel to recover from a payment schema incident with zero data loss and zero pipeline downtime. I also orchestrate pipelines end-to-end with Apache Airflow (DAG dependencies, retries, SLA monitoring), applied Parquet partitioning to reduce query time by 60%, and delivered Power BI dashboards tracking regional revenue, cohort retention, marketing ROAS, and payment health across all clients.
Before that, I designed a hybrid data warehouse following Inmon (3NF) and Kimball principles (Landing/Staging/Mart) and built ETL pipelines in PL/pgSQL with full Slowly Changing Dimension (SCD Type 2) logic. My experience spans Python/SQL ETL, data quality testing, and BI delivery (Power BI/Tableau), and I’ve also built production-ready application backends (e.g., Spring Boot + PostgreSQL) where clean data and reliable integrations matter.
Experience
Work history, roles, and key accomplishments
Data Engineer
Ferret Labs
Nov 2025 - Jun 2026 (7 months)
Built lakehouse (WooCommerce, Ads, Payme/Click) on S3+Delta medallion.
PySpark cluster (5M+ rows): dedup, currency, timezone. dbt Gold (15 models, 50+ tests, CI/CD). Delta time travel recovered schema incident w/ zero loss.
DuckDB/Postgres/PySpark pipeline.
Airflow orchestration; Parquet partitioning cut queries 60%; Power BI
Designed a hybrid data warehouse using Inmon (3NF) and Kimball dimensional models and built ETL pipelines in PL/pgSQL, including SCD Type 2 logic for historical tracking. Created data quality test plans and delivered interactive Power BI dashboards for Adidas sales analytics.
Data Engineer (Part-time)
Cognilabs
Feb 2024 - May 2025 (1 year 3 months)
Built and maintained ETL pipelines using Python and SQL, optimizing reporting queries with indexing and query restructuring to improve performance. Developed Power BI and Tableau reports and automated recurring data cleaning workflows while collaborating with backend/AI teams to deliver structured datasets for ML training.
Data Engineering Intern
Cognilabs
Oct 2023 - Jan 2024 (3 months)
Supported data cleaning and ETL scripting in Python/SQL for client data projects and assisted with SQL queries for reporting pipelines and backend data feeds. Built foundational knowledge of database optimization and delivered BI work using Power BI and Tableau.
Education
Degrees, certifications, and relevant coursework
Gdańsk University of Technology
Bachelor of Science, Data Engineering
Grade: 4.5 / 5
Earned a B.Sc. in Data Engineering focused on big data solutions, data warehousing, database systems, and software engineering.
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Fatkhullakh?
You can contact Fatkhullakh and 90k+ other talented remote workers on Himalayas.
Message FatkhullakhFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
