Aley Ban
@aleyban
Lead Data Engineer and Data Architect specializing in cloud AI/ML platforms.
What I'm looking for
I am a Lead Data Engineer and Data Architect with 10+ years designing and scaling cloud-native, AI-powered, analytics-ready data ecosystems across AWS, Azure, and GCP.
I have built Lakehouse and DataMesh platforms, real-time streaming pipelines, and integrated LLMs and vector databases to drive fraud detection, predictive models, and cost-optimized ML workflows, delivering measurable latency and training-time reductions.
I lead governance, compliance (HIPAA, GDPR, SOC2, PCI-DSS) and observability efforts, mentor engineering teams, and champion self-service BI and modern data stacks to accelerate enterprise digital transformation.
Experience
Work history, roles, and key accomplishments
Architected enterprise Lakehouse and DataMesh platforms (Databricks, Iceberg) supporting large-scale AI/ML workloads; built real-time pipelines that reduced critical trade latency by 65% and cut fraud detection times from hours to sub-seconds via LLM and vector DB integrations.
Senior Data Engineer
BlackBird
Jul 2019 - Feb 2022 (2 years 7 months)
Designed enterprise warehouses on Snowflake, Redshift, and BigQuery to deliver 50% faster queries; built streaming analytics (Kafka + Spark Streaming) reducing latency by 60% and automated ETL/ELT with Airflow and dbt to cut manual intervention by 30%.
Built scalable ETL pipelines and optimized Postgres/MySQL performance to improve query speeds by 50%; developed BI dashboards for executive KPIs and enhanced data security and validation frameworks for regulated financial datasets.
Associate Data Engineer
ApTask
Sep 2015 - Aug 2017 (1 year 11 months)
Designed ETL workflows with Informatica and SQL Server to reduce pipeline runtime by 50% and built fraud-detection pipelines integrating early ML models to improve anomaly detection accuracy.
Education
Degrees, certifications, and relevant coursework
Amazon Web Services (certification)
Certification, Cloud Architecture
AWS Certified Solutions Architect – Professional certification listed among professional credentials.
Google Cloud (certification)
Certification, Data Engineering
Google Cloud Professional Data Engineer certification listed among professional credentials.
Microsoft (certification)
Certification, Cloud Data Engineering
Microsoft Azure Data Engineer Associate certification listed among professional credentials.
Databricks (certification)
Certification, Data Engineering
Databricks Certified Data Engineer credential listed among professional certifications.
Snowflake (certification)
Certification, Cloud Data Platform
SnowPro Certified credential listed among professional certifications.
Tech stack
Software and tools used professionally
Matillion
Talend
Superset
Metabase
D3.js
Kubernetes
Jenkins
NumPy
Pandas
PySpark
Debezium
dbt
MySQL
PostgreSQL
MongoDB
Cassandra
Gmail
Databricks
Neo4j
Terraform
Java
TensorFlow
PyTorch
MLflow
Kubeflow
Kafka
FastAPI
Grafana
Prometheus
OpenTelemetry
Datadog
Elasticsearch
Ansible
Airflow
SQL
Dagster
LangChain
Weaviate
Pinecone
DataHub
OpenMetadata
ArgoCD
Amundsen
Collibra
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Aley?
You can contact Aley and 90k+ other talented remote workers on Himalayas.
Message AleyFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
