James Malik
@jamesmalik
I am a Senior Data Engineer focused on scalable cloud data platforms and ETL.
What I'm looking for
I am a Senior Data Engineer with over 10 years of experience designing, building, and optimizing large-scale data infrastructure across cloud and hybrid environments. I specialize in architecting end-to-end ETL and streaming pipelines using Apache Spark, Kafka, Airflow, dbt, and modern data warehouse technologies.
At recent roles I architected cloud-native platforms for AI/ML workloads, modernized legacy pipelines to serverless AWS Glue and Step Functions, and built ETL systems processing over 5TB daily, leading migrations to BigQuery and reducing infrastructure costs substantially. I implemented observability and governance using OpenTelemetry, CloudWatch, Great Expectations, and data cataloging tools while championing data mesh principles.
I lead cross-functional teams, mentor engineers, and drive CI/CD and IaC best practices with Terraform, Docker, and GitHub Actions to deliver reliable, cost-effective data platforms. I am motivated to build scalable systems that enable analytics and ML, while improving data quality, lineage, and operational excellence.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer
Google Cloud
Jan 2019 - Dec 2020 (1 year 11 months)
Spearheaded enterprise-scale ETL pipelines on GCP using Dataflow, BigQuery, and Pub/Sub, delivering unified batch and streaming ingestion frameworks for global product analytics.
Education
Degrees, certifications, and relevant coursework
National University of Sciences and Technology
Bachelor of Science, Computer Science
Earned a Bachelor of Science in Computer Science from the National University of Sciences and Technology.
Tech stack
Software and tools used professionally
Apache Spark
AWS Glue
Apache Flink
Apache Hive
Talend
Superset
GitHub
GitLab
Bitbucket
ESLint
Kubernetes
Docker Compose
Jenkins
CircleCI
GitHub Actions
Jupyter
PySpark
dbt
MySQL
PostgreSQL
MongoDB
Hadoop
HBase
Gmail
Databricks
pre-commit
Terraform
AWS CloudFormation
Pulumi
Java
MLflow
Kafka
Grafana
Prometheus
OpenTelemetry
Datadog
Serverless
Kafka Streams
Airflow
Apache Beam
Luigi
Microsoft Power BI
SQL
Amazon SageMaker
AWS KMS
Dagster
Apache Iceberg
DataHub
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring James?
You can contact James and 90k+ other talented remote workers on Himalayas.
Message JamesFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
