TEJA MULE
@tejamule
Data Engineer with 4+ years building scalable ETL pipelines and cloud data architectures.
What I'm looking for
I’m a Data Engineer with 4+ years designing, building, and optimizing data pipelines and architectures for large-scale data processing. I leverage modern data engineering tools and cloud technologies (AWS, Azure, GCP) to enable data-driven decision-making.
At BMO Bank, I designed and optimized pipelines using Azure Data Factory and Azure Databricks, integrating core banking, investment, and customer data to support real-time analytics. I built event-driven architectures with Apache Kafka and Azure Synapse Analytics for transaction processing, fraud detection, and regulatory compliance reporting.
I also orchestrate large-scale workflows with Apache Airflow and build interactive dashboards using Power BI and Tableau for KPIs like loan performance, credit utilization, and customer churn. To keep pipelines reliable, I’ve built data quality frameworks with validation rules, anomaly detection, reconciliation checks, and automated alerts.
Previously at Ericsson, I engineered AWS-based pipelines using AWS Glue, AWS Lambda, and Amazon S3, and implemented infrastructure provisioning with IaC via AWS CloudFormation or Terraform. Across my earlier role at Kotak Mahindra Bank, I used Apache Spark (PySpark/Scala) and Kafka-based ingestion to support risk assessment, regulatory reporting, and real-time analytics—then delivered secure, automated deployments with CI/CD and containerized microservices.
Experience
Work history, roles, and key accomplishments
Designed and optimized Azure Data Factory and Azure Databricks pipelines for real-time banking analytics, enabling credit scoring and risk management use cases. Built event-driven Kafka/Synapse architectures, orchestrated workflows with Airflow, and delivered Power BI and Tableau dashboards with integrated ML insights.
Built and optimized AWS Glue, Lambda, and S3 pipelines for large-scale telecom and customer data to support real-time analytics and compliance reporting. Developed Redshift/Snowflake/Aurora data models and warehouses and automated ETL orchestration with Step Functions and Airflow for reliable, scalable processing.
Designed and optimized scalable Spark-based data pipelines in Azure Synapse using PySpark and Scala to transform policyholder, claims, and premium datasets for risk assessment and regulatory reporting. Implemented batch and real-time ingestion with Kafka/Event Hubs, Sqoop, and Flume, and supported high-throughput querying across MongoDB and Cassandra.
Education
Degrees, certifications, and relevant coursework
University of North Texas
Master's in Computer Science, Computer Science
2023 - 2025
Completed a Master's in Computer Science at the University of North Texas from 2023 to 2025.
Tech stack
Software and tools used professionally
Amazon Redshift
Azure Synapse
Apache Spark
AWS Glue
Apache Flink
Apache Hive
Talend
Amazon S3
AWS Step Functions
GitHub
GitLab
Kubernetes
Jenkins
CircleCI
GitLab CI
PySpark
AWS Data Pipeline
Sqoop
MySQL
PostgreSQL
MongoDB
Cassandra
Hadoop
HBase
Gmail
Yarn
Databricks
Microsoft Teams
Terraform
AWS CloudFormation
Azure DevOps
Jira
Java
JSON
Logstash
Apache Flume
Kafka
Apache NiFi
FastAPI
Kibana
Zookeeper
Linux
Windows
Elasticsearch
AWS Lambda
Amazon RDS
Amazon Aurora
Kafka Streams
Airflow
Time Analytics
SQL
GitHub Copilot
Bash
Factory
Objective
Jan
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring TEJA?
You can contact TEJA and 90k+ other talented remote workers on Himalayas.
Message TEJAFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
