Mohan k
@mohank
Experienced Senior Data Engineer with expertise in Big Data solutions.
What I'm looking for
I am a seasoned IT professional with over 12 years of experience, including more than 7 years specializing in Big Data engineering across both on-premises and cloud platforms. My journey has equipped me with a deep understanding of the full software development lifecycle, enabling me to deliver complex, high-quality solutions on time and within budget. As a certified Solutions Architect Associate on AWS and Azure, I have a proven track record of designing, building, and optimizing data pipelines in multi-cloud environments.
Throughout my career, I have honed my skills in data ingestion, ETL pipelines, data modeling, and machine learning integration. My technical expertise includes proficiency in Spark, PySpark, Python, Scala, SQL, and various orchestration tools like Airflow and Azure Data Factory. I thrive in agile environments and have successfully led cross-functional teams while also excelling as an individual contributor. My commitment to driving data innovation and operational efficiency has consistently resulted in strategic insights that align with business needs.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer
Fugetron IT
Jun 2024 - Present (1 year 1 month)
Architected and led the development of end-to-end ETL/ELT pipelines following the Medallion Architecture (Bronze, Silver, Gold) in Azure Databricks. Built ingestion workflows in Azure Data Factory integrating on-premise MySQL, Oracle, and various flat-file feeds into Azure Blob Storage. Transformed and curated large-scale datasets using PySpark on Azure Databricks, implementing partitioning and op
Senior Data Engineer
EPAM Systems
May 2023 - Mar 2024 (10 months)
Architected and implemented data ingestion pipelines from on-premise Oracle and PostgreSQL databases to AWS S3, ensuring continuous and scalable data movement. Developed a robust Python-based jet parser on EC2 (Red Hat Linux) for 24/7 ingestion of XML and JSON files, leveraging auto-scaling and bash scripting for reliability. Built ETL processes using AWS Glue and PySpark to load and transform dat
Senior Data Engineer
Galax E Solutions
Nov 2021 - Mar 2023 (1 year 4 months)
Developed PySpark applications for JSON to DataFrame conversions and element-level comparisons, optimizing healthcare data transformation. Automated data ingestion and transformations using Azure Data Factory, ensuring reliable scheduling and orchestration of large-scale healthcare datasets. Executed end-to-end migrations from Oracle to Azure SQL Database and Cosmos DB, including data type convers
Data Engineer / AI Specialist
Ovato Technologies
Dec 2019 - Mar 2021 (1 year 3 months)
Migrated on-premise Oracle data to Cloud Storage using Python scripts and Storage Transfer Service, ensuring schema conversions and data consistency. Designed and orchestrated scalable ETL workflows in Dataflow for streaming and batch processing of large marketing datasets. Developed PySpark jobs on Dataproc to transform and analyze data stored in Cloud Storage, implementing best practices for job
Data Engineer
Optimum Info Systems
Jul 2019 - Nov 2019 (4 months)
Designed and architected scalable data processing pipelines on AWS (EMR, Redshift, Athena, Glue) based on business requirements. Ingested data from diverse sources (HDFS, RDBMS) using Sqoop and custom ETL scripts, loading into S3 and Redshift. Developed Spark/PySpark workflows on EMR for data cleansing, enrichment, and transformation, ensuring high performance and low latency.
Data Engineer
Infosys Technologies
Aug 2017 - Jul 2019 (1 year 11 months)
Analyzed business requirements for telematics data ingestion and real-time analytics, collaborating with stakeholders to shape a robust solution. Architected ingestion pipelines using Pub/Sub and Dataflow for streaming telematics data into BigQuery. Executed large-scale batch jobs on Dataproc (Hive, Spark, Sqoop) to transform data from on-premise systems into GCP.
Data Engineer
Infosys Technologies
May 2015 - Jul 2017 (2 years 2 months)
Implemented ETL workflows using AWS Glue to move data from HDFS to S3, ensuring data quality and security throughout. Migrated Hive-based queries to EMR and Spark for scalable data transformations, reducing processing times significantly. Built Lambda functions and Step Functions for event-driven data pipelines, enabling automated, near real-time risk alerts.
SAP BO Senior Developer
Infosys Technologies
Oct 2012 - Apr 2015 (2 years 6 months)
Designed and developed Web Intelligence reports, including sub-reports, hierarchies, filters, and prompts for advanced user interactions. Built User-Defined Objects and Report Variables (e.g., @AggregateAware) to generate summary-level and drill-down views. Optimized BO reports for performance, implementing best practices in universe design and database connectivity.
Education
Degrees, certifications, and relevant coursework
JNT University, Hyderabad
B.Tech, Electronics & Communication Engineering
Completed a Bachelor of Technology in Electronics & Communication Engineering. This program provided a strong foundation in engineering principles and problem-solving.
Tech stack
Software and tools used professionally
Azure HDInsight
Azure Synapse
AWS Glue
Google Cloud Platform
Stackdriver
AWS Step Functions
GitHub
Bitbucket
Kubernetes
Azure Kubernetes Service
Jenkins
PySpark
DB
Sqoop
MySQL
PostgreSQL
MongoDB
Hadoop
HBase
Gmail
Databricks
Terraform
Azure DevOps
JSON
XML
Kafka
Azure Monitor
Linux
Windows
Azure SQL Database
Airflow
Time Analytics
SQL
Azure Cosmos DB
Azure Blob Storage
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Mohan?
You can contact Mohan and 90k+ other talented remote workers on Himalayas.
Message MohanFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
