Tvisha Patel
@tvishapatel
Senior Data Engineer with expertise in cloud-based data solutions.
What I'm looking for
I am a seasoned Data Engineering professional with over 6 years of experience in designing, developing, and optimizing large-scale data solutions. My expertise spans across the Hadoop ecosystem, Apache Spark, Kafka, and various cloud platforms including AWS, GCP, and Azure. I have a proven track record of building robust ETL pipelines and real-time streaming applications that enhance data accessibility and business insights.
Throughout my career, I have successfully implemented data architectures that leverage advanced analytics and machine learning, resulting in significant revenue increases for my employers. My hands-on experience with tools such as Informatica, Talend, and Azure Data Factory, combined with my proficiency in SQL and Python, allows me to deliver high-quality data solutions that meet complex business needs. I am passionate about data governance and have integrated best practices in data quality and security across various projects.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer-Azure
Walmart
Apr 2023 - Present (2 years 3 months)
Designed and implemented a Personalized Customer Recommendation System, integrating advanced data collection, processing, and analytics techniques, enhancing customer engagement through tailored recommendations. Developed and maintained end-to-end ETL pipelines in Azure Data Factory (ADF), efficiently handling large-scale structured and unstructured data from both streaming and batch sources. Depl
Data Engineer
TCS
Nov 2020 - Present (4 years 8 months)
Developed and optimized Spark/PySpark-based ETL pipelines for seamless data migration into an enterprise Hadoop Data Lake, implementing partitioning, broadcast joins, and performance tuning. Designed and implemented AWS-based data architecture, integrating AWS Glue, AWS EMR, AWS Lambda, Step Functions, and S3 to automate ETL processes. Extracted, transformed, and loaded data into Azure Data Lake,
AWS Data Engineer
Synechron
Feb 2019 - Present (6 years 5 months)
Designed and developed Spark/PySpark-based ETL pipelines for seamless data migration into an enterprise Hadoop Data Lake, optimizing performance with partitioning, Spark SQL, and broadcast joins. Built and maintained scalable data pipelines using Apache Spark on AWS EMR, integrating structured and semi-structured data into Hadoop and RDBMS environments. Engineered Snowflake data warehouse solution
Education
Degrees, certifications, and relevant coursework
JNTU
Bachelors, Computer Science and Engineering
Completed a Bachelor's degree in Computer Science and Engineering. The curriculum covered fundamental concepts and advanced topics in the field, preparing for a career in technology.
Tech stack
Software and tools used professionally
Amazon Redshift
Azure Synapse
Apache Spark
AWS Glue
Apache Flink
Apache Hive
Talend
AtScale
SAS
QlikView
Microsoft Azure
Amazon S3
Azure Storage
Kubernetes
Jenkins
NumPy
Pandas
PySpark
AWS Data Pipeline
DB
Sqoop
MySQL
PostgreSQL
Cassandra
Hadoop
HBase
IBM DB2
Sybase
Vertica
Gmail
Django
Yarn
Databricks
Jira
Java
JSON
PowerShell
MATLAB
Logstash
Azure Machine Learning
TensorFlow
PyTorch
scikit-learn
Kafka
RabbitMQ
Kibana
Zookeeper
Linux
Windows
Windows Server
Windows 10
Azure Active Directory
GraphQL
Elasticsearch
Avro
AWS Lambda
pytest
BeautifulSoup
Airflow
Apache Oozie
Time Analytics
Google BigQuery
Amazon Athena
SQL
Azure Blob Storage
SciPy
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Tvisha?
You can contact Tvisha and 90k+ other talented remote workers on Himalayas.
Message TvishaFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
