Sandesh Phuyal
@sandeshphuyal
Experienced Data Engineer with expertise in big data and cloud technologies.
What I'm looking for
I am a seasoned Data Engineer with over 7 years of hands-on experience in building and optimizing data systems, specializing in big data and cloud technologies. My proficiency in leveraging platforms like Google Cloud Platform (GCP), AWS, and Azure has enabled me to develop efficient, high-performance data architectures and pipelines. I excel in designing and executing complex ETL pipelines using Apache Airflow, which enhances data transformation and integration processes.
Throughout my career, I have demonstrated a strong ability to deploy machine learning models into production, utilizing data to drive decision-making and business insights. My technical skills include advanced programming in Python, Java, SQL, and Scala, along with extensive experience in managing both SQL and NoSQL databases. I am committed to ongoing professional development and actively participate in professional organizations like ACM and IEEE to stay current with industry trends.
Experience
Work history, roles, and key accomplishments
Data Engineer
Pfizer Inc.
Dec 2022 - Present (2 years 5 months)
Led the design and implementation of a multi-terabyte data warehouse on AWS Redshift, improving query performance. Implemented real-time data processing systems using Apache Kafka and Spark, and collaborated with analysts to support machine learning algorithms. Enhanced data security protocols and automated infrastructure provisioning.
Data Engineer
ServiceNow
Mar 2020 - Jul 2022 (2 years 4 months)
Designed data integration solutions using ETL tools like Informatica and AWS Glue. Managed data storage and processing with AWS services, optimized Hadoop and Spark environments, and developed API integrations for real-time data ingestion. Created dashboards using BI tools for actionable insights.
Data Engineer
Chewy Inc.
Jan 2017 - Feb 2020 (3 years 1 month)
Architected a centralized data lake on AWS, integrating data from over 20 source systems. Led the adoption of microservices architecture and developed data transformation frameworks using PySpark. Established data quality checks and provided technical leadership in database design.
Education
Degrees, certifications, and relevant coursework
Southern New Hampshire University
Master of Science, Information Technology
Tech stack
Software and tools used professionally
Apache Spark
AWS Glue
Google Cloud Platform
AWS Step Functions
Bitbucket
Kubernetes
Jenkins
NumPy
Pandas
PySpark
DB
Sqoop
MySQL
PostgreSQL
MongoDB
Cassandra
Hadoop
HBase
Gmail
Node.js
Databricks
Terraform
AWS CloudFormation
Jira
Java
Kafka
Kibana
Linux
Windows
Google Cloud Dataflow
Elasticsearch
AWS Lambda
Serverless
Airflow
Apache Beam
Root Cause
s3-lambda
Google BigQuery
SQL
ServiceNow
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Sandesh?
You can contact Sandesh and 50k+ other talented remote workers on Himalayas.
Message SandeshFind your dream job
Sign up now and join over 85,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
