Anthony SAS
Open to opportunities

Anthony S

@anthonys

Senior Data Engineer with expertise in scalable data solutions.

United States
Message

What I'm looking for

I am looking for a role that fosters innovation and collaboration, where I can leverage my data engineering skills to drive impactful solutions and contribute to a forward-thinking team.

As a Senior Data Engineer with over 8 years of experience, I specialize in building scalable, reliable, and performant data solutions across various domains including retail, social media, and healthcare. My expertise lies in designing both real-time and batch data pipelines using modern technologies such as Apache Spark, Databricks, Snowflake, AWS, and GCP. I have a proven track record of optimizing data infrastructure, significantly reducing costs and latency while enhancing data quality and integrity.

At Walmart Global Tech, I architected a scalable data pipeline that improved reporting latency by 40% for supply chain analytics. I also developed automated data quality validation frameworks that reduced manual audits by over 60%. My experience at Twitter (X) involved engineering high-performance data pipelines that improved data ingestion latency by 35% and migrating legacy systems to cloud-native infrastructures, resulting in a 30% reduction in costs. I am passionate about leveraging emerging AI and ML technologies to drive innovative, data-driven solutions.

Experience

Work history, roles, and key accomplishments

WT
Current

Senior Data Engineer

Walmart Global Tech

Mar 2023 - Present (2 years 4 months)

Architected a scalable data pipeline using Apache Spark (PySpark) on Databricks and AWS Glue, enabling real-time processing of over 5TB/day of transactional data, which improved reporting latency by 40% for supply chain analytics. Designed and implemented a delta lake-based data lakehouse architecture on AWS S3, leveraging Apache Hudi and Databricks SQL, significantly enhancing data freshness and

T(

Senior Data Engineer

Twitter (X)

May 2020 - Feb 2023 (2 years 9 months)

Engineered high-performance data pipelines leveraging Apache Kafka, Flink, and Apache Spark, enabling near real-time analytics and improving data ingestion latency by 35% across global tweet streams. Migrated legacy Hadoop clusters to cloud-native infrastructure on Google Cloud Platform (GCP), utilizing BigQuery, Cloud Composer (Airflow), and Dataflow, resulting in enhanced scalability and a 30% r

CH

Data Engineer

CVS Health

Oct 2017 - Apr 2020 (2 years 6 months)

Built ETL pipelines using Apache Spark, Hive, and Informatica to efficiently process large-scale healthcare datasets, enabling analytics on pharmacy claims and customer behavior. Implemented data warehousing solutions on-premises using Teradata and Oracle, optimizing data models that improved query performance and reduced analytical report generation times by 25%.

Education

Degrees, certifications, and relevant coursework

The University of Kansas logoTK

The University of Kansas

Master's Degree, Computer Science

Completed a Master's Degree in Computer Science. The program provided advanced knowledge and skills in various areas of computer science.

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan
Anthony S - Senior Data Engineer - Walmart Global Tech | Himalayas