Open to opportunities

Julian Smith

@juliansmith1

Message

Senior Data Engineer with expertise in scalable data solutions.

United States

Message

What I'm looking for

I am looking for a role that fosters innovation and collaboration, where I can leverage my data engineering skills to drive impactful business decisions and contribute to a forward-thinking team.

I am a Senior Data Engineer with over 9 years of experience in building scalable data pipelines, cloud architectures, and machine learning solutions. I have successfully managed over 5 petabytes of data, optimizing real-time analytics and driving revenue growth through data-driven strategies. My expertise spans AWS, Azure, Python, C++, SQL, and Agile methodologies, with a strong focus on security compliance and automation.

My passion lies in data democratization, making complex data accessible and actionable for decision-makers. As a leader, I foster collaboration and innovation, empowering teams to push boundaries in data science and AI. I thrive on solving complex challenges and turning data into a powerful asset for business success, as evidenced by my work at Stripe, where I designed and optimized data pipelines that processed over 10TB of data monthly.

Experience

Work history, roles, and key accomplishments

Current

Senior Data Engineer

Current

Stripe

Jan 2023 - Present (3 years 6 months)

Designed and optimized data pipelines processing 10TB+ of data monthly using Apache NiFi, Python, and Apache Kafka, enabling real-time fraud detection and analytics. Architected scalable data infrastructure in AWS (S3, EC2, RDS, Redshift) and Databricks, reducing query response times by 50% and supporting 520+ active analysts. Developed interactive dashboards and reporting solutions in Tableau and

Apache NiFi Python Apache Kafka AWS Databricks Tableau Power BI Informatica scikit learn Apache Airflow

Data Integration Engineer

Liga Data

Dec 2021 - Jan 2023 (1 year 1 month)

Optimized large-scale distributed systems by supporting operations teams in capacity planning, performance tuning, and resource allocation, improving system efficiency by 30%. Designed and implemented scalable data architectures to store, process, and retrieve high-volume CDR event streams, handling billions of records daily with Apache Kafka, Spark, and HDFS. Developed ETL pipelines and real-time

Apache Kafka Spark HDFS Python SQL ETL Grafana Prometheus Power BI Tableau

Data Engineer

Accenture PLC

Jun 2017 - Dec 2021 (4 years 6 months)

Designed and implemented scalable data management platforms for enterprise data warehousing and advanced analytics, leveraging AWS Redshift, Snowflake, and Google BigQuery. Developed real-time analytics pipelines using Apache Kafka, Spark Streaming, and Flink, enabling faster decision-making and reducing data latency by 60%. Built and optimized ETL workflows using SQL, Apache Spark, and Python, im

AWS RedShift Snowflake Google BigQuery Apache Kafka Flink SQL Python ETL Tableau

Data Scientist

Innowatts

Oct 2015 - Jun 2017 (1 year 8 months)

Automated data pipelines using Python, Apache Airflow, and SQL, reducing manual data processing time by 70% and ensuring seamless integration with cloud-based data warehouses (AWS Redshift, Snowflake). Designed and optimized machine learning models for customer segmentation and predictive analytics, leveraging scikit-learn, XGBoost, and TensorFlow, improving model accuracy by 15%. Developed intera