Julian Smith
@juliansmith1
Senior Data Engineer with expertise in scalable data solutions.
What I'm looking for
I am a Senior Data Engineer with over 9 years of experience in building scalable data pipelines, cloud architectures, and machine learning solutions. I have successfully managed over 5 petabytes of data, optimizing real-time analytics and driving revenue growth through data-driven strategies. My expertise spans AWS, Azure, Python, C++, SQL, and Agile methodologies, with a strong focus on security compliance and automation.
My passion lies in data democratization, making complex data accessible and actionable for decision-makers. As a leader, I foster collaboration and innovation, empowering teams to push boundaries in data science and AI. I thrive on solving complex challenges and turning data into a powerful asset for business success, as evidenced by my work at Stripe, where I designed and optimized data pipelines that processed over 10TB of data monthly.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer
Stripe
Jan 2023 - Present (2 years 5 months)
Designed and optimized data pipelines processing 10TB+ of data monthly using Apache NiFi, Python, and Apache Kafka, enabling real-time fraud detection and analytics. Architected scalable data infrastructure in AWS (S3, EC2, RDS, Redshift) and Databricks, reducing query response times by 50% and supporting 520+ active analysts. Developed interactive dashboards and reporting solutions in Tableau and
Data Integration Engineer
Liga Data
Dec 2021 - Jan 2023 (1 year 1 month)
Optimized large-scale distributed systems by supporting operations teams in capacity planning, performance tuning, and resource allocation, improving system efficiency by 30%. Designed and implemented scalable data architectures to store, process, and retrieve high-volume CDR event streams, handling billions of records daily with Apache Kafka, Spark, and HDFS. Developed ETL pipelines and real-time
Data Engineer
Accenture PLC
Jun 2017 - Dec 2021 (4 years 6 months)
Designed and implemented scalable data management platforms for enterprise data warehousing and advanced analytics, leveraging AWS Redshift, Snowflake, and Google BigQuery. Developed real-time analytics pipelines using Apache Kafka, Spark Streaming, and Flink, enabling faster decision-making and reducing data latency by 60%. Built and optimized ETL workflows using SQL, Apache Spark, and Python, im
Data Scientist
Innowatts
Oct 2015 - Jun 2017 (1 year 8 months)
Automated data pipelines using Python, Apache Airflow, and SQL, reducing manual data processing time by 70% and ensuring seamless integration with cloud-based data warehouses (AWS Redshift, Snowflake). Designed and optimized machine learning models for customer segmentation and predictive analytics, leveraging scikit-learn, XGBoost, and TensorFlow, improving model accuracy by 15%. Developed intera
Education
Degrees, certifications, and relevant coursework
University of Texas - Austin
Bachelor of Science, Computer Science
Tech stack
Software and tools used professionally
Splunk
Apache Spark
AWS Glue
Talend
Data Studio
AWS IAM
Google Cloud Platform
Amazon S3
Google Cloud Storage
AWS Step Functions
GitHub
Kubernetes
Jenkins
GitHub Actions
Pandas
PostgreSQL
MongoDB
Cassandra
Hadoop
Databricks
Terraform
Java
TensorFlow
PyTorch
scikit-learn
Kafka
Apache NiFi
Grafana
Prometheus
Datadog
Google Cloud Dataflow
Elasticsearch
Azure Security Center
AWS Lambda
Airflow
Time Analytics
Google BigQuery
SQL
Azure Blob Storage
XGBoost
Availability
Location
Authorized to work in
Skills
Interested in hiring Julian?
You can contact Julian and 90k+ other talented remote workers on Himalayas.
Message JulianFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
