Jordan Wright
@jordanwright
Senior Data Engineer with expertise in AI and data infrastructure.
What I'm looking for
I am a Senior Data Engineer and Data Scientist with over 10 years of experience in designing high-performance data infrastructure and AI/ML systems across various domains, including healthcare analytics and financial data processing. My expertise lies in distributed computing, data pipeline design, and workflow orchestration, utilizing tools such as Apache Spark, Kafka, and Airflow to implement scalable data processing frameworks.
Throughout my career, I have developed a strong proficiency in Python, R, Scala, and SQL, alongside modern cloud platforms like AWS and GCP. I specialize in machine learning system design, including NLP pipelines and model serving, and have successfully engineered modular ML pipelines and ETL workflows that support real-time and batch processing systems. My commitment to data modeling, feature extraction, and reproducibility ensures that I deliver high-quality, reliable data solutions.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer (AI & ML)
Flatiron Health, Inc.
Oct 2021 - Present (3 years 10 months)
Designed and maintained ETL/ELT pipelines using Apache Spark (Scala/Python) on Databricks, standardizing oncology EHR and lab data from systems. Engineered modular ML pipelines using PyTorch Lightning, TensorFlow/Keras, and Hugging Face Transformers for extracting high-dimensional oncology phenotypes from unstructured clinical text.
Data Scientist (Healthcare & Financial AI)
Bain & Company
Nov 2017 - Present (7 years 9 months)
Designed and deployed ML pipelines using Python, scikit-learn, and TensorFlow for healthcare and financial clients to forecast risk scores, patient churn, and fraud likelihood across multimodal datasets. Engineered NLP models to extract features from unstructured clinical notes and financial documents using spaCy, Transformers, and custom entity recognition pipelines.
Data Engineer (Big Data Systems)
Verisk Analytics
Feb 2015 - Present (10 years 6 months)
Migrated legacy ETL frameworks to run on distributed compute infrastructure, improving scalability, fault tolerance, and reducing job latency. Developed and deployed Python-based microservices using Flask, exposing fraud detection and underwriting risk scores via RESTful APIs.
Education
Degrees, certifications, and relevant coursework
Texas Tech University
Master of Science, Computer Software Engineering
Completed a Master of Science in Computer Software Engineering, focusing on advanced topics in software development and engineering principles. Gained expertise in designing and implementing complex software systems.
Texas Tech University
Bachelor of Science, Computer Science
Obtained a Bachelor of Science in Computer Science, building a strong foundation in computer science fundamentals. Developed skills in programming, algorithms, and data structures.
Tech stack
Software and tools used professionally
Azure Synapse
Apache Spark
AWS Glue
Apache Flink
Dialogflow
AWS Step Functions
GitLab
Kubernetes
Jenkins
PySpark
dbt
MySQL
PostgreSQL
Hadoop
Gmail
Rollout
Django
Databricks
Neo4j
OpenCV
Terraform
JavaScript
Java
TensorFlow
PyTorch
MLflow
scikit-learn
Keras
Kubeflow
Kafka
GraphQL
Google Cloud Dataflow
AWS Lambda
Serverless
Azure Functions
Airflow
SQL
Amazon SageMaker
XGBoost
Hugging Face
Apache Iceberg
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Jordan?
You can contact Jordan and 90k+ other talented remote workers on Himalayas.
Message JordanFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
