Allen M
@allenm1
Senior Data Engineer specializing in cloud-native, real-time data platforms and scalable AI/ML pipelines.
What I'm looking for
I am a Senior Data Engineer with 9+ years building scalable, cloud-native data solutions across AWS, Azure, and GCP, specializing in ETL/ELT, real-time streaming, and modern data architectures.
I have designed and delivered large-scale pipelines and data warehouses (processing terabytes daily), implemented observability and data quality frameworks (Great Expectations, Deequ), and integrated MLOps for predictive analytics and RAG workflows.
I prioritize secure, compliant platforms (HIPAA, SOC2, GDPR) and cost-optimized infrastructure using Terraform, Kubernetes, and CI/CD to drive measurable business outcomes and operational efficiency.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer
Labelbox
Dec 2022 - Present (2 years 10 months)
Designed and implemented ETL/ELT pipelines processing 5TB daily and built real-time streaming applications that reduced incident response time by 60%, while ensuring 99.99% uptime for production data systems.
Cloud Data Engineer
BlueLight
Oct 2019 - Nov 2022 (3 years 1 month)
Architected hybrid cloud data platforms unifying operational and patient data, built high-throughput pipelines enabling near-real-time patient alerting, and implemented data quality frameworks to maintain HIPAA and SOC 2 compliance.
Data Infrastructure Engineer
Teza Technologies
Jul 2016 - Aug 2019 (3 years 1 month)
Built and maintained CI/CD pipelines and deployed trading data pipelines on cloud platforms to ensure high availability and low latency, automating infrastructure provisioning and improving delivery speed for financial data systems.
Education
Degrees, certifications, and relevant coursework
Unknown Institution
Bachelor of Science, Computer Science
Bachelor's degree in Computer Science; coursework and training focused on software engineering, data systems, and algorithms.
Tech stack
Software and tools used professionally
OpenAPI
Matillion
Splunk
Azure Synapse
Apache Spark
AWS Glue
Apache Flink
Druid
Talend
Superset
Amazon Quicksight
D3.js
Chart.js
Highcharts
GitHub
GitLab
Bitbucket
Kubernetes
Jenkins
GitHub Actions
GitLab CI
dbt
DB
MySQL
PostgreSQL
MongoDB
Cassandra
Gmail
Node.js
Django
Spring Boot
Next.js
Mixpanel
Neo4j
Redis
Terraform
Azure DevOps
JavaScript
HTML5
Java
JSON
PowerShell
XML
TensorFlow
PyTorch
MLflow
scikit-learn
Kubeflow
Neptune
Kafka
FastAPI
Grafana
Prometheus
OpenTelemetry
Azure Monitor
Datadog
GraphQL
gRPC
Milvus
Avro
Ansible
AWS Lambda
Serverless
Vercel
Kafka Streams
Redpanda
OAuth2
Airflow
Time Analytics
s3-lambda
SQL
Hugging Face
AWS KMS
Clickhouse
Dagster
Apache Iceberg
Qdrant
Weaviate
Meltano
Evidently AI
Pinecone
WhyLabs
Monte Carlo
Feast
Delta Lake
Great Expectations
ArgoCD
Privacera
Collibra
Availability
Location
Authorized to work in
Website
allen-sigma.vercel.appJob categories
Skills
Interested in hiring Allen?
You can contact Allen and 90k+ other talented remote workers on Himalayas.
Message AllenFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
