Hudson Silva
@hudsonsilva
Senior AI Data Engineer building AI-ready Snowflake/Databricks data platforms.
What I'm looking for
Senior Data Engineer with 9+ years architecting AI-ready data platforms on Snowflake and Databricks. Built Lakehouse/DW from zero serving 600M+ users and powering AI agents at billions-of-events scale on AWS/Azure/GCP, with consistent 25-50% cost cuts via Spark optimization and Medallion. Currently building internal LLM chatbots (Databricks Genie + Snowflake Cortex), bridging data engineering with the GenAI stack — production RAG (hybrid search, reranking, RAGAS) and multi-agent orchestration (LangGraph, CrewAI, MCP). Technical leader: mentored 6+ engineers (2 promoted), led 5+ member teams, presented architecture to C-level.
Experience
Work history, roles, and key accomplishments
Senior AI Data Engineer
Enterprise Data Platform Consulting
Sep 2025 - Present (8 months)
Migrated 50+ Oracle DW tables (100+ TB) to Snowflake, refactoring pipelines to cut critical runtime from 8 hours to <10 minutes (48x faster). Architected RAG and agentic workflows over the Lakehouse (LangGraph, CrewAI) with RAGAS evaluation and Langfuse tracing, and built internal Databricks Genie LLM chatbots for operational data access.
Senior AI Data Engineer (Consultant)
Databool Consulting
Oct 2020 - Present (5 years 7 months)
Provided senior AI/data engineering placements across enterprise clients in energy, telecom, mining, semiconductors, streaming, and fintech, delivering Snowflake/Databricks platform work as part of consulting engagements.
Lead AI Data Engineer
Confidential Client
Mar 2025 - Aug 2025 (5 months)
Built a real-time streaming platform with Spark Structured Streaming in Databricks to feed AI recommendation agents for a 600M+ user social/video platform. Reduced incident resolution time by 40% using custom observability and cut compute costs by 25% via Spark query optimization, partitioning, and caching.
Improved legal-department query performance by 40% through Snowflake/Databricks tuning, enabling faster compliance workflows. Built and maintained 15+ production pipelines on Azure Data Factory + Databricks, cutting incident response time by 50%, and led 5+ engineers to align governance, data masking, and multi-jurisdiction compliance.
DataOps Engineer
BEES
Oct 2023 - May 2024 (7 months)
Optimized Databricks/Snowflake data pipelines for +40% efficiency and -15% cost, resolving Azure/Databricks and MongoDB bottlenecks. Reduced response time by 30% and processing time by 20% through targeted performance fixes.
Senior Data Engineer
Vale S.A.
May 2023 - Oct 2023 (5 months)
Designed a Snowflake data warehouse with dbt and CI/CD on Azure DevOps, replacing the legacy system and processing 150+ TB/day for global mining reporting and compliance. Implemented automated data quality validation to ensure cross-system consistency for critical financial datasets.
Senior Data Engineer (GCP)
Semantix Corp
Oct 2022 - Apr 2023 (6 months)
Optimized GCP queries using Hive/Trino for a 25% performance increase. Introduced Kubernetes orchestration to cut deployment time by 40% and led adoption of new data technologies, improving system performance by 40%.
Senior Cloud Data Engineer
Multiple Clients
Oct 2020 - Sep 2022 (1 year 11 months)
Led Azure-first cloud platform architecture supporting 200% data growth and reduced storage costs by 30% via data-lake optimization. Migrated Pandas pipelines to PySpark (-40% processing time), reduced data latency by 50%, and implemented governance/security (GDPR/LGPD) with +25% accuracy and -40% integration errors.
Education
Degrees, certifications, and relevant coursework
ENEB
Master's degree, Big Data & Business Intelligence
2025 - 2027
Master's program in Big Data & Business Intelligence at ENEB (2025–2027, expected).
Descomplica
Postgraduate, Data Science
2023 - 2024
Postgraduate program in Data Science at Descomplica (2023–2024).
XP Educação
MBA, Data Engineering
2022 - 2023
MBA in Data Engineering at XP Educação (2022–2023).
UFES
Bachelor's degree, Computer Engineering
2008 - 2016
Bachelor's degree in Computer Engineering at UFES (2008–2016).
Tech stack
Software and tools used professionally
Apache Spark
GitHub
Kubernetes
Pandas
PySpark
dbt
MySQL
PostgreSQL
MongoDB
Cassandra
Hadoop
Gmail
Databricks
Redis
Terraform
Azure DevOps
Python
Kafka
FastAPI
Gemini
pytest
Airflow
Hudson
GuardRails
SQL
Clickhouse
Apache Iceberg
Qdrant
LangChain
LlamaIndex
Ollama
Pydantic
Pinecone
CrewAI
DataHub
Delta Lake
OpenAI API
Anthropic Claude API
Google Gemini API
OpenMetadata
DeepEval
Trino
Langfuse
Ragas
Agentic
Vale
LangGraph
LangSmith
Deequ
uv
Docling
Unity Catalog
Factory
Remote
Availability
Location
Authorized to work in
Portfolio
github.com/silvahudsonSalary expectations
Social media
Job categories
Skills
Interested in hiring Hudson?
You can contact Hudson and 90k+ other talented remote workers on Himalayas.
Message HudsonFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
