Rajkumar Vasupari
@rajkumarvasupari
Senior Data Engineer specializing in cloud-native, streaming-first architectures and scalable lakehouse solutions.
What I'm looking for
I am a Senior Data Engineer with 7+ years of experience building cloud-native, large-scale, real-time data platforms across AWS, Azure, and GCP. I design streaming-first, low-latency architectures that enable Customer 360, IoT telemetry, and analytics at scale.
I've implemented high-throughput streaming pipelines using Kafka, Spark, Flink, and PySpark, and built ELT workflows with Databricks, dbt, and Delta Lake to support analytics and ML/GenAI use cases. I also develop microservices with Spring Boot and automate pipelines using Airflow, CI/CD, Docker, and Kubernetes.
I lead data governance, quality, and observability initiatives using Great Expectations, Apache Atlas, Prometheus, Grafana, and CloudWatch to ensure reliable, governed datasets for analytics and model consumption. I collaborate closely with analytics, CRM, and product teams to deliver real-time personalization and operational dashboards.
I mentor junior engineers and drive enterprise modernization by promoting DataOps practices, secure cloud adoption, and domain-oriented data mesh designs that enable distributed ownership and scalable lakehouse implementations.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer
Santander Bank
Sep 2023 - Present (2 years 6 months)
Designed and implemented a real-time lakehouse and streaming architecture unifying customer data across channels, enabling personalized insights and APIs while handling millions of events per day with low latency.
Data Engineer
The Williams Companies
Jun 2020 - Aug 2023 (3 years 2 months)
Built cloud-native telemetry ingestion and analytics platform for SCADA/IoT data using streaming-first architectures, enabling real-time monitoring and predictive maintenance with improved deployment consistency.
Data Engineer
Elevance Health, Inc
Jul 2018 - May 2020 (1 year 10 months)
Developed real-time event streaming platform ingesting member interactions and claims to enable operational workflows and analytics, integrating streams into Snowflake/BigQuery and ensuring HIPAA-aligned data quality.
Education
Degrees, certifications, and relevant coursework
University of North Texas
Master of Science, Advanced Data Analytics
Completed a Masters in Advanced Data Analytics focused on applied data engineering and analytics principles, graduating in May 2025.
Tech stack
Software and tools used professionally
Airbyte
Matillion
Azure Synapse
Apache Spark
AWS Glue
Talend
QlikView
Google Cloud Platform
AWS Step Functions
GitHub
Bitbucket
Kubernetes
AWS CodePipeline
Jenkins
GitHub Actions
NumPy
Pandas
PySpark
dbt
DB
Sqoop
MySQL
PostgreSQL
MongoDB
Cassandra
Hadoop
HBase
Gmail
Spring Boot
Databricks
Redis
Terraform
Jira
Java
MLflow
Kafka
Grafana
Prometheus
Zookeeper
Ubuntu
CentOS
Linux
macOS
Windows
AWS Lambda
Serverless
Kafka Streams
Airflow
Time Analytics
Google BigQuery
SQL
Qubole
Apache Iceberg
Playwright
Delta Lake
Great Expectations
Apache Hudi
dbt Cloud
Cosmos
Factory
Unify
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Rajkumar?
You can contact Rajkumar and 90k+ other talented remote workers on Himalayas.
Message RajkumarFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
