Vijay Shankar
@vijayshankar1
I’m a Data Engineer building scalable batch/real-time pipelines across Azure and GCP.
What I'm looking for
I’m a Big Data Engineer with 5+ years of experience across fraud detection, insurance, and financial services domains. I build and optimize batch and real-time data pipelines at scale, with hands-on expertise in streaming architectures, CDC-based ingestion, and Medallion Lakehouse design. I take ownership across the full data engineering lifecycle—from raw ingestion and transformation through performance tuning, data quality enforcement, and CI/CD deployment across GCP and Azure cloud environments.
In my current Stripe-focused Real-Time Fraud Detection work at Infosys, I engineered high-throughput enterprise CDC pipelines using Debezium into a Delta Lake Medallion architecture on GCS, reducing fraud audit query latency by 40% with Z-Order clustering. I also delivered a high-availability Lambda architecture on GCP Dataflow consuming production events from Kafka and Pub/Sub, streamlined feature delivery into BigQuery, and operationalized batch AI/ML feature ingestion via Vertex AI Pipelines. I’ve enforced enterprise governance using Google Cloud Dataplex, built CI/CD schema-contract validation to prevent breaking changes reaching production, and provisioned secure multi-environment infrastructure with Terraform, VPC Service Controls, and CMEK via Cloud KMS—while optimizing autoscaling and Spark execution to decrease overnight batch runtimes by 35%.
Experience
Work history, roles, and key accomplishments
Engineered high-throughput CDC ingestion using Debezium into a GCS-hosted Delta Lake Medallion architecture, reducing fraud audit query latency by 40% via Z-Order clustering. Built real-time fraud metrics with GCP Dataflow consuming Kafka/Pub-Sub events and streamlined feature delivery to BigQuery for downstream model training, while automating ingestion for an LLM-assisted Fraud Operations Engine
Built Azure Databricks PySpark ingestion pipelines for an insurance claims platform, reducing CRM/policy replication latency from 4 hours to under 30 minutes using Fivetran and achieving sub-5-minute end-to-end latency with Event Hub streaming. Delivered regulatory-ready historical tracking with Delta SCD Type 2 (7 years), optimized Spark jobs to cut average runtimes by 30%, and implemented Hadoop
Education
Degrees, certifications, and relevant coursework
S V College of Engineering (SVCE)
Bachelor of Technology (B.Tech), Electronics and Communication Engineering
2017 - 2021
Completed a B.Tech in Electronics and Communication Engineering, building core knowledge in engineering fundamentals during 2017–2021.
JCR’s Chaitanya Junior College
Intermediate, Intermediate (MPC)
2015 - 2017
Completed Intermediate (MPC) coursework with a focus on science subjects during 2015–2017.
Prashanth English Medium High School
Secondary School Certificate (SSC), Secondary Education
2014 - 2015
Earned the Secondary School Certificate (SSC) during 2014–2015.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Vijay?
You can contact Vijay and 90k+ other talented remote workers on Himalayas.
Message VijayFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
