Cary Fehler
@caryfehler
Senior software engineer architecting production generative AI and full-stack systems at scale.
What I'm looking for
I’m a senior software engineer architecting cloud-native generative AI platforms and full-stack experiences that ship reliably at scale. My work blends system design, model integration, and product-minded engineering to turn ambiguous requirements into production outcomes.
At Adobe (04/2022–Present), I architected and scaled Adobe Firefly, a cloud-native generative AI platform supporting 16B+ content generations. I designed end-to-end multimodal systems for text/image/video generation and collaborative editing (Firefly Boards), built distributed backend pipelines (Node.js, Python), and reduced P95 latency by ~40% through batching, caching (Redis), and optimized request flows.
I also drive scalable infrastructure and deployment practices—building and operating GPU-backed workloads on AWS and Kubernetes with autoscaling and workload scheduling to support millions of requests/month with 99.9% uptime. I integrate LLMs and RAG-style pipelines into production (including partner model integrations and custom fine-tuning), reducing generation latency by ~25% while enabling enterprise-scale brand-safe AI content generation.
Earlier, I helped Stripe Atlas (06/2020–04/2022) enable 100K+ founders across 140+ countries to form U.S. companies, open bank accounts, and start payments in ~2 days. I implemented distributed onboarding workflows (Ruby on Rails / Node.js, Kafka messaging), improved end-to-end latency by ~40%, and achieved 99.9% uptime while maintaining MTTR < 1 hour through CI/CD, testing automation, and operational reliability practices.
Experience
Work history, roles, and key accomplishments
Architected and scaled Adobe Firefly, a cloud-native generative AI platform supporting 16B+ content generations, including multimodal production pipelines and collaborative editing. Reduced P95 latency by ~40% and generation latency by ~25% while enabling millions of requests/month with 99.9% uptime through AWS Batch/EC2, Kubernetes, autoscaling, and Redis-based optimizations.
Architected and delivered Stripe Atlas, enabling 100K+ founders across 140+ countries to incorporate and start payments in ~2 days. Reduced end-to-end latency by ~40% and maintained 99.9% uptime using AWS/Kubernetes, Redis caching, retry/idempotent processing, and Kafka-based distributed workflows, improving reliability and MTTR to <1 hour.
Contributed to Twitter’s unified Progressive Web App (PWA), enabling a single codebase across desktop, mobile web, and lightweight clients. Reduced page load times by ~30% via React/Redux optimizations and service workers, while improving scalability with caching (Redis/Memcached) to support billions of requests/day with >99.9% uptime.
Developed and maintained enterprise-grade applications for healthcare, financial services, and retail with strict high-availability and reliability requirements. Built full-stack features using Java and JavaScript with REST APIs, improving system performance and data processing efficiency while collaborating with cross-functional teams to resolve production issues.
Education
Degrees, certifications, and relevant coursework
Texas Tech University
Bachelor of Science, Computer Science
2011 - 2015
Earned a Bachelor of Science in Computer Science from Texas Tech University (2011–2015).
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Cary?
You can contact Cary and 90k+ other talented remote workers on Himalayas.
Message CaryFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
