RS
Ravi Shukla
HomeBlogToolsAbout
Resume
RS
Ravi Shukla

Senior Java + AI engineer. Kafka, RAG, distributed systems.

Content

  • Blog
  • System Design
  • AI & ML
  • DevOps

Explore

  • About Ravi
  • Open Stats
  • Thank You

© 2026 Ravi Kant Shukla. All rights reserved.

Deployed on Vercel · Mumbai region

All Writing

AI & Machine Learning

3 posts in this category.

ai-ml

Serving ML Models in Production with FastAPI: Async Inference, Streaming, and Deployment

FastAPI has become the go-to Python framework for serving ML models in production. Here's how to build async inference endpoints, stream LLM responses, and deploy them reliably on AWS.

FastAPIMachine LearningPython
May 25, 202620 min read
ai-ml

Building a Production RAG Pipeline with LangChain4j + Spring Boot

A complete guide to building retrieval-augmented generation (RAG) systems in Java using LangChain4j. Learn chunking strategies, embedding pipelines, vector store integration, and how to ship RAG to production.

LangChain4jRAGSpring Boot
May 5, 202618 min read
ai-ml

Deploy Your ML Model on AWS Lambda: The Complete Production Guide

Step-by-step guide to packaging a scikit-learn or PyTorch model as a Lambda function — covering cold starts, container images, model versioning, and A/B testing on AWS.

AWS LambdaML DeploymentDocker
Feb 20, 202414 min read