Home
Articles
Projects
Repositories
Videos
About Me
🎯 Published Posts
HydraServe: Solving Cold Starts for Serverless LLMs in the Cloud — A Technical Deep Dive
Next-Gen Event-Driven Architectures: Performance, Scalability, and Intelligent Orchestration
TrioXpert: Automating Incident Management in Microservice Systems with LLMs
MicroRCA-Agent: Using Large Language Models to Find Root Causes in Microservices
Simplifying Root Cause Analysis in Kubernetes with StateGraph and LLM
Llama-Nemotron: Engineering Efficient Reasoning Models for the Next Generation of LLM Systems
Evaluation Techniques for Foundation Models
Enterprise Version—Train a Large Language Model
A journey from LoRA to Text-to-LoRA — Fine-tuning LLM using prompt
Evo-Memory: DeepMind’s New Benchmark for Agents That Learn From Experience
Making Large Language models Efficient in Production
Semantic Caching of AI agents using Redis Database
Multi Agent Orchestration with MCP Server using AWS BlueRock AMI
LangChain — In Depth Guide from Document parsing to AI Agents
An In-Depth Guide to Evaluate LLM and
How do you chunk your data in RAG — All about chunking strategies
Deploy Production Ready LLM in local using Kubernetes with KServe
KubeIntellect — LLM Orchestrated Agent Framework for End to End Kubernetes Management
AI in Finance Industry
Exploring research paper on Financial Knowledge Large Language Model
Is Google DeepMind — Mixture of Recursions replacing Transformers Architecture?
Understanding Knowledge graphs for Agentic AI
Agentic Knowledge Graph Construction with Neo4j
GPU Aware Kubernetes Inference Simulator with Reinforcement Learning based Auto Scaling
Do you know architecture of Recommendation System at Netflix?
Do you know about Guardrails AI — Safety Mechanism and Validation Tool for LLMs
Building Database Agent — using Hugging Face and Lang Chain
Real-Time Air Quality Monitoring Made Simple: Spark and Hugging Face (gpt-oss)
Vibe Coding in 2025: Hype, Hope, and What It Really Means for Developers
LlamaIndex — Did you try End to End document workflows
Data Intensive System Design — Effective Designing data models
What if your LLM had RAM, Disk, and a Memory Manager?
GenAI Series — A way to optimize your Agentic Apps
GenAI Series— Orchestrating using AirFlow
6 Everyday Ways I Used Vector Databases (With Pinecone + AI Magic)
Research papers every AI enthusiast must read
Data Intensive System Design — How important is Reliability, Scalability and Maintainability
Hack Your Kid’s YouTube Feed with Python — Smarter Screen Time Starts Here!
Model Context Protocol — Interaction of AI Models with external tools
Built My Own AI Study Partner Using RAG — Now I Understand Research Papers 3x Faster!
Built My Own Chat Summarizer with Gemini Flash — Because Who Has Time for 150+ Messages?
Built My Own AI Wellness Planner with LLM Agents — No GPU, No UI, Just Pure Brainpower
Integrated MCP into My RAG Project — Now My AI Assistant Can Talk to Any LLM Seamlessly!
AWS Series — Machine Learning tools in AWS that you might have to know
AWS Tools for Big Data Analytics you might want to learn
AWS Series — Database services AWS offers you might not be aware of
Distributed Architecture — The key attributes which decides the performance of system design
Distributed Transactions — A way to have strong consistency in distributed system
Distributed Systems — Why is it important?
Back To Basics — Disk Driver and Storage
Back To Basics — How will you design Wide Area Network From Scratch?
Back to Basics — How would you design Local Area Network from scratch?
Back to Basics — TCP/IP Tools and Commands
Back to the Basics — Do you know what happens when you type the address of a website?
Back to Basics — Networking — Auto configuration of IP address-DHCP
Back to Basics — Networking — Routers : The post offices of Internet
Back To Basics — Networking — Dig deeper into IP Addresses
Back to Basics — Servers and Virtualization
Back to the Basics — Networking: TCP/IP and Internet — Where did it start?
Back to Basics — Network Protocols: Can you understand software architecture without these concepts?
Back to the Basics — Networking — What is Cyber Security
Back to Basics — Networking: All About Cloud Computing
Back to Basics — Networking — the better understanding with basics
Linux Series — What happens when you press power button on Linux?
ML Series — Machine Learning based auto scaling approach in Microservice Architecture
AWS Series — All about DynamoDB
AWS Series — Why is Amazon Aurora so popular?
AWS Series — Optimize your database with RDS Read Replicas
ML Series — How can regularization reduce Overfitting of a model?
ML Series — Classification with Logistic Regression —A Supervised Machine Learning Algorithm
ML Series — What if you have multiple features to predict the best model: Multiple Linear…
ML Series — Linear Regression-Most used Machine Learning Algorithm: In simple Terms
Machine Learning — The Basic understanding
AWS Series — Relational Database in AWS
Hypotheses Testing — How is it useful in machine learning?
Confidence Interval — Helps better prediction in Machine Learning
MLE— Most popular in machine learning
AWS Series — S3 Cheat Sheet
AWS Series — S3 Technical Essentials
AWS Series — Do you know how to optimize cost of S3 Storage?
AWS Series — How to backup our data in S3 Storage?
AWS Series — Hosting a Static Website Using S3
AWS Series — How do we secure S3 data?
AWS Series — Simple Storage Service : The most used storage in cloud
AWS Series — EBS Cheat Sheet
AWS Series — Do you know these storages exist in cloud?
AWS Series — AMI: The blue print of your virtual machine
Microservice Architect Series — Do you know what Service discovery Do?
AWS Series — What is EC2 Hibernation?
AWS Series — How to Encrypt my data in EBS?
AWS Series — Secret for creating 100’s of VM with same applications and settings
AWS Series — What is Elastic Block Store?
AWS Series — EC2 Instance Bootstraping Hands on Demo
AWS Series — EC2 Cheat Sheet
AWS Series —Expansions of your AWS Cloud
AWS Series — Do you know how to get EC2 instance for less price?
AWS Series — EC2 Technical Essentials
AWS Series — Why IAM roles are important for interacting between resources
AWS Series — EC2: More about the most popular VM
AWS Series — EC2: An AWS Virtual Machine
AWS Series —Networking cheat sheet
AWS Series — Best Secured Ways of Networking
AWS Series — Handling multiple VPCs
Back to the Basic Statistics — Population and Sample
How Probability Distribution works for multi dimensional?
AWS Series — What does VPC Endpoints do?
AWS Series — Guards of Subnets : Security Group, Network ACLs
AWS Series — Virtual Private Cloud: An Intranet of AWS
Advanced Probability Concepts for Machine Learning
Probability — How is it related to Machine learning?
Newton’s Method — Best Optimization technique in Neural networks and Why?
Neural Networks — How do we train them with basic 3 layer?
Perceptron — The brain behind Machine Learning
Tangent Plane — An important concept for deriving Gradient Descent
AWS Series — How to set permissions with IAM
AWS Series — IAM-Securing Root Account
AWS Series — Best Practices in IAM
Calculus —Derivatives that does the magic
Principal Component Analysis — A way to Reduce the features
Linear Algebra made simple — Eigen Vectors and Eigen Values
Linking Matrices to the Neural Networks
Basic Linear Algebra — System of Equations
AWS Series — Basic Services AWS offer
AI Terminologies — Easy to understand
AWS Series — The Shared Responsibility Model
AWS Series — What is Availability Zones and Regions
Linear Regression — In simple terms for Machine Learning
Simple Linear Regression — Back to the basics
Outliers — The major culprits for deviating the predictions
Regression with Categorical Explanatory Variables
Did you hear about R-Squared?
Conditions for Linear Regression
What is Prediction and Extrapolation
The best explanation for Least Squares line
What do you know about Residuals?
All you need to know about Correlation Coefficient
Java — Guess what happens if we remove “static” from main() method?
What happens of I run thread.run() instead of thread.start()?
Java — Ways to avoid deadlock situation in your code
Gradle Architecture — The arrangement to handle single and multiple projects
Understanding Gradle — Building the Java application in efficient way
Java Basics — All about Immutable class with examples
Execution Context
Java Design Patterns — Factory Design Pattern- One of the Most Popular
Sub Topic in Threads — ThreadLocal
SOLID Principles — An approach to code in better way
Java — What happens if we change main(String[] args) to main(Integer[] args)?