Belgavi's AI Lab — Deep Tech, Interactive

Articles by topic

Hand-written technical articles,
organized by domain

Every article is self-contained, uses concrete examples and numbers, and avoids marketing fluff. Click any category to browse its index. 5,619 articles across 39 categories.

✨ Newly added

Updated 2026-07-11

ADK JAVA · RUNTIME

ADK Java Runtime — Anatomy of the Execution Loop

Every phase of the ADK Java runtime's execution loop, from message-in to response-out.

ADK JAVA · ORCHESTRATION

Hierarchical Agents — Supervisor Pattern

One coordinator delegating to specialist workers — classic hierarchical orchestration.

ADK JAVA · ORCHESTRATION

Parallel Agents — Fan-Out Pattern

Send the same input to multiple agents concurrently and gather all their results.

ADK JAVA · ORCHESTRATION

Sequential Agent Chains in ADK Java

The workhorse pipeline pattern — parse, validate, transform, summarize.

ADK JAVA · GUIDE

Choosing an Orchestration Pattern — Decision Guide

Sequential vs parallel vs loop vs hierarchical — the right pattern for your system.

→

See all 565 ADK Java articles

AI & Language Models

724 articles · 7 categories

★ FEATURED

Transformer Math & CPU SLM

394

Linear algebra, attention math, training, CPU inference, weight storage, SLM lifecycle.

Read series → AI · LLM

Large Language Models

LLM architectures, inference servers, sampling, prompt caching, observability.

Browse → AI · MATH

Transformer Math

383

Attention arithmetic, positional encodings, KV cache sizing, FLOP and memory budgets.

Browse → AI · ARCH

Transformers

Attention variants, RoPE, MoE, RMSNorm, MTP, sparse attention, FlashAttention.

Browse → AI · GENERAL

General AI

RAG, hallucination mitigation, embeddings, evals, prompt engineering, agentic safety.

Browse → AI · AGENTS

AI Agents

Tool use, agentic workflows, memory systems, observability, human handoff.

Browse → AI · SLM

Small Language Models

Phi, Qwen, Gemma; on-device inference; distillation; tool-call fine-tunes.

Browse → AI · OPT

Quantization

INT8/INT4, GGUF, AWQ, GPTQ, SmoothQuant, FP8 KV cache.

Browse →

A2A

Agent Protocols

2,145 articles · 8 categories

PROTOCOL

MCP Protocol

Model Context Protocol — server design, OAuth 2.1, transports, testing patterns.

Browse → FRAMEWORK

ADK Framework

Agent Development Kit — lifecycle, tools, evals, guardrails, sessions, skills.

Browse → FRAMEWORK · NEW

ADK for Java

566

Complete coverage: runtime, model package (BaseLLM + Gemini), persistence, memory, tools, orchestration, A2A transport, observability, evaluation, safety, deployment. All with Java code.

Browse → ALGORITHMS · NEW

Algorithms & DP

753

DP, trees, graphs, strings, geometry, network flow, number theory, plus cryptography (AES/RSA/ECC/Ed25519/SHA/HMAC/Argon2/ChaCha/Kyber/ZK-SNARKs/MPC/FHE/JWT/OAuth) and ML (regression/RF/XGBoost/SVM/k-means/PCA/attention/transformer/CNN/RNN/BERT/GPT/Q-learning/PPO/RLHF) — all with animated visualizations.

Browse → PROMPT ENG · NEW

Prompt Engineering

Zero/few-shot, CoT, self-consistency, ToT, ReAct, self-refine, Reflexion, Constitutional AI, DSPy, meta-prompting, RAG (HyDE/GraphRAG/contextual), agentic prompting, multi-agent orchestration, function calling, structured outputs, prompt caching, cost optimization, A/B testing, eval frameworks.

Browse → SECURITY · NEW

LLM Security & Guardrails

548

OWASP LLM Top 10, prompt injection (direct/indirect/multi-turn/GCG), jailbreaks, data exfil, model stealing, MITRE ATLAS, NeMo/Guardrails AI/Llama Guard, PII redaction, hallucination detection, sandboxing, red team (PyRIT/Garak), NIST RMF, EU AI Act, mechanistic interpretability, SAE, watermarking, PQ migration, agent security.

Browse → PROTOCOL

A2A Protocol

Full protocol coverage: JSON-RPC flow, task lifecycle, streaming, push notifications, artifacts, auth, tracing, MCP interop.

Browse → PAYMENTS

AP2 Payments

Agent Payment Protocol — subscriptions, settlement, chargebacks, compliance.

Browse →

SYS

Distributed Systems & Infrastructure

665 articles · 9 categories

DATABASE

Apache Cassandra

Data modeling, consistency, compaction, repair, vector search 5, multi-DC.

Browse → DISTRIBUTED

Distributed Systems

Raft, consensus, CRDTs, vector clocks, BFT, quorum systems, 2PC.

Browse → DESIGN

System Design

150

LinkedIn, YouTube, TikTok, Airbnb, Amazon, Reddit, Spotify, DoorDash, Grab plus WhatsApp/Signal/Stripe/S3/Google Search/Zoom + core patterns — all with architecture diagrams.

Browse → DATABASE

Databases

Postgres internals, isolation levels, replication, pgvector, DuckDB, CDC.

Browse → STREAMING

Streaming Systems

Kafka, Flink, Pulsar, exactly-once, CDC, tiered storage, backpressure.

Browse → CONCURRENCY

Concurrency

Virtual threads, executor pools, fork/join, actor model, work-stealing, coroutines — each with an architecture diagram.

Browse → OPS

Observability

OpenTelemetry, Prometheus at scale, SLOs, eBPF, alert fatigue.

Browse → SECURITY

Security

mTLS, OAuth+PKCE, passkeys, SBOM, zero trust, K8s pod security.

Browse → NETWORK

Networking

TLS 1.3, DNS, BBR, anycast, mTLS rotation, service mesh patterns.

Browse →

Big Data, JVM & Cloud Platforms

1,824 articles · 10 categories

BIG DATA

Hadoop & HDFS

124

HDFS internals, YARN schedulers, MapReduce, Kerberos, upgrades, capacity planning.

Browse → BIG DATA

Apache HBase

150

Regions, WAL, compactions, block cache, bloom filters, Phoenix, schema design.

Browse → BIG DATA

Hive & Impala

150

Tez, LLAP, ACID tables, ORC/Parquet, metastore HA, CBO, Ranger authorization.

Browse → BIG DATA

Apache Spark

139

AQE, shuffle internals, Delta/Iceberg/Hudi, structured streaming, tuning.

Browse → JVM

Scala

141

Type system, cats-effect, ZIO, Akka, fs2/http4s, Scala 3, metaprogramming.

Browse → JVM

Java & JVM

133

GC algorithms, virtual threads, Loom/Panama/Valhalla, concurrency, JFR.

Browse → CLOUD

AWS

129

IAM, VPC, S3, Lambda, EKS, Kinesis, Bedrock, Redshift, Step Functions.

Browse → CLOUD

Google Cloud

134

BigQuery, Spanner, Dataflow, GKE, Pub/Sub, Cloud Run, IAM deep dives.

Browse → CLOUD

Cloud & Multi-Cloud

142

OCI, multi-cloud strategy, FinOps, migration patterns, zero trust, DR.

Browse → GPU · LLM INFRA

GPU & LLM Infrastructure

582

CUDA, NCCL, vLLM/TensorRT-LLM, KV-cache math, parallelism, fine-tuning, serving SLOs, cost.

Browse →

Realtime, Media & Engineering Guides

261 articles · 5 categories

REALTIME

Bidirectional Streaming

gRPC, WebSocket, HTTP/2, HTTP/3, SSE, reconnect strategies, HOL blocking.

Browse → MEDIA

Audio Engineering

Opus, jitter buffers, VAD, AEC, LUFS targets, WebRTC pipeline.

Browse → MEDIA

Video Streaming

HLS, DASH, AV1, ABR strategies, bitrate ladders, DRM, VMAF.

Browse → GUIDE

Engineering Guides

How to design APIs, run postmortems, estimate, pick brokers, review PRs.

Browse → MISC

Built by an engineer
who learns by building

Belgavi's AI Lab is a personal initiative by Sandeep Belgavi Ashok Kumar, a Senior Engineering Manager and Architect working at the intersection of scalable distributed systems and modern AI.

The site captures deep technical knowledge in two complementary forms: focused written explanations and interactive single-file labs that make the underlying mechanics visible. Topics span LLM internals, agent protocols (MCP, A2A, ADK, AP2), Apache Cassandra, distributed consensus, system design, networking, security, observability, and the realtime media stack (WebRTC, HLS, DASH).

Every article aims for concrete numbers and working examples over marketing claims. Every lab runs in a browser — no install, no build step, no JavaScript framework dependency.

Hand-written technical articles,organized by domain

ADK Java Runtime — Anatomy of the Execution Loop

Hierarchical Agents — Supervisor Pattern

Parallel Agents — Fan-Out Pattern

Sequential Agent Chains in ADK Java

Choosing an Orchestration Pattern — Decision Guide

AI & Language Models

Transformer Math & CPU SLM

Large Language Models

Transformer Math

Transformers

General AI

AI Agents

Small Language Models

Quantization

Agent Protocols

MCP Protocol

ADK Framework

ADK for Java

Algorithms & DP

Prompt Engineering

LLM Security & Guardrails

A2A Protocol

AP2 Payments

Distributed Systems & Infrastructure

Apache Cassandra

Distributed Systems

System Design

Databases

Streaming Systems

Concurrency

Observability

Security

Networking

Big Data, JVM & Cloud Platforms

Hadoop & HDFS

Apache HBase

Hive & Impala

Apache Spark

Scala

Java & JVM

AWS

Google Cloud

Cloud & Multi-Cloud

GPU & LLM Infrastructure

Realtime, Media & Engineering Guides

Bidirectional Streaming

Audio Engineering

Video Streaming

Engineering Guides

Other Topics

No install. No build step.146 labs ready to play.

50 labs that visualize transformer math, end-to-end

Transformer Math & CPU SLM

LLM Visualizations

Large Language Models

Transformer Internals

General AI

Quantization

MCP Protocol

Distributed Systems

Apache Cassandra

System Design

Databases

Streaming

Concurrency

Observability

System Simulations

Networking

Security

Bidirectional Streaming

Audio Engineering

Video Streaming

Start here if you're curious where to dig in

Linear Algebra for Transformers

Cassandra Data Modeling Basics

Raft Consensus Intuition

Built by an engineerwho learns by building

Hand-written technical articles,
organized by domain

No install. No build step.
146 labs ready to play.

Built by an engineer
who learns by building