Belgavi's AI Lab — Deep Tech, Interactive

Articles by topic

Hand-written technical articles,
organized by domain

Every article is self-contained, uses concrete examples and numbers, and avoids marketing fluff. Click any category to browse its index. 469 articles across 25 categories.

AI & Language Models

147 articles · 7 categories

★ FEATURED

Transformer Math & CPU SLM

Linear algebra, attention math, training, CPU inference, weight storage, SLM lifecycle.

Read series → AI · LLM

Large Language Models

LLM architectures, inference servers, sampling, prompt caching, observability.

Browse → AI · ARCH

Transformers

Attention variants, RoPE, MoE, RMSNorm, MTP, sparse attention, FlashAttention.

Browse → AI · GENERAL

General AI

RAG, hallucination mitigation, embeddings, evals, prompt engineering, agentic safety.

Browse → AI · AGENTS

AI Agents

Tool use, agentic workflows, memory systems, observability, human handoff.

Browse → AI · SLM

Small Language Models

Phi, Qwen, Gemma; on-device inference; distillation; tool-call fine-tunes.

Browse → AI · OPT

Quantization

INT8/INT4, GGUF, AWQ, GPTQ, SmoothQuant, FP8 KV cache.

Browse →

A2A

Agent Protocols

70 articles · 4 categories

PROTOCOL

MCP Protocol

Model Context Protocol — server design, OAuth 2.1, transports, testing patterns.

Browse → FRAMEWORK

ADK Framework

Agent Development Kit — lifecycle, tools, evals, guardrails, sessions.

Browse → PROTOCOL

A2A Protocol

Agent-to-agent — discovery, trust models, orchestration, error recovery.

Browse → PAYMENTS

AP2 Payments

Agent Payment Protocol — subscriptions, settlement, chargebacks, compliance.

Browse →

SYS

Distributed Systems & Infrastructure

137 articles · 9 categories

DATABASE

Apache Cassandra

Data modeling, consistency, compaction, repair, vector search 5, multi-DC.

Browse → DISTRIBUTED

Distributed Systems

Raft, consensus, CRDTs, vector clocks, BFT, quorum systems, 2PC.

Browse → DESIGN

System Design

Rate limiters, circuit breakers, sharding, real-system blueprints.

Browse → DATABASE

Databases

Postgres internals, isolation levels, replication, pgvector, DuckDB, CDC.

Browse → STREAMING

Streaming Systems

Kafka, Flink, Pulsar, exactly-once, CDC, tiered storage, backpressure.

Browse → CONCURRENCY

Concurrency

Virtual threads, async/await, work-stealing, lock contention, memory models.

Browse → OPS

Observability

OpenTelemetry, Prometheus at scale, SLOs, eBPF, alert fatigue.

Browse → SECURITY

Security

mTLS, OAuth+PKCE, passkeys, SBOM, zero trust, K8s pod security.

Browse → NETWORK

Networking

TLS 1.3, DNS, BBR, anycast, mTLS rotation, service mesh patterns.

Browse →

Realtime, Media & Engineering Guides

80 articles · 5 categories

REALTIME

Bidirectional Streaming

gRPC, WebSocket, HTTP/2, HTTP/3, SSE, reconnect strategies, HOL blocking.

Browse → MEDIA

Audio Engineering

Opus, jitter buffers, VAD, AEC, LUFS targets, WebRTC pipeline.

Browse → MEDIA

Video Streaming

HLS, DASH, AV1, ABR strategies, bitrate ladders, DRM, VMAF.

Browse → GUIDE

Engineering Guides

How to design APIs, run postmortems, estimate, pick brokers, review PRs.

Browse → MISC

Built by an engineer
who learns by building

Belgavi's AI Lab is a personal initiative by Sandeep Belgavi Ashok Kumar, a Senior Engineering Manager and Architect working at the intersection of scalable distributed systems and modern AI.

The site captures deep technical knowledge in two complementary forms: focused written explanations and interactive single-file labs that make the underlying mechanics visible. Topics span LLM internals, agent protocols (MCP, A2A, ADK, AP2), Apache Cassandra, distributed consensus, system design, networking, security, observability, and the realtime media stack (WebRTC, HLS, DASH).

Every article aims for concrete numbers and working examples over marketing claims. Every lab runs in a browser — no install, no build step, no JavaScript framework dependency.

Hand-written technical articles,organized by domain

AI & Language Models

Transformer Math & CPU SLM

Large Language Models

Transformers

General AI

AI Agents

Small Language Models

Quantization

Agent Protocols

MCP Protocol

ADK Framework

A2A Protocol

AP2 Payments

Distributed Systems & Infrastructure

Apache Cassandra

Distributed Systems

System Design

Databases

Streaming Systems

Concurrency

Observability

Security

Networking

Realtime, Media & Engineering Guides

Bidirectional Streaming

Audio Engineering

Video Streaming

Engineering Guides

Other Topics

No install. No build step.135 labs ready to play.

50 labs that visualize transformer math, end-to-end

Transformer Math & CPU SLM

LLM Visualizations

Large Language Models

Transformer Internals

General AI

Quantization

MCP Protocol

Distributed Systems

Apache Cassandra

System Design

Databases

Streaming

Concurrency

Observability

System Simulations

Networking

Security

Bidirectional Streaming

Audio Engineering

Video Streaming

Start here if you're curious where to dig in

Linear Algebra for Transformers

Cassandra Data Modeling Basics

Raft Consensus Intuition

Built by an engineerwho learns by building

Hand-written technical articles,
organized by domain

No install. No build step.
135 labs ready to play.

Built by an engineer
who learns by building