Sampling Strategies Compared

Advertisement

Top-k Top-p 0.90 T 1.0

Truncate distribution by k (fixed count), p (cumulative mass), or T (sharpness).

Greedy = top-1 = T→0. Top-p adapts to distribution; top-k is fixed. Production: top-p with T=0.7-0.9.

★ KEY TAKEAWAY

Top-k truncates to a fixed count; top-p adapts to distribution shape. Combine with temperature for production sampling.

▶ WHAT TO TRY