Logits to Token (Argmax vs Sample)

Advertisement

Strategy Temperature 1.0

logits → softmax(/T) → distribution → pick (argmax or sample).

Final hidden state × W_out → logits ∈ ℝ^V. Apply temperature, softmax, then pick.

Greedy: deterministic, can repeat. Sampling: diverse, can be incoherent at high T.

★ KEY TAKEAWAY

logits → softmax(/T) → distribution → pick (argmax or sample). Temperature is the main creativity knob.

▶ WHAT TO TRY