Maximum number of tokens a model can attend to in one request, constraining prompt size, tool outputs, and in‑context examples. ← Fair Launch