Corpus used to fit model parameters; quality, diversity, and licensing shape capabilities and risks of memorization or bias. ← Fair Launch