Model Settings

These settings help control how the model generates the character’s responses.

  1. Model — choose the appropriate model for the character’s operation.

  2. Temperature — controls the creativity level of responses; recommended range is 0.5 to 1.1.

  3. Top-P — limits word selection to the most probable set covering probability P. For example, at P=0.9, the model chooses words covering 90% probability.

  4. Top-K — limits word selection to the K most likely words. For example, at K=50, the model chooses only from the top 50 words.

  5. Total Context Tokens — the maximum context size the model considers when generating a response.

  6. Max Response Tokens — maximum response length. On average, 100 tokens equal about 70 English words.

  7. Presence Penalty — reduces the chance of repeating words already used, as if saying to the model: “You’ve already said this, don’t repeat.” The higher the value, the stronger the effect.

  8. Frequency Penalty — penalizes frequent repetition of the same words, like saying: “You said this three times — stop.” The higher the value, the stronger the influence.

Last updated