settings

configure model bench

openrouter api key

get a key

judge model

model used for auto-evaluation (LLM-as-judge)

max tokens

1024
no limit4096

lower values let you compare more models on a tight budget