Open Source AI Research
News
Multiple Choice Normalization in LM Evaluation
There are multiple ways of evaluating multiple choice tasks on autoregressive LMs like GPT-3/Neo/J. This post lays out the current prevalent normalization methods....
There are multiple ways of evaluating multiple choice tasks on autoregressive LMs like GPT-3/Neo/J. This post lays out the current prevalent normalization methods.
Source: Blog on EleutherAI Blog
Word count: 141 words
Published on 2021-10-11 23:00