Security Research
News
My benchmark for large language models
A benchmark of ~100 tests for language models, collected from actual questions I've asked of language models in the last year....
A benchmark of ~100 tests for language models, collected from actual questions I've asked of language models in the last year.
Source: Nicholas Carlini
Word count: 106 words
Published on 2024-02-19 08:00