ML Products
News
Task-Specific LLM Evals that Do & Don't Work
Evals for classification, summarization, translation, copyright regurgitation, and toxicity....
Evals for classification, summarization, translation, copyright regurgitation, and toxicity.
Source: Eugene Yan
Word count: 84 words
Published on 2024-03-31 08:00