Home / GPU/AI Computing / Article
GPU/AI Computing News

Benchmarking LLMs on AI-Generated CUDA Code with ComputeEval 2025.2

Daniel Rodrigu…
2025-11-08 4 min read
Benchmarking LLMs on AI-Generated CUDA Code with ComputeEval 2025.2
Benchmarking LLMs on AI-Generated CUDA Code with ComputeEval 2025.2

<img alt="" class="webfeedsFeaturedVisual wp-post-image" height="432" src="https://developer-blogs.nvidia.com/wp-content/uploads/2025/04/blackwell-cuda-12-9-family-specific-featured-768x432.png" style...

Can AI coding assistants write efficient CUDA code? To help measure and improve their capabilities, we created ComputeEval, a robust, open source benchmark for...

Can AI coding assistants write efficient CUDA code? To help measure and improve their capabilities, we created ComputeEval, a robust, open source benchmark for evaluating AI models and agents on CUDA programming tasks. A few months ago, we announced the first release of ComputeEval and today, we’re introducing its first major expansion by adding more than 100 new CUDA challenges.

Source

Source: NVIDIA Technical Blog Word count: 1223 words
Published on 2025-11-08 00:30