Article List

Explore latest news, discover interesting content, and dive deep into topics that interest you

Clear Filters
RL without TD learning Berkeley AI Research

RL without TD learning

In this post, I’ll introduce a reinforcement learning (RL) algorithm based on an “alternative” paradigm: divide and conquer. Unlike traditional method...

2 months, 1 week ago The Berkele…
12892 words 42 min
What exactly does word2vec learn? Berkeley AI Research

What exactly does word2vec learn?

What exactly does word2vec learn, and how? Answering this question amounts to understanding representation lea...

4 months, 1 week ago The Berkele…
8845 words 29 min
Berkeley AI Research

Whole-Body Conditioned Egocentric Video Prediction

× ...

6 months, 1 week ago The Berkele…
16908 words 56 min
Defending against Prompt Injection with Structured Queries (StruQ) and Preference Optimization (SecAlign) Berkeley AI Research

Defending against Prompt Injection with Structured Queries …

Recent advances in Large Language Models (LLMs) enable exciting LLM-integrated applications. However, as LLMs have improved, so have the attacks again...

9 months ago The Berkele…
8563 words 28 min
Repurposing Protein Folding Models for Generation with Latent Diffusion Berkeley AI Research

Repurposing Protein Folding Models for Generation with Late…

<!-- The actual text for the post content appears below. Text will appear on the homepage, i.e., https://bair.berkeley.edu/blog/ but we only show part...

9 months ago The Berkele…
10620 words 35 min
Scaling Up Reinforcement Learning for Traffic Smoothing: A 100-AV Highway Deployment Berkeley AI Research

Scaling Up Reinforcement Learning for Traffic Smoothing: A …

Training Diffusion Models with Reinforcement Learning <video loop="" style="width: 100%; margin: 0; padding: 0; border: none; background: transparent;...

9 months, 2 weeks ago The Berkele…
14068 words 46 min
Virtual Personas for Language Models via an Anthology of Backstories Berkeley AI Research

Virtual Personas for Language Models via an Anthology of Ba…

<!-- These are comments in HTML. The above header text is needed to format the title, authors, etc. The "example_post" is an example representative im...

1 year, 2 months ago The Berkele…
8718 words 29 min
Linguistic Bias in ChatGPT: Language Models Reinforce Dialect Discrimination Berkeley AI Research

Linguistic Bias in ChatGPT: Language Models Reinforce Diale…

<!-- These are comments in HTML. The above header text is needed to format the title, authors, etc. The "example_post" is an example representative im...

1 year, 3 months ago The Berkele…
6939 words 23 min
How to Evaluate Jailbreak Methods: A Case Study with the StrongREJECT Benchmark Berkeley AI Research

How to Evaluate Jailbreak Methods: A Case Study with the St…

<!-- These are comments in HTML. The above header text is needed to format the title, authors, etc. The "example_post" is an example representative im...

1 year, 4 months ago The Berkele…
20372 words 67 min
Are We Ready for Multi-Image Reasoning? Launching VHs: The Visual Haystacks Benchmark! Berkeley AI Research

Are We Ready for Multi-Image Reasoning? Launching VHs: The …

<!-- These are comments in HTML. The above header text is needed to format the title, authors, etc. The "example_post" is an example representative im...

1 year, 5 months ago The Berkele…
14255 words 47 min