Article List - NewsStore

Open Source AI Research

Reward Hacking Resarch Update

Interim report on ongoing work on reward hacking...

6 months ago • Blog on Ele…

41 words 1 min

Open Source AI Research

Pretraining Data Filtering for Open-Weight AI Safety

Announcing Deep Ignorance: Filtering Pretraining Data Builds Tamper-Resistant Safeguards into Open-Weight LLMs...

8 months ago • Blog on Ele…

99 words 1 min

Open Source AI Research

Attention Probes

Adding attention to linear probes...

8 months, 1 week ago • Blog on Ele…

29 words 1 min

Open Source AI Research

Research Update: Applications of Local Volume Measurement

Research update on on applying local volume measurement to downstream tasks...

9 months, 3 weeks ago • Blog on Ele…

65 words 1 min

Open Source AI Research

Studying inductive biases of random networks via local volu…

In this post, we will study inductive biases of the parameter-function map of random neural networks using star domain volume estimates. This builds o...

10 months ago • Blog on Ele…

495 words 1 min

Open Source AI Research

The Common Pile v0.1

Announcing the Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text...

10 months, 1 week ago • Blog on Ele…

75 words 1 min

Open Source AI Research

Product Key Memory Sparse Coders

Using Product Key Memories to encode sparse coder features...

10 months, 1 week ago • Blog on Ele…

50 words 1 min

Open Source AI Research

SAEs trained on the same data don’t learn the same features

In this post, we show that when two TopK SAEs are trained on the same data, with the same batch order but with different random initializations, there...

1 year, 4 months ago • Blog on Ele…

429 words 1 min

Open Source AI Research