Article List

Explore latest news, discover interesting content, and dive deep into topics that interest you

Clear Filters
Can "Sure" be enough to backdoor a large language model into saying anything? AI Research Digest

Can "Sure" be enough to backdoor a large language model int…

The 'Sure' Trap: Multi-Scale Poisoning Analysis of Stealthy Compliance-Only Backdoors in Fine-Tuned Large Language Models...

1 month, 3 weeks ago AIModels.fyi
1417 words 4 min
After text and images, is video how AI truly learns to think dynamically? AI Research Digest

After text and images, is video how AI truly learns to thin…

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm...

2 months ago AIModels.fyi
2660 words 8 min
ChatGPT Atlas can browse, but can it *really* master web games? AI Research Digest

ChatGPT Atlas can browse, but can it *really* master web ga…

Can Agent Conquer Web? Exploring the Frontiers of ChatGPT Atlas Agent in Web Games...

2 months, 1 week ago AIModels.fyi
2021 words 6 min
Can AI finally generate entire, consistent, multi-shot video narratives? AI Research Digest

Can AI finally generate entire, consistent, multi-shot vide…

HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives...

2 months, 2 weeks ago AIModels.fyi
3649 words 12 min
Does a brain-inspired network finally connect Transformers to true reasoning? AI Research Digest

Does a brain-inspired network finally connect Transformers …

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain...

3 months ago AIModels.fyi
24219 words 80 min
Do protein folding models truly need that much domain-specific complexity? AI Research Digest

Do protein folding models truly need that much domain-speci…

SimpleFold: Folding Proteins is Simpler than You Think...

3 months, 2 weeks ago AIModels.fyi
7399 words 24 min
Can unified multimodal models align understanding and generation, without *any* captions? AI Research Digest

Can unified multimodal models align understanding and gener…

Reconstruction alignment improves unified multimodal models...

3 months, 3 weeks ago AIModels.fyi
5694 words 18 min
What if LMs could collectively train, slashing RL post-training costs? AI Research Digest

What if LMs could collectively train, slashing RL post-trai…

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing...

3 months, 4 weeks ago AIModels.fyi
4101 words 13 min
Are we training LLMs to confidently guess instead of admitting uncertainty? AI Research Digest

Are we training LLMs to confidently guess instead of admitt…

Why Language Models Hallucinate...

4 months ago AIModels.fyi
1733 words 5 min
Can you pick the perfect LLM without breaking the bank? AI Research Digest

Can you pick the perfect LLM without breaking the bank?

Adaptive LLM Routing under Budget Constraints...

4 months, 1 week ago AIModels.fyi
1289 words 4 min
Can AI learn to prove theorems by thinking step-by-step like a human mathematician, even without perfect instructions? AI Research Digest

Can AI learn to prove theorems by thinking step-by-step lik…

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving...

4 months, 2 weeks ago AIModels.fyi
3558 words 11 min
Can reinforcement learning fix the glaring visual flaws in AI-generated images? AI Research Digest

Can reinforcement learning fix the glaring visual flaws in …

X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again...

4 months, 3 weeks ago AIModels.fyi
737 words 2 min
Can doctors trust AI diagnostic tools enough to delegate tasks? AI Research Digest

Can doctors trust AI diagnostic tools enough to delegate ta…

Towards physician-centered oversight of conversational diagnostic AI...

5 months, 2 weeks ago AIModels.fyi
4291 words 14 min
Can seeing the document like a human dramatically boost a RAG system's IQ? AI Research Digest

Can seeing the document like a human dramatically boost a R…

Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding...

5 months, 3 weeks ago AIModels.fyi
3113 words 10 min
Can AI reconstruct super-slow-motion 4D models from regular speed multi-camera video? AI Research Digest

Can AI reconstruct super-slow-motion 4D models from regular…

4DSloMo: 4D Reconstruction for High Speed Scene with Asynchronous Capture...

6 months ago AIModels.fyi
9456 words 31 min
An embarrassingly simple defense against LLM abliteration attacks AI Research Digest

An embarrassingly simple defense against LLM abliteration a…

Defending AI systems against a new form of attack...

7 months, 1 week ago AIModels.fyi
8645 words 28 min
Questioning the role of "chains of thought" AI Research Digest

Questioning the role of "chains of thought"

Beyond semantics: The unreasonable effectiveness of reasonless intermediate tokens...

7 months, 2 weeks ago AIModels.fyi
1594 words 5 min
Zero-shot voice cloning without transcription AI Research Digest

Zero-shot voice cloning without transcription

MiniMax-Speech: Intrinsic zero-shot text-to-speech with a learnable speaker encoder...

7 months, 3 weeks ago AIModels.fyi
4434 words 14 min
1 / 2