Article List

AI Research Digest

Can "Sure" be enough to backdoor a large language model int…

The 'Sure' Trap: Multi-Scale Poisoning Analysis of Stealthy Compliance-Only Backdoors in Fine-Tuned Large Language Models...

4 months, 4 weeks ago • AIModels.fyi

1417 words 4 min

AI Research Digest

After text and images, is video how AI truly learns to thin…

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm...

5 months ago • AIModels.fyi

2660 words 8 min

AI Research Digest

ChatGPT Atlas can browse, but can it really master web ga…

Can Agent Conquer Web? Exploring the Frontiers of ChatGPT Atlas Agent in Web Games...

5 months, 2 weeks ago • AIModels.fyi

2021 words 6 min

AI Research Digest

Can AI finally generate entire, consistent, multi-shot vide…

HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives...

5 months, 3 weeks ago • AIModels.fyi

3649 words 12 min

AI Research Digest

Does a brain-inspired network finally connect Transformers …

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain...

6 months, 1 week ago • AIModels.fyi

24219 words 80 min

AI Research Digest

Do protein folding models truly need that much domain-speci…

SimpleFold: Folding Proteins is Simpler than You Think...

6 months, 2 weeks ago • AIModels.fyi

7399 words 24 min

AI Research Digest

Can unified multimodal models align understanding and gener…

Reconstruction alignment improves unified multimodal models...

6 months, 4 weeks ago • AIModels.fyi

5694 words 18 min

AI Research Digest

What if LMs could collectively train, slashing RL post-trai…

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing...

7 months ago • AIModels.fyi

4101 words 13 min

AI Research Digest

Are we training LLMs to confidently guess instead of admitt…

Why Language Models Hallucinate...

7 months, 1 week ago • AIModels.fyi

1733 words 5 min

AI Research Digest

Can you pick the perfect LLM without breaking the bank?

Adaptive LLM Routing under Budget Constraints...

7 months, 2 weeks ago • AIModels.fyi

1289 words 4 min

AI Research Digest

Can AI learn to prove theorems by thinking step-by-step lik…

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving...

7 months, 3 weeks ago • AIModels.fyi

3558 words 11 min

AI Research Digest

Can reinforcement learning fix the glaring visual flaws in …

X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again...

7 months, 4 weeks ago • AIModels.fyi

737 words 2 min

AI Research Digest

Can doctors trust AI diagnostic tools enough to delegate ta…

Towards physician-centered oversight of conversational diagnostic AI...

8 months, 3 weeks ago • AIModels.fyi

4291 words 14 min

AI Research Digest

Can seeing the document like a human dramatically boost a R…

Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding...

9 months ago • AIModels.fyi

3113 words 10 min

AI Research Digest

Can AI reconstruct super-slow-motion 4D models from regular…

4DSloMo: 4D Reconstruction for High Speed Scene with Asynchronous Capture...

9 months, 1 week ago • AIModels.fyi

9456 words 31 min

AI Research Digest

An embarrassingly simple defense against LLM abliteration a…

Defending AI systems against a new form of attack...

10 months, 2 weeks ago • AIModels.fyi

8645 words 28 min

AI Research Digest

Questioning the role of "chains of thought"

Beyond semantics: The unreasonable effectiveness of reasonless intermediate tokens...

10 months, 3 weeks ago • AIModels.fyi

1594 words 5 min

AI Research Digest

Zero-shot voice cloning without transcription

MiniMax-Speech: Intrinsic zero-shot text-to-speech with a learnable speaker encoder...

10 months, 4 weeks ago • AIModels.fyi

4434 words 14 min

Can "Sure" be enough to backdoor a large language model int…

After text and images, is video how AI truly learns to thin…

ChatGPT Atlas can browse, but can it *really* master web ga…

Can AI finally generate entire, consistent, multi-shot vide…

Does a brain-inspired network finally connect Transformers …

Do protein folding models truly need that much domain-speci…

Can unified multimodal models align understanding and gener…

What if LMs could collectively train, slashing RL post-trai…

Are we training LLMs to confidently guess instead of admitt…

Can you pick the perfect LLM without breaking the bank?

Can AI learn to prove theorems by thinking step-by-step lik…

Can reinforcement learning fix the glaring visual flaws in …

Can doctors trust AI diagnostic tools enough to delegate ta…

Can seeing the document like a human dramatically boost a R…

Can AI reconstruct super-slow-motion 4D models from regular…

An embarrassingly simple defense against LLM abliteration a…

Questioning the role of "chains of thought"

Zero-shot voice cloning without transcription

ChatGPT Atlas can browse, but can it really master web ga…