Article List
Explore latest news, discover interesting content, and dive deep into topics that interest you
AI Research Digest
Can "Sure" be enough to backdoor a large language model int…
The 'Sure' Trap: Multi-Scale Poisoning Analysis of Stealthy Compliance-Only Backdoors in Fine-Tuned Large Language Models...
AI Research Digest
After text and images, is video how AI truly learns to thin…
Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm...
AI Research Digest
ChatGPT Atlas can browse, but can it *really* master web ga…
Can Agent Conquer Web? Exploring the Frontiers of ChatGPT Atlas Agent in Web Games...
AI Research Digest
Can AI finally generate entire, consistent, multi-shot vide…
HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives...
AI Research Digest
Does a brain-inspired network finally connect Transformers …
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain...
AI Research Digest
Do protein folding models truly need that much domain-speci…
SimpleFold: Folding Proteins is Simpler than You Think...
AI Research Digest
Can unified multimodal models align understanding and gener…
Reconstruction alignment improves unified multimodal models...
AI Research Digest
What if LMs could collectively train, slashing RL post-trai…
Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing...
AI Research Digest
Are we training LLMs to confidently guess instead of admitt…
Why Language Models Hallucinate...
AI Research Digest
Can you pick the perfect LLM without breaking the bank?
Adaptive LLM Routing under Budget Constraints...
AI Research Digest
Can AI learn to prove theorems by thinking step-by-step lik…
Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving...
AI Research Digest
Can reinforcement learning fix the glaring visual flaws in …
X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again...
AI Research Digest
Can doctors trust AI diagnostic tools enough to delegate ta…
Towards physician-centered oversight of conversational diagnostic AI...
AI Research Digest
Can seeing the document like a human dramatically boost a R…
Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding...
AI Research Digest
Can AI reconstruct super-slow-motion 4D models from regular…
4DSloMo: 4D Reconstruction for High Speed Scene with Asynchronous Capture...
AI Research Digest
An embarrassingly simple defense against LLM abliteration a…
Defending AI systems against a new form of attack...
Questioning the role of "chains of thought"
Beyond semantics: The unreasonable effectiveness of reasonless intermediate tokens...
AI Research Digest
Zero-shot voice cloning without transcription
MiniMax-Speech: Intrinsic zero-shot text-to-speech with a learnable speaker encoder...