Article List
Explore latest news, discover interesting content, and dive deep into topics that interest you
Which Agent Causes Task Failures and When?Researchers from …
In recent years, LLM Multi-Agent systems have garnered widespread attention for their collaborative approach to solving complex problems. However, it'...
AI Technology Review
ByteDance Introduces Astra: A Dual-Model Architecture for A…
ByteDance introduces Astra, an innovative dual-model architecture revolutionizing robot navigation in complex indoor environments. The post <a href="h...
MIT Researchers Unveil “SEAL”: A New Step Towards Self-Impr…
MIT introduces SEAL, a framework enabling large language models to self-edit and update their weights via reinforcement learning. The post <a href="ht...
Researchers from PSU and Duke introduce “Multi-Agent System…
"Automated failure attribution" is a crucial component in the development lifecycle of Multi-Agent systems. It has the potential to transform the chal...
AI Technology Review
Adobe Research Unlocking Long-Term Memory in Video World Mo…
By combining State-Space Models (SSMs) for efficient long-range dependency modeling with dense local attention for coherence, and using training strat...
AI Technology Review
DeepSeek-V3 New Paper is coming! Unveiling the Secrets of L…
A newly released 14-page technical paper from the team behind DeepSeek-V3, with DeepSeek CEO Wenfeng Liang as a co-author, sheds light on the “Scaling...
AI Technology Review
DeepSeek Unveils DeepSeek-Prover-V2: Advancing Neural Theor…
DeepSeek AI releases DeepSeek-Prover-V2, an open-source LLM for Lean 4 theorem proving. It uses recursive proof search with DeepSeek-V3 for training d...
AI Technology Review
Can GRPO be 10x Efficient? Kwai AI’s SRPO Suggests Yes with…
Kwai AI's SRPO framework slashes LLM RL post-training steps by 90% while matching DeepSeek-R1 performance in math and code. This two-stage RL approach...
Zhipu.AI’s Open-Source Power Play: Blazing-Fast GLM Models …
Zhipu.AI open-sources faster GLM models (8x speedup), launches Z.ai, aiming for global expansion, potentially ahead of IPO. The post <a href="https://...
DeepSeek Signals Next-Gen R2 Model, Unveils Novel Approach …
DeepSeek AI, a prominent player in the large language model arena, has recently published a research paper detailing a new technique aimed at enhancin...