Article List

Explore latest news, discover interesting content, and dive deep into topics that interest you

Clear Filters
Which Agent Causes Task Failures and When?Researchers from PSU and Duke explores automated failure attribution of LLM Multi-Agent Systems AI Technology Review

Which Agent Causes Task Failures and When?Researchers from …

In recent years, LLM Multi-Agent systems have garnered widespread attention for their collaborative approach to solving complex problems. However, it'...

5 months ago Synced
357217 words 1190 min
ByteDance Introduces Astra: A Dual-Model Architecture for Autonomous Robot Navigation AI Technology Review

ByteDance Introduces Astra: A Dual-Model Architecture for A…

ByteDance introduces Astra, an innovative dual-model architecture revolutionizing robot navigation in complex indoor environments. The post <a href="h...

6 months, 3 weeks ago Synced
12075 words 40 min
MIT Researchers Unveil “SEAL”: A New Step Towards Self-Improving AI AI Technology Review

MIT Researchers Unveil “SEAL”: A New Step Towards Self-Impr…

MIT introduces SEAL, a framework enabling large language models to self-edit and update their weights via reinforcement learning. The post <a href="ht...

6 months, 4 weeks ago Synced
7739 words 25 min
Researchers from PSU and Duke introduce “Multi-Agent Systems Automated Failure Attribution AI Technology Review

Researchers from PSU and Duke introduce “Multi-Agent System…

"Automated failure attribution" is a crucial component in the development lifecycle of Multi-Agent systems. It has the potential to transform the chal...

6 months, 4 weeks ago Synced
66698 words 222 min
Adobe Research Unlocking Long-Term Memory in Video World Models with State-Space Models AI Technology Review

Adobe Research Unlocking Long-Term Memory in Video World Mo…

By combining State-Space Models (SSMs) for efficient long-range dependency modeling with dense local attention for coherence, and using training strat...

7 months, 2 weeks ago Synced
5631 words 18 min
DeepSeek-V3 New Paper is coming! Unveiling the Secrets of Low-Cost Large Model Training through Hardware-Aware Co-design AI Technology Review

DeepSeek-V3 New Paper is coming! Unveiling the Secrets of L…

A newly released 14-page technical paper from the team behind DeepSeek-V3, with DeepSeek CEO Wenfeng Liang as a co-author, sheds light on the “Scaling...

7 months, 4 weeks ago Synced
20269 words 67 min
DeepSeek Unveils DeepSeek-Prover-V2: Advancing Neural Theorem Proving with Recursive Proof Search and a New Benchmark AI Technology Review

DeepSeek Unveils DeepSeek-Prover-V2: Advancing Neural Theor…

DeepSeek AI releases DeepSeek-Prover-V2, an open-source LLM for Lean 4 theorem proving. It uses recursive proof search with DeepSeek-V3 for training d...

8 months, 2 weeks ago Synced
5483 words 18 min
Can GRPO be 10x Efficient? Kwai AI’s SRPO Suggests Yes with SRPO AI Technology Review

Can GRPO be 10x Efficient? Kwai AI’s SRPO Suggests Yes with…

Kwai AI's SRPO framework slashes LLM RL post-training steps by 90% while matching DeepSeek-R1 performance in math and code. This two-stage RL approach...

8 months, 3 weeks ago Synced
13391 words 44 min
AI Technology Review Synced

Zhipu.AI’s Open-Source Power Play: Blazing-Fast GLM Models …

Zhipu.AI open-sources faster GLM models (8x speedup), launches Z.ai, aiming for global expansion, potentially ahead of IPO. The post <a href="https://...

8 months, 4 weeks ago Synced
3820 words 12 min
AI Technology Review Synced

DeepSeek Signals Next-Gen R2 Model, Unveils Novel Approach …

DeepSeek AI, a prominent player in the large language model arena, has recently published a research paper detailing a new technique aimed at enhancin...

9 months ago Synced
6894 words 22 min