Category: 2026-05-29

五月 29, 2026 When Cloud Agents Meet Device Agents: Lessons from Hybrid Multi-Agent Systems
五月 29, 2026 SAAS: Self-Aware Reinforcement Learning for Over-Search Mitigation in Agentic Search
五月 29, 2026 RewardFlow: Topology-Aware Reward Propagation on State Graphs for Agentic RL with Large Language Models
五月 29, 2026 ToolSpec: Accelerating Tool Calling via Schema-Aware and Retrieval-Augmented Speculative Decoding
五月 29, 2026 SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding
五月 29, 2026 GrepSeek: Training Search Agents for Direct Corpus Interaction
五月 29, 2026 RTP-LLM: High-Performance Alibaba LLM Inference Engine
五月 29, 2026 Tiny Brains, Giant Impact: Uncovering the Keystone Neurons of LLM with Just a Few Prompts
五月 29, 2026 The Curse of Helpfulness: Inverse Scaling Law in Robustness to Distractor Instructions via DistractionIF
五月 29, 2026 Reasoning and Tool-use Compete in Agentic RL:From Quantifying Interference to Disentangled Tuning