-
April 28, 2026
AgenticCache: Cache-Driven Asynchronous Planning for Embodied AI Agents
-
April 28, 2026
BitRL: Reinforcement Learning with 1-bit Quantized Language Models for Resource-Constrained Edge Deployment
-
April 28, 2026
DepthKV: Layer-Dependent KV Cache Pruning for Long-Context LLM Inference
-
April 28, 2026
Beyond the Attention Stability Boundary: Agentic Self-Synthesizing Reasoning Protocols
-
April 28, 2026
Grounding Before Generalizing: How AI Differs from Humans in Causal Transfer
-
April 28, 2026
PhysNote: Self-Knowledge Notes for Evolvable Physical Reasoning in Vision-Language Model
-
April 28, 2026
Stabilizing Efficient Reasoning with Step-Level Advantage Selection
-
April 28, 2026
The Chameleon's Limit: Investigating Persona Collapse and Homogenization in Large Language Models
-
April 28, 2026
Long-Context Aware Upcycling: A New Frontier for Hybrid LLM Scaling
-
April 28, 2026
FlashOverlap: Minimizing Tail Latency in Communication Overlap for Distributed LLM Training