-
April 27, 2026
ARES: Adaptive Red-Teaming and End-to-End Repair of Policy-Reward System
-
April 27, 2026
Copy-as-Decode: Grammar-Constrained Parallel Prefill for LLM Editing
-
April 27, 2026
River-LLM: Large Language Model Seamless Exit Based on KV Share
-
April 27, 2026
Unlocking the Edge deployment and ondevice acceleration of multi-LoRA enabled one-for-all foundational LLM
-
April 27, 2026
HybridGen: Efficient LLM Generative Inference via CPU-GPU Hybrid Computing
-
April 27, 2026
Training and Agentic Inference Strategies for LLM-based Manim Animation Generation
-
April 27, 2026
AQPIM: Breaking the PIM Capacity Wall for LLMs with In-Memory Activation Quantization
-
April 27, 2026
StepPO: Step-Aligned Policy Optimization for Agentic Reinforcement Learning
-
April 27, 2026
MASS-RAG: Multi-Agent Synthesis Retrieval-Augmented Generation
-
April 27, 2026
First, Do No Harm (With LLMs): Mitigating Racial Bias via Agentic Workflows
-
April 27, 2026
Cyber Defense Benchmark: Agentic Threat Hunting Evaluation for LLMs in SecOps
-
April 27, 2026
TRN-R1-Zero: Text-rich Network Reasoning via LLMs with Reinforcement Learning Only
-
April 27, 2026
Detoxification for LLM: From Dataset Itself
-
April 27, 2026
SAW-INT4: System-Aware 4-Bit KV-Cache Quantization for Real-World LLM Serving
-
April 27, 2026
If you're waiting for a sign... that might not be it! Mitigating Trust Boundary Confusion from Visual Injections on Vision-Language Agentic Systems
-
April 27, 2026
Statistics, Not Scale: Modular Medical Dialogue with Bayesian Belief Engine
-
April 27, 2026
A-MAR: Agent-based Multimodal Art Retrieval for Fine-Grained Artwork Understanding
-
April 27, 2026
Rethinking Scale: Deployment Trade-offs of Small Language Models under Agent Paradigms
-
April 27, 2026
GRASPrune: Global Gating for Budgeted Structured Pruning of Large Language Models
-
April 27, 2026
ChipCraftBrain: Validation-First RTL Generation via Multi-Agent Orchestration