-
April 27, 2026
LayerBoost: Layer-Aware Attention Reduction for Efficient LLMs
-
April 27, 2026
Lightweight Retrieval-Augmented Generation and Large Language Model-Based Modeling for Scalable Patient-Trial Matching
-
April 27, 2026
Emergent Strategic Reasoning Risks in AI: A Taxonomy-Driven Evaluation Framework
-
April 27, 2026
Trust but Verify: Introducing DAVinCI -- A Framework for Dual Attribution and Verification in Claim Inference for Language Models
-
April 27, 2026
MambaCSP: Hybrid-Attention State Space Models for Hardware-Efficient Channel State Prediction
-
April 27, 2026
Pre-trained LLMs Meet Sequential Recommenders: Efficient User-Centric Knowledge Distillation
-
April 27, 2026
Memanto: Typed Semantic Memory with Information-Theoretic Retrieval for Long-Horizon Agents
-
April 27, 2026
Tool Attention Is All You Need: Dynamic Tool Gating and Lazy Schema Loading for Eliminating the MCP/Tools Tax in Scalable Agentic Workflows
-
April 27, 2026
Nemobot Games: Crafting Strategic AI Gaming Agents for Interactive Learning with Large Language Models
-
April 27, 2026
Guess-Verify-Refine: Data-Aware Top-K for Sparse-Attention Decoding on Blackwell via Temporal Correlation
-
April 27, 2026
Behavioral Canaries: Auditing Private Retrieved Context Usage in RL Fine-Tuning
-
April 27, 2026
GR-Evolve: Design-Adaptive Global Routing via LLM-Driven Algorithm Evolution