2026-04-27 论文速递

当日 agent / LLM / AI 基础设施方向共匹配到 307 篇 arXiv 论文,其中 10 篇由 Claude Code 生成了详细分析,其余 297 篇列在文末。

1. Preference Heads in Large Language Models: A Mechanistic Framework for Interpretable Personalization

arXiv: 2604.22345 · cs.CL · 相关度分数 22

论文提出 Preference Heads 假设,认为 LLM 中存在稀疏的注意力头编码用户偏好,并基于此设计训练无关的 Differential Preference Steering (DPS) 框架实现可解释的个性化推理。

阅读完整分析 →


2. Sovereign Agentic Loops: Decoupling AI Reasoning from Execution in Real-World Systems

arXiv: 2604.22136 · cs.CR · 相关度分数 21

提出 Sovereign Agentic Loops (SAL):将 LLM agent 的推理与执行解耦,通过控制平面对模型输出的结构化 intent 做策略与状态校验,实现可审计、可重放的安全执行。

阅读完整分析 →


3. Memanto: Typed Semantic Memory with Information-Theoretic Retrieval for Long-Horizon Agents

arXiv: 2604.22085 · cs.AI · 相关度分数 20

Memanto 用类型化语义记忆 schema + Moorcheh 信息论检索引擎替代混合知识图谱,在 LongMemEval 和 LoCoMo 上分别达到 89.8% 和 87.1% 的 SOTA,单次检索、零 ingestion 延迟。

阅读完整分析 →


4. GR-Evolve: Design-Adaptive Global Routing via LLM-Driven Algorithm Evolution

arXiv: 2604.22234 · cs.AR · 相关度分数 19

GR-Evolve 用 agentic LLM 迭代改写 global routing 源码,按 QoR 反馈为每个设计定制 EDA 算法,在七个 benchmark 上最多减少 8.72% 绕线长度。

阅读完整分析 →


5. Behavioral Canaries: Auditing Private Retrieved Context Usage in RL Fine-Tuning

arXiv: 2604.22191 · cs.CR · 相关度分数 19

提出 Behavioral Canaries:在 RL 微调(RLFT)流水线中通过"文档触发器 + 风格化反馈"植入行为标记,用于审计私有检索上下文是否被违规用于训练。

阅读完整分析 →


6. Emergent Strategic Reasoning Risks in AI: A Taxonomy-Driven Evaluation Framework

arXiv: 2604.22119 · cs.AI · 相关度分数 19

提出 ESRR 风险分类 + ESRRSim 自动化评测框架,系统衡量 LLM 的涌现战略推理风险(欺骗、评测博弈、奖励黑客等),覆盖 11 个推理模型,检测率 14.45%–72.72%。

阅读完整分析 →


7. Lightweight Retrieval-Augmented Generation and Large Language Model-Based Modeling for Scalable Patient-Trial Matching

arXiv: 2604.22061 · cs.CL · 相关度分数 19

提出一个轻量级框架,将 RAG 与 LLM 编码解耦用于患者-试验匹配,在多个公开数据集及 Mayo Clinic 真实数据上以显著更低算力达到与端到端 LLM 相当的效果。

阅读完整分析 →


8. LayerBoost: Layer-Aware Attention Reduction for Efficient LLMs

arXiv: 2604.22050 · cs.LG · 相关度分数 19

LayerBoost 基于逐层敏感度分析,对 transformer 不同层分别保留 softmax、替换为线性滑窗或完全移除注意力,仅用 10M token 蒸馏修复,高并发吞吐提升最高 68%。

阅读完整分析 →


9. Guess-Verify-Refine: Data-Aware Top-K for Sparse-Attention Decoding on Blackwell via Temporal Correlation

arXiv: 2604.22312 · cs.DC · 相关度分数 18

GVR 利用相邻 decode 步骤之间 Top-K 的时间相关性,在 Blackwell 上实现数据感知的精确 Top-K,平均单算子加速 1.88×,DeepSeek-V3.2 在 100K 上下文下端到端 TPOT 提升最多 7.52%。

阅读完整分析 →


10. QuantClaw: Precision Where It Matters for OpenClaw

arXiv: 2604.22577 · cs.AI · 相关度分数 17

QuantClaw 是 OpenClaw 的即插即用精度路由插件,按任务复杂度动态分配量化精度,在 GLM-5 上相比 FP8 基线最多省 21.4% 成本、降 15.7% 延迟。

阅读完整分析 →


其他当日匹配论文

这些论文命中了同样的主题关键词,但未进入 top-N 深度分析。按相关度分数降序排列。

  1. Nemobot Games: Crafting Strategic AI Gaming Agents for Interactive Learning with Large Language Models · cs.AI · arXiv 2604.21896 · score 23large language model, llm, agent, agentic, rag, reasoning
  2. Tool Attention Is All You Need: Dynamic Tool Gating and Lazy Schema Loading for Eliminating the MCP/Tools Tax in Scalable Agentic Workflows · cs.AI · arXiv 2604.21816 · score 23large language model, llm, agent, agentic, reasoning, attention
  3. Pre-trained LLMs Meet Sequential Recommenders: Efficient User-Centric Knowledge Distillation · cs.IR · arXiv 2604.21536 · score 20large language model, llm, reasoning, inference, serving, fine-tun
  4. MambaCSP: Hybrid-Attention State Space Models for Hardware-Efficient Channel State Prediction · cs.IT · arXiv 2604.21957 · score 20large language model, llm, inference, attention, transformer, throughput
  5. Enhancing Online Recruitment with Category-Aware MoE and LLM-based Data Augmentation · cs.AI · arXiv 2604.21264 · score 19large language model, llm, rag, chain-of-thought, mixture of experts, moe
  6. Learning to Communicate: Toward End-to-End Optimization of Multi-Agent Language Systems · cs.AI · arXiv 2604.21794 · score 18large language model, agent, multi-agent, reasoning, inference
  7. A Task Decomposition and Planning Framework for Efficient LLM Inference in AI-Enabled WiFi-Offload Networks · cs.DC · arXiv 2604.21399 · score 18large language model, llm, rag, reasoning, inference, latency
  8. Focus Session: Hardware and Software Techniques for Accelerating Multimodal Foundation Models · cs.LG · arXiv 2604.21952 · score 18llm, quantization, speculative decoding, attention, transformer, latency
  9. SparKV: Overhead-Aware KV Cache Loading for Efficient On-Device LLM Inference · cs.NI · arXiv 2604.21231 · score 18large language model, llm, inference, kv cache, latency
  10. Bridging the Long-Tail Gap: Robust Retrieval-Augmented Relation Completion via Multi-Stage Paraphrase Infusion · cs.CL · arXiv 2604.22261 · score 17large language model, llm, retrieval, rag, reasoning, fine-tun
  11. Stealthy Backdoor Attacks against LLMs Based on Natural Style Triggers · cs.CR · arXiv 2604.21700 · score 17large language model, llm, rag, serving, fine-tun
  12. Reasoning Primitives in Hybrid and Non-Hybrid LLMs · cs.CL · arXiv 2604.21454 · score 17large language model, llm, retrieval, reasoning, attention, transformer
  13. How Do AI Agents Spend Your Money? Analyzing and Predicting Token Consumption in Agentic Coding Tasks · cs.CL · arXiv 2604.22750 · score 16llm, agent, agentic, rag, reasoning
  14. Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond · cs.AI · arXiv 2604.22748 · score 16agent, agentic, multi-agent, ai system
  15. Large Language Models Decide Early and Explain Later · cs.CL · arXiv 2604.22266 · score 16large language model, rag, reasoning, chain-of-thought, inference, latency
  16. ResRank: Unifying Retrieval and Listwise Reranking via End-to-End Joint Training with Residual Passage Compression · cs.IR · arXiv 2604.22180 · score 16large language model, llm, retrieval, inference, latency
  17. Reliable Self-Harm Risk Screening via Adaptive Multi-Agent LLM Systems · cs.LG · arXiv 2604.22154 · score 16llm, agent, multi-agent, ai system
  18. HiCrew: Hierarchical Reasoning for Long-Form Video Understanding via Question-Aware Multi-Agent Collaboration · cs.AI · arXiv 2604.21444 · score 16agent, multi-agent, rag, reasoning, serving
  19. Spatial Metaphors for LLM Memory: A Critical Analysis of the MemPalace Architecture · cs.AI · arXiv 2604.21284 · score 16large language model, llm, retrieval, rag, inference
  20. Aligning Dense Retrievers with LLM Utility via DistillationAligning Dense Retrievers with LLM Utility via Distillation · cs.IR · arXiv 2604.22722 · score 15llm, retrieval, rag, inference, serving
  21. Can QPP Choose the Right Query Variant? Evaluating Query Variant Selection for RAG Pipelines · cs.IR · arXiv 2604.22661 · score 15large language model, llm, retrieval, rag, latency
  22. SpikingBrain2.0: Brain-Inspired Foundation Models for Efficient Long-Context and Cross-Platform Inference · cs.LG · arXiv 2604.22575 · score 15llm, inference, quantization, attention, transformer, gpu
  23. CGC: Compositional Grounded Contrast for Fine-Grained Multi-Image Understanding · cs.CV · arXiv 2604.22498 · score 15large language model, llm, reasoning, chain-of-thought, attention
  24. Evaluating LLM-Based Goal Extraction in Requirements Engineering: Prompting Strategies and Their Limitations · cs.SE · arXiv 2604.22207 · score 15large language model, llm, retrieval, rag, chain-of-thought
  25. How Large Language Models Balance Internal Knowledge with User and Document Assertions · cs.CL · arXiv 2604.22193 · score 15large language model, llm, rag, fine-tun, post-train
  26. Voice Under Revision: Large Language Models and the Normalization of Personal Narrative · cs.CL · arXiv 2604.22142 · score 15large language model, llm, reasoning, serving
  27. PrivUn: Unveiling Latent Ripple Effects and Shallow Forgetting in Privacy Unlearning · cs.LG · arXiv 2604.22076 · score 15large language model, llm, retrieval, rag, fine-tun
  28. Transient Turn Injection: Exposing Stateless Multi-Turn Vulnerabilities in Large Language Models · cs.CR · arXiv 2604.21860 · score 15large language model, llm, agent, rag
  29. Language as a Latent Variable for Reasoning Optimization · cs.CL · arXiv 2604.21593 · score 15llm, reasoning, chain-of-thought, inference, serving
  30. AgenticQwen: Training Small Agentic Language Models with Dual Data Flywheels for Industrial-Scale Tool Use · cs.CL · arXiv 2604.21590 · score 15agent, agentic, tool use, reasoning, latency
  31. OptiVerse: A Comprehensive Benchmark towards Optimization Problem Solving · cs.CL · arXiv 2604.21510 · score 15large language model, llm, agent, reasoning
  32. Efficient Agent Evaluation via Diversity-Guided User Simulation · cs.AI · arXiv 2604.21480 · score 15large language model, llm, agent, rag
  33. ReaGeo: Reasoning-Enhanced End-to-End Geocoding with LLMs · cs.AI · arXiv 2604.21357 · score 15large language model, llm, retrieval, reasoning, chain-of-thought
  34. CARE: Counselor-Aligned Response Engine for Online Mental-Health Support · cs.CL · arXiv 2604.21352 · score 15large language model, llm, agent, fine-tun
  35. From Research Question to Scientific Workflow: Leveraging Agentic AI for Science Automation · cs.AI · arXiv 2604.21910 · score 14llm, agent, agentic, rag
  36. Process Supervision via Verbal Critique Improves Reasoning in Large Language Models · cs.CL · arXiv 2604.21611 · score 14large language model, llm, reasoning, inference
  37. Thinking Without Words: Efficient Latent Reasoning with Abstract Chain-of-Thought · cs.CL · arXiv 2604.22709 · score 13rag, reasoning, chain-of-thought, inference, fine-tun, post-train
  38. SOLAR-RL: Semi-Online Long-horizon Assignment Reinforcement Learning · cs.LG · arXiv 2604.22558 · score 13large language model, llm, agent
  39. RouteLMT: Learned Sample Routing for Hybrid LLM Translation Deployment · cs.CL · arXiv 2604.22520 · score 13large language model, llm, serving
  40. CNSL-bench: Benchmarking the Sign Language Understanding Capabilities of MLLMs on Chinese National Sign Language · cs.CL · arXiv 2604.22367 · score 13large language model, llm, rag, reasoning
  41. CAP: Controllable Alignment Prompting for Unlearning in LLMs · cs.LG · arXiv 2604.21251 · score 13large language model, llm, serving
  42. Ethics Testing: Proactive Identification of Generative AI System Harms · cs.SE · arXiv 2604.22089 · score 13large language model, llm, ai system
  43. TingIS: Real-time Risk Event Discovery from Noisy Customer Incidents at Enterprise Scale · cs.CL · arXiv 2604.21889 · score 13large language model, llm, throughput, latency
  44. A Multimodal Text- and Graph-Based Approach for Open-Domain Event Extraction from Documents · cs.CL · arXiv 2604.21885 · score 13large language model, llm, reasoning, attention
  45. GS-Quant: Granular Semantic and Generative Structural Quantization for Knowledge Graph Completion · cs.AI · arXiv 2604.21649 · score 13large language model, llm, reasoning, quantization
  46. DryRUN: On the Role of Public Tests in LLM-Driven Code Generation · cs.SE · arXiv 2604.21598 · score 13large language model, llm, multi-agent
  47. CoFEE: Reasoning Control for LLM-Based Feature Discovery · cs.AI · arXiv 2604.21584 · score 13large language model, llm, rag, reasoning
  48. A Metamorphic Testing Approach to Diagnosing Memorization in LLM-Based Program Repair · cs.SE · arXiv 2604.21579 · score 13large language model, llm, serving
  49. Measuring Opinion Bias and Sycophancy via LLM-based Coercion · cs.CL · arXiv 2604.21564 · score 13large language model, llm, agent
  50. Job Skill Extraction via LLM-Centric Multi-Module Framework · cs.CL · arXiv 2604.21525 · score 13large language model, llm, retrieval, fine-tun
  51. When Agents Look the Same: Quantifying Distillation-Induced Similarity in Tool-Use Behaviors · cs.CL · arXiv 2604.21255 · score 13llm, agent, tool-use, reasoning
  52. VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI Automation · cs.CL · arXiv 2604.21375 · score 12llm, agent, agentic
  53. Introducing Background Temperature to Characterise Hidden Randomness in Large Language Models · cs.AI · arXiv 2604.22411 · score 12large language model, llm, inference
  54. When Does LLM Self-Correction Help? A Control-Theoretic Markov Diagnostic and Verify-First Intervention · cs.AI · arXiv 2604.22273 · score 12llm, agent, agentic
  55. A Co-Evolutionary Theory of Human-AI Coexistence: Mutualism, Governance, and Dynamics in Complex Societies · cs.CY · arXiv 2604.22227 · score 12rag, serving, transformer, ai system
  56. Sound Agentic Science Requires Adversarial Experiments · cs.AI · arXiv 2604.22080 · score 12llm, agent, agentic
  57. Read the Paper, Write the Code: Agentic Reproduction of Social-Science Results · cs.AI · arXiv 2604.21965 · score 12llm, agent, agentic
  58. StructMem: Structured Memory for Long-Horizon Behavior in LLMs · cs.CL · arXiv 2604.21748 · score 12llm, agent, rag, reasoning
  59. AI-Gram: When Visual Agents Interact in a Social Network · cs.AI · arXiv 2604.21446 · score 12llm, agent, multi-agent
  60. Understanding and Mitigating Spurious Signal Amplification in Test-Time Reinforcement Learning for Math Reasoning · cs.LG · arXiv 2604.21327 · score 12large language model, rag, reasoning, inference
  61. CI-Work: Benchmarking Contextual Integrity in Enterprise LLM Agents · cs.CR · arXiv 2604.21308 · score 12llm, agent, retrieval, reasoning
  62. GraphLeap: Decoupling Graph Construction and Convolution for Vision GNN Acceleration on FPGA · cs.CV · arXiv 2604.21290 · score 12inference, parallelism, transformer, gpu, fine-tun
  63. Strategic Heterogeneous Multi-Agent Architecture for Cost-Effective Code Vulnerability Detection · cs.CR · arXiv 2604.21282 · score 12llm, agent, multi-agent
  64. Do LLM Decoders Listen Fairly? Benchmarking How Language Model Priors Shape Bias in Speech Recognition · cs.CL · arXiv 2604.21276 · score 12large language model, llm, inference
  65. Hyperloop Transformers · cs.LG · arXiv 2604.21254 · score 12llm, quantization, transformer, latency, post-train
  66. Rethinking Math Reasoning Evaluation: A Robust LLM-as-a-Judge Framework Beyond Symbolic Rigidity · cs.AI · arXiv 2604.22597 · score 11large language model, llm, reasoning
  67. Learning Evidence Highlighting for Frozen LLMs · cs.CL · arXiv 2604.22565 · score 11large language model, llm, reasoning
  68. FeatEHR-LLM: Leveraging Large Language Models for Feature Engineering in Electronic Health Records · cs.LG · arXiv 2604.22534 · score 11large language model, llm, rag
  69. Superminds Test: Actively Evaluating Collective Intelligence of Agent Society via Probing Agents · cs.AI · arXiv 2604.22452 · score 11large language model, agent, reasoning
  70. SSG: Logit-Balanced Vocabulary Partitioning for LLM Watermarking · cs.CR · arXiv 2604.22438 · score 11large language model, llm, reasoning
  71. Context-Fidelity Boosting: Enhancing Faithful Generation through Watermark-Inspired Decoding · cs.CL · arXiv 2604.22335 · score 11large language model, llm, attention
  72. Dynamically Acquiring Text Content to Enable the Classification of Lesser-known Entities for Real-world Tasks · cs.CL · arXiv 2604.22325 · score 11large language model, llm, rag
  73. BLAST: Benchmarking LLMs with ASP-based Structured Testing · cs.LO · arXiv 2604.22306 · score 11large language model, llm, attention
  74. Tell Me Why: Designing an Explainable LLM-based Dialogue System for Student Problem Behavior Diagnosis · cs.CL · arXiv 2604.22237 · score 11large language model, llm, fine-tun
  75. Recognition Without Authorization: LLMs and the Moral Order of Online Advice · cs.CY · arXiv 2604.22143 · score 11large language model, llm, rag
  76. Reliability Auditing for Downstream LLM tasks in Psychiatry: LLM-Generated Hospitalization Risk Scores · cs.LG · arXiv 2604.22063 · score 11large language model, llm, reasoning
  77. Call-Chain-Aware LLM-Based Test Generation for Java Projects · cs.SE · arXiv 2604.22046 · score 11large language model, llm, rag
  78. Shared Lexical Task Representations Explain Behavioral Variability In LLMs · cs.CL · arXiv 2604.22027 · score 11large language model, llm, attention
  79. Machine Behavior in Relational Moral Dilemmas: Moral Rightness, Predicted Human Behavior, and Model Decisions · cs.CL · arXiv 2604.21871 · score 11large language model, llm, reasoning
  80. Thinking with Reasoning Skills: Fewer Tokens, More Accuracy · cs.AI · arXiv 2604.21764 · score 11llm, reasoning, chain-of-thought, inference
  81. Promoting Simple Agents: Ensemble Methods for Event-Log Prediction · cs.LG · arXiv 2604.21629 · score 11agent, inference, transformer, latency
  82. Separable Expert Architecture: Toward Privacy-Preserving LLM Personalization via Composable Adapters and Deletable User Proxies · cs.AI · arXiv 2604.21571 · score 11llm, inference, serving
  83. Unbiased Prevalence Estimation with Multicalibrated LLMs · cs.AI · arXiv 2604.21549 · score 11large language model, llm, rag
  84. VARestorer: One-Step VAR Distillation for Real-World Image Super-Resolution · cs.CV · arXiv 2604.21450 · score 11rag, inference, attention, transformer, fine-tun
  85. Decoupled Travel Planning with Behavior Forest · cs.LG · arXiv 2604.21354 · score 11large language model, llm, reasoning
  86. Symbolic Grounding Reveals Representational Bottlenecks in Abstract Visual Reasoning · cs.AI · arXiv 2604.21346 · score 11large language model, llm, reasoning
  87. Ideological Bias in LLMs’ Economic Causal Reasoning · cs.AI · arXiv 2604.21334 · score 11large language model, llm, reasoning
  88. Can MLLMs “Read” What is Missing? · cs.AI · arXiv 2604.21277 · score 11large language model, llm, rag
  89. Unlocking the Power of Large Language Models for Multi-table Entity Matching · cs.CL · arXiv 2604.21238 · score 11large language model, llm, rag
  90. Beyond N-gram: Data-Aware X-GRAM Extraction for Efficient Embedding Parameter Scaling · cs.CL · arXiv 2604.21724 · score 10retrieval, rag, serving, attention
  91. Navigating Large-Scale Document Collections: MuDABench for Multi-Document Analytical QA · cs.CL · arXiv 2604.22239 · score 10multi-agent, retrieval, rag, reasoning
  92. Sum-of-Checks: Structured Reasoning for Surgical Safety with Large Vision-Language Models · cs.LG · arXiv 2604.22156 · score 10rag, reasoning, chain-of-thought, ai system
  93. Removing Sandbagging in LLMs by Training with Weak Supervision · cs.LG · arXiv 2604.22082 · score 10llm, ai system, fine-tun
  94. Source-Modality Monitoring in Vision-Language Models · cs.CL · arXiv 2604.22038 · score 10agent, agentic, retrieval
  95. AEL: Agent Evolving Learning for Open-Ended Environments · cs.CL · arXiv 2604.21725 · score 10llm, agent, retrieval
  96. GeoMind: An Agentic Workflow for Lithology Classification with Reasoned Tool Invocation · cs.AI · arXiv 2604.21501 · score 10agent, agentic, reasoning
  97. FairQE: Multi-Agent Framework for Mitigating Gender Bias in Translation Quality Estimation · cs.AI · arXiv 2604.21420 · score 10llm, multi-agent, reasoning
  98. Representational Harms in LLM-Generated Narratives Against Global Majority Nationalities · cs.CL · arXiv 2604.22749 · score 9large language model, llm
  99. Dharma, Data and Deception: An LLM-Powered Rhetorical Analysis of Cow-Urine Health Claims on YouTube · cs.CL · arXiv 2604.22606 · score 9large language model, llm
  100. From Natural Language to Verified Code: Toward AI Assisted Problem-to-Code Generation with Dafny-Based Formal Verification · cs.SE · arXiv 2604.22601 · score 9large language model, llm
  101. Controllable Spoken Dialogue Generation: An LLM-Driven Grading System for K-12 Non-Native English Learners · cs.CL · arXiv 2604.22542 · score 9large language model, llm
  102. HGQ-LUT: Fast LUT-Aware Training and Efficient Architectures for DNN Inference · cs.AR · arXiv 2604.22293 · score 9inference, quantization, gpu, latency
  103. How LLMs Detect and Correct Their Own Errors: The Role of Internal Confidence Signals · cs.LG · arXiv 2604.22271 · score 9large language model, llm
  104. A Probabilistic Framework for Hierarchical Goal Recognition · cs.SC · arXiv 2604.22256 · score 9agent, reasoning, inference
  105. When AI Speaks, Whose Values Does It Express? A Cross-Cultural Audit of Individualism-Collectivism Bias in Large Language Models · cs.CL · arXiv 2604.22153 · score 9large language model, ai system
  106. SHAPE: Unifying Safety, Helpfulness and Pedagogy for Educational LLMs · cs.CL · arXiv 2604.22134 · score 9large language model, llm
  107. PermaFrost-Attack: Stealth Pretraining Seeding(SPS) for planting Logic Landmines During LLM Training · cs.LG · arXiv 2604.22117 · score 9large language model, llm
  108. Spontaneous Persuasion: An Audit of Model Persuasiveness in Everyday Conversations · cs.HC · arXiv 2604.22109 · score 9large language model, llm
  109. When Cow Urine Cures Constipation on YouTube: Limits of LLMs in Detecting Culture-specific Health Misinformation · cs.CL · arXiv 2604.22002 · score 9large language model, llm
  110. Evaluation of Automatic Speech Recognition Using Generative Large Language Models · cs.CL · arXiv 2604.21928 · score 9large language model, llm
  111. Revisiting Non-Verbatim Memorization in Large Language Models: The Role of Entity Surface Forms · cs.CL · arXiv 2604.21882 · score 9large language model, llm
  112. Leveraging SIMD for Accelerating Large-number Arithmetic · cs.DC · arXiv 2604.21566 · score 9rag, parallelism, throughput, latency
  113. MISTY: High-Throughput Motion Planning via Mixer-based Single-step Drifting · cs.RO · arXiv 2604.21489 · score 9inference, attention, throughput, latency
  114. Differentially Private De-identification of Dutch Clinical Notes: A Comparative Evaluation · cs.CR · arXiv 2604.21421 · score 9large language model, llm
  115. Time, Causality, and Observability Failures in Distributed AI Inference Systems · cs.AI · arXiv 2604.21361 · score 9inference, ai system, throughput
  116. When Bigger Isn’t Better: A Comprehensive Fairness Evaluation of Political Bias in Multi-News Summarisation · cs.CL · arXiv 2604.21309 · score 9large language model, llm
  117. ReCAPA: Hierarchical Predictive Correction to Mitigate Cascading Failures · cs.AI · arXiv 2604.21232 · score 9large language model, agent
  118. EngramaBench: Evaluating Long-Term Conversational Memory with Structured Graph Retrieval · cs.CL · arXiv 2604.21229 · score 9large language model, retrieval, reasoning
  119. Aggregate vs. Personalized Judges in Business Idea Evaluation: Evidence from Expert Disagreement · cs.CL · arXiv 2604.22517 · score 8llm, rag, reasoning
  120. From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company · cs.AI · arXiv 2604.22446 · score 8agent, multi-agent
  121. Contexts are Never Long Enough: Structured Reasoning for Scalable Question Answering over Long Document Sets · cs.CL · arXiv 2604.22294 · score 8llm, rag, reasoning
  122. ReLeVAnT: Relevance Lexical Vectors for Accurate Legal Text Classification · cs.CL · arXiv 2604.22292 · score 8llm, retrieval, rag
  123. An LLM-Driven Closed-Loop Autonomous Learning Framework for Robots Facing Uncovered Tasks in Open Environments · cs.RO · arXiv 2604.22199 · score 8llm, rag, reasoning
  124. Hardware-Software Co-Design for Event-Driven SNN Deployment on Low-Cost Neuromorphic FPGAs · cs.AR · arXiv 2604.22179 · score 8serving, gpu, latency
  125. FlashSpread: IO-Aware GPU Simulation of Non-Markovian Epidemic Dynamics via Kernel Fusion · cs.DC · arXiv 2604.22092 · score 8rag, gpu, cuda, throughput
  126. When Quotes Crumble: Detecting Transient Mechanical Liquidity Erosion in Limit Order Books · cs.LG · arXiv 2604.21993 · score 8agent, multi-agent
  127. When Prompts Override Vision: Prompt-Induced Hallucinations in LVLMs · cs.CV · arXiv 2604.21911 · score 8rag, serving, fine-tun
  128. Low-Rank Adaptation Redux for Large Models · cs.LG · arXiv 2604.21905 · score 8serving, fine-tun, post-train
  129. TraceScope: Interactive URL Triage via Decoupled Checklist Adjudication · cs.CR · arXiv 2604.21840 · score 8llm, agent
  130. Why are all LLMs Obsessed with Japanese Culture? On the Hidden Cultural and Regional Biases of LLMs · cs.CL · arXiv 2604.21751 · score 8llm, rag, fine-tun
  131. Agentic AI-assisted coding offers a unique opportunity to instill epistemic grounding during software development · cs.SE · arXiv 2604.21744 · score 8agent, agentic
  132. Risk-Aware and Stable Edge Server Selection Under Network Latency SLOs · cs.DC · arXiv 2604.21483 · score 8rag, serving, latency
  133. CSC: Turning the Adversary’s Poison against Itself · cs.CR · arXiv 2604.21416 · score 8rag, serving, fine-tun
  134. Adaptive Head Budgeting for Efficient Multi-Head Attention · cs.LG · arXiv 2604.22583 · score 7inference, attention, transformer
  135. From Local to Cluster: A Unified Framework for Causal Discovery with Latent Variables · cs.LG · arXiv 2604.22416 · score 7rag, reasoning, inference
  136. Fast Neural-Network Approximation of Active Target Search Under Uncertainty · cs.LG · arXiv 2604.22254 · score 7agent, inference
  137. GenMatter: Perceiving Physical Objects with Generative Matter Models · cs.CV · arXiv 2604.22160 · score 7inference, serving
  138. Who Audits the Auditor? Tamper-Proof Fraud Detection with Blockchain-Anchored Explainable ML · cs.CR · arXiv 2604.22096 · score 7rag, inference, latency
  139. Incentivizing Neuro-symbolic Language-based Reasoning in VLMs via Reinforcement Learning · cs.CL · arXiv 2604.22062 · score 7reasoning, inference, gpu
  140. Kernel Contracts: A Specification Language for ML Kernel Correctness Across Heterogeneous Silicon · cs.LG · arXiv 2604.22032 · score 7attention, cuda, compiler
  141. EVENT5Ws: A Large Dataset for Open-Domain Event Extraction from Documents · cs.CL · arXiv 2604.21890 · score 7large language model, rag
  142. From If-Statements to ML Pipelines: Revisiting Bias in Code-Generation · cs.CL · arXiv 2604.21716 · score 7large language model, rag
  143. BioMiner: A Multi-modal System for Automated Mining of Protein-Ligand Bioactivity Data from Literature · cs.AI · arXiv 2604.21508 · score 7large language model, reasoning
  144. How English Print Media Frames Human-Elephant Conflicts in India · cs.AI · arXiv 2604.21496 · score 7large language model, transformer
  145. How Supply Chain Dependencies Complicate Bias Measurement and Accountability Attribution in AI Hiring Applications · cs.CY · arXiv 2604.22679 · score 6rag, ai system
  146. BERAG: Bayesian Ensemble Retrieval-Augmented Generation for Knowledge-based Visual Question Answering · cs.CL · arXiv 2604.22678 · score 6retrieval, rag, fine-tun
  147. Chamelio: A Fast Shared Cloud Network Stack for Isolated Tenant-Defined Protocols · cs.NI · arXiv 2604.22603 · score 6serving, latency
  148. HubRouter: A Pluggable Sub-Quadratic Routing Primitive for Hybrid Sequence Models · cs.LG · arXiv 2604.22442 · score 6attention, transformer, throughput
  149. AgentSearchBench: A Benchmark for AI Agent Search in the Wild · cs.AI · arXiv 2604.22436 · score 6agent, retrieval
  150. STEM: Structure-Tracing Evidence Mining for Knowledge Graphs-Driven Retrieval-Augmented Generation · cs.CL · arXiv 2604.22282 · score 6retrieval, rag, reasoning
  151. Accelerating Intra-Node GPU-to-GPU Communication Through Multi-Path Transfers with CUDA Graphs · cs.DC · arXiv 2604.22228 · score 6rag, gpu, cuda
  152. Verbal Confidence Saturation in 3-9B Open-Weight Instruction-Tuned LLMs: A Pre-Registered Psychometric Validity Screen · cs.CL · arXiv 2604.22215 · score 6llm, reasoning
  153. Where Should LoRA Go? Component-Type Placement in Hybrid Language Models · cs.CL · arXiv 2604.22127 · score 6attention, transformer, fine-tun
  154. An End-to-End Ukrainian RAG for Local Deployment. Optimized Hybrid Search and Lightweight Generation · cs.CL · arXiv 2604.22095 · score 6retrieval, rag, fine-tun
  155. Outcome Rewards Do Not Guarantee Verifiable or Causally Important Reasoning · cs.CL · arXiv 2604.22074 · score 6reasoning, chain-of-thought, post-train
  156. Universal Transformers Need Memory: Depth-State Trade-offs in Adaptive Recursive Reasoning · cs.LG · arXiv 2604.21999 · score 6reasoning, attention, transformer
  157. A Multi-Stage Warm-Start Deep Learning Framework for Unit Commitment · eess.SY · arXiv 2604.21891 · score 6rag, attention, transformer
  158. SemEval-2026 Task 4: Narrative Story Similarity and Narrative Representation Learning · cs.CL · arXiv 2604.21782 · score 6llm, fine-tun
  159. PrismaDV: Automated Task-Aware Data Unit Test Generation · cs.LG · arXiv 2604.21765 · score 6rag, ai system
  160. A-IC3: Learning-Guided Adaptive Inductive Generalization for Hardware Model Checking · cs.LO · arXiv 2604.21688 · score 6agent, attention
  161. Task-specific Subnetwork Discovery in Reinforcement Learning for Autonomous Underwater Navigation · cs.LG · arXiv 2604.21640 · score 6agent, rag
  162. A-THENA: Early Intrusion Detection for IoT with Time-Aware Hybrid Encoding and Network-Specific Augmentation · cs.CR · arXiv 2604.21623 · score 6rag, transformer, latency
  163. A Kernel Nonconformity Score for Multivariate Conformal Prediction · stat.ML · arXiv 2604.21595 · score 6rag, serving
  164. Attention-based multiple instance learning for predominant growth pattern prediction in lung adenocarcinoma wsi using foundation models · cs.CV · arXiv 2604.21530 · score 6rag, attention, fine-tun
  165. Sub-Token Routing in LoRA for Adaptation and Query-Aware KV Compression · cs.LG · arXiv 2604.21335 · score 6serving, transformer
  166. Planning Beyond Text: Graph-based Reasoning for Complex Narrative Generation · cs.CL · arXiv 2604.21253 · score 6llm, reasoning
  167. Microarchitectural Co-Optimization for Sustained Throughput of RISC-V Multi-Lane Chaining Vector Processors · cs.AR · arXiv 2604.22314 · score 5parallelism, throughput
  168. Exploiting pre-optimized kernels with polyhedral transformations for CGRA compilation · cs.AR · arXiv 2604.22297 · score 5rag, parallelism
  169. Anatomy-Aware Unsupervised Detection and Localization of Retinal Abnormalities in Optical Coherence Tomography · cs.CV · arXiv 2604.22139 · score 5rag, inference
  170. Learning Coverage- and Power-Optimal Transmitter Placement from Building Maps: A Comparative Study of Direct and Indirect Neural Approaches · cs.LG · arXiv 2604.22056 · score 5rag, inference
  171. Null-Space Flow Matching for MIMO Channel Estimation in Latency-Constrained Systems · cs.IT · arXiv 2604.22005 · score 5inference, latency
  172. Efficient Logic Gate Networks for Video Copy Detection · cs.CV · arXiv 2604.21694 · score 5inference, throughput
  173. Fine-Grained Perspectives: Modeling Explanations with Annotator-Specific Rationales · cs.CL · arXiv 2604.21667 · score 5inference, fine-tun
  174. Cross-Domain Data Selection and Augmentation for Automatic Compliance Detection · cs.CL · arXiv 2604.21469 · score 5retrieval, inference
  175. Channel-Free Human Activity Recognition via Inductive-Bias-Aware Fusion Design for Heterogeneous IoT Sensor Environments · cs.LG · arXiv 2604.21369 · score 5rag, inference
  176. Rethinking XAI Evaluation: A Human-Centered Audit of Shapley Benchmarks in High-Stakes Settings · cs.LG · arXiv 2604.22662 · score 4rag, latency
  177. Quality-Driven Selective Mutation for Deep Learning · cs.SE · arXiv 2604.22640 · score 4serving
  178. Adversarial Malware Generation in Linux ELF Binaries via Semantic-Preserving Transformations · cs.CR · arXiv 2604.22639 · score 4serving
  179. Explanation of Dynamic Physical Field Predictions using WassersteinGrad: Application to Autoregressive Weather Forecasting · stat.ML · arXiv 2604.22580 · score 4rag, reasoning
  180. ArmSSL: Adversarial Robust Black-Box Watermarking for Self-Supervised Learning Pre-trained Encoders · cs.CR · arXiv 2604.22550 · score 4serving
  181. Different Strokes for Different Folks: Writer Identification for Historical Arabic Manuscripts · cs.CV · arXiv 2604.22515 · score 4attention, fine-tun
  182. Hidden Failure Modes of Gradient Modification under Adam in Continual Learning, and Adaptive Decoupled Moment Routing as a Repair · cs.LG · arXiv 2604.22407 · score 4serving
  183. SOC-ICNN: From Polyhedral to Conic Geometry for Learning Convex Surrogate Functions · cs.LG · arXiv 2604.22355 · score 4serving
  184. A Nationwide Japanese Medical Claims Foundation Model: Balancing Model Scaling and Task-Specific Computational Efficiency · cs.LG · arXiv 2604.22348 · score 4rag, transformer
  185. TabSCM: A practical Framework for Generating Realistic Tabular Data · cs.LG · arXiv 2604.22337 · score 4llm
  186. CLARITY: A Framework and Benchmark for Conversational Language Ambiguity and Unanswerability in Interactive NL2SQL Systems · cs.CL · arXiv 2604.22313 · score 4llm
  187. Semantic Error Correction and Decoding for Short Block Channel Codes · cs.IT · arXiv 2604.22269 · score 4transformer, latency
  188. Towards Safe Mobility: A Unified Transportation Foundation Model enabled by Open-Ended Vision-Language Dataset · cs.CV · arXiv 2604.22260 · score 4reasoning, attention
  189. Algorithmic Feature Highlighting for Human-AI Decision-Making · cs.GT · arXiv 2604.22236 · score 4agent
  190. UniSonate: A Unified Model for Speech, Music, and Sound Effect Generation with Text Instructions · eess.AS · arXiv 2604.22209 · score 4rag, transformer
  191. Optimal sequential decision-making for error propagation mitigation in digital twins · cs.LG · arXiv 2604.22168 · score 4serving
  192. Logistic Bandits with $\tilde{O}(\sqrt{dT})$ Regret without Context Diversity Assumptions · cs.LG · arXiv 2604.22161 · score 4agent
  193. Dissociating Decodability and Causal Use in Bracket-Sequence Transformers · cs.CL · arXiv 2604.22128 · score 4attention, transformer
  194. GICC: A High-Performance Runtime for GPU-Initiated Communication and Coordination in Modern HPC Systems · cs.DC · arXiv 2604.22126 · score 4gpu, latency
  195. Do Not Imitate, Reinforce: Iterative Classification via Belief Refinement · cs.LG · arXiv 2604.22110 · score 4agent
  196. Knowledge-driven Augmentation and Retrieval for Integrative Temporal Adaptation · cs.CL · arXiv 2604.22098 · score 4retrieval, rag
  197. Optimal Question Selection from a Large Question Bank for Clinical Field Recovery in Conversational Psychiatric Intake · cs.CL · arXiv 2604.22067 · score 4llm
  198. Foundation models for discovering robust biomarkers of neurological disorders from dynamic functional connectivity · q-bio.NC · arXiv 2604.22018 · score 4attention, fine-tun
  199. Seeing Fast and Slow: Learning the Flow of Time in Videos · cs.CV · arXiv 2604.21931 · score 4reasoning, attention
  200. MathDuels: Evaluating LLMs as Problem Posers and Solvers · cs.CL · arXiv 2604.21916 · score 4llm
  201. SPAC: Automating FPGA-based Network Switches with Protocol Adaptive Customization · cs.NI · arXiv 2604.21881 · score 4throughput, latency
  202. Locating acts of mechanistic reasoning in student team conversations with mechanistic machine learning · physics.ed-ph · arXiv 2604.21870 · score 4rag, reasoning
  203. Alignment has a Fantasia Problem · cs.AI · arXiv 2604.21827 · score 4ai system
  204. Who Defines “Best”? Towards Interactive, User-Defined Evaluation of LLM Leaderboards · cs.AI · arXiv 2604.21769 · score 4llm
  205. AUDITA: A New Dataset to Audit Humans vs. AI Skill at Audio QA · cs.CL · arXiv 2604.21766 · score 4rag, reasoning
  206. Bridging the Training-Deployment Gap: Gated Encoding and Multi-Scale Refinement for Efficient Quantization-Aware Image Enhancement · cs.AI · arXiv 2604.21743 · score 4quantization, post-train
  207. Fairness under uncertainty in sequential decisions · cs.LG · arXiv 2604.21711 · score 4serving
  208. Evaluating Post-hoc Explanations of the Transformer-based Genome Language Model DNABERT-2 · cs.LG · arXiv 2604.21690 · score 4attention, transformer
  209. To See the Unseen: on the Generalization Ability of Transformers in Symbolic Reasoning · cs.AI · arXiv 2604.21632 · score 4reasoning, transformer
  210. On the Role of Preprocessing and Memristor Dynamics in Reservoir Computing for Image Classification · cs.NE · arXiv 2604.21602 · score 4quantization, attention
  211. A systematic review of generative AI usage for IT project management · cs.SE · arXiv 2604.21958 · score 4agent
  212. UKP_Psycontrol at SemEval-2026 Task 2: Modeling Valence and Arousal Dynamics from Text · cs.CL · arXiv 2604.21534 · score 4llm
  213. Architectures for Robust Self-Organizing Energy Systems under Information and Control Constraints · cs.MA · arXiv 2604.21529 · score 4agent
  214. From Tokens to Concepts: Leveraging SAE for SPLADE · cs.IR · arXiv 2604.21511 · score 4retrieval, rag
  215. Generalizing Numerical Reasoning in Table Data through Operation Sketches and Self-Supervised Learning · cs.LG · arXiv 2604.21495 · score 4reasoning, fine-tun
  216. Dynamical Priors as a Training Objective in Reinforcement Learning · cs.LG · arXiv 2604.21464 · score 4agent
  217. Brief chatbot interactions produce lasting changes in human moral values · cs.AI · arXiv 2604.21430 · score 4agent
  218. SemanticAgent: A Semantics-Aware Framework for Text-to-SQL Data Synthesis · cs.AI · arXiv 2604.21414 · score 4reasoning, fine-tun
  219. VG-CoT: Towards Trustworthy Visual Reasoning via Grounded Chain-of-Thought · cs.CV · arXiv 2604.21396 · score 4reasoning, chain-of-thought
  220. Supervised Learning Has a Necessary Geometric Blind Spot: Theory, Consequences, and Minimal Repair · cs.LG · arXiv 2604.21395 · score 4rag, fine-tun
  221. mcdok at SemEval-2026 Task 13: Finetuning LLMs for Detection of Machine-Generated Code · cs.LG · arXiv 2604.21365 · score 4llm
  222. Beyond Single Plots: A Benchmark for Question Answering on Multi-Charts · cs.CL · arXiv 2604.21344 · score 4llm
  223. Exploring the Role of Synthetic Data Augmentation in Controllable Human-Centric Video Generation · cs.CV · arXiv 2604.21291 · score 4serving
  224. Optimizing High-Throughput Distributed Data Pipelines for Reproducible Deep Learning at Scale · cs.DC · arXiv 2604.21275 · score 4gpu, throughput
  225. Trustworthy Clinical Decision Support Using Meta-Predicates and Domain-Specific Languages · cs.AI · arXiv 2604.21263 · score 4serving
  226. Towards Adaptive Continual Model Merging via Manifold-Aware Expert Evolution · cs.LG · arXiv 2604.22464 · score 3moe
  227. Preserve Support, Not Correspondence: Dynamic Routing for Offline Reinforcement Learning · cs.LG · arXiv 2604.22229 · score 3inference
  228. Multimodal Diffusion to Mutually Enhance Polarized Light and Low Resolution EBSD Data · eess.IV · arXiv 2604.22212 · score 3inference
  229. Mochi: Aligning Pre-training and Inference for Efficient Graph Foundation Models via Meta-Learning · cs.LG · arXiv 2604.22031 · score 3inference
  230. Bounding the Black Box: A Statistical Certification Framework for AI Risk Regulation · cs.AI · arXiv 2604.21854 · score 3inference
  231. Ramen: Robust Test-Time Adaptation of Vision-Language Models with Active Sample Selection · cs.CV · arXiv 2604.21728 · score 3inference
  232. Causal Disentanglement for Full-Reference Image Quality Assessment · cs.CV · arXiv 2604.21654 · score 3inference
  233. Suppressing the Erasure Error of Fusion Operation in Photonic Quantum Computing · quant-ph · arXiv 2604.21475 · score 3compiler
  234. Tempered Sequential Monte Carlo for Trajectory and Policy Optimization with Differentiable Dynamics · cs.LG · arXiv 2604.21456 · score 3inference
  235. Even More Guarantees for Variational Inference in the Presence of Symmetries · cs.LG · arXiv 2604.21407 · score 3inference
  236. Cross-Entropy Is Load-Bearing: A Pre-Registered Scope Test of the K-Way Energy Probe on Bidirectional Predictive Coding · cs.CL · arXiv 2604.21286 · score 3inference
  237. Calibeating Prediction-Powered Inference · stat.ML · arXiv 2604.21260 · score 3inference
  238. Neural Recovery of Historical Lexical Structure in Bantu Languages from Modern Data · cs.LG · arXiv 2604.22730 · score 2transformer
  239. Zero-Shot Morphological Discovery in Low-Resource Bantu Languages via Cross-Lingual Transfer and Unsupervised Clustering · cs.LG · arXiv 2604.22723 · score 2rag
  240. CRAFT: Clustered Regression for Adaptive Filtering of Training data · cs.CL · arXiv 2604.22693 · score 2fine-tun
  241. Operational Feature Fingerprints of Graph Datasets via a White-Box Signal-Subspace Probe · cs.LG · arXiv 2604.22676 · score 2rag
  242. Detecting Concept Drift in Evolving Malware Families Using Rule-Based Classifier Representations · cs.CR · arXiv 2604.22629 · score 2rag
  243. Adversarial Co-Evolution of Malware and Detection Models: A Bilevel Optimization Perspective · cs.CR · arXiv 2604.22569 · score 2rag
  244. Cross-Stage Coherence in Hierarchical Driving VQA: Explicit Baselines and Learned Gated Context Projectors · cs.CV · arXiv 2604.22560 · score 2reasoning
  245. QDTraj: Exploration of Diverse Trajectory Primitives for Articulated Objects Robotic Manipulation · cs.RO · arXiv 2604.22551 · score 2rag
  246. Multi-output Extreme Spatial Model for Complex Aircraft Production Systems · stat.AP · arXiv 2604.22548 · score 2rag
  247. Measuring and Mitigating Persona Distortions from AI Writing Assistance · cs.CL · arXiv 2604.22503 · score 2rag
  248. Decoding High-Dimensional Finger Motion from EMG Using Riemannian Features and RNNs · cs.LG · arXiv 2604.22499 · score 2rag
  249. FedSPDnet: Geometry-Aware Federated Deep Learning with SPDnet · stat.ML · arXiv 2604.22494 · score 2rag
  250. On the Hybrid Nature of ABPMS Process Frames and its Implications on Automated Process Discovery · cs.AI · arXiv 2604.22455 · score 2rag
  251. Beyond Land Surface Temperature: Explainable Spatial Machine Learning Reveals Urban Morphology Effects on Human-Centric Heat Stress · cs.LG · arXiv 2604.22433 · score 2gpu
  252. A comprehensive evaluation of spatial co-execution on GPUs using MPS and MIG technologies · cs.DC · arXiv 2604.22430 · score 2gpu
  253. CognitiveTwin: Robust Multi-Modal Digital Twins for Predicting Cognitive Decline in Alzheimer’s Disease · cs.AI · arXiv 2604.22428 · score 2transformer
  254. Distance-Misaligned Training in Graph Transformers and Adaptive Graph-Aware Control · cs.LG · arXiv 2604.22413 · score 2transformer
  255. Conformalized Super Learner · stat.ML · arXiv 2604.22391 · score 2rag
  256. Pack only the essentials: Adaptive dictionary learning for kernel ridge regression · stat.ML · arXiv 2604.22386 · score 2rag
  257. Revisiting Neural Activation Coverage for Uncertainty Estimation · cs.LG · arXiv 2604.22360 · score 2rag
  258. ChangeQuery: Advancing Remote Sensing Change Analysis for Natural and Human-Induced Disasters from Visual Detection to Semantic Understanding · cs.CV · arXiv 2604.22333 · score 2reasoning
  259. AutoINV: Automated Invariant Generation Framework for Formal Verification on High-Level Synthesis Designs · cs.AR · arXiv 2604.22285 · score 2rag
  260. TTS-PRISM: A Perceptual Reasoning and Interpretable Speech Model for Fine-Grained Diagnosis · cs.CL · arXiv 2604.22225 · score 2reasoning
  261. From Global to Local: Rethinking CLIP Feature Aggregation for Person Re-Identification · cs.CV · arXiv 2604.22190 · score 2rag
  262. FixV2W: Correcting Invalid CVE-CWE Mappings with Knowledge Graph Embeddings · cs.CR · arXiv 2604.22176 · score 2rag
  263. Fine-Grained Analysis of Shared Syntactic Mechanisms in Language Models · cs.CL · arXiv 2604.22166 · score 2attention
  264. Wiggle and Go! System Identification for Zero-Shot Dynamic Rope Manipulation · cs.RO · arXiv 2604.22102 · score 2rag
  265. Generating Synthetic Malware Samples Using Generative AI · cs.LG · arXiv 2604.22084 · score 2rag
  266. Shard the Gradient, Scale the Model: Serverless Federated Aggregation via Gradient Partitioning · cs.DC · arXiv 2604.22072 · score 2rag
  267. EgoMAGIC- An Egocentric Video Field Medicine Dataset for Training Perception Algorithms · cs.CV · arXiv 2604.22036 · score 2rag
  268. Fine-Tuning Regimes Define Distinct Continual Learning Problems · cs.LG · arXiv 2604.21927 · score 2fine-tun
  269. A Scale-Adaptive Framework for Joint Spatiotemporal Super-Resolution with Diffusion Models · cs.LG · arXiv 2604.21903 · score 2attention
  270. GiVA: Gradient-Informed Bases for Vector-Based Adaptation · cs.CL · arXiv 2604.21901 · score 2fine-tun
  271. Revealing Geography-Driven Signals in Zone-Level Claim Frequency Models: An Empirical Study using Environmental and Visual Predictors · stat.ML · arXiv 2604.21893 · score 2transformer
  272. Addressing Image Authenticity When Cameras Use Generative AI · cs.CV · arXiv 2604.21879 · score 2rag
  273. Replay-buffer engineering for noise-robust quantum circuit optimization · quant-ph · arXiv 2604.21863 · score 2rag
  274. On the algebra of Koopman eigenfunctions and on some of their infinities · math.DS · arXiv 2604.21825 · score 2rag
  275. Divide-then-Diagnose: Weaving Clinician-Inspired Contexts for Ultra-Long Capsule Endoscopy Videos · cs.CV · arXiv 2604.21814 · score 2reasoning
  276. Inferring High-Level Events from Timestamped Data: Complexity and Medical Applications · cs.AI · arXiv 2604.21793 · score 2reasoning
  277. Compliance Moral Hazard and the Backfiring Mandate · cs.GT · arXiv 2604.21789 · score 2rag
  278. Enabling and Inhibitory Pathways of University Students’ Willingness to Disclose AI Use: A Cognition-Affect-Conation Perspective · cs.AI · arXiv 2604.21733 · score 2rag
  279. Towards Universal Tabular Embeddings: A Benchmark Across Data Tasks · cs.LG · arXiv 2604.21696 · score 2retrieval
  280. Geometric Monomial (GEM): a family of rational 2N-differentiable activation functions · cs.LG · arXiv 2604.21677 · score 2transformer
  281. Large-Scale Data Parallelization of Product Quantization and Inverted Indexing Using Dask · cs.LG · arXiv 2604.21645 · score 2quantization
  282. Geometric Characterisation and Structured Trajectory Surrogates for Clinical Dataset Condensation · cs.LG · arXiv 2604.21638 · score 2rag
  283. Finding Meaning in Embeddings: Concept Separation Curves · cs.CL · arXiv 2604.21555 · score 2rag
  284. The CriticalSet problem: Identifying Critical Contributors in Bipartite Dependency Networks · cs.AI · arXiv 2604.21537 · score 2rag
  285. Seeing Isn’t Believing: Uncovering Blind Spots in Evaluator Vision-Language Models · cs.CV · arXiv 2604.21523 · score 2reasoning
  286. Satisfying Rationality Postulates of Structured Argumentation Through Deductive Support – Technical Report · cs.AI · arXiv 2604.21515 · score 2reasoning
  287. Drug Synergy Prediction via Residual Graph Isomorphism Networks and Attention Mechanisms · cs.LG · arXiv 2604.21473 · score 2attention
  288. Research on the efficiency of data loading and storage in Data Lakehouse architectures for the formation of analytical data systems · cs.DC · arXiv 2604.21449 · score 2rag
  289. Decoupled DiLoCo for Resilient Distributed Pre-training · cs.CL · arXiv 2604.21428 · score 2rag
  290. A Green-Integral-Constrained Neural Solver with Stochastic Physics-Informed Regularization · cs.LG · arXiv 2604.21411 · score 2gpu
  291. Conjecture and Inquiry: Quantifying Software Performance Requirements via Interactive Retrieval-Augmented Preference Elicitation · cs.SE · arXiv 2604.21380 · score 2retrieval
  292. MKJ at SemEval-2026 Task 9: A Comparative Study of Generalist, Specialist, and Ensemble Strategies for Multilingual Polarization · cs.CL · arXiv 2604.21370 · score 2rag
  293. Evaluating AI Meeting Summaries with a Reusable Cross-Domain Pipeline · cs.AI · arXiv 2604.21345 · score 2rag
  294. MiMIC: Mitigating Visual Modality Collapse in Universal Multimodal Retrieval While Avoiding Semantic Misalignment · cs.CV · arXiv 2604.21326 · score 2retrieval
  295. Listen and Chant Before You Read: The Ladder of Beauty in LM Pre-Training · cs.CL · arXiv 2604.21265 · score 2transformer
  296. Improving Performance in Classification Tasks with LCEN and the Weighted Focal Differentiable MCC Loss · cs.LG · arXiv 2604.21252 · score 2rag
  297. Learning Dynamic Representations and Policies from Multimodal Clinical Time-Series with Informative Missingness · cs.LG · arXiv 2604.21235 · score 2rag