arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.00835 2026-05-05 cs.LG

Sparse Regression under Correlation and Weak Signals: A Reproducible Benchmark of Classical and Bayesian Methods

Hao Xiao

Comments 14 pages, 8 figures, 6 tables. Code: https://github.com/xiao98/sparse-bayesian-regression-bench

详情

英文摘要

Choosing between classical and Bayesian sparse regression methods involves a real trade-off: penalized estimators like Lasso run in milliseconds but give no uncertainty estimates,while Horseshoe and Spike-and-Slab priors produce full posteriors but need MCMC chains that take minutes per fit.Surprisingly few studies compare these two families head-to-head under the conditions that actually make sparse regression hard -- correlated features, weak signals, and growing dimensionality. We benchmark six methods (OLS, Ridge,Lasso, Elastic Net, Horseshoe, Spike-and-Slab) on synthetic data with three covariance structures (rho up to 0.9), four SNR levels, and p in {20, 50, 100}, plus the Diabetes dataset,totalling over 2,600 experiments. The results are clear on some points and nuanced on others. Bayesian methods win on prediction error (MSE 72 vs. 108-267), and the Horseshoe delivers near-nominal 95% coverage (94.8%). But Spike-and-Slab,despite narrower intervals, under-covers at 91.9% -- its continuous relaxation likely plays a role. For variable selection, Lasso and Spike-and-Slab tie at F1 ~ 0.47, making Lasso the practical default when posteriors are not needed. Code and data are available at https://github.com/xiao98/sparse-bayesian-regression-bench.

URL PDF HTML ☆

赞 0 踩 0

2605.00833 2026-05-05 cs.LG cs.AI

Agentopic: A Generative AI Agent Workflow for Explainable Topic Modeling

Brice Valentin Kok-Shun, Johnny Chan, Gabrielle Peko, David Sundaram

Comments 16 pages, 2 figures

2605.00832 2026-05-05 cs.CV cs.LG

Synthetic Designed Experiments for Diagnosing Vision Model Failure

Krisanu Sarkar

Comments Under review at CVPR SynData4CV 2026

2604.28123 2026-05-05 cs.CV cs.AI cs.CL

Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL

Sudong Wang, Weiquan Huang, Xiaomin Yu, Zuhao Yang, Hehai Lin, Keming Wu, Chaojun Xiao, Chen Chen, Wenxuan Wang, Beier Zhu, Yunjian Zhang, Chengwei Qin

2604.27924 2026-05-05 cs.CL cs.AI

Can AI Be a Good Peer Reviewer? A Survey of Peer Review Process, Evaluation, and the Future

Sihong Wu, Owen Jiang, Yilun Zhao, Tiansheng Hu, Yiling Ma, Kaiyan Zhang, Manasi Patwardhan, Arman Cohan

Comments ACL 2026

2604.27033 2026-05-05 cs.LG eess.SP

Cross-Subject Generalization for EEG Decoding: A Survey of Deep Learning Methods

Taida Li, Yujun Yan, Fei Dou, Wenzhan Song, Xiang Zhang

Comments Accepted manuscript in Progress in Biomedical Engineering. Minor update: corrected author affiliation in comment

2604.25859 2026-05-05 cs.RO

Privileged Foresight Distillation: Zero-Cost Future Correction for World Action Models

Pengcheng Fang, Hongli Chen, Xiaohao Cai

2604.23878 2026-05-05 cs.AI cs.LG

ZenBrain: A Neuroscience-Inspired 7-Layer Memory Architecture for Autonomous AI Systems

Alexander Bering

Comments 47 pages, 31 tables, 3 figures. v3 incorporates extended defensive analyses (Bayesian calibration, power analysis, failure-mode taxonomy, cross-validation) and editorial polish over earlier versions. Earlier preprint versions on Zenodo (concept DOI: 10.5281/zenodo.19353663) and TDCommons (dpubs_series/9683); reproducibility artifacts: 10.5281/zenodo.19481262

2604.21446 2026-05-05 cs.AI cs.CL cs.MA cs.SI

AI-Gram: When Visual Agents Interact in a Social Network

Andrew Shin

2604.21003 2026-05-05 cs.AI

The Last Harness You'll Ever Build

Haebin Seong, Li Yin, Haoran Zhang, Zhan Shi

2604.19298 2026-05-05 cs.CL cs.AI cs.IR

IndiaFinBench: An Evaluation Benchmark for Large Language Model Performance on Indian Financial Regulatory Text

Rajveer Singh Pall

Comments 24 pages, 4 figures, 11 tables. Dataset and evaluation code at https://github.com/rajveerpall/IndiaFinBench

2604.19117 2026-05-05 cs.LG

LLMs Know They're Wrong and Agree Anyway: The Shared Sycophancy-Lying Circuit

Manav Pandey

2604.18058 2026-05-05 cs.LG

Sonata: A Hybrid World Model for Inertial Kinematics under Clinical Data Scarcity

Blaise Delaney, Salil Patel, Yuji Xing, Dominic Dootson, Karin Sevegnani, Chrystalina Antoniades

Comments 18 pages, 3 figures

2604.15174 2026-05-05 cs.LG cs.AI

MambaSL: Exploring Single-Layer Mamba for Time Series Classification

Yoo-Min Jung, Leekyung Kim

Comments accepted at ICLR 2026

2604.15037 2026-05-05 cs.AI cs.CL cs.SD

From Reactive to Proactive: Assessing the Proactivity of Voice Agents via ProVoice-Bench

Ke Xu, Yuhao Wang, Yu Wang

Comments Submitted to Interspeech 2026

2604.14607 2026-05-05 cs.AI

GDPR Auto-Formalization with AI Agents and Human Verification

Ha Thanh Nguyen, Wachara Fungwacharakorn, Sabine Wehnert, May Myo Zin, Yuntao Kong, Jieying Xue, Michał Araszkiewicz, Randy Goebel, Ken Satoh

Comments Accepted at ICAIL 2026

2604.14258 2026-05-05 cs.AI cs.LG

GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification

Wangjie Gan, Miao Pan, Linbo Xi, Wenqi Zhang, Jintao Chen, Jianwei Yin, Xuhong Zhang

2604.14240 2026-05-05 cs.AI cs.LG stat.ML

Interpretable and Explainable Surrogate Modeling for Simulations: A State-of-the-Art Survey and Perspectives on Explainable AI for Decision-Making

Pramudita Satria Palar, Paul Saves, Muhammad Daffa Robani, Nicolas Verstaevel, Moncef Garouani, Julien Aligon, Koji Shimoyama, Joseph Morlier, Benoit Gaudou

Comments Accepted for publication in Archives of Computational Methods in Engineering, 2026

详情

DOI: 10.1007/s11831-026-10600-z

英文摘要

The simulation of complex systems increasingly relies on sophisticated but fundamentally opaque computational black-box simulators. Surrogate models play a central role in reducing the computational cost of complex systems simulations across a wide range of scientific and engineering domains. Notwithstanding, they inevitably inherit and often exacerbate this black-box nature, obscuring how input variables drive physical responses. Conversely, Explainable Artificial Intelligence (XAI) offers powerful tools to unpack these models. Yet, XAI methods struggle with engineering-specific constraints, such as highly correlated inputs, dynamical systems, and rigorous reliability requirements. Consequently, surrogate modeling and XAI have largely evolved as distinct fields of research, despite their strong complementarity. To reconnect these approaches, this state-of-the-art survey provides a structured perspective that maps existing XAI techniques onto the various stages of surrogate modeling workflows for design and exploration. To ground this synthesis, we draw upon illustrative applications across both equation-based simulations and agent-based modeling. We survey a broad spectrum of techniques, highlighting their strengths for revealing interactions and supporting human comprehension. Finally, we identify pressing open challenges, including the explainability of dynamical systems and the handling of mixed-variable systems, and propose a research agenda to make explainability a core, embedded element of simulation-driven workflows from model construction through decision-making. By transforming opaque emulators into explainable tools, this agenda empowers practitioners to move beyond accelerating simulations to extracting actionable insights from complex system behaviors.

URL PDF HTML ☆

赞 0 踩 0

2604.13331 2026-05-05 cs.LG

Text-Attributed Knowledge Graph Enrichment with Large Language Models for Medical Concept Representation

Mohsen Nayebi Kerdabadi, Arya Hadizadeh Moghaddam, Chen Chen, Dongjie Wang, Zijun Yao

Comments This paper has been accepted at ACL 2026 main conference

2604.13076 2026-05-05 cs.CL cs.AI

Alignment midtraining for animals

Jasmine Brazilek, Miles Tidmarsh

Comments 34 pages

2604.10597 2026-05-05 cs.CV cs.AI

COREY: Entropy-Guided Runtime Chunk Scheduling for Selective Scan Kernels

Bo Ma, Jinsong Wu, Weiqi Yan

2604.09132 2026-05-05 cs.CV cs.CG cs.GR

Strips as Tokens: Artist Mesh Generation with Native UV Segmentation

Rui Xu, Dafei Qin, Kaichun Qiao, Qiujie Dong, Huaijin Pi, Qixuan Zhang, Longwen Zhang, Lan Xu, Jingyi Yu, Wenping Wang, Taku Komura

Comments ACM Transactions on Graphics. SIGGRAPH 2026

2604.06091 2026-05-05 cs.CL cs.AI cs.MA

Social Dynamics as Critical Vulnerabilities that Undermine Objective Decision-Making in LLM Collectives

Changgeon Ko, Jisu Shin, Hoyun Song, Huije Lee, Eui Jun Hwang, Jong C. Park

Comments ACL 2026

2604.05134 2026-05-05 cs.LG cs.AI

How Reasoning Evolves from Post-Training Data: An Empirical Study Using Chess

Lucas Dionisopoulos, Nicklas Majamaki, Prithviraj Ammanabrolu

Comments Accepted at ICML 2026. An earlier version appeared at the NeurIPS 2025 Foundations of Reasoning in Language Models (FoRLM) Workshop (Oral)

2604.05081 2026-05-05 cs.AI

MedGemma 1.5 Technical Report

Andrew Sellergren, Chufan Gao, Fereshteh Mahvar, Timo Kohlberger, Fayaz Jamil, Madeleine Traverse, Alberto Tono, Bashir Sadjad, Lin Yang, Charles Lau, Liron Yatziv, Tiffany Chen, Bram Sterling, Kenneth Philbrick, Richa Tiwari, Yun Liu, Madhuram Jajoo, Chandrashekar Sankarapu, Swapnil Vispute, Harshad Purandare, Abhishek Bijay Mishra, Sam Schmidgall, Tao Tu, Anil Palepu, Chunjong Park, Tim Strother, Rahul Thapa, Yong Cheng, Preeti Singh, Kat Black, Yossi Matias, Katherine Chou, Avinatan Hassidim, Kavi Goel, Joelle Barral, Tris Warkentin, Shravya Shetty, Dale Webster, Sunny Virmani, David F. Steiner, Can Kirmizibayrak, Daniel Golden

2604.04106 2026-05-05 cs.AI

InsTraj: Instructing Diffusion Models with Travel Intentions to Generate Real-world Trajectories

Yuanshao Zhu, Yuxuan Liang, Xiangyu Zhao, Liang Han, Xinwei Fang, Xun Zhou, Xuetao Wei, James Jianqiao Yu

2604.03641 2026-05-05 cs.LG cs.AI

Delayed homomorphic reinforcement learning for environments with delayed feedback

Jongsoo Lee, Jangwon Kim, Soohee Han

2604.03380 2026-05-05 cs.CL

Noise Steering for Controlled Text Generation: Improving Diversity and Reading-Level Fidelity in Arabic Educational Story Generation

Haziq Mohammad Khalid, Salsabeel Shapsough, Imran Zualkernan

Comments Accepted to BEA @ ACL 26'

2603.27437 2026-05-05 cs.CV

SpatialStack: Layered Geometry-Language Fusion for 3D VLM Spatial Reasoning

Jian Zhang, Shijie Zhou, Bangya Liu, Achuta Kadambi, Zhiwen Fan

Comments CVPR 2026, Project Website: https://spatial-stack.github.io/

2603.27259 2026-05-05 cs.CV

Seeing the Scene Matters: Revealing Forgetting in Video Understanding Models with a Scene-Aware Long-Video Benchmark

Seng Nam Chen, Hao Chen, Chenglam Ho, Xinyu Mao, Jinping Wang, Yu Zhang, Chao Li

Comments Accepted to CVPR 2026 (Highlight)