arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.12781 2026-04-15 cs.CV

Fragile Reconstruction: Adversarial Vulnerability of Reconstruction-Based Detectors for Diffusion-Generated Images

Haoyang Jiang, Mingyang Yi, Shaolei Zhang, Junxian Cai, Qingbin Liu, Xi Chen, Ju Fan

详情

英文摘要

Recently, detecting AI-generated images produced by diffusion-based models has attracted increasing attention due to their potential threat to safety. Among existing approaches, reconstruction-based methods have emerged as a prominent paradigm for this task. However, we find that such methods exhibit severe security vulnerabilities to adversarial perturbations; that is, by adding imperceptible adversarial perturbations to input images, the detection accuracy of classifiers collapses to near zero. To verify this threat, we present a systematic evaluation of the adversarial robustness of three representative detectors across four diverse generative backbone models. First, we construct adversarial attacks in white-box scenarios, which degrade the performance of all well-trained detectors. Moreover, we find that these attacks demonstrate transferability; specifically, attacks crafted against one detector can be transferred to others, indicating that adversarial attacks on detectors can also be constructed in a black-box setting. Finally, we assess common countermeasures and find that standard defense methods against adversarial attacks provide limited mitigation. We attribute these failures to the low signal-to-noise ratio (SNR) of attacked samples as perceived by the detectors. Overall, our results reveal fundamental security limitations of reconstruction-based detectors and highlight the need to rethink existing detection strategies.

URL PDF HTML ☆

赞 0 踩 0

2604.12780 2026-04-15 cs.CV cs.AI

Efficient Adversarial Training via Criticality-Aware Fine-Tuning

Wenyun Li, Zheng Zhang, Dongmei Jiang, Yaowei Wang, Xiangyuan Lan

2604.12777 2026-04-15 cs.CV cs.AI

Cognition-Inspired Dual-Stream Semantic Enhancement for Vision-Based Dynamic Emotion Modeling

Huanzhen Wang, Ziheng Zhou, Zeng Tao, Aoxing Li, Yingkai Zhao, Yuxuan Lin, Yan Wang, Wenqiang Zhang

Comments Accepted by IEEE ICRA 2026

2604.12776 2026-04-15 cs.CL

EvoSpark: Endogenous Interactive Agent Societies for Unified Long-Horizon Narrative Evolution

Shiyu He, Minchi Kuang, Mengxian Wang, Bin Hu, Tingxiang Gu

Comments Accepted to the Main Conference of ACL 2026

2604.12770 2026-04-15 cs.CL

Teaching LLMs Human-Like Editing of Inappropriate Argumentation via Reinforcement Learning

Timon Ziegenbein, Maja Stahl, Henning Wachsmuth

2604.12768 2026-04-15 cs.LG

Rethinking the Personalized Relaxed Initialization in the Federated Learning: Consistency and Generalization

Li Shen, Yan Sun, Dacheng Tao

Comments arXiv admin note: substantial text overlap with arXiv:2306.05706

2604.12767 2026-04-15 cs.CV cs.AI

CLASP: Class-Adaptive Layer Fusion and Dual-Stage Pruning for Multimodal Large Language Models

Yunkai Dang, Yizhu Jiang, Yifan Jiang, Qi Fan, Yinghuan Shi, Wenbin Li, Yang Gao

2604.12765 2026-04-15 cs.CV cs.GR

A Dataset and Evaluation for Complex 4D Markerless Human Motion Capture

Yeeun Park, Miqdad Naduthodi, Suryansh Kumar

Comments 14 pages, 11 figures, 4 tables. Accepted for publication at CVPR 2026 4D World Models Workshop

2604.12762 2026-04-15 cs.CV cs.AI cs.MA

ARGOS: Who, Where, and When in Agentic Multi-Camera Person Search

Myungchul Kim, Kwanyong Park, Junmo Kim, In So Kweon

Comments Accepted to CVPR 2026 Workshop on Multimodal Spatial Intelligence (MUSI)

2604.12757 2026-04-15 cs.LG cs.AI

GF-Score: Certified Class-Conditional Robustness Evaluation with Fairness Guarantees

Arya Shah, Kaveri Visavadiya, Manisha Padala

Comments 16 pages, 5 tables, 9 figures

2604.12753 2026-04-15 cs.RO

Reliability-Guided Depth Fusion for Glare-Resilient Navigation Costmaps

Shang-En Tsai, Wei-Cheng Sun

2604.12748 2026-04-15 cs.CL

Generating Effective CoT Traces for Mitigating Causal Hallucination

Yiheng Zhao, Jun Yan

Comments 11 pages, 2 figures. Accepted at ACL 2026

2604.12746 2026-04-15 cs.LG eess.SP

Stress Detection Using Wearable Physiological and Sociometric Sensors

Oscar Martinez Mozos, Virginia Sandulescu, Sally Andrews, David Ellis, Nicola Bellotto, Radu Dobrescu, Jose Manuel Ferrandez

Comments This is the accepted manuscript of the article published in International Journal of Neural Systems, 27, 2, 2017. The Version of Record is available at DOI: 10.1142/S0129065716500416

2604.12744 2026-04-15 cs.CL

Universal NER v2: Towards a Massively Multilingual Named Entity Recognition Benchmark

Terra Blevins, Stephen Mayhew, Marek Šuppa, Hila Gonen, Shachar Mirkin, Vasile Pais, Kaja Dobrovoljc, Voula Giouli, Jun Kevin, Eugene Jang, Eungseo Kim, Jeongyeon Seo, Xenophon Gialis, Yuval Pinter

Comments LREC 2026

2604.12743 2026-04-15 cs.AI

Can AI Tools Transform Low-Demand Math Tasks? An Evaluation of Task Modification Capabilities

Danielle S. Fox, Brenda L. Robles, Elizabeth DiPietro Brovey, Christian D. Schunn

Comments 21 pages, 1 figure

详情

英文摘要

While recent research has explored AI tools' ability to classify the quality of mathematical tasks (arXiv:2603.03512), little is known about their capacity to increase the quality of existing tasks. This study investigated whether AI tools could successfully upgrade low-cognitive-demand mathematics tasks. Eleven tools were tested, including six broadly available, general-purpose AI tools (e.g., ChatGPT and Claude) and five tools specialized for mathematics teachers (e.g., Khanmigo, coteach.ai). Using the Task Analysis Guide framework (Stein & Smith, 1998), we prompted AI tools to modify two different types of low-demand mathematical tasks. The prompting strategy aimed to represent likely approaches taken by knowledgeable teachers, rather than extensive optimization to find a more effective prompt (i.e., an optimistic typical outcome). On average, AI tools were only moderately successful: tasks were accurately upgraded only 64% of the time, with different AI tool performance ranging from quite weak (33%) to broadly successful (88%). Specialized tools were only moderately more successful than general-purpose tools. Failure modes included both "undershooting" (maintaining low cognitive demand) and "overshooting" (elevating tasks to an overly ambitious target category that likely would be rejected by teachers). Interestingly, there was a small negative correlation (r = -.35) between whether a given AI tool was able to correctly classify the cognitive demand of tasks and whether the AI was able to upgrade tasks, showing that the ability to modify tasks (i.e., a generative task) represents a distinct capability from the ability to classify them (i.e., judgement using a rubric). These findings have important implications for understanding AI's potential role in curriculum adaptation and highlight the need for specialized approaches to support teachers in modifying instructional materials.

URL PDF HTML ☆

赞 0 踩 0

2604.12736 2026-04-15 cs.CL

Token-Level Policy Optimization: Linking Group-Level Rewards to Token-Level Aggregation via Sequence-Level Likelihood

Xingyu Lin, Yilin Wen, Du Su, Jinchang Hou, En Wang, Wenbin Liu, Chenfu Bao, Zhonghou Lv

2604.12735 2026-04-15 cs.CV

AffectAgent: Collaborative Multi-Agent Reasoning for Retrieval-Augmented Multimodal Emotion Recognition

Zeheng Wang, Zitong Yu, Yijie Zhu, Bo Zhao, Haochen Liang, Taorui Wang, Wei Xia, Jiayu Zhang, Zhishu Liu, Hui Ma, Fei Ma, Qi Tian

2604.12733 2026-04-15 cs.SD cs.LG

Transformer Based Machine Fault Detection From Audio Input

Kiran Voderhobli Holla

2604.12721 2026-04-15 cs.CL

InsightFlow: LLM-Driven Synthesis of Patient Narratives for Mental Health into Causal Models

Shreya Gupta, Prottay Kumar Adhikary, Bhavyaa Dave, Salam Michael Singh, Aniket Deroy, Tanmoy Chakraborty

2604.12719 2026-04-15 cs.LG stat.ML

Monte Carlo Stochastic Depth for Uncertainty Estimation in Deep Learning

Adam T. Müller, Tobias Rögelein, Nicolaj C. Stache

Comments Accepted to the 8th Safe Artificial Intelligence for All Domains (SAIAD) workshop at IEEE/CVF CVPR 2026

2604.12717 2026-04-15 cs.AI

Transferable Expertise for Autonomous Agents via Real-World Case-Based Learning

Zhenyu Ma, Yuyang Song, Chunyi Yang, Jingyi Zhu, Letian Yang, Xukai Jiang

2604.12709 2026-04-15 cs.LG cs.AI cs.CV

Information-Theoretic Optimization for Task-Adapted Compressed Sensing Magnetic Resonance Imaging

Xinyu Peng, Ziyang Zheng, Wenrui Dai, Duoduo Xue, Shaohui Li, Chenglin Li, Junni Zou, Hongkai Xiong

Comments 68 pages, 15 figures, accepted by IEEE TPAMI

详情

DOI: 10.1109/TPAMI.2026.3683201

英文摘要

Task-adapted compressed sensing magnetic resonance imaging (CS-MRI) is emerging to address the specific demands of downstream clinical tasks with significantly fewer k-space measurements than required by Nyquist sampling. However, existing task-adapted CS-MRI methods suffer from the uncertainty problem for medical diagnosis and cannot achieve adaptive sampling in end-to-end optimization with reconstruction or clinical tasks. To address these limitations, we propose the first task-adapted CS-MRI from the information-theoretic perspective to simultaneously achieve probabilistic inference for uncertainty prediction and adapt to arbitrary sampling ratios and versatile clinical applications. Specifically, we formalize the task-adapted CS-MRI optimization problem by maximizing the mutual information between undersampled k-space measurements and clinical tasks to enable probabilistic inference for addressing the uncertainty problem. We leverage amortized optimization and construct tractable variational bounds for mutual information to jointly optimize sampling, reconstruction, and task-inference models, which enables flexible sampling ratio control using a single end-to-end trained model. Furthermore, the proposed framework addresses two kinds of distinct clinical scenarios within a unified approach, i.e., i) joint task and reconstruction, where reconstruction serves as an auxiliary process to enhance task performance; and ii) task implementation with suppressed reconstruction, applicable for privacy protection. Extensive experiments on large-scale MRI datasets demonstrate that the proposed framework achieves highly competitive performance on standard metrics like Dice compared to deterministic counterpart but provides better distribution matching to the ground-truth posterior distribution as measured by the generalized energy distance (GED).

URL PDF HTML ☆

赞 0 踩 0

2604.12700 2026-04-15 cs.AI

MISID: A Multimodal Multi-turn Dataset for Complex Intent Recognition in Strategic Deception Games

Shufang Lin, Muyang Chen, Xiabing Zhou, Rongrong Zhang, Dayou Zhang, Fangxin Wang

Comments 8 pages, 4 figures

2604.12693 2026-04-15 cs.CV

Risk-Calibrated Learning: Minimizing Fatal Errors in Medical AI

Abolfazl Mohammadi-Seif, Ricardo Baeza-Yates

Comments This work has been accepted for publication in the Proceedings of the 2026 International Joint Conference on Neural Networks (IJCNN 2026). The final published version should be cited

2604.12686 2026-04-15 cs.LG cs.AI

BID-LoRA: A Parameter-Efficient Framework for Continual Learning and Unlearning

Jagadeesh Rachapudi, Ritali Vatsi, Praful Hambarde, Amit Shukla

2604.12683 2026-04-15 cs.CV q-bio.NC

Brain-DiT: A Universal Multi-state fMRI Foundation Model with Metadata-Conditioned Pretraining

Junfeng Xia, Wenhao Ye, Xuanye Pan, Xinke Shen, Mo Wang, Quanying Liu

2604.12668 2026-04-15 cs.CV

OFA-Diffusion Compression: Compressing Diffusion Model in One-Shot Manner

Haoyang Jiang, Zekun Wang, Mingyang Yi, Xiuyu Li, Lanqing Hu, Junxian Cai, Qingbin Liu, Xi Chen, Ju Fan

2604.12666 2026-04-15 cs.LG cs.CL cs.HC

From Imitation to Discrimination: Progressive Curriculum Learning for Robust Web Navigation

Chuang Peng, Wei Zhang, Renshuai Tao, Xinhao Zhang, Jian Yang

Comments 17 pages, 10 figures

2604.12665 2026-04-15 cs.CV

Hypergraph-State Collaborative Reasoning for Multi-Object Tracking

Zikai Song, Junqing Yu, Yi-Ping Phoebe Chen, Wei Yang, Xinchao Wang

2604.12663 2026-04-15 cs.AI

Human-Centric Topic Modeling with Goal-Prompted Contrastive Learning and Optimal Transport

Rui Wang, Yi Zheng, Dongxin Wang, Haiping Huang, Yuanzhi Yao, Yuxiang Zhou, Jialin Yu, Philip Torr

Comments 11 Pages, 6 Figures