arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.05663 2026-04-14 cs.AI

CuraLight: Debate-Guided Data Curation for LLM-Centered Traffic Signal Control

Qing Guo, Xinhang Li, Junyu Chen, Zheng Guo, Shengzhe Xu, Lin Zhang, Lei Li

Comments accepted at IJCNN 2026

详情

英文摘要

Traffic signal control (TSC) is a core component of intelligent transportation systems (ITS), aiming to reduce congestion, emissions, and travel time. Recent approaches based on reinforcement learning (RL) and large language models (LLMs) have improved adaptivity, but still suffer from limited interpretability, insufficient interaction data, and weak generalization to heterogeneous intersections. This paper proposes CuraLight, an LLM-centered framework where an RL agent assists the fine-tuning of an LLM-based traffic signal controller. The RL agent explores traffic environments and generates high-quality interaction trajectories, which are converted into prompt-response pairs for imitation fine-tuning. A multi-LLM ensemble deliberation system further evaluates candidate signal timing actions through structured debate, providing preference-aware supervision signals for training. Experiments conducted in SUMO across heterogeneous real-world networks from Jinan, Hangzhou, and Yizhuang demonstrate that CuraLight consistently outperforms state-of-the-art baselines, reducing average travel time by 5.34 percent, average queue length by 5.14 percent, and average waiting time by 7.02 percent. The results highlight the effectiveness of combining RL-assisted exploration with deliberation-based data curation for scalable and interpretable traffic signal control.

URL PDF HTML ☆

赞 0 踩 0

2604.05165 2026-04-14 cs.AI eess.SP

Learning to Focus: CSI-Free Hierarchical MARL for Reconfigurable Reflectors

Hieu Le, Mostafa Ibrahim, Oguz Bedir, Jian Tao, Sabit Ekin

2604.03765 2026-04-14 cs.CV

ITIScore: An Image-to-Text-to-Image Rating Framework for the Image Captioning Ability of MLLMs

Zitong Xu, Huiyu Duan, Shengyao Qin, Guangyu Yang, Guangji Ma, Xiongkuo Min, Ke Gu, Guangtao Zhai, Patrick Le Callet

2604.02927 2026-04-14 cs.LG cs.NI

Towards Near-Real-Time Telemetry-Aware Routing with Neural Routing Algorithms

Andreas Boltres, Niklas Freymuth, Benjamin Schichtholz, Michael König, Gerhard Neumann

2603.28287 2026-04-14 cs.CV

TerraSky3D: Multi-View Reconstructions of European Landmarks in 4K

Mattia D'Urso, Yuxi Hu, Christian Sormann, Mattia Rossi, Friedrich Fraundorfer

Comments Accepted at 3DMV (CVPR Workshop 2026)

2603.27494 2026-04-14 cs.CV cs.AI

Learning to Focus and Precise Cropping: A Reinforcement Learning Framework with Information Gaps and Grounding Loss for MLLMs

Xuanpu Zhao, Zhentao Tan, Dianmo Sheng, Tianxiang Chen, Yao Liu, Yue Wu, Tao Gong, Qi Chu, Nenghai Yu

Comments Accepted by CVPR 2026

2603.26499 2026-04-14 cs.AI

AIRA_2: Overcoming Bottlenecks in AI Research Agents

Karen Hambardzumyan, Nicolas Baldwin, Edan Toledo, Rishi Hazra, Michael Kuchnik, Bassel Al Omari, Thomas Simon Foster, Anton Protopopov, Jean-Christophe Gagnon-Audet, Ishita Mediratta, Kelvin Niu, Michael Shvartsman, Alisia Lupidi, Alexis Audran-Reiss, Parth Pathak, Tatiana Shavrina, Despoina Magka, Hela Momand, Derek Dunfield, Nicola Cancedda, Pontus Stenetorp, Carole-Jean Wu, Jakob Nicolaus Foerster, Yoram Bachrach, Martin Josifoski

2603.25975 2026-04-14 cs.LG cs.AI cs.CL

Do Neurons Dream of Primitive Operators? Wake-Sleep Compression Rediscovers Schank's Event Semantics

Peter Balogh

2603.23964 2026-04-14 cs.AI

From Pixels to Digital Agents: An Empirical Study on the Taxonomy and Technological Trends of Reinforcement Learning Environments

Lijing Luo, Yiben Luo, Alexey Gorbatovski, Sergey Kovalchuk, Xiaodan Liang

Comments 32 pages main text, 18 figures

2603.22962 2026-04-14 cs.LG stat.ML

Asymptotic Learning Curves for Diffusion Models with Random Features Score and Manifold Data

Anand Jerry George, Nicolas Macris

Comments The proof of Lemma 1 in Appendix C is incorrect

2603.22241 2026-04-14 cs.CL

MemDLM: Memory-Enhanced DLM Training

Zehua Pei, Hui-Ling Zhen, Weizhe Lin, Sinno Jialin Pan, Yunhe Wang, Mingxuan Yuan, Bei Yu

2603.21831 2026-04-14 cs.RO math.DG

Directional Mollification for Knot-Preserving $C^{\infty}$ Smoothing of Polygonal Chains with Explicit Curvature Bounds

Alfredo González-Calvin, Juan F. Jiménez, Héctor García de Marina

2603.18806 2026-04-14 cs.AI

dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models

Wenxuan Zhang, Lemeng Wu, Changsheng Zhao, Ernie Chang, Mingchen Zhuge, Zechun Liu, Andy Su, Hanxian Huang, Jun Chen, Chong Zhou, Raghuraman Krishnamoorthi, Vikas Chandra, Mohamed Elhoseiny, Wei Wen

2603.12639 2026-04-14 cs.CV

RoboStereo: Dual-Tower 4D Embodied World Models for Unified Policy Optimization

Ruicheng Zhang, Guangyu Chen, Zunnan Xu, Zihao Liu, Zhizhou Zhong, Mingyang Zhang, Jun Zhou, Xiu Li

2603.12221 2026-04-14 cs.CV

A Two-Stage Dual-Modality Model for Facial Emotional Expression Recognition

Jiajun Sun, Zhe Gao

Comments Camera-ready version. 14 pages, 5 figures in total: 8 pages main text with 4 figures, 3 pages references, and 3 pages appendix with 1 figure. Accepted at the 10th ABAW Workshop, CVPR 2026

2603.11974 2026-04-14 cs.AI

Normative Common Ground Replication (NormCoRe): Replication-by-Translation for Studying Norms in Multi-Agent AI

Luca Deck, Simeon Allmendinger, Lucas Müller, Niklas Kühl

Comments ACM Conference on Fairness, Accountability, and Transparency (ACM FAccT '26)

详情

DOI: 10.1145/3805689.3806731

英文摘要

In the late 2010s, the fashion trend NormCore framed sameness as a signal of belonging, illustrating how norms emerge through collective coordination. Today, similar forms of normative coordination can be observed in systems based on Multi-agent Artificial Intelligence (MAAI), as AI-based agents deliberate, negotiate, and converge on shared decisions in fairness-sensitive domains. Yet, existing empirical approaches often treat norms as targets for alignment or replication, implicitly assuming equivalence between human subjects and AI agents and leaving collective normative dynamics insufficiently examined. To address this gap, we propose Normative Common Ground Replication (NormCoRe), a novel methodological framework to systematically translate the design of human subject experiments into MAAI environments. Building on behavioral science, replication research, and state-of-the-art MAAI architectures, NormCoRe maps the structural layers of human subject studies onto the design of AI agent studies, enabling systematic documentation of study design and analysis of norms in MAAI. We demonstrate the utility of NormCoRe by replicating a seminal experimental study on distributive justice, in which participants negotiate fairness principles under a "veil of ignorance". We show that normative judgments in AI agent studies can differ from human baselines and are sensitive to the choice of the foundation model and the language used to instantiate agent personas. Our work provides a principled pathway for analyzing norms in MAAI and helps to guide, reflect, and document design choices whenever AI agents are used to automate or support tasks formerly carried out by humans.

URL PDF HTML ☆

赞 0 踩 0

2603.10079 2026-04-14 cs.LG math.PR

Large Spikes in Stochastic Gradient Descent: A Large-Deviations View

Benjamin Gess, Daniel Heydecker

2602.14812 2026-04-14 cs.CL

Physical Commonsense Reasoning for Lower-Resourced Languages and Dialects: a Study on Basque

Jaione Bengoetxea, Itziar Gonzalez-Dios, Rodrigo Agerri

2602.13135 2026-04-14 cs.AI cs.LO

Constrained Assumption-Based Argumentation Frameworks

Emanuele De Angelis, Fabio Fioravanti, Maria Chiara Meo, Alberto Pettorossi, Maurizio Proietti, Francesca Toni

Comments Extended version with proofs and additional results of the full paper accepted at the 25th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2026). DOI: https://doi.org/10.65109/KRAP9309

2602.12748 2026-04-14 cs.AI cs.HC cs.SE

X-SYS: A Reference Architecture for Interactive Explanation Systems

Tobias Labarta, Nhi Hoang, Maximilian Dreyer, Jim Berend, Oleg Hein, Jackie Ma, Wojciech Samek, Sebastian Lapuschkin

Comments 18 pages, 8 figures

2602.10751 2026-04-14 cs.LG

Predicting integers from continuous parameters

Bas Maat, Peter Bloem

2602.10420 2026-04-14 cs.LG cs.IT eess.IV eess.SP math.IT

Binary Flow Matching: Prediction-Loss Space Alignment for Robust Learning

Jiadong Hong, Lei Liu, Xinyu Bian, Wenjie Wang, Zhaoyang Zhang

Comments 15 pages, 3 tables, 11 figures

2602.03402 2026-04-14 cs.AI cs.LG

Risk Awareness Injection: Calibrating Vision-Language Models for Safety without Compromising Utility

Mengxuan Wang, Yuxin Chen, Gang Xu, Tao He, Hongjie Jiang, Ming Li

2601.14706 2026-04-14 cs.CV

LookBench: A Live and Holistic Open Benchmark for Fashion Image Retrieval

Gensmo. ai, Chao Gao, Siqiao Xue, Jiwen Fu, Tingyi Gu, Shanshan Li, Fan Zhou

Comments The first two authors contributed equally to this work. Project site: https://serendipityoneinc.github.io/look-bench-page/

2601.14477 2026-04-14 cs.CV cs.AI eess.IV

XD-MAP: Cross-Modal Domain Adaptation via Semantic Parametric Maps for Scalable Training Data Generation

Frank Bieder, Hendrik Königshof, Haohao Hu, Fabian Immel, Yinzhe Shen, Jan-Hendrik Pauls, Christoph Stiller

Comments 10 pages, 7 figures, 3 tables, accepted at CVPRW

2601.14346 2026-04-14 cs.LG cs.AI

DiSPA: Differential Substructure-Pathway Attention for Drug Response Prediction

Yewon Han, Sunghyun Kim, Eunyi Jeong, Sungkyung Lee, Seokwoo Yun, Sangsoo Lim

2601.13844 2026-04-14 cs.LG

Optimal L2 Regularization in High-dimensional Continual Linear Regression

Gilad Karpel, Edward Moroshko, Ran Levinstein, Ron Meir, Daniel Soudry, Itay Evron

Comments Accepted to ALT 2026

2601.12104 2026-04-14 cs.CL cs.AI cs.CR

Powerful Training-Free Membership Inference Against Autoregressive Language Models

David Ilić, David Stanojević, Kostadin Cvejoski

Comments 9 pages, 2 figures; appendix with additional experiments and derivations

2601.12038 2026-04-14 cs.AI

Subargument Argumentation Frameworks: Separating Direct Conflict from Structural Dependency

Beishui Liao

Comments The original title, "Abstract Argumentation with Subargument Relations," has been replaced by "Subargument Argumentation Frameworks: Separating Direct Conflict from Structural Dependency"

2601.09270 2026-04-14 cs.CL

MCGA: A Multi-task Classical Chinese Literary Genre Audio Corpus

Yexing Du, Kaiyuan Liu, Bihe Zhang, Youcheng Pan, Bo Yang, Liangyu Huo, Xiyuan Zhang, Jian Xie, Daojing He, Yang Xiang, Ming Liu, Bing Qin

Comments Accepted in ACL 2026 (Findings)