arXivDaily arXiv每日学术速递 周一至周五更新
全部学科分类 4033
专题追踪
2511.22565 2026-05-12 cs.AI cs.DB cs.LG

Counting Still Counts: Understanding Neural Complex Query Answering Through Query Relaxation

Yannick Brunink, Daniel Daza, Yunjie He, Michael Cochez

发表机构 * Translational AI Laboratory, Department of Laboratory Medicine Amsterdam University Medical Center, Vrije Universiteit Amsterdam(阿姆斯特丹大学医学中心转化人工智能实验室,实验室医学系,自由大学阿姆斯特丹) Vrije Universiteit Amsterdam(自由大学阿姆斯特丹) University of Stuttgart(斯图加特大学) ELLIS Institute Finland & Abo Akademi University, Turku, Finland(芬兰埃利斯研究所及图尔库芬兰阿博阿卡迪米大学) Elsevier discovery lab, Amsterdam(埃斯勒尔发现实验室,阿姆斯特丹)

AI总结 本文研究了神经网络在知识图谱上处理复杂查询(CQA)的能力,通过对比神经方法与一种无需训练的查询松弛策略,揭示了神经模型在推理模式上可能存在的局限性。研究发现,神经模型在多个数据集和查询结构上的表现并不一致优于查询松弛方法,且两者检索出的答案重叠较少,结合两者结果能提升性能。这一结果表明,当前神经CQA模型尚未完全涵盖查询松弛所捕捉的推理模式,强调了引入非神经基线和融合松弛原理对未来发展的重要性。

Comments Accepted in Transactions on Machine Learning Research (2026)

详情
英文摘要

Neural methods for Complex Query Answering (CQA) over knowledge graphs (KGs) are widely believed to learn patterns that generalize beyond explicit graph structure, allowing them to infer answers that are unreachable through symbolic query processing. In this work, we critically examine this assumption through a systematic analysis comparing neural CQA models with an alternative, training-free query relaxation strategy that retrieves possible answers by relaxing query constraints and counting resulting paths. Across multiple datasets and query structures, we find several cases where neural and relaxation-based approaches perform similarly, with no neural model consistently outperforming the latter. Moreover, a similarity analysis reveals that their retrieved answers exhibit little overlap, and that combining their outputs consistently improves performance. These results call for a re-evaluation of progress in neural query answering: despite their complexity, current models fail to subsume the reasoning patterns captured by query relaxation. Our findings highlight the importance of stronger non-neural baselines and suggest that future neural approaches could benefit from incorporating principles of query relaxation.

2511.07756 2026-05-12 cs.CV

Determinism of Randomness: Prompt-Residual Seed Shaping for Diffusion Generation

Song Yan, Wei Zhai, Chenfeng Wang, Xinliang Bi, Jian Yang, Yancheng Cai, Yusen Zhang, Yunwei Lan, Tao Zhang, GuanYe Xiong, Min Li, Zheng-Jun Zha

发表机构 * USTC(中国科学技术大学) Li Auto Inc.(利亚自动化公司) Xi’an High-tech Research Institute(西安高新技术研究院) Wechat Vision(微信视觉) Cambridge University(剑桥大学) HUST(华中科技大学)

AI总结 扩散模型从各向同性高斯潜在空间开始生成,但仅改变随机种子会导致生成结果在语义忠实度、构图和视觉质量上出现显著差异。本文通过分析从初始噪声到生成内容的语义映射,揭示了种子敏感性的几何原因:潜在空间中大多数方向对语义变化不敏感,而语义敏感的变化集中在较小的子空间内。基于这一发现,作者提出了一种无需训练的提示残差种子塑造方法,通过注入与语义变化相关的切向分量,将种子拉回到原始高斯分布的壳层,从而在保持先验兼容性的同时提升生成结果的对齐度和质量。

详情
英文摘要

Diffusion models start generation from an isotropic Gaussian latent, yet changing only the random seed can lead to large differences in prompt faithfulness, composition, and visual quality. We study this seed sensitivity through the semantic map from initial noise to generated meaning. Although the sampling flow is locally invertible, the subsequent semantic projection is many-to-one, inducing a degenerate pullback semi-metric on the latent space: most local directions are nearly semantic-invariant, while semantic-sensitive variation is concentrated in a much smaller horizontal subspace. This provides an explanatory geometric view of the seed lottery. Motivated by this view, we introduce a training-free prompt-residual seed-shaping procedure. Rather than claiming to recover the exact horizontal space, the method uses a single high-noise cold-start prompt residual as a model-coupled proxy, injects only its tangential component, and retracts the seed to the original Gaussian radius shell. This keeps the initialization prior-compatible while adding only one conditional/unconditional probe before standard sampling. Across multiple generation benchmarks, the method improves alignment and quality metrics over standard sampling, supporting both the practical value of the proxy and the explanatory relevance of semantic anisotropy.

2511.02623 2026-05-12 cs.CL

The Realignment Problem: When Right becomes Wrong in LLMs

Aakash Sen Sharma, Debdeep Sanyal, Manodeep Ray, Vivek Srivastava, Shirish Karande, Murari Mandal

发表机构 * Birla AI Labs(比拉人工智能实验室) TCS Research(塔塔咨询服务研究) Kalinga Institute of Industrial Technology, Bhubaneswar(比拉工业技术学院,巴布尔萨瓦尔)

AI总结 随着政策和价值观的变化,大型语言模型(LLMs)的对齐目标可能逐渐偏离现实需求,形成对齐-现实鸿沟。本文提出TRACE框架,通过分析现有数据中的对齐冲突,无需重新标注即可实现模型的再对齐。该方法利用一个更强的模型作为判断者,通过三阶段流程优化模型对齐效果,并在多个主流模型上验证了其有效性与通用性。

Comments ICML 2026

详情
英文摘要

Post-training alignment of large language models (LLMs) relies on large-scale human annotations guided by policy specifications that change over time. Cultural shifts, value reinterpretations, and regulatory or industrial updates make static alignment increasingly brittle. As policies evolve, deployed models can diverge from current alignment objectives, creating an Alignment-Reality Gap that is difficult to audit or correct. Existing remediation typically requires re-annotation under revised guidelines, which introduces systematic challenges, including guideline ambiguity, annotator interpretation drift, and reduced consistency at scale. We introduce TRACE (Triage and Re-align by Alignment Conflict Evaluation), a framework that transforms realignment into a structured optimization problem over existing data without requiring fresh human annotation. Leveraging a stronger model as a proxy judge, TRACE operates via a three-stage pipeline: (1) triaging preference pairs into inversion, suppression, or retention categories based on alignment conflicts; (2) computing an alignment impact score via bi-level optimization to prioritize high-leverage samples; and (3) executing updates using a hybrid objective that combines relational losses (e.g., IPO) for preference inversion and punitive losses (e.g., NPO) for response suppression. Experiments on Qwen2.5-7B, Gemma-2-9B, and Llama-3.1-8B demonstrate robust realignment on synthetic benchmarks and the PKU-SafeRLHF dataset without degrading general utility. This work provides a scalable approach for LLM realignment under evolving data annotation policies and alignment guidelines. We release our code: https://respailab.github.io/TRACE/

2511.01774 2026-05-12 cs.RO cs.SY eess.SY

MOBIUS: A Multi-Modal Bipedal Robot that can Walk, Crawl, Climb, and Roll

Alexander Schperberg, Yusuke Tanaka, Stefano Di Cairano, Dennis Hong

发表机构 * Mitsubishi Electric Research Laboratories(三菱电机研究实验室) Robotic Systems Lab(机器人系统实验室) Robotics and Mechanisms Laboratory(机器人与机构实验室) Department of Mechanical and Aerospace Engineering, University of California, Los Angeles(加州大学洛杉矶分校机械与航空航天工程系)

AI总结 本文介绍了MOBIUS平台,这是一种能够行走、爬行、攀爬和滚动的双足机器人。该机器人配备四条肢体,包括两只6自由度的机械臂和两只4自由度的腿,结合强化学习与力控制的混合架构,实现了多种运动模式的无缝切换和稳定操作。研究通过硬件实验验证了其在复杂地形中的适应性与操作能力,展示了形态设计、高层规划与控制紧密结合在移动操作与抓取任务中的重要性。

Comments Paper is accepted at the Robotics: Science and Systems conference, held in Sydney, Australia, July 13th-17th, 2026. Alexander Schperberg and Yusuke Tanaka are co-first authors. Both were at the Robotics and Mechanisms Laboratory (RoMeLa) at UCLA when the work started, and are now with Mitsubishi Electric Research Laboratories and ETH Zurich (RSL) respectively

详情
英文摘要

This paper presents the MOBIUS platform, a bipedal robot capable of walking, crawling, climbing, and rolling. MOBIUS features four limbs, two 6-DoF arms with two-finger grippers for manipulation and climbing, and two 4-DoF legs for locomotion--enabling smooth transitions across diverse terrains without reconfiguration. A hybrid control architecture combines reinforcement learning for locomotion and force control for compliant contact interactions during manipulation. A high-level MIQCP planner autonomously selects locomotion modes to balance stability and energy efficiency. Hardware experiments demonstrate robust gait transitions, dynamic climbing, and full-body load support via pinch grasp. Overall, MOBIUS demonstrates the importance of tight integration between morphology, high-level planning, and control to enable mobile loco-manipulation and grasping, substantially expanding its interaction capabilities, workspace, and traversability.

2510.27527 2026-05-12 cs.LG cs.AI

TetraJet-v2: Accurate NVFP4 Training for Large Language Models with Oscillation Suppression and Outlier Control

Yuxiang Chen, Yifan Liu, Xiaoming Xu, Pengle Zhang, Michael Beyer, Martin Rapp, Jun Zhu, Jianfei Chen

发表机构 * Dept. of Comp. Sci. and Tech., Institute for AI, BNRist Center, THBI Lab, Tsinghua-Bosch Joint ML Center, Tsinghua University(计算机科学与技术系,人工智能研究所,BNRist中心,THBI实验室,清华-博世联合机器学习中心,清华大学) Zhili College, Tsinghua University(紫荆学院,清华大学) Bosch AI Research, Renningen, Germany(博世人工智能研究,德国Renningen)

AI总结 大型语言模型(LLM)的训练成本极高,因此低精度全量化训练(FQT)受到广泛关注。本文提出 TetraJet-v2,一种基于 NVFP4 格式的端到端 4 位 FQT 方法,用于激活、权重和梯度的量化。针对低精度训练中的权重震荡和异常值问题,该方法引入了无偏双块量化、OsciReset 算法和 OutControl 算法,有效提升了训练稳定性和精度。实验表明,TetraJet-v2 在多个大规模模型上实现了接近 BF16 的性能,同时相比 FP8 方法提升了 1.67 倍的训练速度。

Journal ref Proceedings of the 43rd International Conference on Machine Learning, Seoul, South Korea. PMLR 306, 2026 (ICML 2026)

详情
英文摘要

Large Language Models (LLMs) training is prohibitively expensive, driving interest in low-precision fully-quantized training (FQT). While novel 4-bit formats like NVFP4 offer substantial efficiency gains, achieving near-lossless training at such low precision remains challenging. We introduce TetraJet-v2, an end-to-end 4-bit FQT method that leverages NVFP4 for activations, weights, and gradients in all linear layers. We identify two critical issues hindering low-precision LLM training: weight oscillation and outliers. To address these, we propose: 1) an unbiased double-block quantization method for NVFP4 linear layers with practically optimal convergence in LLM training, 2) OsciReset, the first effective algorithm to suppress LLMs' weight oscillation bottleneck, and 3) OutControl, a mix-precision algorithm to retain outlier accuracy. TetraJet-v2 outperforms prior methods on FP4 pre-training for LLMs across models up to 370M parameters trained up to 212B tokens, reducing the performance gap to BF16 by an average of 51.3% while enabling an 1.67x end-to-end speedup over FP8. The code is available at https://github.com/thu-ml/TetraJet-v2-NVFP4Training.

2510.25372 2026-05-12 cs.CV cs.LG

Prompt Estimation from Prototypes for Federated Prompt Tuning of Vision Transformers

M Yashwanth, Sharannya Ghosh, Aditay Tripathi, Anirban Chakraborty

发表机构 * Department of Computational and Data Sciences, Indian Institute of Science(计算与数据科学系,印度科学研究院) Accenture, Japan(日本Accenture公司) Google, India(印度Google公司) Indian Institute of Science(印度科学研究院)

AI总结 本文研究了如何在联邦学习环境下高效且通用地对视觉Transformer进行提示调优。为了解决全局提示调优泛化性差和个性化调优过拟合的问题,作者提出了PEP-FedPT框架,引入了一种基于类上下文混合提示(CCMP)的新方法,通过全局类原型和客户端类先验动态组合类特定提示,实现样本级提示个性化,而无需存储客户端参数。实验表明,该方法在多个数据集上优于现有方法,为联邦视觉Transformer调优提供了有效解决方案。

Comments Accepted to TMLR 2026

详情
英文摘要

Visual Prompt Tuning (VPT) of pre-trained Vision Transformers (ViTs) has proven highly effective as a parameter-efficient fine-tuning technique for adapting large models to downstream tasks with limited data. Its parameter efficiency makes it particularly suitable for Federated Learning (FL), where both communication and computation budgets are often constrained. However, global prompt tuning struggles to generalize across heterogeneous clients, while personalized tuning overfits to local data and lacks generalization. We propose PEP-FedPT (Prompt Estimation from Prototypes for Federated Prompt Tuning), a unified framework designed to achieve both generalization and personalization in federated prompt tuning of ViTs. Within this framework, we introduce the novel Class-Contextualized Mixed Prompt (CCMP) - based on class-specific prompts maintained alongside a globally shared prompt. For each input, CCMP adaptively combines class-specific prompts using weights derived from global class prototypes and client class priors. This approach enables per-sample prompt personalization without storing client-dependent trainable parameters. The prompts are collaboratively optimized via traditional federated averaging technique on the same. Comprehensive evaluations on CIFAR-100, TinyImageNet, DomainNet, and iNaturalist datasets demonstrate that PEP-FedPT consistently surpasses the state-of-the-art baselines under diverse data heterogeneity scenarios, establishing a strong foundation for efficient and generalizable federated prompt tuning of Vision Transformers.

2510.18184 2026-05-12 cs.LG cs.AI

ActivationReasoning: Logical Reasoning in Latent Activation Spaces

Lukas Helff, Ruben Härle, Wolfgang Stammer, Felix Friedrich, Manuel Brack, Antonia Wüst, Hikaru Shindo, Patrick Schramowski, Kristian Kersting

发表机构 * TU Darmstadt(图恩-达姆施塔特大学) Lab1141(Lab1141实验室) Aleph Alpha Research(Aleph Alpha研究) MPI-Inf, SIC(马克斯·普朗克研究所(MPI-Inf)) Meta FAIR Adobe Applied Research(Adobe应用研究) DFKI(DFKI研究所) CERTAIN, Germany(德国CERTAIN)

AI总结 大型语言模型(LLMs)在生成流畅文本方面表现出色,但其内部推理过程仍不透明且难以控制。为此,研究提出了一种名为ActivationReasoning(AR)的框架,通过在LLMs的潜在激活空间中嵌入显式的逻辑推理,使模型具备系统推理和行为引导的能力。该方法分三个阶段:首先通过稀疏自编码器(SAEs)识别并组织潜在概念表示,其次在推理时将激活的概念映射为逻辑命题,最后通过逻辑规则对这些命题进行推理,生成更高层次的结构、新概念并引导模型行为。实验表明,AR在多项推理任务中表现出良好的鲁棒性和泛化能力,为实现更透明、可控和可审计的AI提供了新路径。

Comments Proceedings of the 14th International Conference on Learning Representations (ICLR 2026)

详情
英文摘要

Large language models (LLMs) excel at generating fluent text, but their internal reasoning remains opaque and difficult to control. Sparse autoencoders (SAEs) make hidden activations more interpretable by exposing latent features that often align with human concepts. Yet, these features are fragile and passive, offering no mechanism for systematic reasoning or model control. To address this, we introduce ActivationReasoning (AR), a framework that embeds explicit logical reasoning into the latent space of LLMs. It proceeds in three stages: (1) Finding latent representations, first latent concept representations are identified (e.g., via SAEs) and organized into a dictionary; (2) Activating propositions, at inference time AR detects activating concepts and maps them to logical propositions; and (3)Logical reasoning, applying logical rules over these propositions to infer higher-order structures, compose new concepts, and steer model behavior. We evaluate AR on multi-hop reasoning (PrOntoQA), abstraction and robustness to indirect concept cues (Rail2Country), reasoning over natural and diverse language (ProverQA), and context-sensitive safety (BeaverTails). Across all tasks, AR scales robustly with reasoning complexity, generalizes to abstract and context-sensitive tasks, and transfers across model backbones. These results demonstrate that grounding logical structure in latent activations not only improves transparency but also enables structured reasoning, reliable control, and alignment with desired behaviors, providing a path toward more reliable and auditable AI.

2510.13397 2026-05-12 cs.LG stat.ML

Assessing the robustness of heterogeneous treatment effects in survival analysis under informative censoring

Yuxin Wang, Dennis Frauen, Jonas Schweisthal, Maresa Schröder, Stefan Feuerriegel

发表机构 * LMU Munich Munich Center of Machine Learning (MCML)(慕尼黑大学慕尼黑机器学习中心)

AI总结 在临床研究中,由于患者提前退出(dropout)现象普遍,且退出可能与生存时间相关(即信息性删失),导致治疗效果估计存在偏差。本文提出了一种假设较少的框架,用于在信息性删失下评估条件平均处理效应(CATE)估计的稳健性,通过部分识别方法推导出CATE的置信区间,从而识别出在存在信息性删失情况下治疗仍有效的患者子群。此外,作者还提出了一种新型的模型无关元学习方法SurvB-learner,能够与任意机器学习模型结合使用,具有双重稳健性和近似最优效率等良好理论性质,并通过仿真和真实数据实验验证了其有效性。

详情
英文摘要

Dropout is common in clinical studies, with up to half of patients leaving early due to side effects or other reasons. When dropout is informative (i.e., dependent on survival time), it introduces censoring bias, because of which treatment effect estimates are also biased. In this paper, we propose an assumption-lean framework to assess the robustness of conditional average treatment effect (CATE) estimates in survival analysis when facing censoring bias. Unlike existing works that rely on strong assumptions, such as non-informative censoring, to obtain point estimation, we use partial identification to derive informative bounds on the CATE. Thereby, our framework helps to identify patient subgroups where treatment is effective despite informative censoring. We further propose a novel model-agnostic meta-learner, called SurvB-learner, to estimate the bounds that can be used in combination with arbitrary machine-learning models, and that has favorable theoretical properties such as double-robustness and quasi-oracle efficiency. We finally demonstrate the effectiveness of our meta-learner across various experiments using both simulated and real-world data.

2510.11233 2026-05-12 cs.CL

CNSocialDepress: A Chinese Social Media Dataset for Depression Risk Detection and Structured Analysis

Jinyuan Xu, Tian Lan, Xintao Yu, Xue He, Hezhi Zhang, Ying Wang, Pierre Magistry, Mathieu Valette, Lei Li

发表机构 * Ertim Inalco Milkuya Studio Sorbonne Université(索邦大学) IRD Lab(IRD实验室) Faculty of Psychology, Peking University(北京大学心理学系) Faculty of Psychology and Cognitive Science, Beijing Normal University(北京师范大学心理学与认知科学系) Beijing Institute of Technology(北京理工大学)

AI总结 CNSocialDepress 是一个用于检测和结构化分析中文社交媒体中抑郁风险的基准数据集。该数据集包含233名用户的44,178条帖子,并由心理专家标注了10,306段与抑郁相关的内容,提供了二分类风险标签及多维心理属性信息,支持细粒度和可解释的抑郁信号分析。实验表明,该数据集在结构化心理画像和大语言模型微调等任务中具有良好的应用效果,为中文语境下的心理健康研究提供了重要资源。

详情
英文摘要

Depression is a pressing global public health issue, yet publicly available Chinese-language resources for depression risk detection remain scarce and largely focus on binary classification. To address this limitation, we release CNSocialDepress, a benchmark dataset for depression risk detection on Chinese social media. The dataset contains 44,178 posts from 233 users; psychological experts annotated 10,306 depression-related segments. CNSocialDepress provides binary risk labels along with structured, multidimensional psychological attributes, enabling interpretable and fine-grained analyses of depressive signals. Experimental results demonstrate the dataset's utility across a range of NLP tasks, including structured psychological profiling and fine-tuning large language models for depression detection. Comprehensive evaluations highlight the dataset's effectiveness and practical value for depression risk identification and psychological analysis, thereby providing insights for mental health applications tailored to Chinese-speaking populations.

2510.10730 2026-05-12 cs.LG cs.AI stat.ML

Provable Anytime Ensemble Sampling Algorithms in Nonlinear Contextual Bandits

Jiazheng Sun, Weixin Wang, Pan Xu

发表机构 * Duke University(杜克大学)

AI总结 本文提出了一种统一的算法框架,用于非线性上下文老虎机中的集成采样,并针对广义线性老虎机和神经网络上下文老虎机两种常见场景,分别给出了广义线性集成采样(GLM-ES)和神经网络集成采样(Neural-ES)方法,并证明了它们的高概率频繁主义遗憾界。研究通过在随机扰动数据上使用最大似然估计维护多个奖励模型参数估计器,解决了非线性模型中的理论挑战,并提供了无需固定时间步长的任意时间版本算法,具有较强的实用性和理论保证。实验结果表明,所提方法在实际中表现优异。

Comments 58 pages, 5 figures, 1 table

详情
英文摘要

We provide a unified algorithmic framework for ensemble sampling in nonlinear contextual bandits and develop corresponding regret bounds for two most common nonlinear contextual bandit settings: Generalized Linear Ensemble Sampling (GLM-ES) for generalized linear bandits and Neural Ensemble Sampling (Neural-ES) for neural contextual bandits. Both methods maintain multiple estimators for the reward model parameters via maximum likelihood estimation on randomly perturbed data. We prove high-probability frequentist regret bounds of $\widetilde{O}(d^{3/2} \sqrt{T} + d^{4})$ for GLM-ES and $\widetilde{O}(\widetilde{d}^{3/2} \sqrt{T})$ for Neural-ES, where $d$ is the dimension of feature vectors, $\widetilde{d}$ is the effective dimension of a neural tangent kernel (NTK) matrix and $T$ is the number of rounds. The regret bound of GLM-ES matches the state-of-the-art result of randomized exploration algorithms in generalized linear bandit setting. In the theoretical analysis, we introduce techniques that address challenges specific to nonlinear models. Practically, we remove fixed-time horizon assumption by developing anytime versions of our algorithms, suitable when $T$ is unknown. Finally, we empirically evaluate GLM-ES, Neural-ES and their anytime variants, demonstrating strong performance. Overall, our results establish ensemble sampling as a provable and practical randomized exploration approach for nonlinear contextual bandits.

2510.10606 2026-05-12 cs.CV

ViSurf: Visual Supervised-and-Reinforcement Fine-Tuning for Large Vision-and-Language Models

Yuqi Liu, Liangyu Chen, Jiazhen Liu, Mingkang Zhu, Zhisheng Zhong, Bei Yu, Jiaya Jia

发表机构 * The Chinese University of Hong Kong(香港中文大学) Renmin University of China(中国人民大学) The Hong Kong University of Science(香港科学大学)

AI总结 ViSurf 是一种统一的单阶段微调方法,旨在解决大型视觉-语言模型在知识注入与性能提升之间的矛盾。该方法结合了监督微调(SFT)和基于可验证奖励的强化学习(RLVR)的优势,通过将真实标签直接注入RLVR过程,实现外部监督与内部强化的同步优化。ViSurf 还引入了三种新的奖励控制策略以保障训练稳定性,实验表明其在多个基准测试中均优于单独使用SFT、RLVR或传统两阶段方法。

详情
英文摘要

Post-training Large Vision-and-Language Models (LVLMs) typically involves Supervised Fine-Tuning (SFT) for knowledge injection or Reinforcement Learning with Verifiable Rewards (RLVR) for performance enhancement. However, SFT often leads to sub-optimal performance, while RLVR remains constrained by the model's internal knowledge base. While a sequential SFT $\rightarrow$ RLVR pipeline can be used, it introduces significant computational overhead and suffers from catastrophic forgetting. To address these limitations, we propose ViSurf (\textbf{Vi}sual \textbf{Su}pervised-and-\textbf{R}einforcement \textbf{F}ine-Tuning), a unified, single-stage paradigm that integrates the strengths of both SFT and RLVR. By analyzing their training objectives, we establish a unified framework that injects ground-truth labels directly into RLVR rollouts, facilitating simultaneous external supervision and internal reinforcement. Furthermore, we introduce three novel reward control strategies to ensure training stability and optimization. Extensive experiments demonstrate that ViSurf consistently outperforms standalone SFT, RLVR, and the traditional two-stage pipeline across diverse benchmarks. In-depth analysis corroborates these findings, validating the derivation and design principles of ViSurf.

2510.07500 2026-05-12 cs.LG cs.IT math.IT

Black-Box Detection of LLM-Generated Text Using Generalized Jensen-Shannon Divergence

Shuangyi Chen, Ashish Khisti

发表机构 * Department of Electrical and Computer Engineering, University of Toronto, Toronto, Canada(电子与计算机工程系,多伦多大学,多伦多,加拿大)

AI总结 本文研究在实际约束下的黑盒检测问题,即在未知源模型与评分模型不匹配、且生成对比样本成本较高的情况下,如何检测机器生成的文本。提出了一种基于参考的检测方法 SurpMark,通过总结文本中 token 惊奇值的动态变化,利用离散化后的状态转移矩阵,并结合广义杰森-香农散度(GJS)与预设的人类和机器参考模型进行对比评分。实验表明,SurpMark 在多个数据集和生成模型上表现优异,具有良好的跨领域和跨生成器鲁棒性。

Comments ICML 2026

详情
英文摘要

We study black-box detection of machine-generated text under practical constraints: the scoring model (proxy LM) may mismatch the unknown source model, and per-input contrastive generation is costly. We propose SurpMark, a reference-based detector that summarizes a passage by the dynamics of its token surprisals. SurpMark discretizes surprisals into interpretable states, estimates a state-transition matrix for the test text, and scores it via a generalized Jensen-Shannon (GJS) gap between the test transitions and two fixed references (human vs. machine) built once from existing corpora. Theoretically, we derive design guidance for how the discretization bins should scale with data and provide a principled justification for our test statistic. Empirically, across multiple datasets, source models, and scenarios, SurpMark consistently matches or surpasses baselines, demonstrating strong robustness across domains and generators; our experiments on hyperparameter sensitivity exhibit trends that our theoretical results help to explain.

2510.04142 2026-05-12 cs.CV cs.AI cs.LG

Turning Drift into Constraint: Robust Reasoning Alignment in Non-Stationary Multi-Stream Environments

Xiaoyu Yang, En Yu, Wei Duan, Jie Lu

发表机构 * Australian Artificial Intelligence Institute (AAII)(澳大利亚人工智能研究所) Faulty of Engineering and Information Technology(工程与信息技术学院) University of Technology Sydney(悉尼技术大学) Australia(澳大利亚)

AI总结 本文研究了在非平稳多流环境中,如何从多个多模态大语言模型中实现鲁棒的推理对齐问题。针对源模型推理分布随时间演变带来的系统性偏差,作者提出了一种新的约束满足框架——自主偏好优化(APO),将模型间差异视为动态负约束,并通过两阶段策略实现对齐:先通过监督引导使目标模型具备源模型的联合能力,再通过约束感知优化生成一致的共识流形。实验表明,该方法在胸部X光解读任务中表现出优越的鲁棒性,并发布了包含七个多模态大模型推理轨迹的CXR-MAX基准数据集。

Comments ICML 2026

详情
英文摘要

This paper identifies a critical yet underexplored challenge in reasoning alignment from multiple multi-modal large language models (MLLMs): In non-stationary environments, the diverse reasoning distributions of source models often evolve unpredictably, transmitting systematic biases and drift to the target model. To address this, we formulate multi-source reasoning alignment as a constraint satisfaction problem under concept drift theory. We propose Autonomous Preference Optimization (APO), a novel framework that treats inter-model divergences not as noise, but as dynamic negative constraints. APO operates via a two-stage protocol: first, supervised bootstrapping projects the target model into the capability union of source models; second, constraint-aware optimization synthesizes a consistent consensus manifold by explicitly suppressing drifting trajectories via a multi-negative Plackett-Luce objective. Extensive experiments on chest X-ray interpretation demonstrate that our 7B model achieves superior robustness, outperforming even proprietary source models in average accuracy. Furthermore, we release CXR-MAX, a large-scale benchmark comprising 170,982 reasoning trajectories from seven large-scale MLLMs to facilitate research on reasoning alignment under drift. Code and data are available at: https://github.com/XiaoyuYoung/APO.

2510.03895 2026-05-12 cs.RO cs.CV

NoTVLA: Semantics-Preserving Robot Adaptation via Narrative Action Interfaces

Zheng Huang, Mingyu Liu, Xiaoyi Lin, Muzhi Zhu, Canyu Zhao, Zongze Du, Ye Lin, Xiaoman Li, Yiduo Jia, Hao Zhong, Hao Chen, Chunhua Shen

发表机构 * Zhejiang University(浙江大学)

AI总结 该研究提出了一种名为NoTVLA的语义保持型机器人自适应框架,旨在解决视觉-语言-动作(VLA)模型在实际部署中面临的灾难性遗忘问题。其核心方法是通过关注稀疏轨迹而非密集动作序列,结合时间压缩和空间推理剪枝策略,优化轨迹规划并降低计算需求。NoTVLA在多任务评估中表现出优于现有模型的性能,同时显著减少计算资源消耗,并无需依赖腕部摄像头,实现了跨平台部署与零样本泛化能力。

详情
英文摘要

Vision-Language-Action (VLA) models represent a pivotal advance in embodied intelligence, yet they confront critical barriers to real-world deployment, most notably catastrophic forgetting. This issue stems from their overreliance on continuous action sequences or action chunks, which inadvertently create isolated data silos that disrupt knowledge retention across tasks. To tackle these challenges, we propose the Narrowing of Trajectory VLA (NoTVLA) framework: a novel approach that narrows its focus to sparse trajectories, thereby avoiding the catastrophic forgetting associated with dense trajectory fine-tuning. A key innovation of NoTVLA lies in its trajectory planning strategy: instead of centering on the target object's trajectory, it leverages temporal compression and spatial reasoning pruning specifically for the robot end effector's trajectory. Furthermore, training is conducted using these sparse trajectories rather than dense action trajectories, an optimization that delivers remarkable practical advantages with better performance in zero-shot. In multi-task evaluation scenarios, NoTVLA achieves superior performance and generalization compared to pi0 while operating under two critical constraints: it uses over an order of magnitude less computing power than pi0 and requires no wrist-mounted camera. This design ensures that NoTVLA's operational accuracy closely approximates that of single-task expert models. Crucially, it also preserves the model's inherent language capabilities, enabling zero-shot generalization in specific scenarios, supporting unified model deployment across multiple robot platforms, and fostering a degree of generalization even when perceiving tasks from novel perspectives.

2510.00883 2026-05-12 cs.LG cs.AI

GLAI: GreenLightningAI for Accelerated Training through Knowledge Decoupling

Jose I. Mestre, Alberto Fernández-Hernández, Cristian Pérez-Corral, Manuel F. Dolz, Jose Duato, Enrique S. Quintana-Ortí

发表机构 * organization= Openchip \& Software Technologies , city= Barcelona , country= Spain

AI总结 本文提出了一种名为GreenLightningAI(GLAI)的新架构模块,旨在替代传统多层感知机(MLP),通过解耦训练过程中通常纠缠的结构知识和量化知识,实现更高效的训练。GLAI在结构稳定后固定其激活路径,仅优化数值参数,从而在保持MLP通用逼近能力的同时,显著提升了训练效率,平均减少约40%的训练时间。该模块具有通用性,可广泛应用于各类神经网络结构中,并在多种实验设置下表现出与MLP相当或更优的性能。

Comments 20 pages, 2 figures

详情
英文摘要

In this work we introduce GreenLightningAI (GLAI), a new architectural block designed as an alternative to conventional MLPs. The central idea is to separate two types of knowledge that are usually entangled during training: (i) *structural knowledge*, encoded by the stable activation patterns induced by ReLU activations; and (ii) *quantitative knowledge*, carried by the numerical weights and biases. By fixing the structure once stabilized, GLAI reformulates the MLP as a combination of paths, where only the quantitative component is optimized. This reformulation retains the universal approximation capabilities of MLPs, yet achieves a more efficient training process, reducing training time by ~40% on average across the cases examined in this study. Crucially, GLAI is not just another classifier, but a generic block that can replace MLPs wherever they are used, from supervised heads with frozen backbones to projection layers in self-supervised learning or few-shot classifiers. Across diverse experimental setups, GLAI consistently matches or exceeds the accuracy of MLPs with an equivalent number of parameters, while converging faster. Overall, GLAI establishes a new design principle that opens a direction for future integration into large-scale architectures such as Transformers, where MLP blocks dominate the computational footprint.

2509.25080 2026-05-12 cs.LG

Towards a Certificate of Trust: Task-Aware OOD Detection for Scientific AI

Bogdan Raonić, Siddhartha Mishra, Samuel Lanthaler

发表机构 * Seminar for Applied Mathematics, ETH Zurich(应用数学研究所,苏黎世联邦理工学院) ETH AI Center(苏黎世人工智能中心)

AI总结 在科学人工智能领域,数据驱动模型在天气预测和流体力学等关键任务中广泛应用,但其在面对分布外(OOD)数据时可能失效,如何检测此类失效仍是回归任务中的挑战。本文提出一种基于分数扩散模型的联合似然估计方法,结合输入数据与回归模型预测结果,生成任务感知的可靠性评分。实验表明,该方法在多个科学数据集上能有效反映预测误差,为构建可验证的“信任证书”提供了基础,有助于评估科学人工智能预测的可信度。

详情
英文摘要

Data-driven models are increasingly adopted in critical scientific fields like weather forecasting and fluid dynamics. These methods can fail on out-of-distribution (OOD) data, but detecting such failures in regression tasks is an open challenge. We propose a new OOD detection method based on estimating joint likelihoods using a score-based diffusion model. This approach considers not just the input but also the regression model's prediction, providing a task-aware reliability score. Across numerous scientific datasets, including PDE datasets, satellite imagery and brain tumor segmentation, we show that this likelihood strongly correlates with prediction error. Our work provides a foundational step towards building a verifiable 'certificate of trust', thereby offering a practical tool for assessing the trustworthiness of AI-based scientific predictions. Our code is publicly available at https://github.com/bogdanraonic3/OOD_Detection_ScientificML

2509.24244 2026-05-12 cs.AI

Model Merging Scaling Laws in Large Language Models

Yuanyi Wang, Yanggan Gu, Yiming Zhang, Qi Zhou, Zhaoyi Yan, Congkai Xie, Xinyao Wang, Jianbo Yuan, Hongxia Yang

发表机构 * The Hong Kong Polytechnic University (PolyU)(香港理工大学) Amazon(亚马逊) Innovation Research Institute(创新研究院)

AI总结 本文研究了大语言模型中模型合并的规模定律,通过交叉熵进行衡量。作者发现了一个简洁的幂律关系,揭示了模型规模与专家数量之间的联系,并指出随着模型容量增大,合并效果的下限降低,而专家数量带来的收益则呈现边际递减趋势。该定律适用于不同领域和多种合并方法,能够解释合并过程中收益快速衰减和波动减小的现象,并为模型合并提供了预测性规划的理论依据,为分布式生成式AI系统的发展提供了可预测的扩展原则。

Comments ICML 2026

详情
英文摘要

We study empirical scaling laws for language model merging measured by cross-entropy. Despite its wide practical use, merging lacks a quantitative rule that predicts returns as we add experts or scale the model size. We identify a compact power law that links model size and expert number: the size-dependent floor decreases with model capacity, while the merging tail exhibits clear diminishing returns in the number of experts. The law holds in-domain and cross-domain, tightly fits measured curves across diverse architectures and methods (Average, TA, TIES, DARE), and explains two robust regularities: most gains arrive early, and variability shrinks as more experts are included. Building on this, we present a simple theory that explains why gains fall roughly as 1/k and links the floor and tail to properties of the base model and the diversity across domains. This law enables predictive planning: estimate how many experts are needed to reach a target loss, decide when to stop adding experts, and trade off scaling the base model versus adding experts under a fixed budget--turning merging from heuristic practice into a computationally efficient, planable alternative to multitask training. This suggests a scaling principle for distributed generative AI: predictable gains can be achieved by composing specialists, offering a complementary path toward AGI-level systems.

2509.21892 2026-05-12 cs.CL cs.AI cs.LG

Elastic MoE: Unlocking the Inference-Time Scalability of Mixture-of-Experts

Naibin Gu, Zhenyu Zhang, Yuchen Feng, Yilong Chen, Peng Fu, Zheng Lin, Shuohuan Wang, Yu Sun, Hua Wu, Weiping Wang, Haifeng Wang

发表机构 * Institute of Information Engineering, Chinese Academy of Sciences(中国科学院信息工程研究所) School of Cyber Security, University of Chinese Academy of Sciences(中国科学院大学网络安全学院) Baidu Inc.(百度公司)

AI总结 本文研究了混合专家(MoE)模型在推理时动态调整激活专家数量以适应不同硬件和负载需求的问题。传统MoE模型在训练和推理时固定激活专家数,难以应对实际场景中的变化。作者提出了一种新的训练框架Elastic MoE(EMoE),通过同时训练专家在不同组合下的协作能力,并引导路由器做出高质量选择,从而在推理时弹性调整激活专家数量,显著提升了模型在不同预算下的性能表现。实验表明,EMoE在多个大规模MoE架构和基准测试中均取得了更广的扩展范围和更高的峰值性能。

详情
英文摘要

Mixture-of-Experts (MoE) models typically fix the number of activated experts $k$ at both training and inference. However, real-world deployments often face heterogeneous hardware, fluctuating workloads, and diverse quality-latency requirements, while training separate models for each scenario is costly. Considering that MoE models already operate with sparse activation, adjusting the number of activated experts offers a natural path to serving diverse budgets with a single model. Yet, we find that activating more experts $k'$ ($> k$) at inference does not yield the expected gains. Instead, performance degrades rapidly after only a slight increase, a phenomenon we term the \textit{inference-time scaling wall}. Further investigation reveals that this degradation stems from a lack of learned collaboration among experts. To address this, we introduce \textbf{Elastic Mixture-of-Experts (EMoE)}, a novel training framework that enables MoE models to elastically vary the number of activated experts at inference. By simultaneously training experts to collaborate in diverse combinations and encouraging the router to make high-quality selections, EMoE ensures robust performance across inference budgets. Extensive experiments across four MoE architectures (7B--21B) and nine benchmarks show that EMoE significantly expands the effective scaling range to 2-3$\times$ the training-time $k$, while also achieving higher peak performance.

2509.21000 2026-05-12 cs.LG math.OC

Feature Augmentation of GNNs for ILPs: Local Uniqueness Suffices

Qingyu Han, Qian Li, Linxin Yang, Qian Chen, Qingjiang Shi, Ruoyu Sun

发表机构 * School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen, China(深圳大学理工学院,香港中文大学(深圳)) Shenzhen International Center for Industrial and Applied Mathematics, Shenzhen Research Institute of Big Data, China(深圳国际工业与应用数学中心,深圳大数据研究院) School of Data Science, The Chinese University of Hong Kong, Shenzhen, China(数据科学学院,香港中文大学(深圳)) School of Software Engineering, Tongji University, Shanghai, China(软件工程学院,同济大学)

AI总结 本文研究了如何提升图神经网络(GNN)在求解整数线性规划(ILP)问题中的表现。传统GNN因缺乏节点唯一标识而表达能力受限,而引入全局唯一标识(UID)又会导致泛化性能下降。为此,作者提出了一种局部唯一标识(Local-UID)方案,仅在每个节点的d-hop邻域内保证唯一性,并基于此设计了ColorGNN和ColorUID模型。实验表明,该方法在保持表达能力的同时显著提升了模型在ILP任务上的泛化性能。

Comments 19 pages, 9 Tables

详情
英文摘要

Integer Linear Programs (ILPs) are central to real-world optimizations but notoriously difficult to solve. Learning to Optimize (L2O) has emerged as a promising paradigm, with Graph Neural Networks (GNNs) serving as the standard backbone. However, standard anonymous GNNs are limited in expressiveness for ILPs, and the common enhancement of augmenting nodes with globally unique identifiers (UIDs) typically introduces spurious correlations that severely harm generalization. To address this tradeoff, we propose a parsimonious Local-UID scheme based on d-hop uniqueness coloring, which ensures identifiers are unique only within each node's d-hop neighborhood. Building on this scheme, we introduce ColorGNN, which incorporates color information via color-conditioned embeddings, and ColorUID, a lightweight feature-level variant. We prove that for d-layer networks, Local-UIDs achieve the expressive power of Global-UIDs while offering stronger generalization. Extensive experiments show that our approach yields substantial and robust gains across ILP benchmarks.

2509.20863 2026-05-12 cs.CL

GIFT: Guided Importance-Aware Fine-Tuning for Diffusion Language Models

Guowei Xu, Wenxin Xu, Jiawang Zhao, Kaisheng Ma

发表机构 * Institute for Interdisciplinary Information Sciences(交叉信息科学研究院)

AI总结 本文提出了一种针对扩散语言模型的指导性重要性感知微调方法GIFT,旨在解决其在监督微调过程中因缺乏精确概率估计而导致的生成不稳定问题。该方法通过基于词元熵值分配不同重要性权重,引导模型更关注关键生成步骤,从而提升生成一致性和准确性。实验表明,GIFT在多个主流数据集和不同微调设置下均优于传统微调方法,在四个广泛使用的推理基准测试中表现出显著性能提升。

Comments preprint

详情
英文摘要

Diffusion models have recently shown strong potential in language modeling, offering faster generation compared to traditional autoregressive approaches. However, applying supervised fine-tuning (SFT) to diffusion models remains challenging, as they lack precise probability estimates at each denoising step. While the diffusion mechanism enables the model to reason over entire sequences, it also makes the generation process less predictable and often inconsistent. This highlights the importance of controlling key tokens that guide the direction of generation. To address this issue, we propose GIFT, an importance-aware finetuning method for diffusion language models, where tokens are assigned different importance weights based on their entropy. Derived from diffusion theory, GIFT delivers substantial gains: across diverse settings including different mainstream training datasets ranging from 1k to 10k in size, utilizing LoRA or full parameter fine-tuning, and training on base or instruct models, GIFT consistently achieves superior overall performance compared to standard SFT on four widely used reasoning benchmarks (Sudoku, Countdown, GSM8K, and MATH-500).

2509.20294 2026-05-12 cs.LG math.ST stat.TH

Alignment-Sensitive Minimax Rates for Spectral Algorithms with Learned Kernels

Dongming Huang, Zhifan Li, Yicheng Li, Qian Lin

发表机构 * Department of Statistics and Data Science, National University of Singapore, Singapore(新加坡国立大学统计与数据科学系) School of Statistics and Mathematics, Zhongnan University of Economics and Law, Wuhan, China(中南财经政法大学统计与数学学院) Department of Statistics and Data Science, Tsinghua University, Beijing, China(清华大学统计与数据科学系)

AI总结 本文研究了在核函数从数据中学习的背景下谱算法的泛化性能,引入了一个新的复杂度度量——有效跨度维度(ESD),该度量考虑了信号、谱和噪声水平的联合影响,适用于任意核和信号,无需依赖特征值衰减条件。研究证明,当序列模型的ESD不超过$K$时,最小最大超额风险与$σ^2 K$成比例,并分析了过参数化梯度流如何降低ESD,从而提升谱算法的泛化能力。该框架拓展到了线性模型和再生核希尔伯特空间回归,并通过数值实验验证了理论结果,为理解自适应特征学习与泛化性能的关系提供了新视角。

详情
英文摘要

We study spectral algorithms in the setting where kernels are learned from data. We introduce the effective span dimension (ESD), an alignment-sensitive complexity measure that depends jointly on the signal, spectrum, and noise level $σ^2$. The ESD is well-defined for arbitrary kernels and signals without requiring eigen-decay conditions or source conditions. We prove that for sequence models whose ESD is at most $K$, the minimax excess risk scales as $σ^2 K$. Furthermore, we analyze over-parameterized gradient flow and prove that it can reduce the ESD. This finding establishes a connection between adaptive feature learning and provable improvements in generalization of spectral algorithms. We demonstrate the generality of the ESD framework by extending it to linear models and RKHS regression, and we support the theory with numerical experiments. This framework provides a novel perspective on generalization beyond traditional fixed-kernel theories.

2509.17815 2026-05-12 cs.LG math.OC

Global Optimization via Softmin Energy Minimization

Andrea Agazzi, Vittorio Carlei, Marco Romito, Samuele Saviozzi

发表机构 * Department of Mathematics(数学系) University of Pisa(比萨大学) Institute of Mathematical Statistics and Actuarial Science(数学统计与精算科学研究所) University of Bern(伯尔尼大学) Department of Buisness Economics(商业经济学系) University Gabreiele D’annunzio(加布里埃尔·达·安奇奥尼奥大学)

AI总结 本文研究了非凸函数的全局优化问题,针对传统梯度方法易陷入局部极小和元启发式方法缺乏理论保证的不足,提出了一种基于软最小能量函数的梯度粒子群优化方法。该方法通过引入平滑的软最小能量函数和布朗运动项,结合时间依赖参数控制平滑度,实现了粒子群在探索与收敛之间的有效平衡。理论分析表明,该方法在强凸函数下能保证至少一个粒子收敛到全局最优,且在逃离局部极小方面优于模拟退火方法,数值实验进一步验证了其有效性。

详情
英文摘要

Global optimization, particularly for non-convex functions with multiple local minima, poses significant challenges for traditional gradient-based methods. While metaheuristic approaches offer empirical effectiveness, they often lack theoretical convergence guarantees and may disregard available gradient information. This paper introduces a novel gradient-based swarm particle optimization method designed to efficiently escape local minima and locate global optima. Our approach leverages a "Soft-min Energy" interacting function, $J_β(\mathbf{x})$, which provides a smooth, differentiable approximation of the minimum function value within a particle swarm. We define a stochastic gradient flow in the particle space, incorporating a Brownian motion term for exploration and a time-dependent parameter $β$ to control smoothness, similar to temperature annealing. We theoretically demonstrate that for strongly convex functions, our dynamics converges to a stationary point where at least one particle reaches the global minimum, with other particles exhibiting exploratory behavior. Furthermore, we show that our method facilitates faster transitions between local minima by reducing effective potential barriers with respect to Simulated Annealing. More specifically, we estimate the hitting times of unexplored potential wells for our model in the small noise regime and show that they compare favorably with the ones of overdamped Langevin. Numerical experiments on benchmark functions, including double wells and the Ackley function, validate our theoretical findings and demonstrate better performance over the well-known Simulated Annealing method in terms of escaping local minima and achieving faster convergence.

2509.12982 2026-05-12 cs.RO cs.AI cs.SE

Out of Distribution Detection in Self-adaptive Robots with AI-powered Digital Twins

Erblin Isaku, Hassan Sartaj, Shaukat Ali, Beatriz Sanguino, Tongtong Wang, Guoyuan Li, Houxiang Zhang, Thomas Peyrucain

发表机构 * Simula Research Laboratory(Simula研究实验室) University of Oslo(奥斯陆大学) Norwegian University of Science and Technology(挪威科学技术大学) PAL Robotics(PAL机器人)

AI总结 本文研究了自适应机器人在复杂不确定环境中检测分布外(OOD)行为的问题,提出了一种基于数字孪生的解决方案ODiSAR。该方法利用基于Transformer的数字孪生模型预测机器人状态,并通过重构误差和蒙特卡洛dropout进行不确定性量化,从而有效检测未知条件下的OOD行为。实验表明,ODiSAR在工业机器人场景中实现了高达98%的AUROC和96%的TNR@TPR95等优异检测性能,同时提供了可解释的洞察以支持机器人的自适应能力。

Comments 15 pages, 4 figures, 3 tables

Journal ref 2025 40th IEEE/ACM International Conference on Automated Software Engineering (ASE)

详情
英文摘要

Self-adaptive robots (SARs) in complex, uncertain environments must proactively detect and address abnormal behaviors, including out-of-distribution (OOD) cases. To this end, digital twins offer a valuable solution for OOD detection. Thus, we present a digital twin-based approach for OOD detection (ODiSAR) in SARs. ODiSAR uses a Transformer-based digital twin to forecast SAR states and employs reconstruction error and Monte Carlo dropout for uncertainty quantification. By combining reconstruction error with predictive variance, the digital twin effectively detects OOD behaviors, even in previously unseen conditions. The digital twin also includes an explainability layer that links potential OOD to specific SAR states, offering insights for self-adaptation. We evaluated ODiSAR by creating digital twins of two industrial robots: one navigating an office environment, and another performing maritime ship navigation. In both cases, ODiSAR forecasts SAR behaviors (i.e., robot trajectories and vessel motion) and proactively detects OOD events. Our results showed that ODiSAR achieved high detection performance -- up to 98\% AUROC, 96\% TNR@TPR95, and 95\% F1-score -- while providing interpretable insights to support self-adaptation.

2509.10737 2026-05-12 cs.CL cs.LG

PolyTruth: Multilingual Disinformation Detection using Transformer-Based Language Models

Zaur Gouliev, Jennifer Waters, Chengqian Wang

发表机构 * School of Information & Communication Studies(信息与通信研究学院) University College Dublin(都柏林大学学院)

AI总结 本文提出 PolyTruth,一种基于 Transformer 的多语言虚假信息检测方法,旨在解决当前 AI 模型主要依赖英语数据而忽视多语言环境的问题。研究系统比较了五种多语言 Transformer 模型在统一的真假分类任务上的表现,并构建了一个包含 60,486 对多语言声明的 PolyTruth 数据集,涵盖五大语言系和多个主题领域。实验发现,如 RemBERT 等模型在低资源语言中表现更优,而 mBERT 和 XLM 在数据稀缺时存在明显局限,研究结果为多语言虚假信息检测的模型选择和实际应用提供了重要参考。

Comments 11 pages, 5 figures, 4 tables. Submitted to arXiv in Computation and Language

Journal ref Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2025, Communications in Computer and Information Science, vol. 2843, pp. 353-367, Springer, Cham (2026)

详情
英文摘要

Disinformation spreads rapidly across linguistic boundaries, yet most AI models are still benchmarked only on English. We address this gap with a systematic comparison of five multilingual transformer models: mBERT, XLM, XLM-RoBERTa, RemBERT, and mT5 on a common fake-vs-true machine learning classification task. While transformer-based language models have demonstrated notable success in detecting disinformation in English, their effectiveness in multilingual contexts still remains up for debate. To facilitate evaluation, we introduce PolyTruth Disinfo Corpus, a novel corpus of 60,486 statement pairs (false claim vs. factual correction) spanning over twenty five languages that collectively cover five language families and a broad topical range from politics, health, climate, finance, and conspiracy, half of which are fact-checked disinformation claims verified by an augmented MindBugs Discovery dataset. Our experiments revealed performance variations. Models such as RemBERT achieved better overall accuracy, particularly excelling in low-resource languages, whereas models like mBERT and XLM exhibit considerable limitations when training data is scarce. We provide a discussion of these performance patterns and implications for real-world deployment. The dataset is publicly available on our GitHub repository to encourage further experimentation and advancement. Our findings illuminate both the potential and the current limitations of AI systems for multilingual disinformation detection.

2509.08031 2026-05-12 cs.SD cs.AI cs.LG eess.AS

AU-Harness: An Open-Source Toolkit for Holistic Evaluation of Audio LLMs

Hoang Nguyen, Sidharth Surapaneni, Akshay Kalkunte, Jash Mehta, Aman Tiwari, Oluwanifemi Bamgbose, Khyati Mahajan, Jash Shah, Shruthan Radhakrishna, Sathwik Tejaswi Madhusudhan, Vikas Yadav, Sai Rajeswar

发表机构 * ServiceNow University of Texas at Austin(德克萨斯大学奥斯汀分校)

AI总结 随着大音频语言模型(LALMs)的快速发展,其评估工具仍面临效率低、标准化不足等问题,限制了模型的公平比较和系统评估。为此,本文提出AU-Harness,一个高效且全面的评估框架,通过优化的批量处理和并行执行,实现比现有工具快151%的评估速度,并提供标准化的提示协议和灵活配置,支持多轮对话分析,揭示LALMs的真实音频推理能力,推动模型的系统性发展。

详情
英文摘要

Large Audio Language Models (LALMs) are rapidly advancing, but evaluating them remains challenging due to inefficient and non-standardized toolkits that limit fair comparison and systematic assessment. Existing evaluation frameworks exhibit three critical limitations: (1) slow and inefficient processing pipeline that bottlenecks large-scale studies, (2) inadequate multi-turn dialogue support, leaving fundamental questions about cross-turn context integration and performance dynamics over extended conversations in LALMs unanswered; and (3) the absence of unified and scalable evaluation framework capable of keeping pace with the rapid growth of both LALMs and audio benchmarks. To address these issues, we introduce AU-Harness, an efficient and comprehensive evaluation framework for LALMs. Our system achieves a speedup of up to 151% over existing evaluation toolkits through optimized batch processing and parallel execution, enabling large-scale evaluations previously considered impractical. We provide standardized prompting protocols and flexible configurations for fair model comparison across diverse scenarios. AU-Harness unlocks a range of in-depth analyses difficult to conduct without a unified foundation, including multi-turn dialogue dynamics, enabling the study of true audio reasoning capabilities in existing LALMs. AU-Harness provides both practical evaluation tools and insights into model limitations, advancing systematic LALM development.

2508.20325 2026-05-12 cs.CL cs.AI cs.CV

GUARD: Guideline Upholding Test through Adaptive Role-play and Jailbreak Diagnostics for LLMs

Haibo Jin, Ruoxi Chen, Peiyan Zhang, Andy Zhou, Zelei Cheng, Haohan Wang

发表机构 * Hong Kong University of Science and Technology(香港理工大学) Lapis Labs(Lapis实验室) Capital One

AI总结 随着大型语言模型(LLMs)在各领域应用日益广泛,其生成有害内容的潜在风险引发了社会和监管方面的关注。为验证LLMs是否符合政府发布的伦理指南,本文提出GUARD方法,通过自动生成违反指南的问题并结合“越狱”检测技术,评估模型对指南的遵循程度。该方法不仅能够识别直接违反指南的响应,还能发现可能绕过安全机制的潜在违规场景,并已在多个主流LLMs上进行了实证验证,展示了其在提升模型可靠性方面的有效性。

Comments 56 pages

详情
英文摘要

As Large Language Models (LLMs) become increasingly integral to various domains, their potential to generate harmful responses has prompted significant societal and regulatory concerns. In response, governments have issued ethics guidelines to promote the development of trustworthy AI. However, these guidelines are typically high-level demands for developers and testers, leaving a gap in translating them into actionable testing questions to verify LLM compliance. To address this challenge, we introduce GUARD (Guideline Upholding Test through Adaptive Role-play and Jailbreak Diagnostics), a testing method designed to operationalize guidelines into specific guideline-violating questions that assess LLM adherence. To implement this, GUARD uses automated generation of guideline-violating questions based on government-issued guidelines, thereby testing whether responses comply with these guidelines. When responses directly violate guidelines, GUARD reports inconsistencies. Furthermore, for responses that do not directly violate guidelines, GUARD integrates the concept of ``jailbreaks'' to diagnostics, named GUARD-JD, which creates scenarios that provoke unethical or guideline-violating responses, effectively identifying potential scenarios that could bypass built-in safety mechanisms. Our method finally culminates in a compliance report, delineating the extent of adherence and highlighting any violations. We empirically validated the effectiveness of GUARD on eight LLMs, including Vicuna-13B, LongChat-7B, Llama2-7B, Llama-3-8B, GPT-3.5, GPT-4, GPT-4o, and Claude-3.7, by testing compliance under three government-issued guidelines and conducting jailbreak diagnostics. Additionally, GUARD-JD can transfer jailbreak diagnostics to vision-language models (MiniGPT-v2 and Gemini-1.5), demonstrating its usage in promoting reliable LLM-based applications.

2508.14137 2026-05-12 cs.LG

Learning to Learn the Macroscopic Fundamental Diagram using Physics-Informed and meta Machine Learning techniques

Amalie Roark, Serio Agriesti, Francisco Camara Pereira, Guido Cantelmo

发表机构 * Technical University of Denmark(丹麦技术大学)

AI总结 该研究旨在解决宏观基本图(MFD)估计中因检测器数量不足导致的数据稀缺问题,提出了一种结合元学习与物理信息神经网络的框架。通过从数据丰富的城市中学习可迁移的模式,并将其应用于数据有限的城市,该方法显著提升了MFD预测的准确性,平均将流量预测的平均绝对误差降低了约50%。实验表明,该元学习框架在不同城市和拓扑结构中具有良好的泛化能力,为在实际交通管理中应用提供了有效解决方案。

Comments Version accepted for publication in Transportation Research Part C (before proof-reading)

Journal ref Learning to learn the macroscopic fundamental diagram using physics-informed and model agnostic machine learning. Transportation Research Part C: Emerging Technologies, 2026, 189, 105707

详情
英文摘要

The Macroscopic Fundamental Diagram is a popular tool used to describe traffic dynamics in an aggregated way, with applications ranging from traffic control to incident analysis. However, estimating the MFD for a given network requires large numbers of loop detectors, which is not always available in practise. This article proposes a framework to alleviate the data scarcity challenge harnessing Meta-Learning, a subcategory of Machine Learning that trains models to understand and adapt to new tasks on their own. We use Meta-Learning to identify and exploit transferable patterns from data-rich cities to cities where not enough data is available to estimate the MFD. The developed model is trained and tested by leveraging data from multiple cities and exploiting it to model the MFD of other cities with different shares of detectors and topological structures. The proposed Meta-Learning framework is applied to an ad-hoc Multi-Task Physics-Informed Neural Network, specifically designed to estimate the MFD. Results show an average MAE improvement in flow prediction of around 50% across cities (depending on the subset of loop detectors tested). The Meta-Learning framework thus successfully generalises across diverse urban settings and improves performance on cities with limited data, demonstrating the potential of using Meta-Learning when a limited number of detectors is available. We directly test this assumption by applying the Meta-Learning outputs to unseen cities to simulate a real-life application scenario and the wide applicability of the proposed methodology. Finally, the proposed framework is validated against traditional Transfer Learning approaches and tested with FitFun, a model for FD estimation from the literature, to prove its transferability.

2508.13813 2026-05-12 cs.LG cs.AI

Assessing Trustworthiness of AI Training Dataset using Subjective Logic -- A Use Case on Bias

Koffi Ismael Ouattara, Ioannis Krontiris, Theo Dimitrakos, Frank Kargl

发表机构 * Huawei Technologies(华为技术有限公司) Ulm Universität(乌尔姆大学)

AI总结 随着AI系统对训练数据的依赖日益增加,评估数据集的可信度变得尤为重要,尤其是在数据集层面出现的公平性或偏见等属性。本文首次提出了一种基于主观逻辑的正式框架,用于评估AI训练数据集的可信度,能够在证据不完整、分布或冲突的情况下对全局属性(如偏见)进行不确定性感知的评估。该方法在交通标志识别数据集上的实验表明,其能够有效捕捉类别不平衡现象,并在集中式和联邦学习场景中保持良好的可解释性和鲁棒性。

Comments Accepted at ECML PKDD Bias Workshop '25

详情
英文摘要

As AI systems increasingly rely on training data, assessing dataset trustworthiness has become critical, particularly for properties like fairness or bias that emerge at the dataset level. Prior work has used Subjective Logic to assess trustworthiness of individual data, but not to evaluate trustworthiness properties that emerge only at the level of the dataset as a whole. This paper introduces the first formal framework for assessing the trustworthiness of AI training datasets, enabling uncertainty-aware evaluations of global properties such as bias. Built on Subjective Logic, our approach supports trust propositions and quantifies uncertainty in scenarios where evidence is incomplete, distributed, and/or conflicting. We instantiate this framework on the trustworthiness property of bias, and we experimentally evaluate it based on a traffic sign recognition dataset. The results demonstrate that our method captures class imbalance and remains interpretable and robust in both centralized and federated contexts.

2508.06248 2026-05-12 cs.CV

Deepfake Detection that Generalizes Across Benchmarks

Andrii Yermakov, Jan Cech, Jiri Matas, Mario Fritz

发表机构 * Czech Technical University in Prague(捷克技术大学布拉格分校) CISPA Helmholtz Center for Information Security(CISPA海德堡信息安全中心)

AI总结 本文研究了如何使深度伪造检测方法在面对未知的伪造技术时仍具有良好的泛化能力。提出了一种名为GenD的方法,仅通过微调预训练视觉编码器中的层归一化参数(占总参数的0.03%),结合L2归一化和度量学习,实现了高效的泛化性能。实验表明,该方法在14个不同年份的基准数据集上取得了最先进的结果,证明了在保持模型简洁性的同时,也能实现强大的跨数据集检测能力。

详情
英文摘要

The generalization of deepfake detectors to unseen manipulation techniques remains a challenge for practical deployment. Although many approaches adapt foundation models by introducing significant architectural complexity, this work demonstrates that robust generalization is achievable through a parameter-efficient adaptation of one of the foundational pre-trained vision encoders. The proposed method, GenD, fine-tunes only the Layer Normalization parameters (0.03% of the total) and enhances generalization by enforcing a hyperspherical feature manifold using L2 normalization and metric learning on it. We conducted an extensive evaluation on 14 benchmark datasets spanning from 2019 to 2025. The proposed method achieves state-of-the-art performance, outperforming more complex, recent approaches in average cross-dataset AUROC. Our analysis yields two primary findings for the field: 1) training on paired real-fake data from the same source video is essential for mitigating shortcut learning and improving generalization, and 2) detection difficulty on academic datasets has not strictly increased over time, with models trained on older, diverse datasets showing strong generalization capabilities. This work delivers a computationally efficient and reproducible method, proving that state-of-the-art generalization is attainable by making targeted, minimal changes to a pre-trained foundational image encoder model. The code is at: https://github.com/yermandy/GenD

2508.05463 2026-05-12 cs.LG cs.AI physics.soc-ph

Task complexity shapes internal representations and robustness in neural networks

Robert Jankowski, Filippo Radicchi, M. Ángeles Serrano, Marián Boguñá, Santo Fortunato

发表机构 * Universitat de Barcelona(巴塞罗那大学) Universitat de Barcelona Institute of Complex Systems(巴塞罗那大学复杂系统研究所) Center for Complex Networks and Systems Research(复杂网络与系统研究所以) ICREA

AI总结 本研究探讨了神经网络内部表示和鲁棒性如何受任务复杂度的影响。通过引入一系列数据无关的分析方法,如剪枝、二值化、噪声注入等,研究发现任务难度显著影响多层感知机(MLP)的结构和性能表现。研究还揭示了任务复杂度可由全精度模型与二值化或随机化模型之间的性能差距来衡量,并指出保留符号结构而非精确权重大小即可维持较高准确率,为模型压缩和可解释性提供了新思路。

详情
英文摘要

Neural networks excel across a wide range of tasks, yet remain black boxes. In particular, how their internal representations are shaped by the complexity of the input data and the problems they solve remains obscure. In this work, we introduce a suite of five data-agnostic probes-pruning, binarization, noise injection, sign flipping, and bipartite network randomization-to quantify how task difficulty influences the topology and robustness of representations in multilayer perceptrons (MLPs). MLPs are represented as signed, weighted bipartite graphs from a network science perspective. We contrast easy and hard classification tasks on the MNIST and Fashion-MNIST datasets. We show that binarizing weights in hard-task models collapses accuracy to chance, whereas easy-task models remain robust. We also find that pruning low-magnitude edges in binarized hard-task models reveals a sharp phase-transition in performance. Moreover, moderate noise injection can enhance accuracy, resembling a stochastic-resonance effect linked to optimal sign flips of small-magnitude weights. Finally, preserving only the sign structure-instead of precise weight magnitudes-through bipartite network randomizations suffices to maintain high accuracy. These phenomena define a model- and modality-agnostic measure of task complexity: the performance gap between full-precision and binarized or shuffled neural network performance. Our findings highlight the crucial role of signed bipartite topology in learned representations and suggest practical strategies for model compression and interpretability that align with task complexity.