URL PDF HTML ☆

赞 0 踩 0

2606.16843 2026-06-16 cs.CL 新提交

Data-Driven Decoding of Russell's Circumplex Model of Affect

基于数据驱动的Russell情感环状模型解码

Amdjed Belaref, Samir Sadok, Zineb Noumir, Renaud Seguier

发表机构 * Alten ； CentraleSupélec IETR UMR CNRS 6164（中央理工-高等电力学院 IETR CNRS 6164 联合研究单位）； Inria at Univ. Grenoble Alpes, CNRS, LJK（法国国家信息与自动化研究所，格勒诺布尔阿尔卑斯大学，CNRS，LJK）

AI总结本文研究Transformer嵌入是否恢复Russell环状模型的几何规律，通过文本和语音模型实验，发现多模态融合完美对齐情感排序，零样本下细粒度情感词接近人类映射坐标。

Comments This work has been submitted to the IEEE for possible publication

详情

AI中文摘要

情感计算日益依赖深度学习来表示情感，然而潜在空间通常是不透明的高维黑箱。本文研究Transformer的嵌入是否恢复Russell环状模型的几何规律。我们统一了两个互补实验，检验以下假设：在文本和语音上训练模型后，其潜在空间编码了与效价-唤醒一致的拓扑结构，并再现了类似人类的邻域关系。具体而言，我们评估了基于Transformer的文本（RoBERTa）和语音（wav2vec 2.0）编码器以及多模态Transformer融合架构提取的深度表示，使用了MSP-Podcast等自然数据集和受控的LLM生成刺激。我们的分析表明，文本和音频的多模态融合与Russell的主要情感排序实现了完美的拓扑对齐。此外，在零样本设置中，使用通用文本嵌入，投影的细粒度情感术语接近其已建立的人类映射坐标。我们的贡献是一个新颖的数据驱动框架，用于验证情感模型，证明Russell环状结构内在地编码于这些模态的嵌入中，而不仅仅是人类标注的产物，从而弥合了心理学理论与表示学习之间的差距。

英文摘要

Affective computing increasingly relies on deep learning to represent emotions, yet latent spaces often remain opaque, high-dimensional black boxes. This paper investigates whether Transformers' embeddings recover the geometric regularities of Russell's circumplex model. We unify two complementary experiments testing the hypothesis that, after training models on text and speech, their resulting latent spaces encode a topology consistent with valence-arousal and reproduce human-like neighborhood relations. Specifically, we evaluate deep representations extracted from Transformer-based text (RoBERTa) and speech (wav2vec 2.0) encoders, along with a multimodal Transformer fusion architecture, across naturalistic datasets like MSP-Podcast and controlled LLM-generated stimuli. Our analysis reveals that multimodal fusion of text and audio yields perfect topological alignment with Russell's primary emotion ordering. Furthermore, in a zero-shot setting using generic text embeddings, projected fine-grained emotion terms fall close to their established human-mapped coordinates. Our contribution is a novel, data-driven framework for validating emotion models, demonstrating that Russell's circumplex structure is intrinsically encoded in the embeddings of these modalities rather than being solely an artifact of human labeling, thereby bridging the gap between psychological theory and representation learning.

URL PDF HTML ☆

赞 0 踩 0

2606.16837 2026-06-16 cs.CV cs.AI cs.SD 新提交

Robust Spoofed Speech Detection via Temporal Pyramid Modeling

基于时间金字塔建模的鲁棒语音伪造检测

Mahtab Masoudi Nezhad, Nima Karimian

发表机构 * Lane Department of Computer Science and Electrical Engineering, West Virginia University（西弗吉尼亚大学莱恩计算机科学与电气工程系）； Bellini College of Artificial Intelligence, Cybersecurity and Computing, University of South Florida（南佛罗里达大学贝利尼人工智能、网络安全与计算学院）

AI总结提出时间金字塔适配器，通过多尺度时间卷积捕获局部伪影和全局韵律异常，结合自监督XLS-R表示，在多个数据集上显著优于基线模型。

详情

自适应且显式安全：触发大型推理模型中的潜在安全意识

Ke Miao, Jiaxin Li, Hongliang Chen, Yuke Hu, Zhan Qin

发表机构 * The State Key Laboratory of Blockchain and Data Security, Zhejiang University（浙江大学区块链与数据安全全国重点实验室）； Hangzhou HighTech Zone (Binjiang) Blockchain and Data Security Research Institute, China（杭州高新区（滨江）区块链与数据安全研究院）； Li Auto Inc.（理想汽车）； Tsinghua University（清华大学）； King Abdullah University Of Science And Technology（阿卜杜拉国王科技大学）

AI总结针对大型推理模型易受越狱攻击的问题，提出Safe Trigger方法，通过SFT显式诱导安全标签触发安全分析，并用DPO优化，显著降低攻击成功率而不影响通用性能。

详情

AI中文摘要

尽管大型推理模型（LRMs）在复杂任务上表现出色，但它们仍然极易受到复杂的越狱攻击和直接的有害查询。为了解决这一脆弱性，先前的工作严重依赖外部手动数据注释进行安全对齐。然而，我们观察到，当原始查询与其自身的推理轨迹一起重新呈现时，LRMs能够固有地识别安全风险——我们将这种能力称为潜在安全意识。为了利用这种安全意识，我们首先采用监督微调（SFT）显式诱导安全标签，以在初始推理内容之后触发对不安全查询的安全分析和指导，同时保留对一般查询的标准响应以确保自适应触发。随后，我们应用直接偏好优化（DPO）进一步增强安全分析和指导的正确性和稳定性。值得注意的是，两个训练阶段所需的响应完全由正在优化的模型生成。通过（Safe Trigger）SFT和DPO，实验结果表明安全性显著增强。例如，DeepSeek-R1-Distill-Llama-8B在有害和越狱基准上的平均攻击成功率（ASR）分别下降了24.65%和36.72%。最后，我们的Safe Trigger方法对通用性能或用户体验几乎没有负面影响。

英文摘要

While Large Reasoning Models (LRMs) excel at complex tasks, they remain highly vulnerable to sophisticated jailbreaks and direct harmful queries. To address this vulnerability, prior works depend heavily on external manual data annotation for safety alignment. However, we observe that LRMs can inherently identify safety risks when being re-presented with original queries alongside their own reasoning trajectories -- a capability we term Latent Safety Awareness. To leverage this safety awareness, we first employ Supervised Fine-Tuning (SFT) to explicitly induce safe tags to trigger safety analysis and guidance following the initial reasoning content for unsafe queries, while preserving standard responses for general queries to ensure adaptive triggering. Subsequently, we apply Direct Preference Optimization (DPO) to further enhance the correctness and stability of the safety analysis and guidance. Notably, responses required for both training stages are entirely generated by models being optimized. With (Safe Trigger) SFT and DPO, experimental results demonstrate significant safety enhancement. For example, the Attack Success Rate (ASR) of DeepSeek-R1-Distill-Llama-8B, on average, drops 24.65% and 36.72% on harmful and jailbreak benchmarks, respectively. Finally, our Safe Trigger method exerts almost no negative impact on general performance or user experience.

URL PDF HTML ☆

赞 0 踩 0

2606.16806 2026-06-16 cs.CL 新提交

LLM-based Visual Code Completion for Aerospace Geometric Design

基于LLM的航空航天几何设计视觉代码补全

Hau Kit Yong, Robert Marsh, Edmar A. Silva, András Sóbester, Stuart E. Middleton

发表机构 * Faculty of Engineering and Physical Sciences, University of Southampton（南安普顿大学工程与物理科学学院）； School of Electronics and Computer Science, University of Southampton（南安普顿大学电子与计算机科学学院）

AI总结提出基于LLM的视觉编程副驾驶系统，结合ReAct方法和GPT 5.4，用于航空航天几何设计，并构建Wingbuilder插件库和AVPD数据集，用户试验表明系统能生成有用建议但推理速度慢。

详情

AI中文摘要

近年来，大型语言模型（LLMs）和视觉语言模型（VLMs）在视觉代码补全能力上取得了显著进步，但航空航天行业优先考虑安全性和可解释性而非快速采用LLM，目前尚无公开宣布的基于LLM的几何设计副驾驶系统在商业上被航空航天原始设备制造商（OEMs）使用。本文提出了一种基于LLM的视觉编程副驾驶应用，用于航空航天工程设计任务，采用ReAct方法的视觉编程变体和GPT 5.4。除了副驾驶系统，我们还描述了Wingbuilder，这是一个新的Grasshopper插件库，包含用于航空航天特定几何抽象的自定义组件，以及一个相关的航空航天视觉编程数据集（AVPD），包含18个由航空航天专家设计的不同难度级别的任务及其真实解决方案。我们通过用户试验评估了副驾驶应用，试验涉及来自一家大型飞机制造公司的两位经验丰富的航空航天工程师。我们发现，我们的副驾驶视觉编程ReAct方法成功生成了参与者认为有帮助的建议，但缓慢的ReAct推理时间限制了其在更复杂、耗时的任务中的实用性，因为等待好的副驾驶解决方案建议是值得的。参与者表示他们喜欢这个工具，并愿意在未来使用它。

英文摘要

Recent advances in both Large Language Models (LLMs) and Vision Language Models (VLMs) have seen a step change in their ability to perform visual code completion, but the aerospace industry, which prioritizes safety and explainabilty over rapid LLM adoption, currently has no publicly announced LLM-based geometric design copilot systems in commercial use by aerospace Original Equipment Manufacturers (OEMs). This paper presents a LLM-based visual programming copilot application for aerospace engineering design tasks, using a visual programming variant of the ReAct methodology and GPT 5.4. In addition to the copilot, we describe Wingbuilder, a new Grasshopper plugin library with custom components for aerospace-specific geometry abstraction, and an associated Aerospace Visual Programming Dataset (AVPD) with 18 aerospace expert designed tasks at different levels of difficulty alongside ground truth solutions. We evaluate our copilot application with a user trial involving two experienced aerospace engineers from a large aircraft manufacturing company. We find our copilot visual programming ReAct methodology was successful in generating suggestions that participants found helpful, but slow ReAct inference times limit its usefulness to more complex time-consuming tasks where waiting for good copilot solution suggestion was worthwhile. Participants reported they liked the tool and would be willing to use it in the future.

URL PDF HTML ☆

赞 0 踩 0

2606.16802 2026-06-16 cs.AI 新提交

决策加权流匹配用于上下文随机优化

Jize Xie, Haomiao Wu, Qiang Chen, Xiu Su, Yi Chen

发表机构 * Hong Kong University of Science and Technology（香港科技大学）； Central South University（中南大学）； Big Data Institute（大数据研究院）

AI总结提出决策加权流匹配（DW-FM）框架，通过重加权速度回归目标对齐下游遗憾，在CVaR基准上优于标准方法。

详情

AI中文摘要

条件生成模型越来越多地被用作随机优化的场景生成器，但标准训练目标强调均匀分布拟合，而非生成场景所引发的下游决策。这造成了目标不匹配：统计常见区域的误差对决策遗憾影响很小，而决策敏感区域的误差可能显著改变最优行动。我们提出决策加权流匹配（DW-FM），一种遗憾对齐的训练框架，它保留了标准流匹配的简单性，同时使用决策敏感的端点信息对其速度回归目标进行重加权。理论上，我们通过损失诱导的决策差异和伴随输运论证将下游遗憾与路径速度不匹配联系起来，得到一个理想的遗憾对齐替代目标以及具有遗憾保证的实用端点加权目标。实验上，我们在三个基于CVaR的上下文随机优化基准（涵盖合成投资组合、半真实金融和交通CVaR任务）上展示了DW-FM的有效性，其中DW-FM在标准基线上改善了下游遗憾。

英文摘要

Conditional generative models are increasingly used as scenario generators for stochastic optimization, but standard training objectives emphasize uniform distributional fit rather than the downstream decisions induced by generated scenarios. This creates an objective mismatch: errors in statistically common regions may have little effect on decision regret, whereas errors in decision-sensitive regions can substantially change the optimal action. We propose Decision-Weighted Flow Matching (DW-FM), a regret-aligned training framework that preserves the simplicity of standard flow matching while reweighting its velocity-regression objective using decision-sensitive endpoint information. Theoretically, we connect downstream regret to pathwise velocity mismatch through a loss-induced decision discrepancy and an adjoint transport argument, yielding an ideal regret-aligned surrogate and practical endpoint-weighted objectives with regret guarantees. Empirically, we demonstrate the effectiveness of DW-FM on three CVaR-based contextual stochastic optimization benchmarks spanning synthetic portfolio, semi-real financial, and traffic-CVaR tasks, where DW-FM improves downstream regret over standard baselines.

URL PDF HTML ☆

赞 0 踩 0

2606.16788 2026-06-16 cs.RO 新提交

SoK: Security and Privacy of Foundation-Model-Powered Robots

SoK: 基础模型驱动机器人的安全与隐私

Xueluan Gong, Chen Chen, Jinxin Liu, Qian Wang, Kwok-Yan Lam

发表机构 * College of Computing and Data Science, Nanyang Technological University（南洋理工大学计算与数据科学学院）； School of Cyber Science and Engineering, Wuhan University（武汉大学网络空间安全学院）

AI总结本文提出F-E-S-G结构边界框架，系统分析基础模型驱动机器人的安全与隐私风险，并基于96篇论文揭示威胁模式、防御不匹配和评估差距。

Comments 21 pages, 2 figures

详情

AI中文摘要

基础模型正在重塑机器人技术，使机器人能够解释开放式指令、推理多模态上下文并在复杂的开放世界环境中运行。然而，它们的集成也引入了安全与隐私（S&P）风险，这些风险从基础模型本身扩展到具身执行管道、支持生态系统以及更广泛的治理影响。现有文献综述提供了宝贵的见解，但通常侧重于特定的基础模型类型、风险类别、缓解策略或信任边界。因此，该领域缺乏一个统一的结构来分析风险源自何处、如何在机器人系统中传播以及缓解措施应在何处干预。为填补这一空白，我们提出了一个渐进式的F-E-S-G结构边界框架，用于分析基础模型驱动机器人的安全与隐私。该框架包含四个层次：基础模型层（F）、具身系统层（E）、支持生态系统层（S）和治理影响层（G）。基于此结构，我们开发了一个多级分类法，沿三个层次组织先前的研究：F-E-S-G信任边界、安全-隐私关注点以及风险-缓解视角。我们进一步使用细粒度编码属性对每项研究进行注释，包括目标、生命周期阶段、机制、系统访问和效果。在此框架和分类法的指导下，我们对96篇论文进行了系统化分析。我们的分析揭示了从单一边界视角难以识别的多种威胁模式、防御不匹配和评估差距。基于这些发现，我们确定了开放挑战和未来方向，为开发安全、隐私保护且负责任治理的基础模型驱动机器人系统提供了研究议程。

英文摘要

Foundation models are reshaping robotics by enabling robots to interpret open-ended instructions, reason over multimodal contexts, and operate in complex, open-world environments. However, their integration also introduces security and privacy (S&P) risks that extend beyond the FMs themselves to embodied execution pipelines, supporting ecosystems, and broader governance impacts. Existing literature reviews provide valuable insights but often focus on specific FM types, risk categories, mitigation strategies, or trust boundaries. Consequently, the field lacks a unified structure for analyzing where risks originate, how they propagate across robotic systems, and where mitigations should intervene. To address this gap, we propose a progressive F-E-S-G structural boundary framework for analyzing the S&P of FM-powered robots. The framework comprises four layers: the Foundation model layer (F), Embodied system layer (E), Supporting ecosystem layer (S), and Governance impact layer (G). Building on this structure, we develop a multi-level taxonomy that organizes prior studies along three levels: F-E-S-G trust boundary, security-privacy concerns, and risk-mitigation perspectives. We further annotate each study using fine-grained coding attributes, including target, lifecycle stage, mechanism, system access, and effect. Guided by this framework and taxonomy, we systematize 96 papers. Our analysis uncovers multiple threat patterns, defense mismatches, and evaluation gaps that are difficult to identify from a single-boundary perspective. Based on these findings, we identify open challenges and future directions to provide a research agenda for developing secure, privacy-preserving, and responsibly governed FM-powered robotic systems.

URL PDF HTML ☆

赞 0 踩 0

2606.16786 2026-06-16 cs.LG 新提交

We Need Explanation Cards to Connect Explanation Algorithms to the Real World

我们需要解释卡来连接解释算法与现实世界

Eric Günther, Balázs Szabados, Kristof Meding, Gunnar König, Sebastian Bordt, Ulrike von Luxburg

发表机构 * University of Tübingen（蒂宾根大学）； Tübingen AI Center（蒂宾根人工智能中心）； HUN-REN Institute for Computer Science and Control (SZTAKI), Budapest, Hungary（匈牙利科学院计算机科学与控制研究所（SZTAKI））

AI总结针对算法解释在实践中含义模糊且信息不足的问题，提出解释卡，通过补充鲁棒性和有效性信息及解释说明，帮助用户正确解读，并满足欧盟AI法案的可解释性要求。

详情

AI中文摘要

算法解释旨在帮助利益相关者理解不透明的算法决策，但在实践中往往达不到预期。首先，算法解释的含义通常不是人们直观期望的那样，因此需要专业知识才能正确解释。其次，最近的研究表明，流行的解释算法对于复杂决策函数的行为信息不足。这些共同导致了解释表面传达的内容与实际提供的内容之间的差距。在这项工作中，我们提出了解释算法的解释卡，它用关于鲁棒性和有效性的补充信息以及清晰的解释说明来增强标准解释。补充信息可以使原本无信息的解释变得实际有用，同时也有助于检测它们不适用的情况。重要的是，解释卡中的解释说明将责任从用户转移到提供者：提供者必须事先明确说明从解释中可以得出什么和不能得出什么，而不是期望用户自己识别。使用反事实解释和SHAP作为示例，我们展示了提供者如何构建解释卡，以及这些卡为用户提供了正确解释所需的指导。我们进一步论证了解释卡是实践欧盟AI法案可解释性规定的实用手段。总体而言，解释卡是使解释算法适应现实世界用例的重要一步。

英文摘要

Algorithmic explanations are intended to help stakeholders understand opaque algorithmic decisions, but in practice, they often fall short. First, the meaning of algorithmic explanations is often not what one might intuitively expect, so expert knowledge is required to interpret them correctly. Second, recent work has shown that popular explanation algorithms are uninformative about the behavior of complex decision functions. Together, these issues create a gap between what explanations appear to convey and what they actually provide. In this work, we propose Explanation Cards for Explanation Algorithms, which augment standard explanations with complementary information about robustness and validity, as well as clear instructions for interpretation. The complementary information can render otherwise uninformative explanations practically useful, while also helping to detect cases where they are not. Importantly, the interpretation instructions in explanation cards shift responsibility from users to providers: Rather than expecting users to recognize what can and cannot be concluded from an explanation, providers must make this explicit upfront. Using counterfactual explanations and SHAP as examples, we demonstrate how providers can construct explanation cards and that these cards provide users with the guidance needed for sound interpretation. We further argue that explanation cards offer a practical means of operationalising the explainability provisions of the EU AI Act. Overall, explanation cards are a significant step toward making explanation algorithms fit for real-world use cases.

URL PDF HTML ☆

赞 0 踩 0

2606.16783 2026-06-16 cs.CV cs.AI cs.LG 新提交

Gen-VCoT: Generative Visual Chain-of-Thought Reasoning via Diffusion-Based RGB Intermediate Representations

Gen-VCoT: 基于扩散的RGB中间表示的生成式视觉思维链推理

Zhiqiang Zhou, Junliang Dai, Xu ling

发表机构 * Hunan Chemical Industry Vocational and Technical College（湖南化工职业技术学院）

AI总结提出Gen-VCoT框架，利用专家视觉模型生成RGB图像作为推理中间步骤，通过自适应路由器选择推理深度，在空间和深度问题上分别提升25%和50%，但简单事实查询性能下降，表明最优表示依赖于任务。

Comments 12 pages, 5 figures

2606.16780 2026-06-16 cs.RO 新提交

驯服曲率：稳定Transformer训练的架构预热

Sameera Ramasinghe, Ajanthan Thalaiyasingam, Hadi Mohaghegh Dolatabadi, Chamin Hewa Koneputugodage, Gil Avraham, Violetta Shevchenko, Yan Zuo, Karol Pajak, Alexander Long

发表机构 * Pluralis Research

AI总结提出基于热启动幂迭代的快速在线曲率估计方法，并发现训练不稳定性与预条件曲率激增相关，进而提出渐进增加网络深度的架构预热策略，有效稳定大模型训练。

详情

AI中文摘要

训练数十亿参数的Transformer通常很脆弱，会出现瞬时的损失尖峰和发散，浪费计算资源。尽管最近发展的边缘稳定性（EoS）理论通过（预条件）曲率提供了理解和控制优化方法稳定性的强大工具，但由于曲率估计的复杂性，这些曲率控制方法在大规模Transformer训练中并不流行。为此，我们首先引入一种基于热启动变体的快速在线估计器，用于估计最大的（预条件）Hessian特征值（即曲率），该估计器使用Hessian-向量积进行幂迭代。我们从理论上证明，并通过实验验证，所提出的方法在十亿参数规模下使每次迭代的曲率跟踪变得可行，同时更加准确。利用这一工具，我们发现训练不稳定性与预条件曲率的激增同时发生，并且曲率随深度增加而增长。基于这些观察，我们提出架构预热：逐步增加网络深度，以仔细控制预条件Hessian并稳定训练。在大规模Transformer上的实验验证了我们的方法能够实现高效的曲率跟踪，并在不减慢收敛速度的情况下，与现有最先进的稳定技术相比减少了不稳定性。

英文摘要

Training billion-parameter Transformers is often brittle, with transient loss spikes and divergence that waste compute. Even though the recently developed Edge of Stability (EoS) theory provides a powerful tool to understand and control the stability of optimization methods via the (preconditioned) curvature, these curvature-controlling methods are not popular in large-scale Transformer training due to the complexity of curvature estimation. To this end, we first introduce a fast online estimator of the largest (preconditioned) Hessian eigenvalue (i.e., curvature) based on a warm-started variant for power iteration with Hessian-vector products. We show theoretically, and verify empirically, that the proposed method makes per-iteration curvature tracking feasible at billion parameter scale while being more accurate. Using this tool, we find that training instabilities coincide with surges in preconditioned curvature and that curvature grows with depth. Motivated by these observations, we propose architecture warm-up: progressively growing network depth to carefully control the preconditioned Hessian and stabilize training. Experiments on large Transformers validate that our approach enables efficient curvature tracking and reduces instabilities compared to existing state-of-the-art stabilization techniques without slowing down convergence.

URL PDF HTML ☆

赞 0 踩 0

2606.16767 2026-06-16 cs.CV 新提交

P3B3：用于测量大语言模型中欧洲和巴西葡萄牙语变体偏差的多轮对话基准

Rafael Ferreira, Inês Vieira, Inês Calvo, James Furtado, Iago Paulo, Diogo Tavares, Diogo Glória-Silva, David Semedo, João Magalhães

发表机构 * NOVA University of Lisbon（新里斯本大学）； NOVA LINCS（NOVA LINCS实验室）

AI总结提出P3B3基准，通过专家策划的对话提示和评估框架，测量大语言模型在葡萄牙语变体（欧洲vs巴西）上的偏差和可控性，发现多数模型偏向巴西葡萄牙语。

Comments Accepted at MeLLM Workshop at ACL 2026