arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2606.18932 2026-06-18 astro-ph.EP astro-ph.IM cs.AI cs.LG 新提交

TransitNet: A Compact Attention-Augmented Deep Learning Framework for Low-SNR Transit Blind Searches

TransitNet: 一种用于低信噪比凌星盲搜索的紧凑型注意力增强深度学习框架

Xingchen Yan, Jian Ge, Qingtian Liu, Kevin Willis, Quanquan Hu, Jiapeng Zhu

发表机构 * Shanghai Astronomical Observatory, Shanghai 200030, China（上海天文台，上海200030，中国）； University of Chinese Academy of Sciences, Yanqi Lake Campus, East Road 1, Huairou, Beijing 101408, China（中国科学院大学，燕琦湖校区，东路1号，北京101408，中国）； Science Talent Training Center, Gainesville, FL, 32606 USA（科学人才培训中心，佛罗里达州盖恩斯维尔，32606美国）

AI总结提出紧凑型注意力增强深度学习框架TransitNet，用于低信噪比凌星盲搜索，在SNR 6-8范围内达到95.2%准确率，恢复率93.0%，远超TLS和BLS，且模型仅1.5 MB，推理速度提升12-25倍。

Comments 24 pages, 23 figures, 3 tables, submitted to MNRAS

详情

AI中文摘要

受中长周期地球大小行星观测不完整性的启发，我们提出了TransitNet，一种用于低信噪比凌星盲搜索的紧凑型注意力增强深度学习框架。为了实现盲搜索条件下现实的方法开发和客观的阈值校准，我们开发了一个统一的数据集构建、基准测试和阈值选择框架。在由未见过的Kepler目标构建的恢复基准测试中，TransitNet在具有挑战性的信噪比6-8范围内达到了95.2%的准确率，并优于TLS和BLS，ROC-AUC和PR-AP值分别为0.974和0.982。在一次注入的地球大小和亚地球大小凌星恢复实验中，TransitNet实现了93.0%的恢复率，显著超过TLS（63.1%）和BLS（60.0%）。除了检测，TransitNet还提供了基于注意力的凌星窗口和中点估计。在一个独立评估集上，97.4%的注入凌星被估计的凌星窗口完全覆盖。应用于真实的Kepler观测，该模型成功恢复了所有34个选定的已确认Kepler行星，平均绝对凌星中点误差为1.24小时。该模型结合了约1.5 MB的紧凑体积和高推理效率，相对于CPU-TLS加速约12-25倍，相对于CPU-BLS加速约4-5倍。这些结果表明，TransitNet在测试范围内为低信噪比凌星盲搜索提供了一个准确、可扩展且计算高效的框架，并激励其扩展到更长周期的地球大小行星搜索。

英文摘要

Motivated by the observational incompleteness of intermediate-to-long-period Earth-size planets, we present TransitNet, a compact attention-augmented deep-learning framework for low-SNR transit blind searches. To enable realistic method development and objective threshold calibration under blind-search conditions, we develop a unified dataset construction, benchmarking, and threshold-selection framework. On recovery benchmarks constructed from unseen Kepler targets, TransitNet attains 95.2 percent accuracy in the challenging SNR range of 6 to 8 and outperforms both TLS and BLS, achieving ROC-AUC and PR-AP values of 0.974 and 0.982, respectively. In an injected Earth-size and sub-Earth-size transit recovery experiment, TransitNet achieves a recovery rate of 93.0 percent, substantially exceeding those of TLS (63.1 percent) and BLS (60.0 percent). In addition to detection, TransitNet provides attention-based estimates of transit windows and midpoints. On an independent evaluation set, 97.4 percent of injected transits are fully covered by the estimated transit window. Applied to real Kepler observations, the model successfully recovers all 34 selected confirmed Kepler planets, with a mean absolute transit midpoint error of 1.24 hours. The model combines a compact footprint of about 1.5 MB with high inference efficiency, yielding speed-ups of about 12 to 25 times relative to CPU-TLS and about 4 to 5 times relative to CPU-BLS. These results demonstrate that TransitNet provides an accurate, scalable, and computationally efficient framework for low-SNR transit blind searches in the tested regime and motivate its extension to longer-period Earth-size planet searches.

URL PDF HTML ☆

赞 0 踩 0

2606.18464 2026-06-18 astro-ph.IM astro-ph.EP cs.LG 新提交

Modeling Doppler Shifts in Radial-Velocity Data with Deep Learning toward Earth-mass Exoplanet Detection

利用深度学习建模径向速度数据中的多普勒频移以探测地球质量系外行星

Isidro Gómez-Vargas, Xavier Dumusque, Yinan Zhao, Khaled Al Moulla, Michael Cretignier

发表机构 * Department of Astronomy, University of Geneva 51 chemin de Pegasi, 1290 Versoix, Switzerland. Instituto de Astrofı\'isica de Andaluc\'ia (CSIC), Glorieta de la Astronom\'ia s/n, E-18008 Granada, Spain. Institute of Space Sciences (CSIC), Carrer de Can Magrans s/n, E-08193 Barcelona, Spain. Department of Astronomy, University of Texas at Austin, 2515 Speedway, Austin, TX 78712, USA. Instituto de Astrofísica e Ciências do Espaço, Universidade do Porto, CAUP, Rua das Estrelas, 4150-762 Porto, Portugal. Department of Physics, University of Oxford, OX13RH Oxford, UK.

AI总结针对恒星活动干扰，提出结合物理启发光谱表示与深度学习的框架，通过交叉验证和遗传算法优化，可靠恢复振幅≥25 cm/s、周期10-550天的行星信号，并发布Python包doppleriann。

Comments 20 pages, 14 figures. Accepted for publication in Astronomy & Astrophysics

详情

AI中文摘要

由于恒星活动的影响，在恒星径向速度测量中探测由地球质量行星引起的微小多普勒频移仍然极具挑战性。许多在模拟数据上表现良好的深度学习方法难以可靠地应用于真实恒星光谱。本工作的目标是开发一种深度学习框架，使其能够泛化到真实、未见过的光谱，并提高径向速度数据中地球质量行星的可探测性。我们在注入行星信号的HARPS-N太阳光谱上训练人工神经网络，使用基于通量和谱线形成温度的物理驱动光谱表示，以及它们的速度梯度。探索了两种训练策略：留出测试和交叉验证。通过基于遗传算法的超参数优化增强模型鲁棒性，并使用蒙特卡洛dropout量化预测不确定性。在交叉验证策略下，我们最精确的神经网络模型能够可靠地恢复振幅≥25 cm/s、周期在10到550天之间的行星信号的振幅、相位和轨道周期。此外，在所有测试案例中，成功恢复的信号对应于多普勒频移预测周期图中最显著的峰值。基于温度的光谱壳表示始终优于基于通量的壳。我们还发布了实现该框架的Python包doppleriann。我们的结果表明，将物理驱动的光谱表示与深度学习相结合，为从真实观测的径向速度数据中探测地球质量行星提供了一条有前景的途径，该建模框架既具有物理基础又具有统计严谨性，并包含了不确定性量化和优化的训练策略。

英文摘要

Detecting the tiny Doppler shifts induced by Earth-mass planets in stellar radial-velocity measurements remains extremely challenging due to stellar activity. Many deep-learning methods performing well on simulated data remain difficult to apply reliably on real stellar spectra. The aim of this work is to develop a deep-learning framework that generalizes to real, unseen spectra and improves the detectability of Earth-mass planets in radial-velocity data. We train artificial neural networks on HARPS-N solar spectra with injected planetary signals, using physics-motivated spectral representations based on flux and line-formation temperature, together with their velocity gradients. Two training strategies are explored: hold-out testing and cross-validation. Model robustness is enhanced through genetic-algorithm-based hyperparameter optimization, and predictive uncertainty is quantified using Monte Carlo dropout. Our most precise neural network model reliably retrieves, under the cross-validation strategy, the amplitudes, phases, and orbital periods of planetary signals with amplitudes greater than or equal to 25 cm/s and periods between 10 and 550 days. In addition, in all cases tested here, the successfully recovered signals correspond to the most significant peaks in the periodograms of the Doppler-shift predictions. Temperature-based spectral-shell representations consistently outperform flux-based shells. We also release doppleriann, a Python package implementing the proposed framework. Our results demonstrate that combining physically motivated spectral representations with deep learning provides a promising pathway toward the detection of Earth-mass planets in radial-velocity data from real observations, supported by a modeling framework that is both physically grounded and statistically rigorous, incorporating uncertainty quantification and optimized training strategies.

URL PDF HTML ☆

赞 0 踩 0

2606.17491 2026-06-18 stat.ML cs.LG stat.ME 新提交

A Bayesian Boolean Matrix Factorization with Application to Copy Number Analysis in Cancer

贝叶斯布尔矩阵分解及其在癌症拷贝数分析中的应用

Adolphus Wagala, Mehmet Samur, Giovanni Parmigiani

发表机构 * Department of Data Science, Dana-Farber Cancer Institute（数据科学部，达纳-法伯癌症研究所）； Department of Biostatistics, Harvard T.H. Chan School of Public Health（生物统计学部，哈佛T.H. 潘克学校公共卫生学院）

AI总结提出贝叶斯布尔矩阵分解（BBMF）模型，通过全共轭生成模型和稀疏先验实现布尔约束下的可解释因子分解，并应用于多发性骨髓瘤的染色体臂拷贝数变异分析，揭示肿瘤异质性的离散潜在结构。

详情

AI中文摘要

二值数据分解很常见，但实值方法忽略了离散性并产生难以解释的因子。布尔矩阵分解（BooMF）通过逻辑与和或运算将二值矩阵分解为两个低秩二值矩阵，将数据表示为可解释模式的布尔析取。在癌症基因组学中，BooMF可以揭示可能驱动肿瘤演化的协调特征变化，这与旋转或加性分解不同。大多数现有的BooMF方法是启发式的、贪婪的、对初始化敏感、容易陷入局部最优，并且不支持原则性的模型选择或不确定性量化。我们引入了贝叶斯布尔矩阵分解（BBMF），这是一个具有稀疏诱导先验的全共轭生成模型。它强制执行布尔约束，产生具有一致不确定性量化的可解释潜在因子，并允许具有封闭形式全条件分布的吉布斯采样。由于癌症演化通常涉及广泛、近乎同时的染色体数目变化（例如，全基因组复制后伴随不稳定性和选择），布尔分解比加性模型更自然地捕捉这些模式。应用于多发性骨髓瘤的臂级拷贝数变异数据（其中条目指示染色体臂扩增的存在/缺失），BBMF找到了一小组可解释的双团，将患者子集与反复共变的染色体臂联系起来，提供了肿瘤异质性的紧凑、生物学上有意义的总结，并展示了BBMF在复杂二值数据中发现离散潜在结构的实用性。

英文摘要

Binary data factorization is common, but real-valued methods ignore discreteness and yield hard-to-interpret factors. Boolean Matrix Factorization (BooMF) instead decomposes a binary matrix into two lower-rank binary matrices via logical AND and OR, expressing the data as a Boolean disjunction of interpretable patterns. In cancer genomics, BooMF can reveal coordinated feature changes that may drive tumor evolution, unlike rotational or additive decompositions. Most existing BooMF methods are heuristic, greedy, sensitive to initialization, prone to local optima, and do not support principled model selection or uncertainty quantification. We introduce Bayesian Boolean Matrix Factorization (BBMF), a fully conjugate generative model with sparsity-inducing priors. It enforces Boolean constraints, yields interpretable latent factors with coherent uncertainty quantification, and admits Gibbs sampling with closed-form full conditionals. Because cancer evolution often involves widespread, near-simultaneous chromosome-number changes (e.g., whole-genome duplication followed by instability and selection), Boolean factorizations capture these patterns more naturally than additive models. Applied to arm-level copy-number alteration data in multiple myeloma, where entries indicate presence/absence of chromosomal-arm amplifications, BBMF finds a small set of interpretable bicliques linking patient subsets to recurrently co-altered chromosomal arms, providing a compact, biologically meaningful summary of tumor heterogeneity and demonstrating BBMF's utility for uncovering discrete latent structure in complex binary data.

URL PDF HTML ☆

赞 0 踩 0

2606.18105 2026-06-18 cs.NI cs.LG 新提交

OmniPlan: An Adaptive Framework for Timely and Near-Optimal Network Planning Optimization

OmniPlan：一种用于及时且近乎最优的网络规划优化的自适应框架

Longlong Zhu, Jiashuo Yu, Zedi Chen, Yuhan Wu, Zhifan Jiang, Yuchen Xian, Yimeng Liu, Jiajie Su, Shaopeng Zhou, Xingyuan Li, Hongyan Liu, Xuan Liu, Dong Zhang, Chunming Wu, Xiang Chen

发表机构 * Zhejiang University（浙江大学）； Fuzhou University（福州市大学）； Yangzhou University（扬州大学）； The State Key Laboratory of Blockchain and Data Security（区块链与数据安全国家重点实验室）； College of Computer Science and Technology（计算机科学与技术学院）

AI总结提出OmniPlan自适应框架，利用大语言模型解析用户意图，通过混合专家架构动态选择MIP求解器、启发式算法或深度强化学习模型，实现网络规划优化的及时性与近乎最优性，在分布式机器学习推理卸载任务中延迟降低97.8%，资源消耗降低11.5%。

Comments Accepted by ACM KDD 2026

详情

AI中文摘要

网络规划优化是跨多个领域（包括交通系统、通信网络和电网）的基本问题。它需要在复杂约束下同时优化多个相互竞争的目标。现有的网络规划优化框架依赖混合整数规划（MIP）求解器、启发式算法和深度强化学习（DRL）模型来计算规划决策。然而，它们缺乏对多样化和动态用户意图的有效适应性，从而导致执行时间与最优性之间的权衡。在本文中，我们提出OmniPlan，一种自适应框架，在网络规划优化中同时实现及时性和近乎最优性。为了实现现有解决方案所缺乏的适应性，OmniPlan采用基于大语言模型（LLM）的解释器，将异构的自然语言意图转换为统一且可量化的用户偏好向量。然后，它采用混合专家架构，集成MIP求解器、启发式算法和DRL模型作为专门专家，OmniPlan通过动态选择及时且近乎最优的专家来适应多样化的意图。最后，它包含一个基于DRL的专家配置模块，该模块微调优化目标权重，使规划决策与用户特定偏好对齐。我们使用代表性的真实工作负载（即分布式机器学习（ML））评估OmniPlan，其中我们利用OmniPlan将广泛的ML推理任务（例如决策树、SVM、朴素贝叶斯、XGBoost和随机森林）卸载到硬件设备网络。我们在真实测试平台上的实验表明，OmniPlan为真实ML推理任务实现了近乎最优且低执行时间的卸载，延迟降低高达97.8%，网络设备资源消耗降低高达11.5%。

英文摘要

Network planning optimization is a fundamental problem across diverse domains, including transportation systems, communication networks, and power grids. It requires simultaneous optimization of multiple competing objectives under complex constraints. Existing network planning optimization frameworks rely on mixed integer programming (MIP) solvers, heuristics, and deep reinforcement learning (DRL) models to compute planning decisions. However, they lack effective adaptability to diverse and dynamic user intents, thus leading to the trade-off between execution time and optimality. In this paper, we propose OmniPlan, an adaptive framework that achieves both timeliness and near-optimality in network planning optimization. To achieve the adaptability lacking in existing solutions, OmniPlan employs a large language model (LLM)-based interpreter to convert heterogeneous natural-language intents into a unified and quantifiable user-preference vector. Then it employs a mixture-of-experts architecture that integrates MIP solvers, heuristics, and DRL models as specialized experts, where OmniPlan adapts to diverse intents by dynamically selecting timely and near-optimal experts. Finally, it incorporates a DRL-based expert configuration module that fine-tunes optimization objective weights to align planning decisions with user-specific preferences. We evaluate OmniPlan with a representative real-world workload, i.e., distributed machine learning (ML), where we leverage OmniPlan to offload a wide spectrum of ML inference tasks, e.g., decision trees, SVM, naive Bayes, XGBoost, and random forests, onto a network of hardware devices. Our experiments on a real-world testbed indicate that OmniPlan achieves near-optimal and low-execution-time offloading for real-world ML inference tasks, reducing latency by up to 97.8\% and network device resource consumption by up to 11.5\%.

URL PDF HTML ☆

赞 0 踩 0

2606.17276 2026-06-18 cs.IR cs.LG 新提交

On the Memorization Behavior of LLMs in Generative Recommendation: Observations, Implications, and Training Strategies

LLM在生成式推荐中的记忆行为：观察、启示与训练策略

Sunwoo Kim, Sunkyung Lee, Clark Mingxuan Ju, Donald Loveland, Bhuvesh Kumar, Kijung Shin, Neil Shah, Liam Collins

发表机构 * KAIST（韩国科学技术院）； Sungkyunkwan University（成均馆大学）； Snap Inc.（Snap公司）

AI总结研究LLM在生成式推荐中的记忆倾向，发现其过度依赖一跳记忆，提出IIRG训练策略以学习多跳协同与语义关系，显著提升对非一跳记忆用户的推荐效果。

详情

AI中文摘要

生成式推荐（GR）已成为推荐系统的一个有前景的方向。最近，大型语言模型（LLM）越来越多地被用于GR，因为其丰富的预训练知识有望帮助它们泛化到传统以记忆为导向的基线所能捕捉的常见用户行为模式之外。然而，现有的基于LLM的GR工作很大程度上忽略了LLM众所周知的记忆倾向，如果这种倾向存在于为GR微调的LLM中，将限制它们对预训练知识的利用。在这项工作中，我们通过检查一跳记忆（即模型推荐训练数据中项目的直接后继项目）来研究这一担忧。我们表明，LLM比非LLM的GR模型更频繁地这样做——事实上，它们相对于GR基线的大部分增益实际上来自那些目标项目可以通过一跳记忆预测的用户。我们直觉认为，提高剩余用户的性能需要LLM学习更丰富的项目-项目关系，超越一跳转换。为此，我们提出了IIRG，一种新颖的训练策略，教导LLM捕获：（1）从用户序列中跨多跳的项目共现导出的协同关系，以及（2）具有相似主题的项目之间的语义关系，这两者都可以作为有用的推荐信号。我们表明，IIRG显著优于仅使用标准下一项目预测训练的LLM，尤其是对于那些测试项目在训练时的一跳转换中未覆盖的用户，增益尤为显著。

英文摘要

Generative recommendation (GR) has emerged as a promising direction for recommender systems. Recently, large language models (LLMs) have been increasingly adopted for GR, as their rich pretrained knowledge is expected to help them generalize beyond common user behavior patterns that traditional memorization-oriented baselines can capture. However, existing LLM-based GR works largely ignore LLMs' well-known tendency to memorize, which, if present in LLMs fine-tuned for GR, would restrict their utilization of pretrained knowledge. In this work, we investigate this concern by examining one-hop memorization, where a model recommends items that are direct successors of items in the training data. We show that LLMs do this more than non-LLM-based GR models-in fact, the vast majority of their gains over GR baselines are actually on users whose target items can be predicted through one-hop memorization. We intuit that improving performance on the remaining users requires LLMs to learn richer item-item relations beyond one-hop transitions. To achieve this, we propose IIRG, a novel training strategy that teaches LLMs to capture: (1) collaborative relations derived from item co-occurrences across multiple hops in user sequences, and (2) semantic relations among items with similar themes, both of which can serve as useful recommendation signals. We show that IIRG significantly improves over LLMs trained solely with standard next-item prediction, with especially large gains for users whose test items are not covered by train-time one-hop transitions.

URL PDF HTML ☆

赞 0 踩 0

2606.17102 2026-06-18 physics.pop-ph cs.AI cs.ET cs.HC quant-ph 新提交

Quantum Cinema: An Interactive Cinematic Exploration of Quantum Computing Hardware via Generative World Models

量子影院：通过生成世界模型对量子计算硬件进行交互式电影探索

Aoyu Zhang, Dongping Liu, Luyao Zhang

发表机构 * Amazon Web Services（亚马逊网络服务）； Duke Kunshan University（杜克昆山大学）

AI总结本文提出量子影院，一个基于生成世界模型的开源交互式应用，通过四幕叙事将不可见的量子硬件转化为可探索的电影体验，旨在弥合量子计算与公众之间的想象鸿沟。

详情

AI中文摘要

量子计算有望在科学和工业领域带来变革性进步，但实现这些计算的物理硬件对公众而言仍然不可见：量子处理器在接近绝对零度的密封稀释制冷机内运行，使得直接观察成为不可能。这种量子计算日益增长的社会影响与公众可视化能力之间的“想象鸿沟”构成了量子素养和劳动力发展的重大障碍。我们提出量子影院，一个开源、基于浏览器的交互式应用，通过使用生成世界模型将不可见的量子硬件转化为可探索的电影体验，从而弥合这一鸿沟。量子影院引导用户经历四幕叙事——从获得诺贝尔奖的量子纠缠基础科学，通过策划的视频介绍三种主要量子计算架构（离子阱、中性原子和超导系统），进入沉浸式三维生成世界，使不可见的量子现象变得可观察，最后到基于真实量子设备规格的交互式雷达图比较。所有三维环境均使用WorldLabs的生成世界模型平台生成，并基于亚马逊云服务（AWS）Braket量子硬件策划的指标进行科学依据。量子影院无需安装、无需专用硬件、无需量子计算背景。它旨在服务于两个不同的群体：寻求复制或扩展平台的学者和开发者，以及寻求直观工具向不同受众解释量子硬件的教育者、研究人员和科学传播者。本文描述了系统架构、生成世界模型流程、两个群体的用例以及未来工作方向。

英文摘要

Quantum computing promises transformative advances across science and industry, yet the physical hardware that enables these computations remains invisible to the public: quantum processors operate inside sealed dilution refrigerators at temperatures near absolute zero, making direct observation impossible. This "imagination gap" between quantum computing's growing societal impact and the public's ability to visualize it represents a significant barrier to quantum literacy and workforce development. We present Quantum Cinema, an open-source, browser-based interactive application that closes this gap by transforming invisible quantum hardware into explorable, cinematic experiences using generative world models. Quantum Cinema guides users through a four-act narrative -- from the foundational Nobel Prize-winning science of quantum entanglement, through curated video introductions to three major quantum computing architectures (trapped-ion, neutral-atom, and superconducting systems), into immersive three-dimensional generative worlds that make invisible quantum phenomena observable, and finally to interactive radar-chart comparisons grounded in real quantum device specifications. All three-dimensional environments are generated using WorldLabs' generative world model platform and are scientifically grounded in curated metrics from Amazon Web Services (AWS) Braket quantum hardware. Quantum Cinema requires no installation, no specialized hardware, and no quantum computing background. It is designed to serve two distinct communities: scholars and developers seeking to replicate or extend the platform, and educators, researchers, and science communicators seeking an intuitive tool for explaining quantum hardware to diverse audiences. This paper describes the system architecture, the generative world model pipeline, use cases for both communities, and directions for future work.

URL PDF HTML ☆

赞 0 踩 0

2606.17077 2026-06-18 physics.chem-ph cs.AI cs.LG quant-ph 新提交

Comprehensive pKa Data Augmentation from Limited Real Data through an Engineered Models-Quantum Framework

基于工程化模型-量子框架从有限真实数据中全面增强pKa数据

Wang Rui, Liu Dinghao

发表机构 * Department of Chemistry, Tsinghua University（清华大学化学系）； Department of Chemical Engineering, Tsinghua University（清华大学化学工程系）； School of Science, China Pharmaceutical University（中国药科大学理学院）

AI总结针对pKa数据稀疏问题，提出量子辅助分子生成方法，利用优化机器学习模型预测和量子退火器采样，在相干伊辛机上实现极端值采样。

详情

AI中文摘要

质子解离常数(pKa)对于功能分子发现和分子建模至关重要。基于已建立的最大实验pKa数据库iBonD，我们和其他研究人员开发了多种方法，包括基于机器学习的经验预测和高精度能量计算。尽管如此，高质量pKa数据的快速增强仍然受到根本性限制。作为这项工作的一部分，我们使用一组经过广泛优化的机器学习模型，对未标记分子数据集进行了大规模基于回归的pKa预测。结果表明，由于未标记分子数据集的特征分布，pKa数据分布近似正态，尾部区域样本极度稀缺。尽管这种增强对于提高整体数据可用性和预测建模非常有价值，但对于高效发现具有广谱pKa性质的分子仍然不足。为了解决这个问题，我们探索从广阔的化学空间中定向生成具有稀疏pKa性质的分子。鉴于传统的连续潜在空间VAE-RNN分子生成方法稳定性不足，且在补充稀疏数据方面未能显示出明显优势，我们设计并实现了一种量子辅助的稀疏pKa分子生成。在模拟量子退火器上验证了可行性，并在物理相干伊辛机(CIM)上进一步实现了优越的极端值采样。(未完待续)

英文摘要

Proton dissociation constants (pKa) are critical for functional molecule discovery and molecular modeling. Building on iBonD, the largest experimental pKa database established, we and other researchers have developed several methods including machine-learning-based empirical prediction and high-accuracy energy calculations. Despite this foundation, the rapid augmentation of high-quality pKa data remains fundamentally constrained. As part of this work, we performed large-scale regression-based pKa prediction on unlabeled molecular datasets using a collection of extensively optimized machine-learning models. The results indicate that, since the feature distributions of unlabeled molecular datasets, the pKa data distribution approximates normality, with extreme scarcity of tail-region samples. Although such augmentation is highly valuable for improving overall data availability and predictive modeling, it remains insufficient for efficiently discovering molecules with broad-spectrum pKa properties. To address this, we explore the targeted generation of molecules with sparse pKa properties from the vast chemical space. Given that traditional continuous latent space VAE-RNN methods for molecular generation suffer from insufficient stability and fail to demonstrate clear advantages in complementing sparse data, we design and implement a quantum-assisted sparse-pKa molecular generation. Feasibility is validated on a simulated quantum annealer, and superior extreme-value sampling is further achieved on physical coherent Ising machines (CIMs). (to be continued)

URL PDF HTML ☆

赞 0 踩 0

2606.15604 2026-06-18 eess.IV cs.CV 新提交

Parameter-Efficient Adaptation of SAM 3 for Automated ITV Generation from 4DCT Images

基于参数高效微调SAM 3从4DCT图像自动生成内靶区

Changwoo Song

发表机构 * Oncosoft Inc.（Oncosoft公司）； Department of Computer Science & Engineering, Chungnam National University（忠南大学计算机科学与工程系）

AI总结提出轻量框架，通过LoRA参数高效微调SAM 3，结合硬负样本挖掘和相位相干滤波，仅用7个标注体数据实现高精度内靶区自动生成，中位Dice达0.968。

Comments 10 pages, 4 figures, 2 tables

详情

AI中文摘要

四维计算机断层扫描（4DCT）捕获了胸部解剖结构的完整呼吸周期，然而当前的内靶区勾画流程孤立处理每个相位，丢弃了时间相干性，使轮廓易受相位特定伪影影响。我们提出一个轻量框架，通过低秩适应（LoRA）对Segment Anything Model 3（SAM 3）进行参数高效微调，仅使用七个标注的3D CT体数据，将其文本提示分割与医学领域对齐。此外，该框架结合了硬负样本挖掘策略，以改善低对比度胸部区域的边界判别。在推理时，通过相位相干时间滤波和空间连通性分析细化逐相位预测。由于呼吸运动是连续且周期性的，真实解剖结构出现在连续的相位块中，而瞬态伪影零星出现，因此被有效抑制。在肺部和心脏结构上的实验分别产生中位Dice分数0.968和0.910，95百分位Hausdorff距离分别为0.998 mm和2.931 mm。所提框架有效消除了未适应SAM 3零样本推理中固有的严重假阳性预测。仅用七个标注体数据，框架保留了超过95%的全数据准确率，且整个流水线可在单个消费级GPU上训练，展示了自适应放疗中可扩展、数据高效的解决方案。

英文摘要

Four-dimensional computed tomography (4DCT) captures the full respiratory cycle of thoracic anatomy, yet current Internal Target Volume contouring workflows process each phase in isolation, discarding temporal coherence and leaving contours vulnerable to phase-specific artifacts. We present a lightweight framework that applies parameter-efficient fine-tuning to the Segment Anything Model 3 (SAM 3) via low-rank adaptation (LoRA) to align its text-prompted segmentation with the medical domain using only seven annotated 3D CT volumes. Furthermore, the framework incorporates a hard negative mining strategy to improve boundary discrimination in low-contrast thoracic regions. At inference, phase-wise predictions are refined through phase-coherent temporal filtering and spatial connectivity analysis. Since respiratory motion is continuous and periodic, genuine anatomy appears in contiguous blocks of phases, whereas transient artifacts appear sporadically and are thus effectively suppressed. Experiments on pulmonary and cardiac structures yield median Dice scores of 0.968 and 0.910 with 95th-percentile Hausdorff distances of 0.998 mm and 2.931 mm, respectively. The proposed framework effectively eliminates the severe false-positive predictions inherent in the zero-shot inference of the unadapted SAM 3. With only seven annotated volumes, the framework retains over 95% of full-data accuracy, and the entire pipeline is trainable on a single consumer-grade GPU, demonstrating a scalable, data-efficient solution for adaptive radiotherapy.

URL PDF HTML ☆

赞 0 踩 0

2606.15091 2026-06-18 cs.HC cs.AI 新提交

Sensory Restoration via Brain-Computer Interfaces: A Unified 2 x 2 Framework and Convergence Roadmap

通过脑机接口的感觉恢复：统一的2×2框架与融合路线图

Xuan-The Tran

发表机构 * School of Mechanical Engineering, Vietnam Maritime University（机械工程学院，越南海防大学）

AI总结本文提出一个统一的2×2框架，按侵入性和信号方向分类脑机接口，并定义恢复、替代和增强范式，同时给出近中长期的融合路线图。

2606.14824 2026-06-18 cs.AR cs.AI cs.LG 新提交

Running hardware-aware neural architecture search on embedded devices under 512MB of RAM

在512MB内存下的嵌入式设备上运行硬件感知的神经架构搜索

Andrea Mattia Garavagno, Edoardo Ragusa, Paolo Gastaldo, Antonio Frisoli

发表机构 * University of Bologna（博洛尼亚大学）； Politecnico di Milano（米兰理工学院）

AI总结提出一种在资源受限的嵌入式设备上直接运行的硬件感知神经架构搜索方法，生成针对低端MCU的微型CNN，在Visual Wake Word数据集上达到最先进水平。

Journal ref 2024 IEEE International Conference on Consumer Electronics (ICCE), Las Vegas, NV, USA, 2024, pp. 1-2

2606.14202 2026-06-18 cs.NE cs.AI 新提交

MeEvo: Metacognitive Evolution Combined with Natural Evolution for Automatic Heuristic Design

MeEvo: 元认知进化与自然进化相结合用于自动启发式设计

Zishang Qiu, Xinan Chen, Rong Qu, Ruibin Bai

发表机构 * School of Computer Science, University of Nottingham Ningbo China（诺丁汉大学宁波分校计算机科学学院）； School of Computer Science, University of Nottingham（诺丁汉大学计算机科学学院）

AI总结提出MeEvo框架，通过循环耦合自然进化（探索启发式代码）和元认知进化（反思历史生成改进启发式），解决现有方法知识继承弱、探索不足的问题，在五个优化问题上表现更优。

详情

AI中文摘要

大型语言模型（LLMs）通过推理和代码合成实现启发式生成，推动了自动启发式设计（AHD）的发展。现有的基于LLM的AHD架构主要遵循两种范式：自然进化，它使用交叉和变异来探索启发式程序；以及元认知进化，它通过反思来改进推理。然而，自然进化丢弃了推理轨迹，削弱了知识继承和利用，而元认知进化缺乏种群级别的重组，限制了探索并增加了过早收敛的风险。这些局限性降低了复杂问题的搜索效率、稳定性和解的质量。为了解决这一差距，我们提出了MeEvo，一种双层AHD框架，它循环耦合自然进化和元认知进化。自然进化探索启发式代码，同时将推理轨迹、适应度值和错误记录到共享历史中；然后元认知进化反思该历史以生成改进的启发式，这些启发式重新进入父代池以进行下一轮循环。这种设计使得种群驱动的探索和反思驱动的改进相互加强。在五个优化问题上的实验（使用两个LLM骨干）表明，MeEvo比现有的基于LLM的AHD架构实现了更强且更稳定的性能，尤其是在复杂约束任务上。

英文摘要

Large Language Models (LLMs) have advanced Automatic Heuristic Design (AHD) by enabling heuristic generation through reasoning and code synthesis. Existing LLM-based AHD architectures mainly follow two paradigms: Natural Evolution, which uses crossover and mutation to explore heuristic programs, and Metacognitive Evolution, which refines reasoning through reflection. However, Natural Evolution discards reasoning traces, weakening knowledge inheritance and exploitation, while Metacognitive Evolution lacks population-level recombination, limiting exploration and increasing the risk of premature convergence. These limitations reduce search efficiency, stability, and solution quality on complex problems. To address this gap, we propose MeEvo, a dual-layer AHD framework that cyclically couples Natural Evolution and Metacognitive Evolution. Natural Evolution explores heuristic code while recording reasoning traces, fitness values, and errors into a shared history; Metacognitive Evolution then reflects on this history to generate improved heuristics that re-enter the parent pool for the next cycle. This design enables population-driven exploration and reflection-driven refinement to reinforce each other. Experiments on five optimization problems with two LLM backbones show that MeEvo achieves stronger and more stable performance than existing LLM-based AHD architectures, especially on complex constrained tasks.

URL PDF HTML ☆

赞 0 踩 0

2606.12816 2026-06-18 quant-ph cs.ET cs.LG 新提交

Graph Reinforcement Learning for Calibration-Aware Quantum Circuit Routing

图强化学习用于校准感知的量子电路路由

Yash Vardhan Tomar, Dheeraj Peddireddy

发表机构 * University of California, Berkeley（加州大学伯克利分校）； National Institute of Standards and Technology（国家标准与技术研究院）

AI总结提出一种利用图强化学习进行校准感知的量子电路路由方法，通过IBM Heron r2校准数据选择SWAP操作，在MQT Bench电路上平均保真度达0.727，优于SABRE-best20的0.440。

详情

AI中文摘要

量子电路路由是在为噪声中等规模量子处理器编译程序时的关键步骤。通过标准开销指标看似高效的路由，在通过校准不良的耦合器时仍可能损失保真度。我们研究了一种校准感知的图强化学习路由器，该路由器使用当天的IBM Heron r2校准数据来选择硬件边缘SWAP。我们使用近端策略优化训练策略，并通过九个慕尼黑量子工具包（MQT）基准电路和三个校准快照的精确模拟保真度进行评估。在这些评估中，合并的平均精确保真度为$0.727$，而SABRE-best20为$0.440$，目标感知SABRE为$0.481$。保真度增益伴随着更高的路由双量子比特计数，并集中在5q和8q电路系列中；在固定树动作图下，所有10q系列都倾向于SABRE-best20。总体而言，我们的结果表明，校准感知的学习路由可以超越基于门计数的编译，提高保真度。

英文摘要

Quantum circuit routing is a key step in compiling programs for noisy intermediate-scale quantum processors. Routes that appear efficient by standard overhead metrics can still lose fidelity when they pass through poorly calibrated couplers. We study a calibration-aware graph reinforcement-learning router that uses same-day IBM Heron r2 calibration data to choose hardware-edge SWAPs. We train the policy with proximal policy optimization and evaluate it with exact simulated fidelity across nine Munich Quantum Toolkit (MQT) Bench circuits and three calibration snapshots. Across these evaluations, pooled mean exact fidelity is $0.727$, compared with $0.440$ for SABRE-best20 and $0.481$ for target-aware SABRE. We observed that fidelity gains came with higher routed two-qubit counts and were concentrated in 5 qubit and 8 qubit circuit families; under the fixed tree action graph, all 10 qubit families favored SABRE-best20. Overall, our results show that calibration-aware learned routing can improve fidelity beyond gate-count-driven compilation.

URL PDF HTML ☆

赞 0 踩 0

2606.09946 2026-06-18 cs.AR cs.CV 新提交

SPARX: Secure and Privacy-Aware Approximate CNN Acceleration with Edge RISC-V SoC

SPARX: 面向边缘RISC-V SoC的安全与隐私感知近似CNN加速

Sonu Kumar, Akash Sankhe, Mukul Lokhande, Santosh Kumar Vishvakarma

发表机构 * Dept of Science and Technology (DST), Govt of India（印度科学技术部）； MeitY/SMDP-C2S（印度电子与信息化部/SMDP-C2S）

AI总结提出SPARX框架，集成RISC-V指令扩展、近似对数CNN加速单元、差分隐私引擎和认证机制，通过近似感知决策框架选择最优乘法器，在边缘实现安全高效的CNN推理。

Comments Under review in 12th International Symposium on Smart Electronic Systems (iSES) 2026

详情

AI中文摘要

边缘AI系统日益需要在严格的能耗、性能、安全和隐私约束下进行实时CNN推理。近似计算通过利用神经网络工作负载的错误容忍性来提高硬件效率；然而，大多数近似CNN加速器并未联合考虑安全的、隐私感知的边缘部署。本文提出了SPARX，一个集成在异构RV32IMC RISC-V系统级芯片（SoC）内的安全与隐私感知近似CNN加速框架。SPARX结合了自定义RISC-V指令扩展、近似对数CNN加速单元、轻量级基于差分噪声的隐私引擎以及挑战-响应认证机制。为了指导算术选择，引入了一个近似感知决策框架，该框架使用近似严重性指数（ASI）、近似效率（AE）、近似质量（QoA）、近似品质因数（AFOM）和硬件加速效率（HAE）。对11种最先进的近似MAC架构的评估表明，迭代对数乘法器（ILM）是最合适的设计，与精确的基4 Booth MAC相比，面积减少51.7%，功耗降低81.5%，吞吐量提升2.13倍，而仅使ResNet-20/CIFAR-10的准确率降低2.82个百分点。在Xilinx VC707平台上的FPGA实现实现了250 MHz下58.4 GOPS/W的能效，而28纳米CMOS物理实现验证了ASIC的可行性。

英文摘要

Edge-AI systems increasingly require real-time CNN inference under strict energy, performance, security, and privacy constraints. Approximate computing improves hardware efficiency by exploiting the error resilience of neural network workloads; however, most approximate CNN accelerators do not jointly consider secure, privacy-aware edge deployment. This paper presents SPARX, a Secure and Privacy-Aware Approximate CNN Acceleration framework integrated within a heterogeneous RV32IMC RISC-V System-on-Chip (SoC). SPARX combines a custom RISC-V instruction extension, an approximate logarithmic CNN acceleration unit, a lightweight differential-noise-based privacy engine, and a challenge-response authentication mechanism. To guide arithmetic selection, an approximation-aware decision framework is introduced that uses the Approximation Severity Index (ASI), Approximation Efficiency (AE), Quality of Approximation (QoA), Approximation Figure-of-Merit (AFOM), and Hardware Acceleration Efficiency (HAE). Evaluation across 11 state-of-the-art approximate MAC architectures identifies the Iterative Logarithmic Multiplier (ILM) as the most suitable design, achieving 51.7% area reduction, 81.5% power reduction, and 2.13x throughput improvement compared with an accurate radix-4 Booth MAC, while only reducing ResNet-20/CIFAR-10 accuracy by 2.82 percentage points. FPGA implementation on a Xilinx VC707 platform achieves 58.4 GOPS/W energy efficiency at 250 MHz, while 28-nm CMOS physical implementation validates ASIC feasibility

URL PDF HTML ☆

赞 0 踩 0

2606.07150 2026-06-18 cs.CR cs.AI cs.MA cs.NI 新提交

From Privacy to Workflow Integrity: Communication-Graph Metadata in Autonomous Agent Interoperability

从隐私到工作流完整性：自主智能体互操作性中的通信图元数据

Bijaya Dangol

发表机构 * Independent Researcher（独立研究者）

AI总结针对智能体通信图元数据泄露问题，提出工作流完整性威胁模型，定义传输层与引导层隐私属性，并通过A2A案例验证元数据保护可有效抑制任务推断。

Comments 22 pages, 7 figures, 6 tables

详情

AI中文摘要

诸如A2A和MCP之类的智能体互操作性协议标准化了智能体之间的通信内容，但假设基于地址的HTTP(S)传输。此类传输保护消息内容，并越来越多地采用端到端加密。它们暴露在明文中的是通信图：哪个智能体联系哪个智能体、何时以及频率如何。在智能体系统中，该图比隐私框架所暗示的更具后果性。端点通常带有能力标签，工作流是结构化和链式的，交互与实际行动耦合，因此观察者恢复的不仅仅是过去的关系。它可以推断出待处理的工作流、正在组装的任务以及可能即将发生的行动。以机器速度，它可以在工作流完成之前根据该推断采取行动。因此，威胁是工作流完整性，而不仅仅是隐私：对自主行动的预测性杠杆。我们为智能体通信图提供了一个威胁模型；识别了使智能体元数据具有独特揭示性的因素（语义性、前瞻性、驱动性）；定义了传输层和引导层隐私属性，并评估了候选传输（SimpleX/SMP、Tor、混合网络）与这些属性的匹配程度；并提出了一个A2A案例研究，其中元数据保护绑定是可表达的，但揭示了协议的身份假设。我们在一个基于真实A2A捕获的生成模型上测试了这些。仅凭被动元数据，没有载荷，一个分类器从工作流的开头就能以远高于随机水平的概率恢复任务类别；应用这些属性后，该恢复被急剧拉回随机水平。除了观察者能恢复的内容外，我们衡量了利用泄露的杠杆：在工作流开头和固定预算下，选择对哪些工作流采取行动的对手在此模型中实现了大部分先知攻击者相对于元数据盲攻击者的优势，而相同的属性抑制了这一点。

英文摘要

Agent-interoperability protocols such as A2A and MCP standardize what agents say to one another but assume address-based transport. Whether over HTTP(S) or a content-protecting binding such as MLS-based SLIM, these transports protect message content yet leave the communication graph exposed: which agent contacts which, when, and how often. In agent systems this graph is more consequential than a privacy framing suggests. Endpoints are capability-labeled, workflows are structured and chained, and interactions are coupled to actions, so an observer recovers more than past relationships: it can recognize a recurring pending workflow from its opening and, at machine speed, act on it before it completes. The threat is one of workflow integrity, not privacy alone. We give a threat model for the communication graph and locate what makes its metadata distinctively consequential: not stronger fingerprinting but exposure across independent trust domains, coupled to autonomous action. We define transport- and bootstrap-layer privacy properties, give them an indistinguishability-game semantics, evaluate transports, and give an A2A case study where a metadata-protecting binding surfaces its implicit identity assumptions. On a corpus of real multi-agent A2A traffic from the official reference agents, on a live A2A binding, and with a generative model as a controlled instrument, a label-blind classifier recovers a task's class from passive metadata at 6x chance, and from only its opening; a defense-aware adversary does not overturn this, and only the full set of properties drives recovery toward chance. Acting on the leak is distinct from recoverability: under a fixed budget an adversary captures 0.63 of a clairvoyant attacker's advantage on the corpus (0.41 from a workflow's opening), governed by top-ranked precision rather than overall accuracy, so integrity and privacy come apart under defense.

URL PDF HTML ☆

赞 0 踩 0

2606.03745 2026-06-18 hep-ph cs.LG hep-ex physics.data-an 交叉投稿

Predicting the Neutrino Mass Ordering Using Neural Networks

利用神经网络预测中微子质量顺序

T. J. C. Bezerra, L. Asquith, E. Bannister, W. Shorrock

发表机构 * Department of Physics and Astronomy, University of Sussex（苏塞克斯大学物理与天文学系）

AI总结针对中微子质量顺序这一粒子物理核心问题，提出基于前馈神经网络分类器的机器学习方法，利用合成长基线数据集训练，并与标准χ²和logL方法对比，证明其性能相当，可作为独立交叉检验工具。

Comments 11 pages, 7 figures

详情

AI中文摘要

确定中微子质量顺序仍是粒子物理中的一个核心开放问题。虽然下一代长基线实验有望解决这一问题，但当前数据提供的灵敏度有限，因为正常顺序和倒置顺序之间的谱差异细微且与参数简并纠缠。我们研究了一种用于质量顺序确定的机器学习策略，使用前馈神经网络分类器，该分类器在合成长基线数据集上训练，这些数据集由三味振荡概率、物质效应和统计涨落生成。我们使用常见的判别指标（包括接收者操作特征曲线）将分类器与标准χ²和logL方法进行评估，以量化灵敏度并说明如何选择操作点以优先考虑纯度或效率。我们发现，在所研究的场景中，神经网络实现了与常规拟合相当的性能，为已有分析提供了灵活、独立的交叉检验。该框架可以扩展以包含系统不确定性并探索振荡参数的联合推断，也可作为在中微子物理中引入机器学习方法的教学工具。

英文摘要

Determining the neutrino mass ordering remains a central open problem in particle physics. While next-generation long-baseline experiments are expected to resolve this question, current data provide limited sensitivity because the spectral differences between normal and inverted ordering are subtle and entangled with parameter degeneracies. We investigate a machine-learning strategy for mass-ordering determination using a feed-forward neural-network classifier trained on synthetic long-baseline datasets generated with three-flavour oscillation probabilities, matter effects, and statistical fluctuations. We evaluate the classifier against standard $χ^2$ and $\log\mathcal{L}$ approaches using common discrimination metrics, including receiver-operating-characteristic curves, to quantify sensitivity and to illustrate how operating points can be selected to prioritise purity or efficiency. We find that the neural network achieves performance comparable to conventional fits for the scenarios studied, providing a flexible, independent cross-check of established analyses. The framework can be extended to incorporate systematic uncertainties and to explore joint inference of oscillation parameters, and it may also serve as a pedagogical tool for introducing machine-learning methods in neutrino physics.

URL PDF HTML ☆

赞 0 踩 0

2605.27729 2026-06-18 cs.CR cs.AI cs.ET quant-ph 交叉投稿

QSignAI: Quantum-Randomness-Seeded Identity Signatures at the Intersection of AI for Science and Science for AI

QSignAI: 量子随机性种子身份签名——AI for Science 与 Science for AI 的交汇

Dongping Liu, Aoyu Zhang, Luyao Zhang

发表机构 * Amazon Web Services（亚马逊网络服务）； Duke Kunshan University（杜克昆山大学）

AI总结提出 QSignAI 平台，通过云端量子电路生成量子随机性种子，为社交平台用户提供唯一身份签名，并借助 AI 机器人使量子现象对普通用户可感知。

详情

AI中文摘要

2024-2025 年的诺贝尔奖和图灵奖同时表彰了人工智能和量子科学——机器学习作为物理科学，人工智能解决了 50 年的科学问题，超导量子电路作为量子计算的硬件基础，量子信息原理作为计算的最高成就。然而，没有任何已部署的人工智能系统将这两者结合起来为公众服务：身份系统仍然依赖伪随机令牌，量子电路对于每天使用机器人支持的社交消息平台的数十亿人来说仍然不可见。本文介绍了 QSignAI，一个已部署到生产环境的开源平台，在实时事件参与系统中展示了人工智能与量子科学之间的双向关系。我们解决三个研究问题：第一，能否通过真实量子电路生成量子随机性，并将其嵌入到人工智能驱动的社交平台中，且延迟和成本可接受；第二，人工智能机器人能否使量子现象对没有技术背景的普通观众在感知上可理解；第三，结合这两个方向的系统在实践中是否有效。一个对话式人工智能机器人在云端量子模拟器上通过双电路量子管道路由每个参与者的第一条消息，为每个参与者生成唯一的量子随机性种子身份签名。前两个问题通过系统设计和定性部署证据得到回答；可衡量的比较被确定为优先的未来工作。

英文摘要

The 2024-2025 Nobel and Turing awards recognised AI and quantum science simultaneously. Yet no deployed system has brought these streams together for the public. This paper presents QSignAI, a production-deployed platform demonstrating a bidirectional AI-quantum relationship in a real-time event participation system. We address three questions: can quantum-randomness generation via a two-source extractor be embedded in an AI-driven social platform with acceptable latency; can an AI bot make quantum phenomena perceptually legible to general audiences; and does the combined system work in practice? A conversational bot routes each participant's first message through a quantum pipeline comprising a Toeplitz two-source extractor over independent single-qubit Hadamard measurements on SV1 and DM1 simulators, plus a 2-qubit Bell state, producing a unique quantum-randomness-seeded identity signature per participant. The first two questions are answered through system architecture and qualitative deployment evidence from live events; the third through successful production deployment. The current deployment uses cloud quantum simulators; physical QPU randomness is the near-term extension. Measurable benchmarks are identified as priority future work.

URL PDF HTML ☆

赞 0 踩 0

2601.23018 2026-06-18 cs.HC cs.AI cs.LG 交叉投稿

Integrating Multi-Label Classification and Generative AI for Scalable Analysis of User Feedback

整合多标签分类与生成式AI实现用户反馈的可扩展分析

Sandra Loop, Erik Bertram, Sebastian Juhl, Martin Schrepp

发表机构 * SAP SE（SAP公司）； Hochschule Fresenius Heidelberg（弗赖辛大学海德堡分校）； University of Missouri（密苏里大学）

AI总结提出结合监督多标签分类与生成式AI的方法，高效处理大量用户评论，自动分配主题标签并生成摘要，同时发现情感分析不能可靠反映产品满意度。

Comments 8 pages, 2 figures, submitted to Springer Nature

详情

AI中文摘要

在高度竞争的软件市场中，用户体验（UX）评估对于确保软件质量和促进产品长期成功至关重要。此类UX评估通常将标准化问卷的定量指标与通过开放式问题收集的定性反馈相结合。虽然开放式反馈为改进提供了有价值的见解，并有助于解释定量结果，但分析大量用户评论具有挑战性且耗时。在本文中，我们介绍了一家大型软件公司在长期UX测量项目中开发的技术，以高效处理和解释大量用户评论。为了提供收集到的评论的高层概述，我们采用监督机器学习方法，为每条评论分配有意义的预定义主题标签。此外，我们展示了如何利用生成式AI（GenAI）创建简洁且信息丰富的用户反馈摘要，促进向组织尤其是高层管理人员有效传达发现。最后，我们研究了用户评论中表达的情感是否可以作为整体产品满意度的指标。我们的结果表明，仅凭情感分析并不能可靠地反映用户满意度。相反，产品满意度需要在调查中明确评估，以衡量用户对产品的感知。

英文摘要

In highly competitive software markets, user experience (UX) evaluation is crucial for ensuring software quality and fostering long-term product success. Such UX evaluations typically combine quantitative metrics from standardized questionnaires with qualitative feedback collected through open-ended questions. While open-ended feedback offers valuable insights for improvement and helps explain quantitative results, analyzing large volumes of user comments is challenging and time-consuming. In this paper, we present techniques developed during a long-term UX measurement project at a major software company to efficiently process and interpret extensive volumes of user comments. To provide a high-level overview of the collected comments, we employ a supervised machine learning approach that assigns meaningful, pre-defined topic labels to each comment. Additionally, we demonstrate how generative AI (GenAI) can be leveraged to create concise and informative summaries of user feedback, facilitating effective communication of findings to the organization and especially upper management. Finally, we investigate whether the sentiment expressed in user comments can serve as an indicator for overall product satisfaction. Our results show that sentiment analysis alone does not reliably reflect user satisfaction. Instead, product satisfaction needs to be assessed explicitly in surveys to measure the user's perception of the product.

URL PDF HTML ☆

赞 0 踩 0

2606.06133 2026-06-18 cs.SE cs.AI cs.LG cs.LO 版本更新

TLA-Prover: Verifiable TLA+ Specification Synthesis via Preference-Optimized Low-Rank Adaptation

TLA-Prover: 通过偏好优化低秩适配实现可验证的 TLA+ 规范合成

Eric Spencer, Arslan Bisharat, Brian Ortiz, Khushboo Bhadauria, TaiNing Wang, George K. Thiruvathukal, Konstantin Laufer, Mohammed Abuhamad

发表机构 * Department of Computer Science, Loyola University Chicago（洛约拉芝加哥大学计算机科学系）

AI总结提出 TLA-Prover 模型，结合监督微调和基于修复的组相对策略优化，在 TLC 模型检查器上实现 TLA+ 规范合成，Gold/Diamond 级别通过率达 30%，约为未调优基线的 3.5 倍。

Comments 12 pages, 5 tables, 3 figures. Accepted at the 21st International Conference on Software Technologies (ICSOFT 2026)

详情

AI中文摘要

TLA+ 是一种用于验证分布式系统和安全关键协议的正式规范语言。大型语言模型（LLM）生成的 TLA+ 规范常常因语义原因无法通过 TLC 模型检查器。在 25 个 LLM 中，最佳公开基线的语法解析成功率为 26.6%，语义模型检查通过率为 8.6%。我们提出了 TLA-Prover，一个 200 亿参数的 TLA+ 规范合成模型。训练结合了在已验证示例上的监督微调（SFT）和基于修复的组相对策略优化（GRPO）。在 GRPO 阶段，模型学习修复自身被拒绝的规范。我们还从相同的 SFT 检查点训练了一个直接偏好优化（DPO）变体作为消融实验。TLC 直接提供奖励信号，无需学习奖励模型。每个输出分为四个等级：青铜（解析通过）、银（无警告）、金（通过 TLC）和钻石。要达到钻石级，模型的正确性属性会被自动微小修改；TLC 必须检测到违反。如果 TLC 仍然通过，则该属性始终为真且无贡献；输出无法达到钻石级。在一个保留的 30 问题基准上，TLA-Prover 在金级和钻石级均达到 9/30（即 pass@1 = 30%）。这大约是未调优基线 8.6% 的 3.5 倍。DPO 变体在钻石级达到 20%。金级和钻石级在每个检查点都一致；这防止了平凡属性失败模式。

英文摘要

TLA+ is a formal specification language for verifying distributed systems and safety-critical protocols. Large language models (LLMs) frequently produce TLA+ specifications that fail the TLC model checker for semantic reasons. Across 25 LLMs, the best public baseline is 26.6% syntactic parse and 8.6% semantic model-check. We present TLA-Prover, a 20-billion-parameter model for TLA+ specification synthesis. Training combines supervised fine-tuning (SFT) on verified examples with repair-based group-relative policy optimization (GRPO). In the GRPO stage, the model learns to fix its own rejected specifications. We also train a direct preference optimization (DPO) variant from the same SFT checkpoint as an ablation. TLC provides the reward signal directly, with no learned reward model. Four tiers grade each output: Bronze (parses), Silver (no warnings), Gold (passes TLC), and Diamond. To reach Diamond, the model's correctness property is automatically altered in a small way; TLC must then detect a violation. If TLC still passes, the property was always-true and contributes nothing; the output fails Diamond. TLA-Prover reaches 9/30 (i.e. pass@1 = 30%) at both Gold and Diamond on a held-out 30-problem benchmark. This is roughly 3.5x the 8.6% untuned baseline. The DPO variant reaches 20% at Diamond. Gold and Diamond coincide at every checkpoint; this prevents the trivial-property failure mode.

URL PDF HTML ☆

赞 0 踩 0

2606.04404 2026-06-18 stat.ML cs.LG 版本更新

Knockoffs-based False Discovery Rate Control and Simplification for Deep Neural Networks

基于Knockoffs的深度神经网络错误发现率控制与简化

Wenyu Liao, Yiqing Shi, Fang Xie

发表机构 * bnbu.edu.cn（北京理工大学）

AI总结本文基于knockoff方法和正则化神经网络，提出了三种在控制错误发现率条件下的变量筛选方法（单层过滤、多层过滤、变量权重聚合过滤），以简化深度神经网络并降低计算复杂度。

2606.00182 2026-06-18 cs.HC cs.AI cs.CY 版本更新

The New Social Image: How AI Competency and AI Proactivity Influence Self- and Peer-Perceptions in the Workplace

新社会形象：AI能力与AI主动性如何影响职场中的自我与同伴感知

Kuntal Ghosh, Marc Hassenzahl, Shadan Sadeghian

发表机构 * Autonomous Interactive Systems, University of Siegen（自主交互系统，锡根大学）； Experience & Interaction Design, University of Siegen（体验与交互设计，锡根大学）

AI总结通过2x2x2情景实验（n=50），研究AI能力与主动性水平对员工工作所有权、情感、意义感及角色动态的自我与同伴感知影响，发现低能力或低主动性的AI通常提升积极感知，但高能力与高主动性可能带来负面影响。

Comments Updated metadata following publication in Interacting with Computers. Added DOI and publication information

详情

DOI: 10.1093/iwc/iwag033

AI中文摘要

人机协作被视为将AI融入职场的最有前景方式。然而，这种协作的体验后果尚未被探索。具体而言，在与AI组成的团队中，人类如何感知自己（自我感知）以及同事如何看待他们（同伴感知）在工作所有权和工作意义方面。在一项2x2x2情景研究（n=50）中，参与者对所有权、情感、工作意义和满意度以及角色动态的感知进行了评分，其中AI主动性和AI能力作为被试内因素（低/高两个水平），视角（自我感知/同伴感知）作为被试间因素。我们的结果表明，低能力或低主动性的AI通常提升了与所有权、意义感、满意度和角色动态相关的感受，并增加了积极情感，减少了消极情感。然而，这些效应往往受到视角的影响。例如，低AI主动性从自我感知而非同伴感知中带来了更高的工作满意度。基于我们的发现，我们认为仅围绕绩效指标设计未来工作的AI可能并不足够。高能力和高主动性的AI驱动系统可能对所有权感知、工作身份、社会形象和团队动态产生不良影响，进而影响工作意义。

英文摘要

Human-AI collaboration is considered the most promising way to incorporate AI in the workplace. What remains unexplored are the experiential consequences of this teaming. More specifically, in a team with AI, how humans perceive themselves (self-perception) and how they are perceived by their coworkers (peer perception) in terms of work ownership and job meaningfulness. In a 2x2x2 vignette study (n=50), participants rated perceptions of ownership, affect, job meaningfulness and satisfaction, and role dynamics across two levels (low/high) of AI proactivity and AI competency as within-subject factors, with point-of-view (self perception/peer perception) as between-subjects. Our results showed that AI with low competency or low proactivity generally improved feelings related to ownership, meaningfulness, satisfaction, and role dynamics, and also increased positive affect while reducing negative affect. However, these effects were often influenced by point-of-view. For instance, low AI proactivity resulted in higher job satisfaction from self-perception rather than peer perception. Based on our findings, we argue that designing AI for the future of work solely around performance metrics may not be adequate. Highly competent and proactive AI-driven systems can have undesirable impacts on perceptions of ownership, job identity, social image and team dynamics, and consequently, job meaningfulness.

URL PDF HTML ☆

赞 0 踩 0

2605.28690 2026-06-18 quant-ph cs.LG 版本更新

Latent-Conditioned Parameterized Quantum Circuits as Universal Approximators for Distributions over Quantum States

潜在条件参数化量子电路作为量子态分布的通用近似器

Quoc Hoan Tran, Koki Chinzei, Yasuhiro Endo, Hirotaka Oshima

发表机构 * Quantum Laboratory, Fujitsu Research, Fujitsu Limited（Fujitsu 研究所量子实验室， Fujitsu 有限公司）

AI总结提出潜在条件参数化量子电路（LPQC），通过经典神经网络将潜在变量映射到量子电路参数，证明其在1-Wasserstein距离下是密度算子概率测度的通用近似器，并引入多模态潜在先验和专家混合电路架构缓解贫瘠高原问题。

Comments 21 pages, 11 figures (fix the proof and update appendix for barren plateaus analysis)

详情

AI中文摘要

量子模拟、量子化学和量子机器学习中的许多应用不仅需要单个量子态，还需要表征目标系统异质性的量子态系综。在变分和容错设置中，逐个状态地准备这样的系综是不可行的，这激发了生成式建模方法。我们引入了潜在条件参数化量子电路（LPQC），这是一种混合量子-经典框架，其中经典神经网络将从先验分布中采样的潜在变量映射到参数化量子电路的参数。我们证明了LPQC在1-Wasserstein距离下是密度算子概率测度的通用近似器，将经典通用近似定理扩展到量子分布设置。我们还引入了多模态潜在先验和专家混合电路架构，并表明它在优化过程中经验性地缓解了贫瘠高原问题。数值实验在合成多簇混合量子态系综和QM9衍生的3D分子结构系综上验证了该框架。在这些任务中，LPQC优于最近的量子生成基线，同时与典型的经典基线相比，在输出维度大幅降低的情况下保持竞争力。通过利用潜在空间中的经典表达能力，LPQC为量子生成建模提供了一条可行的途径。

英文摘要

Many applications in quantum simulation, quantum chemistry, and quantum machine learning require not a single quantum state but an ensemble of states characterizing the heterogeneity of a target system. Preparing such ensembles state-by-state is prohibitive in both variational and fault-tolerant settings, thereby motivating a generative modeling approach. We introduce latent-conditioned parameterized quantum circuits (LPQCs), a hybrid quantum-classical framework in which classical neural networks map a latent variable sampled from a prior distribution to the parameters of a parameterized quantum circuit. We prove that LPQCs are universal approximators for probability measures over density operators in the 1-Wasserstein distance, extending classical universal approximation theorems to the quantum-distribution setting. We additionally introduce a multimodal latent prior and a mixture-of-experts circuit architecture, and show empirically that the latent-conditioned parameterization alleviates the barren plateau problem during optimization, a behavior for which we provide rigorous partial guarantees. Numerical experiments validate the framework on a synthetic multi-cluster ensemble of mixed quantum states and on a QM9-derived ensemble of 3-D molecular structures. In these tasks, LPQC outperforms recent quantum generative baselines and matches the generation quality of a classical neural-network baseline, while requiring an output dimension that grows only linearly with the number of qubits rather than exponentially. By leveraging classical expressivity in the latent space, LPQCs offer a tractable route to quantum generative modeling.

URL PDF HTML ☆

赞 0 踩 0

2605.26903 2026-06-18 cs.CR cs.AI 版本更新

Practical Anonymous Two-Party Gradient Boosting Decision Tree

实用的匿名两方梯度提升决策树

Chenyu Huang, Fan Zhang, Minxin Du, Sherman S. M. Chow, Huangxun Chen, Huaming Rao, Danqing Huang, Bo Qian, Peng Chen

发表机构 * Tencent（腾讯）； Hong Kong Polytechnic University（香港理工大学）； Chinese University of Hong Kong（香港中文大学）； HKUST-GZ

AI总结针对两方垂直分割数据上的梯度提升决策树训练，提出一种基于双电路隐私集合求交和遗忘可编程伪随机函数的匿名协议，在隐藏记录标识符的同时保持效率。

Comments 19 pages; 2026 IEEE Symposium on Security and Privacy (SP)

Journal ref 2026 IEEE Symposium on Security and Privacy (SP)

详情

DOI: 10.1109/SP63933.2026.00084

AI中文摘要

梯度提升决策树（GBDT）擅长处理结构化数据，通常用于在互不信任的各方之间垂直分割的特征上进行训练。高速和可解释性使得GBDT在金融和医疗领域广受欢迎，而神经网络在这些领域可能表现不佳。为GBDT启用安全计算带来了独特的挑战，需要安全的记录对齐以进行比较。依赖隐私集合求交（PSI）是一种事实上的方法。将PSI误认为是安全措施实际上会暴露数据集中哪些记录标识符（ID）是共享的。尽管电路PSI可以提供帮助，但对于通用用途来说成本高昂。需要新的思路来在“黑暗森林”中高效训练。为了隐藏ID，我们启动了对两方持有的分割数据上的匿名GBDT训练的研究。我们设计中的双电路PSI让双方交替作为接收者，对本地特征执行“选取后求和”。通过遗忘可编程伪随机函数，我们将电路PSI的输出作为共享状态在运行之间传播。避免通用对齐，我们解决了被忽视的困境：隐藏ID会带来与域大小成比例的成本。接下来，我们将用于将单指令多数据同态加密从（环）学习误差转换的密文打包成本减半，相比之前的安全GBDT（Usenix Security' 23）和相关安全机器学习计算。对比实验表明，我们的协议在效率上与有泄漏的方法相比仍具有竞争力。通过启用隐藏ID的聚合，我们的技术可以扩展到其他垂直分割的分析场景。

英文摘要

Structured data is well handled by gradient-boosted decision trees (GBDT), which are usually trained on vertically partitioned features across mutually distrustful parties. High speed and interpretability make GBDTs popular in finance and healthcare, where neural networks may fall short. Enabling secure computation for GBDTs poses unique challenges, requiring secure record alignment for comparison. Relying on private set intersection (PSI) is a de facto approach. Mistaking PSI for a safety measure actually exposes which record identifiers (IDs) are shared between the datasets. Although circuit-PSI could help, it is costly for generic uses. New ideas are needed to efficiently train in a "dark forest". Aiming to hide the IDs, we initiate the study of anonymous GBDT training on split data held by two parties. Dual circuit-PSI in our design lets the parties alternate as receiver to run pick-then-sum over local features. Via oblivious programmable pseudorandom functions, we propagate circuit-PSI outputs as shared state across runs. Avoiding universal alignment, we resolve the neglected dilemma that ID hiding incurs a cost that scales with domain size. Next, we halve the cost of ciphertext packing used to convert single-instruction multiple-data homomorphic encryption from (ring) learning with errors in prior secure GBDT (Usenix Security' 23) and related secure machine-learning computations. Comparative experiments show our protocol remains competitive with leaky approaches in efficiency. Enabling ID-hiding aggregation, our techniques can extend to other vertically partitioned analytics.

URL PDF HTML ☆

赞 0 踩 0

2605.26672 2026-06-18 cs.MM cs.SD 版本更新

Can We Hear from Events? Generating Speech from Event Camera

我们能从事件中听到声音吗？从事件相机生成语音

Jingping Fang, Lin Chen, Chenyang Xu, Tong Zhao, Weidong Cai, Xiaoming Chen

发表机构 * Beijing Technology and Business University（北京技术与商业大学）； Xidian University（西安电子科技大学）； Tongji University（同济大学）； University of Sydney（悉尼大学）

AI总结提出EventSpeech框架，利用神经形态事件相机的高时间精度解决传统RGB语音生成中的时间粒度不匹配问题，实现情感丰富且抗运动模糊的语音生成。

详情

AI中文摘要

传统的基于RGB的语音生成面临时间粒度不匹配问题，因为固定的相机曝光时间不可避免地模糊了渲染情感语音所需的高频发音瞬态。为了打破这一限制，我们提出EventSpeech，这是一个新颖的文本条件框架，率先利用神经形态事件进行表达性语音生成，因为这些微秒级精确的事件自然与声学波形动态对齐。我们的架构集成了一个专用的事件编码器来建模稀疏的神经形态事件，以及一个多尺度音频编码器，其中包含分层小波上下文器（HWC）。双向对齐机制无缝地将语言内容和视觉动态与密集的声学特征同步。此外，我们构建了EVT-SPK作为第一个基准，包括大规模合成数据和来自专用神经形态硬件的真实世界记录。大量评估表明，EventSpeech通过保留细粒度情感和抵抗运动模糊，显著优于当前基线，为多模态语音生成建立了新范式。代码和演示可在https://xrfang-0102.github.io/EventSpeechWeb/获取。

英文摘要

Traditional RGB-based speech generation faces Temporal Granularity Mismatch since fixed camera exposure times inevitably blur the high-frequency articulatory transients essential for rendering emotional speech. To break this ceiling, we propose EventSpeech as a novel text-conditioned framework pioneering the use of neuromorphic events for expressive speech generation, since these microsecond-precise events naturally align with acoustic waveform dynamics. Our architecture integrates a dedicated Event Encoder to model sparse neuromorphic events alongside a multi-scale Audio Encoder featuring a Hierarchical Wavelet Contextualizer (HWC). A bidirectional alignment mechanism seamlessly synchronizes linguistic content and visual dynamics with dense acoustic features. Furthermore, we construct EVT-SPK as the first benchmark comprising large-scale synthetic data and real-world recordings from specialized neuromorphic hardware. Extensive evaluations demonstrate that EventSpeech significantly outperforms current baselines by preserving fine-grained emotions and resisting motion blur to establish a new paradigm for multimodal speech generation. Code and demo are available at https://xrfang-0102.github.io/EventSpeechWeb/.

URL PDF HTML ☆

赞 0 踩 0

2605.26631 2026-06-18 stat.AP cs.LG 版本更新

Data-driven sparse identification of governing PDEs via knockoff filters and multi-criteria trade-offs

基于Knockoff滤波器与多准则权衡的数据驱动稀疏识别控制偏微分方程

Pongpisit Thanasutives, Naichang Ke, Yoshinobu Kawahara

发表机构 * RIKEN Center for Advanced Intelligence Project (AIP)（RIKEN先进人工智能项目中心）； The University of Osaka（大阪大学）

AI总结提出KO-PDE-IDENT框架，通过模型-X knockoff滤波器控制错误发现率，结合递归特征消除和多准则决策，从噪声数据中稀疏识别偏微分方程。

Comments 44 pages, 5 figures, 11 tables

详情

AI中文摘要

我们提出KO-PDE-IDENT，一个用于识别简洁偏微分方程（PDE）并控制错误发现率（FDR）的数据驱动框架。从噪声观测中发现PDE常常受到候选项之间极端多重共线性的阻碍，这导致典型的稀疏回归方法选择虚假项。为了解决这个问题，KO-PDE-IDENT首先通过具有有限样本FDR控制的模型-X knockoff滤波器挖掘潜在候选项的支持集，然后对存活的PDE备选方案进行细化和排序。该框架整合了三个组成部分。首先，通过将$\ell_{0}$约束的自适应最佳子集选择与SHapley Additive exPlanations（SHAP）相结合，构建knockoff特征统计量，产生有效且计算高效的差异统计量。其次，递归特征消除（RFE）过程去除边际贡献可省略的项，并通过knockoff扰动假设检验评估统计必要性。第三，最终模型选择被表述为一个多准则决策（MCDM）问题，其中最优控制方程是在预测精度、模型复杂度和系数不确定性等广泛准则之间取得最佳平衡的备选方案。我们在严重噪声污染下对五个经典PDE验证了KO-PDE-IDENT。实验结果表明，我们的框架可以精确恢复真实的PDE结构，消除错误发现同时保留所有真实潜在项，且系数估计误差低。

英文摘要

We propose KO-PDE-IDENT, a data-driven framework for identifying parsimonious partial differential equations (PDEs) with false discovery rate (FDR) control. PDE discovery from noisy observations is often hindered by extreme multicollinearity among candidate terms, which causes typical sparse-regression methods to select spurious terms. To address this problem, KO-PDE-IDENT initially mines a support set of potential candidate terms via model-X knockoff filters with finite-sample FDR control, then refines and ranks the surviving PDE alternatives. The framework integrates three components. First, knockoff feature statistics are constructed by coupling $\ell_{0}$-constrained adaptive best-subset selection with SHapley Additive exPlanations (SHAP), yielding an effective and computationally efficient difference statistic. Second, a recursive feature elimination (RFE) procedure removes terms whose marginal contributions are dispensable and assesses statistical necessity through knockoff-perturbed hypothesis testing. Third, the final model selection is formulated as a multi-criteria decision-making (MCDM) problem, where the optimal governing equation is the alternative that best balances a wide range of criteria such as predictive accuracy, model complexity and coefficient uncertainty. We evaluate KO-PDE-IDENT on five canonical PDEs under severe noise corruption. Empirical results show that our framework can exactly recover the true PDE structure, eliminating false discoveries while retaining all true underlying terms, with low coefficient estimation error.

URL PDF HTML ☆

赞 0 踩 0

2605.27478 2026-06-18 stat.ML cs.LG math.PR 版本更新

Triangular-Reference Schrödinger Bridges for Time Series Generation

三角参考薛定谔桥用于时间序列生成

Gabriele Bocchi

发表机构 * Arakne S.r.l.（阿拉克内公司）

AI总结提出三角参考薛定谔桥框架，通过区间冻结的退化扩散参考和层次化潜在波动率结构，实现时间序列的保守生成，并保持熵最小化的变分核心。

详情

AI中文摘要

我们引入了用于时间序列的三角参考薛定谔桥（TR-SBTS），这是SBTS框架的一种保守扩展，其中布朗参考被替换为区间冻结的、可能退化的扩散参考，在潜在波动率水平的层次上呈三角形。该构造是在增广状态空间上的单一熵投影，变分约束在时间和潜在水平上联合施加，并通过相对熵的分解层次展开。SBTS的变分核心得以保留：熵最小化器是参考的h-变换，在每个冻结区间上，最优动力学在活跃协方差方向的仿射叶上具有对数梯度漂移公式，即使冻结协方差是秩亏的也成立。我们建立了冻结近似的稳定性以及相应正则化核估计量的收敛性。该构造通过一个有限维条件映射实现，该映射由三种互补的过去约简组成——块PCR摘要、由运行时冻结协方差累积量诱导的过去增量的参考感知马氏核，以及在同一参考度量下的过去窗口WLS漂移回归器——以及一个耦合的状态-协方差桥步骤，其中每个潜在水平为上一水平产生动态参考，并由协方差描述符总结；该构造在数值实验上进行了评估。

英文摘要

Schrödinger bridges for time series (SBTS) generate synthetic paths by projecting, in relative entropy, a Brownian reference onto the path laws that match the joint distribution of the data on the observation grid. The Brownian reference, however, fixes the quadratic variation of the generated paths, which is restrictive when stochastic volatility, correlated noise, or rank-deficient covariance structures must be reproduced. We introduce "Triangular-Reference Schrödinger Bridges for Time Series" (TR-SBTS), which keeps the entropy-projection backbone of SBTS but replaces the Brownian reference by a triangular, volatility-informed, intervalwise frozen reference on a state augmented with latent covariance descriptors. The construction remains a single entropy projection on the augmented state: the minimiser is the $h$-transform of the reference, and on each frozen interval the optimal drift has the logarithmic-gradient form $b^\star(t,x)=A\,\nabla\log H(t,x)$, intrinsic to the active covariance directions when the frozen covariance $A$ is degenerate. We prove stability of the frozen approximation and consistency of the associated regularised kernel estimators, describe a reference-aware Nadaraya--Watson implementation of the conditional next-increment law, and evaluate the construction on numerical experiments.

URL PDF HTML ☆

赞 0 踩 0

2605.25929 2026-06-18 cs.MA cs.LG 版本更新

Multi-Agent Systems are Mixtures of Experts: Who Becomes an Influencer?

多智能体系统是专家混合：谁成为影响者？

Franka Bause, Jonas Niederle, Martin Pawelczyk, Rebekka Burkholz

发表机构 * CISPA Helmholtz Center for Information Security（CISPA海德堡信息安全中心）； Faculty of Computer Science, University of Vienna（维也纳大学计算机科学系）

AI总结本文通过Friedkin-Johnsen意见动力学模型分析多智能体LLM协商机制，揭示输入依赖的FJ参数使系统成为专家混合，并探讨基于自信度、感知自信度和初始观点对齐的影响者形成机制。

Comments Accepted at the 2nd Workshop on Compositional Learning at ICML 2026

2605.17986 2026-06-18 cs.CR cs.AI 版本更新

LivePI: More Realistic Benchmarking of Agents Against Indirect Prompt Injection

LivePI：更真实的智能体对抗间接提示注入基准测试

Lei Zhao, Abhay Bhaskar, Edgar Dobriban

发表机构 * University of Pennsylvania（宾夕法尼亚大学）

AI总结提出LivePI基准，覆盖7种输入表面、12种攻击/渲染家族和5种恶意目标，在真实虚拟机环境中评估多个AI智能体，发现攻击成功率10.7%-29.6%，并验证了两层防御的有效性。

详情

AI中文摘要

诸如OpenClaw之类的AI智能体越来越多地部署在本地工作流中，并能够访问外部工具。这带来了间接提示注入（IPI）风险：智能体可能会执行嵌入在不可信输入（如电子邮件、下载文件、网页、仓库或群聊消息）中的有害指令。现有的评估通常规模较小、完全模拟或仅关注狭窄的通道。我们引入了LivePI（实时提示注入），这是一个在生产类似但测试可控环境中的IPI风险结构化基准。LivePI覆盖了七个输入表面、十二个攻击/渲染家族和五个恶意目标，包括受保护信息窃取、未经授权的安全控制更改、不安全的代码检索或执行、收件箱摘要窃取以及加密货币转账。我们在一个真实的虚拟机上运行LivePI，该虚拟机具有实时但测试可控的电子邮件、聊天、网页、本地文件、仓库和钱包接口。在GPT-5.3-Codex、Claude Opus 4.6、Gemini 3.1 Pro、Kimi K2.5和GLM-5上，总攻击成功率范围为10.7%至29.6%。群聊注入在我们部署中评估的所有骨干模型上均成功，而仓库链接攻击尽管分母较小，仍产生了高严重性失败。我们还评估了一种由提示级过滤和执行前工具调用授权组成的两层防御。在GPT-5.3-Codex设置中，该防御在LivePI中拦截了所有测试的恶意目标完成，同时保留了PinchBench衍生工作负载上的良性效用。

英文摘要

AI agents such as OpenClaw are increasingly deployed in local workflows with access to external tools. This creates indirect prompt-injection (IPI) risk: an agent may execute harmful instructions embedded in untrusted inputs such as email, downloaded files, webpages, repositories, or group-chat messages. Existing evaluations are often small, purely simulated, or focused on a narrow set of channels. We introduce LivePI (Live Prompt Injection), a structured benchmark for IPI risk in a production-like but test-controlled environment. LivePI covers seven input surfaces, twelve attack/rendering families, and five malicious goals, including protected-information exfiltration, unauthorized security-control changes, unsafe code retrieval or execution, inbox-summary exfiltration, and cryptocurrency transfer. We run LivePI on a real virtual machine with live but test-controlled email, chat, web, local-file, repository, and wallet interfaces. Across GPT-5.3-Codex, Claude Opus 4.6, Gemini 3.1 Pro, Kimi K2.5, and GLM-5, total attack success rates range from 10.7% to 29.6%. Group-chat injection is uniformly successful across the evaluated backbones in our deployment, and repository-link attacks produce high-severity failures despite a small denominator. We also evaluate a two-layer defense consisting of prompt-level filtering and pre-execution tool-call authorization. In the GPT-5.3-Codex setting, the defense intercepts all tested malicious-goal completions in LivePI before execution while preserving benign utility on PinchBench-derived workloads.

URL PDF HTML ☆

赞 0 踩 0

2603.28707 2026-06-18 cs.CE cs.AI 版本更新

A Convex Route to Thermoelasticity: Learning Internal Energy and Dissipation

热力学的凸路径：学习内能和耗散

Hagen Holthusen, Paul Steinmann, Ellen Kuhl

发表机构 * Institute of Applied Mechanics, University of Erlangen-Nuremberg, Egerlandstra{\ss}e 5, 91058 Erlangen, Germany（埃尔兰根-纽伦堡应用力学研究所，埃尔兰根大学，德国）； Department of Mechanical Engineering, Stanford University, United States（机械工程系，斯坦福大学，美国）

AI总结提出基于物理的神经网络框架，通过输入凸神经网络表示内能和耗散势，自动满足热力学第二定律，实现全耦合热力学本构建模。

Comments 31 pages, 16 figures, 4 tables

详情

DOI: 10.1016/j.cma.2026.119082

AI中文摘要

我们提出了一个基于物理的神经网络框架，用于发现全耦合热力学中的本构模型。与基于亥姆霍兹能量的经典公式不同，我们采用内能和耗散势作为主要本构函数，以变形和熵为变量。这一选择避免了强制混合凸-凹条件，并促进了热力学原理的一致纳入。在本文中，我们关注没有优先方向或内变量的材料。尽管公式以熵表示，但温度被视为独立可观测量，熵通过本构关系内部推断，从而在不需要熵数据的情况下实现热力学一致建模。网络的热力学可接受性通过构造保证。内能和耗散势由输入凸神经网络表示，确保凸性和符合第二定律。客观性、材料对称性和归一化通过基于不变量的表示和零锚定公式直接嵌入架构中。我们在合成和实验数据集上展示了所提出框架的性能，包括纯热问题以及软组织和填充橡胶的全耦合热力学响应。结果表明，学习模型准确捕捉了潜在的本构行为。所有代码、数据和训练模型均通过 https://doi.org/10.5281/zenodo.19248596 公开提供。

英文摘要

We present a physics-based neural network framework for the discovery of constitutive models in fully coupled thermomechanics. In contrast to classical formulations based on the Helmholtz energy, we adopt the internal energy and a dissipation potential as primary constitutive functions, expressed in terms of deformation and entropy. This choice avoids the need to enforce mixed convexity--concavity conditions and facilitates a consistent incorporation of thermodynamic principles. In this contribution, we focus on materials without preferred directions or internal variables. While the formulation is posed in terms of entropy, the temperature is treated as the independent observable, and the entropy is inferred internally through the constitutive relation, enabling thermodynamically consistent modeling without requiring entropy data. Thermodynamic admissibility of the networks is guaranteed by construction. The internal energy and dissipation potential are represented by input convex neural networks, ensuring convexity and compliance with the second law. Objectivity, material symmetry, and normalization are embedded directly into the architecture through invariant-based representations and zero-anchored formulations. We demonstrate the performance of the proposed framework on synthetic and experimental datasets, including purely thermal problems and fully coupled thermomechanical responses of soft tissues and filled rubbers. The results show that the learned models accurately capture the underlying constitutive behavior. All code, data, and trained models are made publicly available via https://doi.org/10.5281/zenodo.19248596.

URL PDF HTML ☆

赞 0 踩 0

2601.12805 2026-06-18 q-bio.GN cs.AI cs.CL 版本更新

SciHorizon-GENE: Benchmarking LLM for Life Sciences Inference from Gene Knowledge to Functional Understanding

SciHorizon-GENE：从基因知识到功能理解的生命科学推理基准测试

Xiaohan Huang, Meng Xiao, Chuan Qin, Qingqing Long, Jinmiao Chen, Yuanchun Zhou, Hengshu Zhu

发表机构 * Computer Network Information Center, Chinese Academy of Sciences（中国科学院计算机网络信息中心）； University of the Chinese Academy of Sciences（中国科学院大学）； DUKE-NUS Medical School, National University of Singapore（新加坡国立大学杜克-新加坡医学学校）； Singapore Immunology Network, Agency for Science, Technology and Research（新加坡免疫网络，科技研究局）

AI总结针对大语言模型在基因级推理能力上的不足，构建了包含超过19万个人类基因和54万问题的基准SciHorizon-GENE，从研究关注敏感性、幻觉倾向、答案完整性和文献影响力四个生物学关键维度评估模型，揭示了模型在生成忠实、完整且基于文献的功能解释方面的持续挑战。

Comments Accepted by SIGKDD 2026. 12 pages

详情

AI中文摘要

大型语言模型（LLMs）在生物医学研究中展现出日益增长的潜力，尤其是在知识驱动的解释任务中。然而，它们从基因知识到功能理解的可靠推理能力——这是知识增强型细胞图谱解释的核心要求——仍然在很大程度上未被探索。为了填补这一空白，我们引入了SciHorizon-GENE，这是一个基于权威生物数据库构建的大规模基因中心基准。该基准整合了超过19万个人类基因的 curated 知识，包含超过54万个问题，涵盖了与细胞类型注释、功能解释和机制导向分析相关的多种基因到功能推理场景。受初步检查中观察到的行为模式启发，SciHorizon-GENE从四个生物学关键角度评估LLMs：研究关注敏感性、幻觉倾向、答案完整性和文献影响力，明确针对限制LLMs在生物解释管道中安全采用的失败模式。我们系统评估了多种最先进的通用和生物医学LLMs，揭示了基因级推理能力的显著异质性，以及在生成忠实、完整且基于文献的功能解释方面的持续挑战。我们的基准为在基因尺度上分析LLM行为建立了系统基础，并为模型选择和发展提供了见解，与知识增强型生物解释直接相关。

英文摘要

Large language models (LLMs) have shown growing promise in biomedical research, particularly for knowledge-driven interpretation tasks. However, their ability to reliably reason from gene-level knowledge to functional understanding, a core requirement for knowledge-enhanced cell atlas interpretation, remains largely underexplored. To address this gap, we introduce SciHorizon-GENE, a large-scale gene-centric benchmark constructed from authoritative biological databases. The benchmark integrates curated knowledge for over 190K human genes and comprises more than 540K questions covering diverse gene-to-function reasoning scenarios relevant to cell type annotation, functional interpretation, and mechanism-oriented analysis. Motivated by behavioral patterns observed in preliminary examinations, SciHorizon-GENE evaluates LLMs along four biologically critical perspectives: research attention sensitivity, hallucination tendency, answer completeness, and literature influence, explicitly targeting failure modes that limit the safe adoption of LLMs in biological interpretation pipelines. We systematically evaluate a wide range of state-of-the-art general-purpose and biomedical LLMs, revealing substantial heterogeneity in gene-level reasoning capabilities and persistent challenges in generating faithful, complete, and literature-grounded functional interpretations. Our benchmark establishes a systematic foundation for analyzing LLM behavior at the gene scale and offers insights for model selection and development, with direct relevance to knowledge-enhanced biological interpretation.

URL PDF HTML ☆

赞 0 踩 0

2605.22845 2026-06-18 cs.CE cs.LG 版本更新

A finite-element-inspired bipartite graph learned simulator for manufacturability assessment in large-deformation sheet forming

基于交叉注意力的二分图神经网络用于大变形板材成形中节点和单元场的耦合预测

Yingxue Zhao, Haoran Li, Haosu Zhou, Tobias Pfaff, Nan Li

发表机构 * Dyson School of Design Engineering（设计工程学院）； Imperial College London（帝国理工学院伦敦分校）； NVIDIA（NVIDIA公司）

AI总结提出交叉注意力二分图神经网络（CAtt-BiGNN），通过节点-单元二分图结构和边感知交叉注意力机制，实现大变形板材成形中节点位移增量和单元减薄量的耦合预测。

详情

AI中文摘要

大变形板材成形的有限元模拟涉及节点运动学与单元级变形度量之间的节点-单元耦合。机器学习代理可以加速此类模拟，但大多数基于图的模型使用以节点为中心的表示。这种表示对于单元级量是间接的，通常通过插值或后处理从节点预测中恢复。它也可能模糊有限元更新背后的节点-单元耦合结构。本文提出了一种基于交叉注意力的二分图神经网络（CAtt-BiGNN），用于节点位移增量和单元减薄量的耦合预测。该图将网格节点和单元表示为不同但相连的实体，通过有向节点-单元边连接，从而在它们本征的离散域上预测节点场和单元场。边感知交叉注意力处理器根据几何边特征自适应地调节节点-单元耦合权重，实现节点运动状态与单元变形状态之间的双向消息传递。层次化扩展CAtt-BiUGNN将CAtt-BiGNN与图下采样-上采样相结合，以改善在较大网格上的信息传播。进一步评估了自适应高斯噪声作为可选的展开稳定策略。模型在两个具有不同图尺寸的代表性成形案例上进行了测试。与以节点为中心的基线和二分消融变体相比，CAtt-BiGNN改善了位移和减薄预测之间的平衡，而CAtt-BiUGNN在较大图设置下给出了最强的整体性能。结果表明，所提出的模型为大变形板材成形提供了一个有效的代理框架。

英文摘要

Explicit dynamic finite element (FE) simulations are widely used for large deformation engineering analysis, but repeated simulations remain costly during design space exploration and optimisation. In explicit FE analysis, nodal kinematics and element level deformation measures evolve through coupled node element updates. This motivates graph learned simulators that approximate one step FE state transitions and roll them out autoregressively. However, many mesh based graph surrogates are node centred, which makes element level variables and native nodal elemental exchange less direct to represent. This work proposes CAttBiGNN, a cross attention based bipartite graph neural network for coupled nodal elemental learning. The graph represents FE mesh nodes and elements as distinct entities linked by directed node element edges, enabling nodal displacement increments and element level deformation states to be predicted on their native discretisation domains. An edge aware cross attention processor uses geometric edge embeddings to modulate directional node element message passing. For larger graphs, CAttBiUGNN combines the bipartite processor with graph downsampling and upsampling to improve long-range information propagation. The method is evaluated on dome shaped cold forming and corner shaped hot forming benchmarks. Comparisons with node centred baselines and bipartite and attention ablations show improved accuracy and balance in nodal displacement and elemental thinning prediction during autoregressive rollout. The results indicate that the proposed finite element inspired learned simulator can support manufacturability oriented field prediction and efficient design space exploration in large deformation sheet material forming.

URL PDF HTML ☆

赞 0 踩 0

AI 大模型

视觉与机器人

科学与医疗

TransitNet: A Compact Attention-Augmented Deep Learning Framework for Low-SNR Transit Blind Searches

Modeling Doppler Shifts in Radial-Velocity Data with Deep Learning toward Earth-mass Exoplanet Detection

A Bayesian Boolean Matrix Factorization with Application to Copy Number Analysis in Cancer

OmniPlan: An Adaptive Framework for Timely and Near-Optimal Network Planning Optimization

On the Memorization Behavior of LLMs in Generative Recommendation: Observations, Implications, and Training Strategies

Quantum Cinema: An Interactive Cinematic Exploration of Quantum Computing Hardware via Generative World Models

Comprehensive pKa Data Augmentation from Limited Real Data through an Engineered Models-Quantum Framework

Parameter-Efficient Adaptation of SAM 3 for Automated ITV Generation from 4DCT Images

Sensory Restoration via Brain-Computer Interfaces: A Unified 2 x 2 Framework and Convergence Roadmap

Running hardware-aware neural architecture search on embedded devices under 512MB of RAM

MeEvo: Metacognitive Evolution Combined with Natural Evolution for Automatic Heuristic Design

Graph Reinforcement Learning for Calibration-Aware Quantum Circuit Routing

SPARX: Secure and Privacy-Aware Approximate CNN Acceleration with Edge RISC-V SoC

From Privacy to Workflow Integrity: Communication-Graph Metadata in Autonomous Agent Interoperability

Predicting the Neutrino Mass Ordering Using Neural Networks

QSignAI: Quantum-Randomness-Seeded Identity Signatures at the Intersection of AI for Science and Science for AI

Integrating Multi-Label Classification and Generative AI for Scalable Analysis of User Feedback

TLA-Prover: Verifiable TLA+ Specification Synthesis via Preference-Optimized Low-Rank Adaptation

Knockoffs-based False Discovery Rate Control and Simplification for Deep Neural Networks

The New Social Image: How AI Competency and AI Proactivity Influence Self- and Peer-Perceptions in the Workplace

Latent-Conditioned Parameterized Quantum Circuits as Universal Approximators for Distributions over Quantum States

Practical Anonymous Two-Party Gradient Boosting Decision Tree

Can We Hear from Events? Generating Speech from Event Camera

Data-driven sparse identification of governing PDEs via knockoff filters and multi-criteria trade-offs

Triangular-Reference Schrödinger Bridges for Time Series Generation

Multi-Agent Systems are Mixtures of Experts: Who Becomes an Influencer?

LivePI: More Realistic Benchmarking of Agents Against Indirect Prompt Injection

A Convex Route to Thermoelasticity: Learning Internal Energy and Dissipation

SciHorizon-GENE: Benchmarking LLM for Life Sciences Inference from Gene Knowledge to Functional Understanding

A finite-element-inspired bipartite graph learned simulator for manufacturability assessment in large-deformation sheet forming