arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 2329
2605.12286 2026-05-13 q-bio.GN cs.AI

Set-Aggregated Genome Embeddings for Microbiome Abundance Prediction

Younhun Kim, Georg K. Gerber, Travis E. Gibson

AI总结 该研究探讨了是否仅通过微生物群落成员的原始DNA序列即可预测其群落层面的丰度特征。研究提出了一种基于集合聚合基因组嵌入(SAGE)的方法,结合基因组语言模型(GLMs)的少样本学习能力,用于预测微生物群落的丰度分布。实验表明,该方法在新型基因组上的泛化能力优于传统生物信息学方法,并验证了群落层面潜在表示对性能提升的关键作用。

Comments 11 pages, 7 figures

详情
英文摘要

Microbiome functions are encoded within the genes of the community-wide metagenome. A natural question is whether properties of a microbial community can be predicted just from knowing the raw DNA sequences of its members. In this work, we employ set-aggregated genome embeddings (SAGE) to predict community-level abundance profiles, exploiting the few-shot learning capabilities of genomic language models (GLMs). We benchmark this approach to show improved generalization on novel genomes compared to classical bioinformatics approaches. Model ablation shows that community-level latent representations directly result in improved performance. Lastly, we demonstrate the benefits of intermediate transformations between latent representations and demonstrate the differences between GLM embedding choices.

2605.12280 2026-05-13 cs.SE cs.AI

Iterative Audit Convergence in LLM-Managed Multi-Agent Systems: A Case Study in Prompt Engineering Quality Assurance

Elias Calboreanu

AI总结 本文研究了在大型语言模型(LLM)管理的多智能体系统中,通过迭代审计实现规范收敛的问题,以AEGIS系统为案例,探讨提示规范的质量保证。研究采用由 Claude 子代理执行的检查表驱动审计方法,发现了51个提示规范一致性缺陷,并提出了七类缺陷的分类体系及编码规则。实验表明,随着审计范围的扩展,缺陷收敛呈现非单调变化,且单一文件审查无法发现所有问题,研究还提炼出一套可复现的审计协议。

Comments 13 pages, 3 figures, 6 tables. Companion preprint at arXiv:2604.05000. Submitted to MDPI Software, Special Issue on Software Reliability, Security and Quality Assurance

详情
英文摘要

Prompt specifications for multi-agent large language model (LLM) systems carry data contracts and integration logic across many interdependent files but are rarely subjected to structured-inspection rigor. This paper reports a single-system empirical case study of iterative, agent-driven auditing applied to AEGIS (Autonomous Engineering Governance and Intelligence System), a production seven-lane orchestration pipeline whose prompt-specification surface comprises approximately 7150 lines: 6907 across seven lane PROMPT.md files and a 245-line shared Ticket Contract. Nine sequential audit rounds, executed by Claude sub-agents using a checklist-driven walkthrough adapted from Weinberg and Freedman, surfaced 51 prompt-specification consistency defects, distinct from the 51 STRIDE-categorized adversarial code findings reported in the companion preprint. Per-round counts were 15, 8, 12, 2, 8, 1, 4, 1, and 0. We report a seven-category post-hoc defect taxonomy with explicit coding rules, observed non-monotonic convergence consistent with cascading edits and audit-scope expansion, and an audit protocol distilled from the study, with the final locked checklist released as a reproducibility appendix. Single-file review missed defect classes that were surfaced only by later expanded-scope rounds in this system. The same LLM family authored and audited the specifications; replication with dissimilar models and human reviewers is required before generalization.

2605.12264 2026-05-13 cs.CR cs.CL cs.LG

Reconstruction of Personally Identifiable Information from Supervised Finetuned Models

Sae Furukawa, Alina Oprea

AI总结 本文首次研究了从监督微调(SFT)模型中重建个人身份信息(PII)的问题。作者构建了包含PII的医疗和法律场景下的多轮问答数据集,用于评估模型在微调过程中可能泄露的隐私信息。研究提出了一种名为COVA的新解码算法,在前缀攻击下显著优于现有方法,实验表明即使攻击者仅掌握部分微调数据知识,也能有效重建PII,且不同类型PII的泄露程度存在显著差异。

详情
英文摘要

Supervised Finetuning (SFT) has become one of the primary methods for adapting a large language model (LLM) with extensive pre-trained knowledge to domain-specific, instruction-following tasks. SFT datasets, composed of instruction-response pairs, often include user-provided information that may contain sensitive data such as personally identifiable information (PII), raising privacy concerns. This paper studies the problem of PII reconstruction from SFT models for the first time. We construct multi-turn, user-centric Q&A datasets in sensitive domains, specifically medical and legal settings, that incorporate PII to enable realistic evaluation of leakage. Using these datasets, we evaluate the extent to which an adversary, with varying levels of knowledge about the fine-tuning dataset, can infer sensitive information about individuals whose data was used during SFT. In the reconstruction setting, we propose COVA, a novel decoding algorithm to reconstruct PII under prefix-based attacks, consistently outperforming existing extraction methods. Our results show that even partial attacker knowledge can significantly improve reconstruction success, while leakage varies substantially across PII types.

2605.12263 2026-05-13 cs.DL cs.AI

Reconnecting Fragmented Citation Networks with Semantic Augmentation

Vu Thi Huong, Annika Buchholz, Imene Khebouri, Thorsten Koch, Tim Kunt, Wolfgang Peters-Kottig, Tomasz Stompor, Janina Zittel

AI总结 本文研究了如何通过语义增强方法修复科学文献引用网络中的碎片化问题。作者提出了一种结合引用拓扑结构和基于大语言模型的文本相似度的高效混合框架,通过添加语义边和调整现有引用权重来增强原始引用网络。该方法在保持学科同质性的同时显著减少了网络碎片,并在大规模数据集上表现出良好的扩展性,为改进基于引用的科学评价指标提供了实用策略。

Comments 11 pages, 4 figures, 3 tables

详情
英文摘要

Citation graphs are fundamental tools for modeling scientific structure, but are often fragmented due to missing citations of scientifically connected articles. To address this issue, we propose a computationally efficient hybrid framework integrating citation topology with large language model (LLM)-based text similarity. Using 662,369 Web of Science publications in Mathematics and Operations Research & Management Science, we augment the original graph by adding semantic edges from small, disconnected components and weighting existing citations according to textual similarity. Semantic augmentation substantially reduces fragmentation while preserving disciplinary homogeneity. Compared to embedding-only clustering, cluster detection on augmented graphs using the Leiden algorithm retains structural interpretability while offering multi-scale organization. The method scales efficiently to large datasets and offers a practical strategy for strengthening citation-based indicators without collapsing disciplinary boundaries.

2605.12241 2026-05-13 eess.SP cs.AI cs.LG

Pretraining Strategies and Scaling for ECG Foundation Models: A Systematic Study

M A Al-Masud, Nils Strodthoff

AI总结 本文系统研究了心电图(ECG)基础模型的预训练策略及其规模扩展,评估了五种不同的自监督学习目标,并在最多1100万条公开数据上分析了模型性能随数据量增长的变化趋势。研究发现,对比预测编码(CPC)在多种临床任务中表现出最佳的迁移能力,且随着数据量增加,大多数目标的性能仍有显著提升。此外,研究还表明结构化状态空间模型在ECG表示学习中优于Transformer和CNN模型,其强归纳偏置可能是提升模型性能的关键因素。

Comments 59 pages, 16 figures, 59 Tables. Code available at https://anonymous.4open.science/r/ecg-pretraining-strategies-4DE3

详情
英文摘要

Specialized foundation models are beginning to emerge in various medical subdomains, but pretraining methodologies and parametric scaling with the size of the pretraining dataset are rarely assessed systematically and in a like-for-like manner. This work focuses on foundation models for electrocardiography (ECG) data, one of the most widely captured physiological time series world-wide. We present a comprehensive assessment of pretraining methodologies, covering five different contrastive and non-contrastive self-supervised learning objectives for ECG foundation models, and investigate their scaling behavior with pretraining dataset sizes up to 11M input samples, exclusively from publicly available sources. Pretraining strategy has a meaningful and consistent impact on downstream performance, with contrastive predictive coding (slightly ahead of JEPA) yielding the most transferable representations across diverse clinical tasks. Scaling pretraining data continues to yield meaningful improvements up to 11M samples for most objectives. We also compare model architectures across all pretraining methodologies and find evidence for a clear superiority of structured state space models compared to transformers and CNN models. We hypothesize that the strong inductive biases of structured state space models, rather than pretraining scale alone, are the primary driver of effective ECG representation learning, with important implications for future foundation model development in this and potentially other physiological signal domains.

2605.12239 2026-05-13 cs.PL cs.AI math.CT

Harness Engineering as Categorical Architecture

Bogdan Banu

AI总结 本文探讨了基于大语言模型的智能体系统中“代理框架”(harness)的设计问题,提出了一种基于范畴论的架构三元组(G, Know, Phi)作为形式化理论,用于描述和规范代理系统的组成、属性保持和跨框架比较。研究将代理外部化的四个核心要素——记忆、技能、协议和框架工程——映射到该架构的三个组成部分,并通过编译器验证结构保证的保持性。实验验证了该理论在多个实际框架中的适用性,并展示了其在质量驱动的智能体升级中的有效性。

详情
英文摘要

The agent harness, the system layer comprising prompts, tools, memory, and orchestration logic that surrounds the model, has emerged as the central engineering abstraction for LLMbased agents. Yet harness design remains ad hoc, with no formal theory governing composition, preservation of properties under compilation, or systematic comparison across frameworks. We show that the categorical Architecture triple (G, Know, Phi) from the ArchAgents framework provides exactly this formalization. The four pillars of agent externalization (Memory, Skills, Protocols, Harness Engineering) map onto the triple's components: Memory as coalgebraic state, Skills as operad-composed objects, Protocols as syntactic wiring G, and the full Harness as the Architecture itself. Structural guarantees-integrity gates, quality-based escalation, supported convergence checks-are Know-level certificates whose preservation is structural replay: our compiler checks identity and verifier replay, not output-layer correctness or model behavior. We validate this correspondence with a reference implementation featuring compiler functors targeting Swarms, DeerFlow, Ralph, Scion, and LangGraph: the four configuration compilers preserve three named certificate types by identity or replay, and LangGraph preserves the same certificates through its shared per-stage execution path. The LangGraph compiler creates one node per stage using the same per-stage method as the native runtime, providing LangGraph-native observability without reimplementing harness logic. An end-to-end escalation experiment with real LLM agents confirms that the quality-based escalation control path is model-parametric in this two-model, one-task experiment. The result positions categorical architecture as the formal theory behind harness engineering.

2605.12235 2026-05-13 stat.ML cs.LG

Optimal Policy Learning under Budget and Coverage Constraints

Giovanni Cerulli

AI总结 本文研究在预算和最低覆盖约束下的最优策略学习问题,揭示了该问题具有类似于背包问题的结构,并证明最优策略可通过结合预算和覆盖影子价格的线性阈值规则来刻画。研究还表明其组合优化的线性规划松弛具有常数积分间隙,意味着离散分配与最优解在渐近情况下等价。基于此,作者提出了两种可实施的算法——贪心拉格朗日算法和排序-切割算法,并通过实验验证了它们在不同条件下的近似最优性能。

详情
英文摘要

We study optimal policy learning under combined budget and minimum coverage constraints. We show that the problem admits a knapsack-type structure and that the optimal policy can be characterized by an affine threshold rule involving both budget and coverage shadow prices. We establish that the linear programming relaxation of the combinatorial solution has an O(1) integrality gap, implying asymptotic equivalence with the optimal discrete allocation. Building on this result, we analyze two implementable approaches: a Greedy-Lagrangian (GLC) and a rank-and-cut (RC) algorithm. We show that the GLC closely approximates the optimal solution and achieves near-optimal performance in finite samples. By contrast, RC is approximately optimal whenever the coverage constraint is slack or costs are homogeneous, while misallocation arises only when cost heterogeneity interacts with a binding coverage constraint. Monte Carlo evidence supports these findings.

2605.12217 2026-05-13 cs.AR cs.AI

Heterogeneous SoC Integrating an Open-Source Recurrent SNN Accelerator for Neuromorphic Edge Computing on FPGA

Michelangelo Barocci, Vittorio Fra, Enrico Macii, Gianvito Urgese

AI总结 本文提出了一种异构系统级芯片(SoC),集成开源的循环脉冲神经网络(SNN)加速器ReckOn,旨在推动边缘端神经形态计算的发展。该设计结合了RISC-V开源微控制器X-HEEP和Zynq Ultrascale系统中的ARM处理器,通过在FPGA上实现ReckOn的物理版本,验证了其分类性能与实际硬件的一致性,并进一步评估了其在线学习能力,用于盲文数字数据集的分类任务。该研究为开放源码的神经形态硬件设计提供了一种灵活且成本较低的实现方案。

Comments Deep Learning meets Neuromorphic Hardware Workshop at ECML-PKDD 2024 Conference in Vilnius, Lithuania

详情
Journal ref
Machine Learning and Principles and Practice of Knowledge Discovery in Databases 3 (2026) 128-143
英文摘要

The growing popularity of Spiking Neural Networks (SNNs) and their applications has led to a significant fast-paced increase of neuromorphic architectures capable of mimicking the spike-based data processing typical of biological neurons. The efficient power consumption and parallel computing capabilities of the SNNs lead researchers towards the development of digital accelerators, which exploit such features to bring fast and low-power computation on edge devices. The spread of digital neuromorphic hardware however is slowed down by the prohibitive costs that the silicon tape out of circuits brings, that's why targeting Field Programmable Gate Arrays (FPGAs) could represent a viable alternative, offering a flexible and cost-effective platform for implementing digital neuromorphic systems and helping the spread of open-source hardware designs. In this work we present an heterogeneous System-on-Chip (SoC) where the operations of ReckOn, a Recurrent SNN accelerator, are managed through the integration with traditional processors. These include the RISC-V-based, open-source microcontroller X-HEEP and the ARM processor featured in Zynq Ultrascale systems. We validate our design by reproducing the classification results through the implementation on FPGA of the taped-out version of ReckOn in order to check the equivalence of the accuracy and the characteristics in terms of physical implementation. In a second set of experiments, we evaluate the online learning capability of the solution in classifying a subset of the Braille digit dataset recently used to compare neuromorphic frameworks and platforms.

2605.12201 2026-05-13 cs.SE cs.AI

Uncertainty Quantification for LLM-based Code Generation

Senrong Xu, Yuhao Tan, Yanke Zhou, Guangyuan Wu, Zenan Li, Yuan Yao, Taolue Chen, Feng Xu, Xiaoxing Ma

AI总结 本文研究了基于大语言模型(LLM)的代码生成任务中的不确定性量化问题,提出了一种名为RisCoSet的新方法。该方法通过多假设检验构建风险可控的预测集,能够在保证高置信度包含正确解的前提下,有效减少生成代码的冗余。实验表明,与现有方法相比,RisCoSet在多个LLM上均表现出更优的性能,最多可减少24.5%的代码移除量。

详情
英文摘要

Prediction sets provide a theoretically grounded framework for quantifying uncertainty in machine learning models. Adapting them to structured generation tasks, in particular, large language model (LLM) based code generation, remains a challenging problem. An existing attempt proposes PAC prediction sets but is limited by its strong monotonicity assumption on risk and single-label classification framework, which severely limits the space of candidate programs and cannot accommodate the multiple valid outputs inherent to code generation. To address these limitations, we propose an approach RisCoSet that leverages multiple hypothesis testing to construct risk-controlling predictions for LLM-based code generation. Given a trained code generation model, we produce a prediction set represented by a partial program, which is guaranteed to contain a correct solution with high confidence. Extensive experiments on three LLMs demonstrate the effectiveness of the proposed method. For instance, compared with the state-of-the-art, our method can significantly reduce the code removal by up to 24.5%, at the same level of risk.

2605.12194 2026-05-13 cond-mat.mtrl-sci cs.LG

Probing Non-Equilibrium Grain Boundary Dynamics with XPCS and Domain-Adaptive Machine Learning

Mouyang Cheng, Bowen Yu, Chu-Liang Fu, Nina Andrejevic, Matthias T. Agne, Riley Hanus, Qiwei Wan, Nathan C. Drucker, Thanh Nguyen, Andrei Fluerasu, Lutz Wiegart, Xiaoqian M Chen, Daniel Pajerowski, Yongqiang Cheng, Joshua J Turner, G. Jeffrey Snyder, Mingda Li

AI总结 该研究结合X射线光子相关谱(XPCS)与领域自适应机器学习方法,探索纳米晶材料中晶界在非平衡状态下的动态行为。通过温度和晶粒尺寸依赖的两时间XPCS测量,揭示了晶界弛豫过程在实验时间尺度上远未达到平衡的现象。研究提出了一种半监督学习框架,通过领域自适应表示对齐技术,将连续介质模拟中的物理参数标签迁移至实验XPCS数据,从而直接提取出晶界扩散率、刚度和有效浓度等关键动力学参数,为研究固体中非平衡缺陷运动提供了新的方法。

Comments 14 pages, 4 figures

详情
英文摘要

Grain-boundary (GB) dynamics control the stability, mechanical, and functional response of nanocrystalline materials, but direct experimental access to their slow non-equilibrium motion has been limited. Here we establish X-ray photon correlation spectroscopy (XPCS), combined with domain-adaptive machine learning, as a quantitative probe of GB dynamics. Temperature- and grain-size-dependent two-time XPCS measurements in nanocrystalline silicon reveal pronounced departures from time-translation invariance, showing that GB relaxation can remain far from equilibrium over experimental timescales. However, direct extraction of quantitative physical information from these high-dimensional, noisy fluctuation maps faces a significant challenge. To overcome this barrier, we develop a semi-supervised learning framework that transfers physical parameter labels from continuum simulations to unlabeled experimental XPCS maps through domain-adaptive representation alignment. This AI-augmented approach enables the extraction of key kinetic parameters, including bulk diffusivity, GB stiffness, and effective GB concentration, directly from experimental XPCS measurements. Our results show how machine learning can transform indirect fluctuation signals into quantitative materials dynamics, providing a general route to study non-equilibrium defect motion in solids.

2605.12190 2026-05-13 stat.ML cs.LG

Information-Theoretic Generalization Bounds for Sequential Decision Making

Futoshi Futami, Masahiro Fujisawa

AI总结 本文研究了序贯决策问题中的泛化界分析,针对在线学习、流式主动学习和多臂老虎机等场景,提出了一个序贯超样本框架。该方法通过分离学习者的过滤过程与用于幽灵坐标比较的证明扩展,引入了基于轮次选择器-损失信息项的序贯条件互信息(CMI)来控制泛化差距,并在适当方差条件下建立了伯恩斯坦型改进,提升了收敛速率。该方法适用于多种序贯决策场景,为算法依赖的泛化分析提供了新工具。

详情
英文摘要

Information-theoretic generalization bounds based on the supersample construction are a central tool for algorithm-dependent generalization analysis in the batch i.i.d.~setting. However, existing supersample conditional mutual information (CMI) bounds do not directly apply to sequential decision-making problems such as online learning, streaming active learning, and bandits, where data are revealed adaptively and the learner evolves along a causal trajectory. To address this limitation, we develop a sequential supersample framework that separates the learner filtration from a proof-side enlargement used for ghost-coordinate comparisons. Under a row-wise exchangeability assumption, the sequential generalization gap is controlled by sequential CMI, a sum of roundwise selector--loss information terms. We also establish a Bernstein-type refinement that yields faster rates under suitable variance conditions. The selector-SCMI proof strategy applies to online learning, streaming active learning with importance weighting, and stochastic multi-armed bandits.

2605.12180 2026-05-13 cs.IT cs.AI math.IT

A Deep Learning-based Receiver for Asynchronous Grant-Free Random Access in Control-to-Control Networks

Massimo Battaglioni, Edoardo Carnevali, Dania De Crescenzo, Enrico Testi, Marco Baldi, Enrico Paolini

AI总结 本文研究了室内共享无线信道中异步无授权控制到控制(C2C)通信系统中的接收机设计问题。每个通信节点发送包含可变长度LDPC编码数据的命令单元,并由起始序列和尾序列标识。由于异步接入,接收端观测到的是多个节点发送信号的叠加。本文提出了一种基于卷积神经网络的接收机架构,能够直接从接收信号中检测命令单元的边界,并利用LDPC译码的软信息和信道估计提升尾序列检测性能。仿真结果表明,该接收机在高负载和无协调条件下仍能实现可靠的包边界识别和低端到端丢包率。

Comments Submitted to IEEE Transactions on Communications

详情
英文摘要

In this paper, we study grant-free, asynchronous control-to-control (C2C) communications in an indoor scenario with a shared wireless channel. Each communication node transmits command units, each consisting of a variable-length low-density parity-check (LDPC)--coded payload preceded by a start sequence and followed by a tail sequence. Due to the asynchronous nature of the access, transmissions from different nodes are not aligned over time. As a result, each receiving controller observes the superposition of multiple command units transmitted by different nodes over a receiver-defined superframe interval. Each node transmits one or more replicas of the same command unit. We propose a receiver architecture in which the detection of command unit boundaries (start/tail sequences) is carried out by a single convolutional neural network (CNN) operating directly on the received signal. We show that, while start-sequence detection must rely only on the received waveform, tail-sequence detection can additionally exploit the soft information produced by the LDPC decoder, together with channel estimates. Finally, once commands units are successfully decoded, successive interference cancellation (SIC) can be applied. Simulation results demonstrate that the receiver we propose achieves reliable packet-boundary identification and a low end-to-end packet loss rate, even under uncoordinated and high-traffic operating conditions.

2605.12165 2026-05-13 physics.ins-det cs.LG physics.comp-ph

Machine Learning for neutron source distributions

Jose Ignacio Robledo, Norberto Schmidt, Klaus Lieutenant, Jingjing Li, Stefan Kesselheim, Paul Zakalek

AI总结 本文提出了一种基于概率生成模型的新方法,用于中子源分布的估计。该方法利用蒙特卡洛粒子列表进行训练,训练完成后模型可独立于原始数据进行高效、快速且无需额外内存的采样。研究对比了变分自编码器、归一化流、生成对抗网络和去噪扩散模型等多种生成模型,并与现有方法进行了比较,展示了概率生成模型在中子源分布建模中的可行性和优势。

Comments Under review at Machine Learning: Science & Technology

详情
英文摘要

In light of the recent advancements in machine learning, we propose a novel approach to neutron source distribution estimation through the utilisation of probabilistic generative models. The estimation is based on a Monte Carlo particle list, which is only required during the training stage of the machine learning model. Once the source distribution has been learned, the model is independent of the original particle list, allowing for further sampling in an efficient, rapid, and memory-costless manner. The performance of various generative models is evaluated, including a variational autoencoder, a normalizing flow, a generative adversarial network, and a denoising diffusion model. These approaches are then compared to existing source distribution estimations, and the advantages and disadvantages of each approach are discussed. The results demonstrate that source distributions can be modeled through the use of probabilistic generative models, which paves the way for further advancements in this field.

2605.12153 2026-05-13 cs.SE cs.AI

CIDR: A Large-Scale Industrial Source Code Dataset for Software Engineering Research

Vladislav Savenkov

AI总结 本文介绍了CIDR,一个通过与12家工业合作伙伴直接合作收集的大型工业源代码数据集,包含2440个软件仓库,涵盖138种编程语言,总代码量达3.73亿行,并附有结构化元数据。与现有基于开源平台的代码语料库不同,CIDR仅包含在正式数据共享协议下提供的专有生产代码,覆盖企业级Web与移动开发、金融科技和定制软件咨询等领域。该数据集经过多阶段处理流程,包括结构化合作伙伴接入、两阶段质量筛选和确定性匿名化处理,旨在支持代码智能、软件质量分析、代码语言模型预训练与微调、开发者行为研究以及智能体评估基准构建等方向的研究。

Comments 34 pages, 9 figures, 4 appendices. Dataset access: https://fermatix.ai/#Contact. Anonymization tool: https://github.com/Fermatix/repo-sanitizer. Metadata utility: https://github.com/Fermatix/repo_metadata_cli

详情
英文摘要

We present Curated Industrial Developer Repository (CIDR), a large-scale dataset of real-world software repositories collected through direct collaboration with 12 industrial partner organizations. The dataset comprises 2,440 repositories spanning 138 programming languages and totalling 373 million lines of code, accompanied by structured per-repository metadata. Unlike existing code corpora derived from public open-source platforms, CIDR consists exclusively of proprietary production codebases contributed under formal data sharing agreements, covering application domains including enterprise web and mobile development, fintech, and custom software consultancy. All repositories were processed through a multi-stage pipeline encompassing structured partner onboarding, two-stage quality selection combining automated metadata filtering with manual code review, and a deterministic anonymization pipeline covering the full version control history. The dataset is intended to support research in code intelligence, software quality analysis, pre-training and fine-tuning of code language models, developer behaviour studies, and construction of agent evaluation benchmarks. Access is provided under a restricted commercial license; details are available at https://fermatix.ai/#Contact.

2605.12147 2026-05-13 cs.CR cs.LG

PrivacySIM: Evaluating LLM Simulation of User Privacy Behavior

James Flemings, Murali Annavaram

AI总结 本文提出PrivacySIM,用于评估大型语言模型(LLMs)在模拟用户隐私行为方面的表现。研究通过分析1000名用户的实际隐私决策数据,探讨用户人口统计信息、过往经验及隐私态度等特征对LLM模拟效果的影响。实验表明,基于用户画像的条件模拟能提升模型表现,但现有模型仍难以准确还原个体隐私决策,尤其对高AI使用但隐私态度不明确的用户模拟难度较大。PrivacySIM为评估和改进LLM隐私行为模拟能力提供了重要工具。

详情
英文摘要

Large language models (LLMs) are increasingly used to simulate human behavior, but their ability to simulate $individual$ privacy decisions is not well understood. In this paper, we address the problem of evaluating whether a core set of user persona attributes can drive LLMs to simulate individual-level privacy behavior. We introduce PrivacySIM, an evaluation suite that benchmarks LLM simulation of user privacy behavior against the ground-truth responses of 1,000 users. These users are drawn from five published user studies on privacy spanning LLM healthcare consultations, conversational agents, and chatbots. Drawing on these user studies, we hypothesize three persona facets as plausible predictors of privacy decision-making: demographics, previous experiences, and stated privacy attitudes. We condition nine frontier LLMs on subsets of these three facets and measure how often each model's response to a data-sharing scenario matches the user's actual response. Our findings show that (1) privacy persona conditioning consistently improves simulation quality over no-persona conditioning, but even the strongest model (40.4\% accuracy) remains far from faithfully simulating individual privacy decisions. (2) A user's stated privacy attitudes alone may not be the best predictor because they often diverge from the user's actual privacy behavior. (3) Users with high AI/chatbot experience but low stated privacy attitudes are the most challenging to simulate. PrivacySIM is a first step toward understanding and improving the capabilities of LLMs to simulate user privacy decisions. We release PrivacySIM to enable further evaluation of LLM privacy simulation.

2605.12129 2026-05-13 cs.SE cs.AI cs.OS

It's Not the Size: Harness Design Determines Operational Stability in Small Language Models

Yong-eun Cho

AI总结 本文研究了小语言模型(2-3B参数)的操作稳定性如何受“框架设计”影响,而非模型规模。通过对比三种不同框架条件(仅模型、最小外壳、四阶段流水线)在24个任务中的表现,发现四阶段流水线显著提升了任务成功率,尤其在Gemma4 E2B模型上达到了95.2%的任务成功率和100%的有效任务成功率。研究还揭示了框架缺失可能导致模型结构崩溃,并发现规划和恢复机制对性能提升贡献显著。

详情
英文摘要

This paper experimentally analyzes how the level of harness engineering affects the operational performance of small language models (SLMs, 2-3B parameters). Three harness conditions - model-only (raw prompt), minimal-shell (wrapper tags), and a 4-stage pipeline (plan->execute->verify->recover) - are applied to three models (Gemma4 E2B, Qwen3.5:2B, LLaMA 3.2 3B) across 24 tasks, comparing Task Success Rate (TSR) and Valid TSR (VTSR). The pipeline harness achieves TSR=0.952 and VTSR=1.000 on Gemma4 E2B (T1-T5, 21 tasks). A non-monotonic phenomenon - minimal-shell TSR < model-only TSR - is observed in two models. In LLaMA 3.2 3B model-only, seven format violations yield TSR=0.429, revealing scaffold collapse: the model abandons JSON structure under complex format requirements without harness support. Ablation shows planning and recovery each contribute approximately 24.7% of total gain. VCR (Verification Catch Rate)=0.625 across all pipeline runs.

2605.12078 2026-05-13 cs.SE cs.AI

Property-Level Reconstructability of Agent Decisions: An Anchor-Level Pilot Across Vendor SDK Adapter Regimes

Oleg Solozobov

AI总结 该研究探讨了智能体决策在不同供应商SDK适配环境下的可重构性问题,旨在评估决策过程的可追溯程度。研究采用未修改的决策轨迹重构器,对六个公共SDK体系中的固定示例进行分析,按属性分类判断其可填充程度。结果表明,不同体系下决策属性的可重构性存在显著差异,揭示了在治理完整性方面存在的多层级差距,为跨体系的智能体行为分析提供了新的评估框架。

Comments 23 pages, 3 tables; reproducibility package: https://doi.org/10.5281/zenodo.20077961; GitHub: https://github.com/agent-runtime-evidence/anchor-level-reconstructability-pilot

详情
英文摘要

Agentic AI failures need post-hoc reconstruction: what the agent did, on whose authority, against which policy, and from what reasoning. Cross-regime feasibility remains unmeasured under one property-level schema. We apply the Decision Trace Reconstructor unmodified to pinned worked-example anchors from six public vendor SDK regimes spanning cloud-agent, observability, tool-use, telemetry, and protocol traces, plus two comparator columns. Each Decision Event Schema (DES) property is classified as fully fillable, partially fillable, structurally unfillable, or opaque. Per-property reconstructability of an agent decision already varies between regimes at this anchor scale. Strict-governance-completeness separates into three tiers ranging from 42.9% to 85.7%, yielding one regime-independent gap (reasoning trace), four regime-dependent gaps, and one Mixed property; the pilot is single-annotator, one anchor per cell, descriptive, with outputs checksum-verifiable from a deposited reproducibility package.

2605.12075 2026-05-13 cs.CR cs.AI

The Deepfakes We Missed: We Built Detectors for a Threat That Didn't Arrive

Shaina Raza

AI总结 近年来,深度伪造(deepfake)检测的研究主要围绕2017至2019年间提出的威胁模型展开,重点关注公众人物的面部替换和语音操控等大规模虚假信息风险。然而,2022年至2026年的实际案例显示,当前主要威胁已转变为非自愿亲密影像、语音克隆诈骗和情感操控欺诈等新型问题。本文指出,研究方向与现实威胁的脱节已成为深度伪造防御的主要瓶颈,并呼吁学界重新调整研究重点,以应对当前日益增长的实际危害。

详情
英文摘要

Nearly a decade of Machine Learning (ML) research on deepfake detection has been organized around a threat model inherited from 2017--2019, revolving around face-swap and talking-head manipulation of public figures, motivated by concerns about large-scale misinformation and video-evidence fraud. This position paper argues that the threat the field prepared for did not arrive, and the threats that did arrive are substantially different. An accounting of deepfake incidents in 2022--2026 shows that the dominant observed harms are peer-generated Non-Consensual Intimate Imagery (NCII), voice-clone scam calls targeting families and finance workers, and emotional-manipulation fraud. The predicted large-scale public-figure deepfake catastrophe did not materialize during the 2024 global information environment despite extensive preparation. Meanwhile, research effort, benchmarks, and detection methods remain concentrated on the inherited threat model. The central claim of this paper is that this misalignment is now the dominant bottleneck on real-world deepfake defense, not model capability. We argue the ML research community should substantially rebalance its research agenda toward the harm categories that are actually growing. We support this position with empirical accounting of research effort and harm distribution, identify the structural reasons the misalignment persists, and outline three concrete technical research agendas for the under-defended harm categories.

2605.12073 2026-05-13 cs.CC cs.AI

Clausal Deletion Backdoors for QBF: a Parameterized Complexity Approach

Leif Eriksson, Victor Lagerkvist, Sebastian Ordyniak, George Osipov, Fahad Panolan, Mateusz Rychlicki

AI总结 该论文研究了量化布尔公式(QBF)的可满足性问题,提出了一种新的参数化复杂性方法,基于“子句删除后门”(CC-backdoor)的大小来分析求解效率。作者考虑了三个经典的易解QBF子类——Horn、2-CNF和线性方程,并证明除了Horn类外,其余两类在给定CC-backdoor大小为$k$时具有固定参数可解性(FPT)。研究揭示了QBF参数化复杂性中的关键区分点,并展示了不同求解技术在该框架下的应用潜力。

详情
英文摘要

Determining the validity of a quantified Boolean formula (QBF) is a PSPACE-complete problem with rich expressive power. Despite interest in efficient solvers, there is, compared to problems in NP, a lack of positive theoretical results, and in the parameterized complexity setting one often has to restrict the quantifier prefix (e.g., bounding alternations) to obtain fixed parameter tractability (FPT). We propose a new parameter: the number of variables in clauses that has to be removed before reaching a tractable class (a clause covering (CC) backdoor). We are then interested in solving QBF in FPT time given a CC-backdoor of size $k$. We consider the three classical, tractable cases of QBF as base classes: Horn, 2-CNF, and linear equations. We establish W[1]-hardness for Horn but prove FPT for the others, and prove that in a precise, algebraic sense, we are only missing one important case for a full dichotomy. Our algorithms are non-trivial and depend on propagation, and Gaussian elimination, respectively, and are comparably unexplored for QBF.

2605.12059 2026-05-13 cs.HC cs.RO

RoboBlockly Studio: Conversational Block Programming with Embodied Robot Feedback for Computational Thinking

Leyi Li, Chenyu Du, Jiafei Sun, Erick Purwanto, Qing Zhang

AI总结 本文介绍了一款名为 RoboBlockly Studio 的交互式编程学习系统,旨在通过结合积木式编程、对话式AI教学代理和实体机器人执行,提升学生计算思维能力。该系统通过编程、运行、观察和修改的紧密循环,帮助学习者更好地理解程序逻辑与实际效果之间的联系。研究基于对编程教师的访谈设计,支持学习者自主性、程序行为的透明性、课堂任务的具身化以及通过AI对话引导反思等目标,并通过与高中生的实际应用验证了其有效性。

Comments Accepted to ACM DIS 2026. Camera-ready version

详情
英文摘要

Computational thinking (CT) is increasingly promoted as a core literacy, yet learners and teachers face challenges in connecting abstract program logic to meaningful outcomes. We design and evaluate RoboBlockly Studio, an integrated interactive system that combines block-based programming, a conversational AI teaching agent, and embodied robot execution. RoboBlockly Studio creates a tight iterative loop of authoring, running, observing, and revising. Informed by interviews with five programming teachers, the system was designed to support four goals: (1) preserving learner agency in computational thinking, (2) making program behavior transparent and interpretable, (3) grounding programming in embodied, classroom-aligned tasks, and (4) scaffolding reflection through pedagogically grounded AI dialogue. We deployed RoboBlockly Studio with 32 high school students, observing how robot and AI feedback influenced students' interactions with code, reflections on problem-solving strategies, and understanding of CT concepts. We discuss design insights and implications for creating interactive, embodied learning environments that integrate AI and robotics to support CT learning in computing education.

2605.12046 2026-05-13 quant-ph cs.AI cs.LG

Rethink the Role of Neural Decoders in Quantum Error Correction

Ge Yan, Shanchuan Li, Yuxuan Du

AI总结 本文重新审视了神经解码器在量子纠错中的作用,针对表面码解码问题,在明确的精度与延迟约束下,对多种神经解码器架构进行了统一与改进,并开发了端到端压缩流程以评估其在FPGA硬件上的部署性能。研究发现,短期内解码性能更依赖于数据规模而非架构复杂度,适当的归纳偏置对实现高精度至关重要,且INT4量化是满足微秒级延迟需求的必要条件,为可扩展的实时神经量子纠错解码提供了具体指导。

Comments Accepted to ICML 2026; 33 Pages, 9 figures

详情
英文摘要

Quantum error correction (QEC) is essential for enabling quantum advantages, with decoding as a central algorithmic primitive. Owing to its importance and intrinsic difficulty, substantial effort has been made to QEC decoder design, among which neural decoders have recently emerged as a promising data-driven paradigm. Despite this progress, practical deployment remains hindered by a fundamental accuracy-latency tradeoff, often on the microsecond timescale. To address this challenge, here we revisit neural decoders for surface-code decoding under explicit accuracy-latency constraints, considering code distances up to d=9 (161 physical qubits). We unify and redesign representative neural decoders into five architectural paradigms and develop an end-to-end compression pipeline to evaluate their deployability and performance on FPGA hardware. Through systematic experiments, we reveal several previously underexplored insights: (i) near-term decoding performance is driven more by data scale than architectural complexity; (ii) appropriate inductive bias is essential for achieving high decoding accuracy; and (iii) INT4 quantization is a prerequisite for meeting microsecond-scale latency requirements on FPGAs. Together, these findings provide concrete guidance toward scalable and real-time neural QEC decoding.

2605.12001 2026-05-13 cs.IT cs.AI math.IT

CR^2: Cost-Aware Risk-Controlled Routing for Wireless Device-Edge LLM Inference

Nan Xue, Shengkang Chen, Zhiyong Chen, Jiangchao Yao, Yaping Sun, Zixia Hu, Meixia Tao

AI总结 随着大语言模型(LLM)从集中式云平台向移动边缘环境迁移,如何在有限的设备-边缘资源下高效平衡延迟、能耗与精度成为关键问题。本文提出CR²,一种面向无线设备-边缘环境的成本感知风险控制路由框架,通过解耦设备端的轻量边缘门和边缘端的效用选择器,实现对查询的延迟路由决策。CR²引入了符合风险控制校准方法,能够在有限信息下显式控制决策风险,并在实验中表现出优于现有方法的精度-成本帕累托前沿性能。

Comments submitted to IEEE Journal

详情
英文摘要

As large language models (LLMs) move from centralized clouds to mobile edge environments, efficient serving must balance latency, energy consumption, and accuracy under constrained device-edge resources. Query-level routing between lightweight on-device models and stronger edge models provides a flexible mechanism to navigate this trade-off. However, existing routers are designed for centralized cloud settings and optimize token-level costs, failing to capture the dynamic latency and energy overheads in wireless edge deployments. In this paper, we formulate mobile edge LLM routing as a deployment-constrained, cost-aware decision problem, and propose CR^2, a two-stage device-edge routing framework. CR^2 decouples a lightweight on-device margin gate from an edge-side utility selector for deferred queries. The margin gate operates on frozen query embeddings and a user-specified cost weight to predict whether local execution is utility-optimal relative to the best edge alternative under the target operating point. We further introduce a conformal risk control (CRC) calibration procedure that maps each operating point to an acceptance threshold, enabling explicit control of the marginal false-acceptance risk under the full-information utility reference. Experiments on the routing task show that CR^2 closely matches a full-information reference router using only device-side signals before deferral. Compared with strong query-level baselines, CR^2 consistently improves the deployable accuracy-cost Pareto frontier and reduces normalized deployment cost by up to 16.9% at matched accuracy.

2605.11999 2026-05-13 cs.DC cs.AI cs.LG cs.PF

The Illusion of Power Capping in LLM Decode: A Phase-Aware Energy Characterisation Across Attention Architectures

Bole Ma, Ayesha Afzal, Jan Eitzinger, Gerhard Wellein

AI总结 本文研究了在大语言模型推理过程中,功率限制(Power Capping)在实际应用中的效果问题,发现其在主流的自回归解码阶段效果并不明显。通过在多种注意力架构上进行能效分析,作者指出解码阶段主要受限于内存带宽而非计算能力,导致功率限制机制无法触发。研究提出通过时钟锁定(SM clock locking)替代功率限制,能够更有效地优化能效,在保持吞吐量损失最小的前提下,提升解码阶段的能源效率,并揭示了不同架构下的动态电压频率调节(DVFS)行为模式。

详情
英文摘要

Power capping is the standard GPU energy lever in LLM serving, and it appears to work: throughput drops, power readings fall, and energy budgets are met. We show the appearance is illusory for the phase that dominates production serving: autoregressive decode. Across four attention paradigms -- GQA, MLA, Gated DeltaNet, and Mamba2 -- on NVIDIA H200, decode draws only 137--300\,W on a 700\,W GPU; no cap ever triggers, because memory-bound decode saturates HBM bandwidth rather than compute and leaves power headroom untouched. Firmware-initiated clock throttling compounds the illusion: these deviations can corrupt any throughput measurement that attributes them to the cap. SM clock locking dissolves both confounds. By targeting the lever that is actually on the critical path, clock locking Pareto-dominates power capping universally, recovering up to 32\% of decode energy at minimal throughput loss. We identify three architecture-dependent DVFS behavioural classes and characterise a common energy pattern across novel attention replacements: a heavy prefill cost recouped by efficient decode, eventually halving total request energy relative to GQA at production batch sizes.

2605.11981 2026-05-13 physics.flu-dyn cs.AI

High-lift Wing Separation Control via Bayesian Optimization and Deep Reinforcement Learning

Ricard Montalà, Bernat Font, Oriol Lehmkuhl, Ricardo Vinuesa, Ivette Rodriguez

AI总结 本研究利用壁面解析的大涡模拟方法,探讨了在雷诺数 $Re_c = 450,000$ 和攻角 $α = 23^\circ$ 下,通过合成射流对30P30N高升力翼型进行主动流动控制的问题。研究对比了开环贝叶斯优化和闭环深度强化学习两种优化策略,结果表明贝叶斯优化能有效提升气动效率,而深度强化学习由于奖励函数设计的限制,仅取得有限的改进。该工作为高雷诺数下基于深度强化学习的流动控制方法提供了重要的优化方向和实践经验。

详情
英文摘要

This study investigates active flow control (AFC) of a 30P30N high-lift wing at a Reynolds number Re$_c$ = 450,000 and angle of attack $α$ = 23$^\circ$ using wallresolved large-eddy simulations (LES). Two optimization strategies are explored: open-loop Bayesian optimization (BO) and closed-loop deep reinforcement learning (DRL), both targeting the mitigation of stall and the improvement of aerodynamic efficiency via synthetic jets on the slat, main, and flap elements. The uncontrolled configuration was validated against literature data, confirming the reliability of the LES setup. The BO framework successfully identified steady jet velocities that increased efficiency by +10.9% through a -9.7% drag reduction while maintaining lift. In contrast, the DRL agent, despite leveraging instantaneous flow information from distributed sensors, achieved only minor improvements in lift and drag, with negligible efficiency gain. Training analysis indicated that the penalty-dominated reward constrained exploration. These results highlight the need for carefully designed rewards and computational acceleration strategies in DRL-based flow control at high Reynolds numbers.

2605.11922 2026-05-13 cs.SE cs.CL

StepCodeReasoner: Aligning Code Reasoning with Stepwise Execution Traces via Reinforcement Learning

Hao Wang, Rui Li, Lei Sha, Jie M. Zhang

AI总结 现有的代码推理方法主要关注最终输出结果,忽视了中间推理过程,容易导致奖励黑客问题。为此,本文提出StepCodeReasoner框架,通过强化学习引入显式的中间执行状态监督,将代码推理转化为可验证的逐步执行建模问题。该方法在多个基准测试中表现出色,显著优于现有模型,在代码推理和生成任务中均取得提升。

详情
英文摘要

Existing code reasoning methods primarily supervise final code outputs, ignoring intermediate states, often leading to reward hacking where correct answers are obtained through inconsistent reasoning. We propose StepCodeReasoner, a framework that introduces explicit intermediate execution-state supervision. By automatically inserting structured print-based execution-trace anchors into code, the model is trained to predict runtime states at each step, transforming code reasoning into a verifiable, stepwise execution modeling problem. Building on this execution-aware method, we introduce Bi-Level GRPO, a reinforcement learning algorithm for structured credit assignment at two levels: inter-trajectory, comparing alternative execution paths, and intra-trajectory, rewarding intermediate accuracy based on its impact on downstream correctness. Extensive experiments demonstrate that StepCodeReasoner achieves SOTA performance in code reasoning. In particular, our 7B model achieves 91.1\% on CRUXEval and 86.5\% on LiveCodeBench, outperforming the CodeReasoner-7B baseline (86.0\% and 77.7\%) and GPT-4o (85.6\% and 75.1\%). Furthermore, on the execution-trace benchmark REval, our model scores 82.9\%, outperforming baseline CodeReasoner-7B (72.3\%), its 14B counterpart (81.1\%), and GPT-4o (77.3\%). Additionally, our approach also improves code generation performance, demonstrating that explicit execution modeling enhances both code reasoning and code generation.

2605.11901 2026-05-13 cs.CR cs.AI

AccLock: Unlocking Identity with Heartbeat Using In-Ear Accelerometers

Lei Wang, Jiangxuan Shen, Xi Zhang, Dalin Zhang, Jingyu Li, Haipeng Dai, Chenren Xu, Daqing Zhang, He Huang

AI总结 本文提出了一种基于耳内加速度计的被动身份认证系统 AccLock,通过提取耳内血压波(BCG)信号的独特特征实现无需用户主动参与的高安全性身份验证。该系统采用两阶段去噪方案和基于解耦的深度学习模型 HIDNet 提取用户特定特征,并结合 Siamese 网络构建可扩展的认证框架,有效提升了环境噪声下的鲁棒性和实用性。实验表明,AccLock 在 33 名参与者中实现了平均误拒率(FAR)3.13% 和误接受率(FRR)2.99%,验证了其实际可行性。

详情
英文摘要

The widespread use of earphones has enabled various sensing applications, including activity recognition, health monitoring, and context-aware computing. Among these, earphone-based user authentication has become a key technique by leveraging unique biometric features. However, existing earphone-based authentication systems face key limitations: they either require explicit user interaction or active speaker output, or suffer from poor accessibility and vulnerability to environmental noise, which hinders large-scale deployment. In this paper, we propose a passive authentication system, called AccLock, which leverages distinctive features extracted from in-ear BCG signals to enable secure and unobtrusive user verification. Our system offers several advantages over previous systems, including zero-involvement for both the device and the user, ubiquitous, and resilient to environmental noise. To realize this, we first design a two-stage denoising scheme to suppress both inherent and sporadic interference. To extract user-specific features, we then propose a disentanglement-based deep learning model, HIDNet, which explicitly separates user-specific features from shared nuisance components. Lastly, we develop a scalable authentication framework based on a Siamese network that eliminates the need for per-user classifier training. We conduct extensive experiments with 33 participants, achieving an average FAR of 3.13% and FRR of 2.99%, which demonstrates the practical feasibility of AccLock.

2605.11891 2026-05-13 cs.CR cs.AI

Proteus: A Self-Evolving Red Team for Agent Skill Ecosystems

Zhaojiacheng Zhou

AI总结 该研究提出了一种名为Proteus的自我进化的红队框架,用于评估基于技能的智能体生态系统中的安全风险。面对第三方技能可能在部署后通过迭代修改绕过审核并造成运行时危害的问题,Proteus通过模拟攻击者的行为,在形式化的五维攻击空间中搜索潜在威胁,并利用审核反馈进行跨轮次的技能变异与优化。实验表明,Proteus在多个测试场景中表现出较高的攻击成功率,揭示了当前技能审核机制在应对自适应攻击时存在显著的漏检风险。

详情
英文摘要

Agent skills extend LLM agents with reusable instructions, tool interfaces, and executable code, and users increasingly install third-party skills from marketplaces, repositories, and community channels. Because a skill exposes both executable behavior and context-setting documentation, its deployment risk cannot be measured by single-shot audits or prompt-level red teams alone: a realistic attacker can use audit and runtime feedback to repeatedly rewrite the skill. We frame this risk as \emph{adaptive leakage} -- whether a budgeted attacker can iteratively revise a skill until it passes audit and produces verified runtime harm -- and present \ours{}, a grey-box self-evolving red-team framework for measuring it. Proteus searches a formalized five-axis skill-attack space. Each candidate is evaluated through a unified audit-sandbox-oracle pipeline that returns structured audit findings and runtime evidence to guide cross-round mutation. Beyond initial evasion, Proteus performs path expansion, which finds alternative implementations of successful attacks, and surface expansion, which transfers learned implementation patterns to new attack objectives beyond the original seed catalogue. Across eight phase-1 cells, Proteus reaches 40--90\% Attack Success Rate at $5$ rounds (ASR@5) with positive learning-curve slopes on both evaluated auditors. Phase-2 path/surface expansion produces 438 jointly bypassing and lethal variants, with SkillVetter bypassed at $\geq 93\%$ in every cell and AI-Infra-Guard, the strongest public auditor we evaluate, still admitting up to 41.3\% joint-success. These results show that current skill vetting substantially underestimates residual risk when evaluated against adaptive, feedback-driven attackers.

2605.11875 2026-05-13 eess.SP cs.AI

Modulation Consistency-based Contrastive Learning for Self-Supervised Automatic Modulation Classification

Chenxu Wang, Shuang Wang, Lirong Han, Xinyu Hu, Hanlin Mo, Hantong Xing, Licheng Jiao

AI总结 本文针对自动调制分类(AMC)任务中自监督学习方法依赖任务无关预训练目标、导致表征受干扰因素影响的问题,提出了一种基于调制一致性的对比学习框架Mod-CL。该方法利用同一信号不同时间片段之间调制类型一致但波形不同的特性,构建正样本对以学习共享的调制信息并抑制干扰因素。实验表明,Mod-CL在多个RadioML数据集上显著优于现有方法,尤其在标签稀缺场景下表现出色。

详情
英文摘要

Deep learning-based AMC methods have achieved remarkable performance, but their practical deployment remains constrained by the high cost of labeled data. Although self-supervised learning (SSL) reduces the reliance on labels, existing SSL-based AMC methods often rely on task-agnostic pretext objectives misaligned with modulation classification, leading to representations entangled with nuisance factors such as symbol, channel, and noise. In this paper, we identify intra-instance modulation consistency as a task-aware structural prior, whereby different temporal segments of the same signal may differ in waveform while preserving the same modulation type, thus providing a principled cue for task-aligned self-supervision. Based on this prior, we propose Mod-CL, a Modulation consistency-based Contrastive Learning framework that constructs positive pairs from different temporal segments of the same signal instance, to encourage the model to learn shared modulation information while suppressing nuisance variations. We further develop a contrastive objective tailored to Mod-CL, which jointly exploits temporal segmentation and data augmentation to pull together views sharing the same modulation semantics while avoiding supervisory conflicts within each signal instance. Extensive experiments on RadioML datasets show that Mod-CL consistently outperforms strong baselines, especially in low-label regimes, achieving substantial improvements in linear probing accuracy.

2605.11868 2026-05-13 cs.CR cs.AI

IPI-proxy: An Intercepting Proxy for Red-Teaming Web-Browsing AI Agents Against Indirect Prompt Injection

Chia-Pei, Chen, Kentaroh Toyoda, Anita Lai, Alex Leung

AI总结 本文提出IPI-proxy,一个用于对抗间接提示注入(IPI)的开源拦截代理工具,旨在评估和增强浏览网页的AI代理的安全性。该工具通过实时修改白名单域名的HTTP响应,嵌入从多个基准库中提取的攻击载荷,支持多种嵌入方式和位置参数化配置,实现无需模拟页面的参数扫描测试。IPI-proxy填补了现有红队工具在真实部署环境中测试IPI漏洞的空白,为AI安全团队提供了一种可复现的测试平台。

Comments code: https://github.com/VulcanLab/IPI-Proxy/

详情
英文摘要

Web-browsing AI agents are increasingly deployed in enterprise settings under strict whitelists of approved domains, yet adversaries can still influence them by embedding hidden instructions in the HTML pages those domains serve. Existing red-teaming resources fall short of this scenario: prompt-injection benchmarks ship pre-built adversarial pages that whitelisted agents cannot reach, and generic LLM scanners probe the model API rather than its retrieved content. We present IPI-proxy, an open-source toolkit for red-teaming web-browsing agents against indirect prompt injection (IPI). At its core is an intercepting proxy that rewrites real HTTP responses from whitelisted domains in flight, embedding payloads drawn from a unified library of 820 deduplicated attack strings extracted from six published benchmarks (BIPIA, InjecAgent, AgentDojo, Tensor Trust, WASP, and LLMail-Inject). A YAML-driven test harness independently parameterizes the payload set, the embedding technique (HTML comment, invisible CSS, or LLM-generated semantic prose), and the HTML insertion point (6 locations from \icode{head\_meta} to \icode{script\_comment}), enabling parameter-sweep evaluation without mock pages or sandboxed environments. A companion exfiltration tracker logs successful callbacks. This paper describes the threat model, situates IPI-proxy among contemporary IPI benchmarks and red-teaming tools, and details its architecture, design decisions, and configuration interface. By bridging static benchmarks and live deployment, IPI-proxy gives AI security teams a reproducible substrate for measuring and hardening web-browsing agents against indirect prompt injection on the same retrieval surface attackers exploit in production.

2605.11865 2026-05-13 stat.ML cs.LG

Variance-aware Reward Modeling with Anchor Guidance

Shuxing Fang, Ruijian Han, Liangyu Zhang, Fan Zhou

AI总结 本文研究了在人类偏好多样化的情况下,如何改进奖励模型以更准确地反映偏好不确定性。提出了一种基于锚点引导的方差感知奖励建模方法,通过引入两个粗粒度的响应级锚点标签,解决了高斯奖励模型在仅依赖成对偏好数据时的基本不可识别性问题。该方法在理论分析和多个实际数据集上均表现出优越的奖励建模性能和强化学习效果。

详情
英文摘要

Standard Bradley--Terry (BT) reward models are limited when human preferences are pluralistic. Although soft preference labels preserve disagreement information, BT can only express it by shrinking reward margins. Gaussian reward models provide an alternative by jointly predicting a reward mean and a reward variance, but suffer from a fundamental non-identifiability from pairwise preferences alone. We propose Anchor-guided Variance-aware Reward Modeling, a framework that resolves this non-identifiability by augmenting preference data with two coarse response-level anchor labels. Building on this, we prove that two anchors are sufficient for identification, develop a joint training objective and establish a non-asymptotic convergence rate for both the estimated reward mean and variance functions. Across simulation studies and four real-world diverging-preference datasets, our method consistently improves reward modeling performance and downstream RLHF, including PPO training and best-of-$N$ selection.