arXivDaily arXiv每日学术速递 周一至周五更新
全部学科分类 2136
专题追踪
2606.10660 2026-06-10 cs.CY cs.AI 新提交

Accounting for AI Inference in Corporate GHG Inventories: A Four-Tier Methodology for Scope 3 Category 1 Reporting

企业温室气体清单中AI推理的核算:范围3类别1报告的四层方法

Guillermo Llopis

发表机构 * SOMA AI

AI总结 针对CSRD要求下AI推理服务在范围3类别1中缺乏标准核算方法的问题,提出基于token物理估算的四层框架,通过GPU能耗基准和区域电网碳强度精确估算排放,并揭示水碳权衡。

Comments Preprint. Data repository: https://doi.org/10.5281/zenodo.20443586. 18 pages, 3 figures, 6 tables

详情
AI中文摘要

AI推理服务——API订阅、企业聊天工具和嵌入AI功能的SaaS产品——明确属于《企业可持续发展报告指令》(CSRD)下的范围3类别1,该指令要求自2024年1月开始的财年进行披露。然而,目前尚无标准方法将其纳入企业温室气体清单。现行实践要么完全忽略该类别,要么应用针对整个ICT行业校准的通用经济投入产出(EEIO)因子,导致AI推理排放被高估10-40倍(相对于物理衍生方法)。我们提出了一个四层框架,将估算精度与组织实际可获取的数据相匹配,从基于token的直接物理估算(使用GPU能耗基准和区域电网碳强度)逐步降级到基于支出的EEIO后备方法(用于无使用数据的服务)。排放因子来源于同行评审的GPU能耗基准(此http URL排行榜v3)、确认的电网碳强度(EPA eGRID 2023;Ember 2023)以及已发布的水利用效率数据(Li等人,2025)。应用于一家200人的欧洲企业,该框架得出的总排放量低于1 tCO2e,表明合规挑战在于方法论而非规模。我们进一步记录了当前ESG工具未揭示的水碳权衡:瑞典以水电为主的电网在数据集中碳强度最低,但水足迹最高,这对数据中心选址策略有直接影响。

英文摘要

AI inference services -- API subscriptions, enterprise chat tools, and SaaS products with embedded AI features -- fall unambiguously within Scope 3 Category 1 under the Corporate Sustainability Reporting Directive (CSRD), which requires disclosure for fiscal years starting January 2024. Yet no standardised methodology exists for including them in corporate GHG inventories. Current practice either omits the category entirely or applies a generic economic input-output (EEIO) factor calibrated to the ICT sector as a whole, overestimating AI inference emissions by 10-40x relative to physically derived alternatives. We propose a four-tier framework that matches estimation precision to the data organisations can realistically obtain, progressing from direct token-based physical estimation -- using GPU energy benchmarks and regional grid carbon intensities -- down to a spend-based EEIO fallback for services where no usage data exists. Emission factors are derived from peer-reviewed GPU energy benchmarks (ML.ENERGY Leaderboard v3), confirmed grid carbon intensities (EPA eGRID 2023; Ember 2023), and published water use effectiveness data (Li et al., 2025). Applied to a 200-person European firm, the framework yields a total below 1 tCO2e, illustrating that the compliance challenge is methodological rather than magnitude-driven. We further document a water-carbon trade-off that current ESG tools do not surface: Sweden's hydro-dominated grid delivers the lowest carbon intensity in our dataset but the highest water footprint, with direct implications for data centre location strategy.

2606.10658 2026-06-10 cs.CR cs.AI cs.CE q-fin.CP 新提交

Post-Quantum Secure Federated DeFi for Inclusive Banking

面向普惠银行的后量子安全联邦DeFi

Swati Sachan, Dale Fickett, Richard Buchinger, Theo Miller

发表机构 * AI FinTech Group, University of Liverpool(人工智能金融科技组,利物浦大学) RVA Works and University of Richmond(RVA Works和里士满大学) Chain Crunch Labs(Chain Crunch实验室)

AI总结 提出后量子安全联邦DeFi框架,利用格基全同态加密和NASA-IBM地理空间基础模型,实现银行间加密协作以提升信用不足个体的金融普惠性。

详情
AI中文摘要

近期纠错量子比特的进展加速了实用量子计算的时间表,这对用于保护金融系统、政府基础设施、通信网络和DeFi(去中心化金融)生态系统的密码原语构成威胁。本文提出一个后量子安全的联邦DeFi框架,支持银行间协作,以改善因有限金融历史而受到当地贷款机构服务不足的个体的普惠性。多家银行将加密信息批次贡献给一个虚拟服务器,其中基于格的完全同态加密(FHE)实现了端到端的同态计算。服务器以加密格式融合本地数据驱动的概率评估、专家信念以及由NASA-IBM Prithvi地理空间基础模型(GFM)生成的可验证证据。采用去中心化技术确保机构与服务器之间所有加密数据交换的防篡改证据和可审计问责性。该框架在弗吉尼亚州农村借款人的农业贷款决策上进行了测试。

英文摘要

Recent advances in error-corrected qubits have accelerated the timeline for practical quantum computing. It poses a threat to cryptographic primitives used to secure financial systems, government infrastructure, communication networks, and DeFi (Decentralized Finance) ecosystems. This paper introduces a post-quantum secure federated DeFi framework that enables inter-bank collaboration to improve the inclusivity of individuals underserved by local lenders due to limited financial histories. Multiple banks contribute encrypted information batches to a virtual server, where lattice-based Fully Homomorphic Encryption (FHE) enables end-to-end homomorphic computation. The server fuses local data-driven probabilistic assessments, expert beliefs, and verifiable evidence generated by the NASA-IBM Prithvi Geospatial Foundation Model (GFM), in encrypted format. Decentralized technologies are employed to ensure tamper-proof evidence and auditable accountability for all encrypted data exchanges between institutions and the server. The framework is tested on agricultural lending decisions for rural borrowers in Virginia.

2606.10627 2026-06-10 cs.HC cs.LG cs.SD 新提交

Profy: Interpretable Visualization of Expertise-Dependent Motor Skills Toward Supporting Piano Practice

Profy: 面向钢琴练习的、可解释的专业技能依赖性运动技能可视化

Kazuki Kawamura, Fujiki Nakamura, Hayato Nishioka, Momoko Shioki, Shinichi Furuya, Jun Rekimoto

发表机构 * The University of Tokyo(东京大学) Sony Computer Science Laboratories(索尼计算机科学实验室) NeuroPiano Institute(神经钢琴研究所)

AI总结 提出弱监督系统Profy,利用听众评分标签学习时间对齐的高亮,帮助钢琴学习者定位需重点练习的段落,在无局部标签下与专家标注高度一致。

Comments Designing Interactive Systems Conference (DIS '26), June 13-17, 2026, Singapore, Singapore

详情
AI中文摘要

钢琴演奏的质量取决于微妙的时机、发音和动态控制,但练习反馈通常是基于总结的且难以付诸行动。我们介绍了Profy,一个弱监督系统,它从聚合听众评分(专家标记与业余标记)中学习片段级标签,生成时间对齐的高亮,用于钢琴练习中的回顾。我们收集了73名钢琴家的同步1 kHz键运动与音频数据,并使用1083个有效片段进行建模和评估。模型在共享的重采样模型时间基上输出片段级预测和证据分数以进行可视化。在21名专家钢琴家标注的20个业余短技术练习片段上,尽管训练时没有局部标签,显示的高亮分数与专家标记用于回顾的段落一致(Pearson r=0.61,ROC-AUC 0.75)。Profy不是用一个全局分数总结一个片段,而是通过支持与专家-业余差异相关的时间局部段落的擦洗、循环和聚焦回放,帮助学习者决定下一步检查哪里。

英文摘要

The quality of piano performance depends on nuanced timing, articulation, and dynamic control, but practice feedback is often summary-based and hard to act on. We introduce Profy, a weakly supervised system that learns from take-level labels derived from aggregated listener ratings (expert-labeled vs. amateur-labeled) to produce time-aligned highlights for review during piano practice. We collected synchronized 1 kHz key-motion and audio from 73 pianists and used 1,083 valid takes for modeling and evaluation. The model outputs clip-level predictions together with evidence scores on a shared resampled model time base for visualization. On 20 amateur clips from short technique studies annotated by 21 expert pianists, the displayed highlight score aligns with passages that expert pianists marked for review despite training without localized labels (Pearson r=0.61, ROC-AUC 0.75). Rather than summarizing a take with a single global score, Profy helps learners decide where to inspect next by supporting scrubbing, looping, and focused replay of time-localized passages associated with expert-amateur differences.

2606.10621 2026-06-10 cs.IR cs.AI 新提交

STORM: Stepwise Token Optimization with Reward-Guided Beam Search

STORM: 基于奖励引导束搜索的逐步令牌优化

Arthur Satouf, Giulio D'Erasmo, Yuxuan Zong, Habiboulaye Amadou Boubacar, Pablo Piantanida, Benjamin Piwowarski

发表机构 * MILA – Quebec AI Institute & ILLS(魁北克人工智能研究所与ILLs) Université Paris-Saclay & CentraleSupélec & CNRS(巴黎-萨克雷大学及CentraleSupélec与CNRS) Air Liquide Sorbonne Université & ISIR & CNRS(索邦大学及ISIR与CNRS) Sapienza, University of Rome(罗马大学Sapienza)

AI总结 提出STORM框架,通过检索奖励引导的束搜索在每一步优化令牌选择,实现词汇检索的查询扩展,在多个基准上匹配或超越大模型重写器,并零样本迁移至18种语言。

详情
AI中文摘要

现代检索越来越依赖密集和学习的稀疏神经模型,这些模型有效但需要将整个语料库编码为专门的索引,并在模型变化时重建。像BM25这样的词汇检索器在标准倒排索引上保持高效和透明,无需随模型演变而改变,但存在词汇不匹配问题。LLM查询重写可以提供帮助,但提示式重写器会生成格式良好但检索无效或有害的术语,而针对检索奖励进行训练仅提供延迟的、序列级别的监督,掩盖了哪些术语有帮助。我们引入了STORM(基于奖励引导束搜索的逐步令牌优化),一个用于词汇查询扩展的自监督框架。STORM通过检索指标引导生成来训练重写器:在每一步,候选扩展根据BM25索引进行评分,并剪枝低奖励的延续,将检索奖励转化为令牌级别的信号,集中探索检索有效的词汇。在TREC DL和BEIR上,STORM使0.6B-8B的骨干模型匹配或超越有竞争力的LLM重写器,同时检索速度与普通BM25一样快;在8B规模上,它可与更大的专有重写器相媲美。它进一步零样本迁移到18种语言(MIRACL),平均击败了专门的多语言密集检索器,使STORM成为密集神经检索的一种有竞争力、基础设施轻量级的替代方案。

英文摘要

Modern retrieval increasingly relies on dense and learned-sparse neural models that are effective but require encoding the entire corpus into a specialized index, rebuilt whenever the model changes. Lexical retrievers like BM25 stay efficient and transparent on a standard inverted index that need not change as models evolve, but suffer from vocabulary mismatch. LLM query rewriting can help, yet prompted rewriters emit well-formed but retrieval-ineffective or harmful-terms, and training against a retrieval reward gives only delayed, sequence-level supervision that obscures which terms helped. We introduce STORM (Stepwise Token Optimization with Reward-guided beaM search), a self-supervised framework for lexical query expansion. STORM trains the rewriter through generation guided by retrieval metrics: at each step, candidate expansions are scored against the BM25 index and low-reward continuations pruned, turning the retrieval reward into a token-level signal that concentrates exploration on retrieval-effective vocabulary. Across TREC DL and BEIR, STORM lets 0.6B-8B backbones match or surpass competitive LLM rewriters while retrieving as fast as plain BM25; at 8B it rivals far larger proprietary rewriters. It further transfers zero-shot to 18 languages (MIRACL), beating dedicated multilingual dense retrievers on average, making STORM a competitive, infrastructure-light alternative to dense neural retrieval.

2606.10600 2026-06-10 eess.SY cs.LG cs.SY 新提交

Toward Proactive RF Charging Scheduling: Generative AI for Decision Support

面向主动射频充电调度:用于决策支持的生成式人工智能

Amirhossein Azarbahram, Osmel M. Rosabal, David Ernesto Ruiz-Guirola, Melike Erol-Kantarci, Kaibin Huang, Onel L. A. López

发表机构 * Centre for Wireless Communications (CWC), University of Oulu, Finland(芬兰奥卢大学无线通信中心) School of Electrical Engineering and Computer Science, University of Ottawa, Ottawa, Canada(加拿大渥太华大学电气与计算机科学学院) Department of Electrical and Electronic Engineering, The University of Hong Kong, Hong Kong(香港大学电子与电气工程系)

AI总结 本文提出将生成式AI作为不确定性感知支持层,辅助射频无线充电调度器在有限资源和不确定条件下做出鲁棒充电决策,并通过仓库案例验证其有效性。

详情
AI中文摘要

射频无线能量传输(RF-WPT)是一种支持未来物联网系统不间断通信的使能技术,通过减少电池更换需求和缓解电池废弃物相关问题。在大规模RF-WPT部署中,主要挑战之一是调度器级别的资源分配。具体而言,发射器必须在有限的充电资源、不完整的接收端信息以及不确定的近未来充电条件下,决定输送多少能量、何时输送以及向谁输送。本文将生成式人工智能(GenAI)定位为一种有前景的工具,因为它能够基于粗略的操作上下文和接收端信息,预见多种可能的充电场景。我们提出GenAI作为RF-WPT调度器的不确定性感知支持层,而非独立的预测或决策工具。为此,我们首先重新审视RF-WPT调度面临的主要挑战,并讨论主要GenAI系列如何通过为下游任务生成基于场景的输入来支持不确定性感知的充电决策。然后,我们通过一个仓库式案例研究表明,与确定性预测和简单的无学习基线相比,通过生成模型的采样能力保留不确定性可以改善鲁棒充电决策,尤其是在风险敏感目标下。最后,我们指出了关键开放挑战并提出了未来研究方向。

英文摘要

Radio frequency wireless power transfer (RF-WPT) is an enabling technology for supporting uninterrupted communications in future Internet of Things systems by reducing the need for battery replacement and mitigating battery-waste-related issues. For large-scale RF-WPT deployment, one of the main challenges is the scheduler-level resource allocation. Specifically, the transmitter must decide how much energy to deliver, when, and to whom, under limited charging resources, incomplete receiver-side information, and uncertain near-future charging conditions. This article positions generative artificial intelligence (GenAI) as a promising tool for this setting because it can foresee multiple plausible charging scenarios conditioned on coarse operational context and receiver-side information. We propose GenAI to act as an uncertainty-aware support layer for the RF-WPT scheduler rather than as a standalone forecasting or decision-making tool. To this end, we first revisit the main challenges of RF-WPT scheduling, and discuss how major GenAI families can support uncertainty-aware charging decisions by generating scenario-based inputs for downstream tasks. We then present a warehouse-style case study showing that preserving uncertainty through the sampling capability of generative models can improve robust charging decisions compared with deterministic prediction and simple non-learning baselines, especially under risk-sensitive objectives. Finally, we identify key open challenges and present some directions for future research.

2606.10595 2026-06-10 cs.CR cs.AI 新提交

From Data Heterogeneity to Convergence: A Data-Centric Review of Federated Learning

从数据异质性到收敛:联邦学习的数据中心综述

Huong Nguyen, Mickaël Bettinelli, Amirhossein Ghaffari, Alexandre Benoit, Hong-Tri Nguyen, Susanna Pirttikangas, Lauri Lovén

发表机构 * Oulu University(奥卢大学) University of Southern Brittany(南 Brittany 大学) Aalto University(Aalto 大学)

AI总结 本文从数据视角系统分析联邦学习中数据异质性对收敛的影响,提出可测量特征分类、连接实验分割与真实现象、评估数据相关脆弱性与防御对收敛的影响,为设计可预测收敛的系统提供指导。

详情
AI中文摘要

联邦学习(FL)已成为集中式学习中数据饥饿问题的有前途解决方案。这种范式使得多个客户端能够在隐私保护下协作训练共享任务模型,而无需暴露其本地数据。虽然数据是任何学习系统中的关键组成部分,但它也是漏洞和挑战的主要来源,并且是稳定且良好收敛训练的主要决定因素。现有的FL综述描述了通用基础、安全实践、机遇、挑战和应用,但没有深入探讨数据的各个方面以及从数据角度考虑问题。它们很少提供一种数据视角的综合,将具体的数据属性、分割协议和防御与收敛速度和稳定性联系起来。本综述通过三个进展填补了这一空白。首先,我们将非独立同分布(non-IID)分析为可测量的特征,并根据其对收敛的影响将其排序为强、中、轻,解释每种影响背后的机制,并调和图像、文本和图上的证据。其次,我们将实验分割实践与它们模拟的真实现象联系起来,揭示它们引入的伪影,并展示这些伪影如何影响目标精度。第三,我们分析了数据相关的脆弱性及其提出的防御如何影响收敛,报告在干净和对抗条件下的性能,使收敛-鲁棒性权衡明确。据我们所知,这是第一个提供对支配FL的数据相关挑战的完整理解的综述。针对每个问题提炼出清晰的要点,我们的工作可作为可操作的指南,帮助从业者设计具有可预测收敛和稳定性的系统。

英文摘要

Federated Learning (FL) has emerged as a promising solution for data hunger in centralized learning. This paradigm enables privacy with multiple clients to train a shared-task model collaboratively without exposing their local data. While being a key component in any learning system, data is also a primary source of vulnerabilities and challenges, and a major determinant of a stable and well-converged training. Existing FL reviews describe general foundations, security practices, opportunities, challenges, and applications, without delving into diverse aspects of data and considering problems from the data perspective. They rarely provide a data-lens synthesis that links concrete data properties, split protocols, and defenses to convergence speed and stability. This survey fills that gap with three advances. First, we analyze non-IID into measurable traits and rank their influence on convergence as strong, medium, or light, explaining the mechanisms behind each and reconciling evidence across images, texts, and graphs. Second, we connect experimental splitting practices to the real phenomena they emulate, expose the artifacts they introduce, and show how those artifacts affect target accuracy. Third, we analyze how data-related vulnerabilities and their proposed defenses affect convergence, reporting performance under clean and adversarial conditions to make the convergence-robustness trade-off explicit. To our knowledge, this is the first survey to provide a complete understanding of data-related challenges that govern FL. With clear takeaways distilled for each concern, our work serves as actionable guidance, helping practitioners design their system with predictable convergence and stability.

2606.10525 2026-06-10 cs.CR cs.AI 新提交

Assessing Automated Prompt Injection Attacks in Agentic Environments

评估智能体环境中的自动化提示注入攻击

David Hofer, Edoardo Debenedetti, Florian Tramèr

发表机构 * ETH Zurich(苏黎世联邦理工学院)

AI总结 研究在智能体环境中,黑盒优化方法(TAP)比梯度方法(GCG)更有效,且攻击效果依赖于攻击者模型,任务通用攻击可迁移但跨模型迁移受限。

详情
AI中文摘要

间接提示注入对与不可信外部数据交互的LLM智能体构成严重威胁,然而在现实智能体环境中,自动化攻击方法(已被证明对越狱有效)仍未得到充分探索。我们针对LLM智能体进行了自动化提示注入攻击的全面实证评估,将白盒(GCG)和黑盒(TAP)方法都适配到AgentDojo框架中的智能体设置。我们在跨越四个领域和多个模型的80个任务对上进行评估,发现黑盒优化显著优于基于梯度的方法,我们将这一差距归因于GCG在合理计算预算下的优化不稳定性。我们还发现TAP的有效性取决于攻击者模型,因为通用能力和安全调优都会影响攻击成功率——更强的模型产生更有效的注入,而安全调优的攻击者可能拒绝生成对抗性提示。任务通用攻击有效迁移到未见过的任务和分布外领域,但在较小开源模型上优化的攻击不会迁移到GPT-5等前沿模型。这些发现表明自动化提示注入是一种可信但依赖于模型的威胁,对于模型无关的利用仍存在重大障碍。

英文摘要

Indirect prompt injection poses a critical threat to LLM agents that interact with untrusted external data, yet automated attack methods--proven effective for jailbreaking--remain underexplored in realistic agentic settings. We present a comprehensive empirical evaluation of automated prompt injection attacks against LLM agents, adapting both white-box (GCG) and black-box (TAP) methods to the agentic setting within the AgentDojo framework. We evaluate across 80 task pairs spanning four domains and multiple models, and find that black-box optimization substantially outperforms gradient-based methods, a gap we attribute to GCG's optimization instability under reasonable compute budgets. We also find that TAP's effectiveness depends on the attacker model, as both general capability and safety tuning affect attack success--stronger models produce more effective injections, while safety-tuned attackers can refuse to generate adversarial prompts. Task-universal attacks transfer effectively to unseen tasks and out-of-distribution domains, but attacks optimized on smaller open-source models do not transfer to frontier models like GPT-5. These findings highlight automated prompt injection as a credible but model-dependent threat, with significant barriers remaining for model-agnostic exploitation.

2606.10493 2026-06-10 cs.DC cs.AI cs.LG cs.NE 新提交

Achieving Cloud-Grade SLOs for Local Mixture-of-Experts Inference through CPU-GPU Hybrid Design

实现本地混合专家模型推理的云级SLO:CPU-GPU混合设计

Wenxin Wang, Yule Hou, Yu Ji, Peng Qu, Youhui Zhang

发表机构 * Tsinghua University(清华大学) Xingyun Integrated Circuits Co., Ltd.(星云集成电路有限公司) Beijing National Research Center for Information Science and Technology(北京信息科学与技术国家研究中心)

AI总结 针对本地MoE推理在低并发下仍无法达到云级服务质量的问题,提出CPU-GPU混合系统,通过流加载预填充、分布式SLP、节点内预填充-解码分离、AVX-512优化FP8 GEMV内核和细粒度CPU并行,在消费级硬件上实现云级SLO。

Comments Accepted to the 20th USENIX Symposium on Operating Systems Design and Implementation (OSDI '26). The official version will appear in the OSDI '26 proceedings published by USENIX

详情
AI中文摘要

本地部署大型混合专家(MoE)模型即使在低并发工作负载下也无法达到云级环境中的服务质量。我们识别出本地MoE推理中的四个关键差距:依赖容量缩减模型(量化、蒸馏、重路由)、无法满足长预填充(超过12K)的30秒TTFT、低于基线的解码吞吐量(低于20 tokens/s)、以及在混合预填充-解码和批量解码工作负载下的并发性差。我们提出一个CPU-GPU混合系统,通过以下方式在双插槽商用CPU和消费级GPU上实现云级SLO:(1)流加载预填充(SLP),将预填充吞吐量提升至1,200 tokens/s,并在30秒内支持32K提示;(2)采用SmallEP专家并行的分布式SLP(DSLP),在两张RTX 5090上达到1,800 tokens/s和45K提示;(3)节点内预填充-解码分离,具有零拷贝共享权重和双批次注意力-MoE重叠方案,在延迟增加低于15%且吞吐量提升50%的情况下维持并发性;(4)AVX-512优化的FP8 GEMV内核,实现原生CPU FP8推理,同时降低4-5倍CPU延迟;(5)细粒度CPU并行,在INT4 DeepSeek-V3上达到28 tokens/s,在完整FP8 V3上达到21.5 tokens/s。评估表明,我们的系统在消费级CPU-GPU平台上为旗舰MoE模型提供云级QoS,通过完整原始精度推理重塑本地部署,无需数据中心基础设施即可实现高质量、经济高效的访问。

英文摘要

Local deployment of large Mixture-of-Experts (MoE) models falls short of the service quality achieved in cloud-scale environments, even under low-concurrency workloads. We identify four key gaps in local MoE inference: reliance on capacity-reduced models (quantized, distilled, rerouted), inability to meet 30-second TTFT for long prefills (more than 12K), sub-baseline decode throughput (under 20 tokens/s), and poor concurrency under mixed prefill-decode and batched decode workloads. We present a CPU-GPU hybrid system that achieves cloud-level SLOs on dual-socket commodity CPUs and consumer GPUs by (1) stream-loading prefill (SLP), boosting prefill throughput to 1,200 tokens/s and enabling 32K prompts within 30 seconds; (2) distributed SLP (DSLP) with SmallEP expert parallelism, reaching 1,800 tokens/s and 45K prompts in 30 seconds on two RTX 5090s; (3) intra-node prefill-decode disaggregation with zero-copy shared weights and a dual-batch attention-MoE overlap scheme, sustaining concurrency with under 15 percent latency increase and 50 percent throughput gains; (4) an AVX-512-optimized FP8 GEMV kernel, enabling native CPU FP8 inference while delivering 4-5x lower CPU latency; and (5) fine-grained CPU parallelism that attains 28 tokens/s on INT4 DeepSeek-V3 and 21.5 tokens/s on intact FP8 V3. Evaluations show our system delivers cloud-level QoS for flagship MoE models on consumer CPU-GPU platforms, reshaping local deployment with intact, original-precision inference and enabling high-quality, cost-effective access without datacenter infrastructure.

2606.10475 2026-06-10 cs.MA cs.AI cs.CL 新提交

Decoupling Thought from Speech: Knowledge-Grounded Counterfactual Reasoning for Resilient Multi-Agent Argumentation

思想与言语解耦:基于知识反事实推理的鲁棒多智能体辩论

Jakub Masłowski, Jarosław A. Chudziak

发表机构 * Institute of Computer Science, Warsaw University of Technology(华沙技术大学计算机科学学院)

AI总结 提出知识反事实推理(KG-CFR)双阶段架构,通过私有规划缓冲与公共执行层分离,在动态资源分配环境下将扰动后论证质量从0.694提升至0.822,并减少语义循环。

Comments Accepted for publication in the Proceedings of the 30th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems (KES 2026)

详情
AI中文摘要

多智能体辩论框架已被证明能提升大语言模型在收敛任务上的表现,但目前优化方式过度偏向最终输出准确性而非过程稳定性。在长时间交互中,持续扰动下的反应式系统常出现逻辑退化、论点重复和角色漂移。为从结构上防止身份丢失并保持过程保真度,我们引入知识反事实推理(KG-CFR),一种双阶段架构,在私有检索增强规划缓冲区和公共执行层之间强制执行严格关注点分离。我们在不确定性下动态资源分配(DRAU)这一专用1v1v1环境中评估该系统,引入与标准辩论设置不同的多样性。在270次完全析因危机模拟轨迹(含随机环境冲击)中,KG-CFR在超过95%的扰动运行中防止了裁判检测到的关键冲击后退化(定义为质量偏移Δ ≤ -0.20),将整体论证质量从0.694提升至0.822。我们的主要贡献是证明架构解耦是在持续压力下不损失质量而增强系统鲁棒性的重要因素。此外,我们引入了用于话语发散和计划执行对齐的自定义向量度量,为操作稳定性提供了强有力且方向一致的证据。消融实验表明,适当的教义基础与前瞻规划对论证质量同等重要。根据初步度量评估,KG-CFR通过保持智能体与原始计划的一致性减少了语义循环。

英文摘要

Multi-agent debate frameworks have been shown to improve large language model performance in convergent tasks, but they are currently optimized in a way that heavily favors final output accuracy rather than stability of the process. During long-horizon exchanges reactive systems under sustained perturbations often experience logic degradation, argument repetition, and role drift. To structurally prevent the identity loss and maintain the process fidelity, we introduce Knowledge-Grounded Counterfactual Reasoning (KG-CFR), a dual-stage architecture that enforces a strict separation of concerns between a private, retrieval-augmented planning buffer, and a public execution layer. We assess this system in Dynamic Resource Allocation under Uncertainty (DRAU), a dedicated 1v1v1 environment, introducing diversity as distinct from standard debate settings. Over 270 completely factorial crisis simulation trajectories with stochastic environmental shocks, KG-CFR prevents judge-detected critical post-shock degradation (defined as a quality shift, $Δ\le -0.20$) in more than 95% of perturbed runs, increasing the overall argument quality from 0.694 to 0.822. Our primary contribution is the demonstration of architectural decoupling being an important factor of systemic resilience enhancement under sustained pressure without quality loss. Furthermore, we introduce custom vector metrics for discourse divergence and plan-execution alignment that provide strong, directionally consistent evidence of operational stability. Our ablation experiments suggest that the proper doctrinal grounding can be an equally important factor for argument quality, as the prospective planning. KG-CFR, according to our initial metric evaluations, reduces semantic looping, by preserving the agent's consistency with the original plan.

2606.10472 2026-06-10 cs.GT cs.LG 新提交

Trading Utility for Dynamic Fairness in Multiple Resource Division with Sequential Demand

在顺序需求的多资源分配中权衡效用与动态公平性

Kaiqi Jiang, Karim El Husseini, Wenzhe Fan, Xinhua Zhang

发表机构 * Computer Science Dept. University of Illinois Chicago(伊利诺伊大学芝加哥分校计算机科学系)

AI总结 提出一种神经分配机制,通过多目标优化在顺序分配中平衡公平与效用,实现更高的效用同时保持可比公平性。

详情
AI中文摘要

动态多资源分配是共享计算环境中的一个核心问题,其中用户的需求顺序到达,且必须在不知道未来需求的情况下公平分配资源。现有方法强调公平性保证,如共享激励、无嫉妒和动态帕累托最优性,但往往忽略系统效用。此外,这些公平性标准互不兼容,无法同时严格实施。我们提出一种神经分配机制,通过在顺序展开过程中进行多目标优化来调和公平性与效用。我们首先通过共享激励、无嫉妒和动态帕累托最优性的逐步损失函数形式化动态环境中的公平性,从而实现可微训练。利用非浪费性,我们通过将分配约束在需求子空间内来参数化解,同时允许在资源可用时进行弹性过度分配。实验结果表明,我们学习的分配器在可比公平性水平下实现了显著更高的效用,揭示了跨指标的清晰帕累托前沿式权衡。

英文摘要

Dynamic multi-resource allocation is a central problem in shared computing environments, where users' demands arrive sequentially and resources must be distributed fairly without knowledge of future demands. Existing methods emphasize fairness guarantees such as Sharing Incentive, Envy Freeness, and Dynamic Pareto Optimality, but often overlook system utility. Moreover, these fairness criteria are mutually incompatible, preventing strict enforcement of them at the same time. We propose a neural allocation mechanism that reconciles fairness with utility through multi-objective optimization during sequential rollout. We first formalize fairness in the dynamic setting via stepwise loss functions for Sharing Incentive, Envy Freeness, and Dynamic Pareto Optimality, enabling differentiable training. Leveraging non-wastefulness, we parameterized the solutions by constraining allocations to the subspace of demand while allowing elastic over-allocation when resources remain available. Empirical results demonstrate that our learned allocator achieves substantially higher utility at comparable levels of fairness, uncovering clear Pareto-frontier-like tradeoffs across metrics.

2606.10459 2026-06-10 cs.SI cs.CL 新提交

Leveraging Social Media Data for COVID-19 Studies

利用社交媒体数据进行COVID-19研究

Nur Hafieza Ismail, Nur Shazwani Kamarudin, Nurol Husna Che Rose

发表机构 * Faculty of Computing, University Malaysia Pahang(马来西亚乌拉大学 computing 学院) Faculty of Electronic Engineering Technology, University Malaysia Perlis(马来西亚霹雳大学电子工程技术学院)

AI总结 本文探讨社交媒体在COVID-19大流行期间的作用,分类使用数据,介绍机器学习、特征工程、自然语言处理和调查方法,并指出未来研究方向。

Comments 8 pages, 1 figure

详情
AI中文摘要

如今,社交网络已成为广泛偏好的信息来源。特别是在2019冠状病毒病(COVID-19)大流行期间,社交媒体已成为获取与COVID-19相关最新新闻和信息的最常用平台之一。社交媒体之所以受欢迎,是因为它们为注册用户提供免费访问,并允许他们发布、传播信息以及回复他人的帖子。全球有近46亿社交媒体用户,因此这些平台上共享的大量信息可能影响人们如何看待和应对当前面临的大流行,这并不令人惊讶。通过合理使用,社交媒体可以成为传播可靠新闻和提高患者、临床医生及社会公众意识的有益数字工具。具体而言,本章描述了用户披露中表达的语言、视觉和情感指标。因此,本章详细探讨和讨论了COVID-19大流行期间社交媒体平台使用的相关研究。本章还对所使用的社交媒体数据进行了分类,介绍了不同的部署机器学习、特征工程、自然语言处理和调查方法,并概述了未来研究的方向。

英文摘要

Nowadays, social media networks have become widely preferred sources of information. Especially during the time of the Coronavirus disease 2019 COVID 19 pandemic, social media has been one of the most used platforms to get the latest news and information related to COVID 19. Social media are popular because they offer free access to their registered users and allow them to do posting, disseminate information, and respond to others postings. With almost 4.6 billion social media users worldwide, it is not surprising the significant amount of information shared through these platforms could affect how people perceive and cope with the pandemic that we are facing right now. With decent use, social media can be a beneficial digital tool to spread reliable news and public awareness for patients, clinicians, and society. Specifically, this chapter describes linguistic, visual, and emotional indicators expressed in user disclosures. Thus, in this chapter, the related studies of social media platforms usage during the COVID 19 pandemic are explored and discussed in detail. This chapter also categorizes social media data used, introduces different deployed machine learning, feature engineering, natural language processing, and survey methods, and outlines directions for future research.

2606.10440 2026-06-10 cs.DC cs.LG cs.NI 新提交

ASTRA-sim 3.0: Next-Level Distributed Machine Learning Simulations via High-Fidelity GPU and Infrastructure Modeling

ASTRA-sim 3.0:通过高保真GPU和基础设施建模实现下一代分布式机器学习模拟

William Won, Jinsun Yoo, Tuan Ta, Moumita Dey, Andy Balogh, Pradosh Datta, Furkan Eris, Conor Green, Winston Liu, Changhai Man, Kingshuk Mandal, Amos Rai, Vinay Ramakrishnaiah, Ruchi Shah, David Sidler, Harsh Sikhwal, Hanjiang Wu, Tushar Krishna, Bradford M. Beckmann

发表机构 * AMD Research and Advanced Development(AMD研究与高级开发) Georgia Institute of Technology(佐治亚理工学院) Keysight Purdue University(普渡大学)

AI总结 针对分布式机器学习中延迟敏感通信建模的不足,提出ASTRA-sim 3.0,通过细粒度缓存行级负载存储模拟和标准化基础设施表示InfraGraph,实现高保真模拟,支持优化集合算法、网络需求和GPU架构的设计空间探索。

Comments 10 pages, 15 figures, one table

详情
AI中文摘要

分布式机器学习是当今大规模人工智能应用的关键范式。随着模型推理成为重要用例,对延迟敏感的集合通信进行忠实建模从未如此重要。因此,如今必须高保真地捕获设备架构并建模控制和数据路径。拥有分布式机器学习基础设施的通用、详细表示也至关重要。我们重新审视了有前途的开源社区驱动模拟器:ASTRA-sim。在这项工作中,我们识别了当前ASTRA-sim模拟器的局限性,并为其增加了新功能。为此,我们通过标准化的基础设施表示实现了细粒度、高保真的模拟,开辟了新的设计空间探索机会。我们提出了缓存行大小的负载存储粒度的模拟,并带有详细的图形处理单元(GPU)执行模型,以平衡模拟的可扩展性和保真度。我们还引入了InfraGraph,一种标准化表示,用于详细捕获分布式机器学习网络基础设施。使用更新的ASTRA-sim 3.0模拟器,我们展示了设计优化集合算法、网络需求和GPU架构的有趣设计空间探索。

英文摘要

Distributed machine learning (ML) is a key paradigm for today's large-scale artificial intelligence applications. As model inference arises as an important use case, faithful modeling of latency-sensitive collective communication has never been more important. Capturing the device architecture and modeling control and data paths at high fidelity is therefore a necessity today. Having a common, detailed representation for distributed ML infrastructure is also crucial. We revisit the promising open-source, community-driven simulator: ASTRA-sim. In this work, we identify limitations of the current ASTRA-sim simulator and augment it with new features. To this end, we enable fine-grained, high-fidelity simulation with a standardized infrastructure representation, opening new design space exploration opportunities. We propose the simulation at cache-line-sized load-store granularity, with a detailed graphics processing unit (GPU) execution model, to balance simulation scalability and fidelity. We also introduce InfraGraph, a standardized representation to capture distributed ML network infrastructure in detail. Using the updated ASTRA-sim 3.0 simulator, we showcase interesting design space explorations for designing optimized collective algorithms, network requirements, and GPU architectures.

2606.10398 2026-06-10 cs.IR cs.CL cs.HC cs.SI 新提交

Selection, Not Salience: The Shape and Limits of Personalization in Social Highlighting

选择而非显著性:社交高亮中个性化的形态与局限

Kazuki Nakayashiki, Keisuke Watanabe

发表机构 * Glasp Inc.(Glasp公司)

AI总结 通过社交高亮和共读身份控制实验,发现个性化主要作用于文档选择层(约+0.13),而非句子显著性层,且效果主要由主题偏好驱动。

Comments 9 pages, 1 figure, 3 tables

详情
AI中文摘要

个性化读者所见内容是否值得,其边界在哪里?利用社交网页高亮器和共读身份控制(同一文档被多个用户高亮,固定文档和主题,询问个人历史是否比另一个读者的历史更好地预测其标记),我们绘制了跨阅读层次的个性化形态与局限。在文档层次,我们给出了干净、无泄漏、身份控制的测量,而先前的下一文档评估只能给出上界:个人历史能识别共读邻域中哪些文档属于该用户,自身与其他的差距为+0.169(相对于社区负例)和+0.119(相对于主题匹配的难负例),两者均高度显著;基于内容的实验表明该信号并非纯粹由标题驱动,而主要是主题性的。这与我们先前工作中跨度级的选择信号(+0.14)相当:选择信号在不同层次上幅度相近(+0.12至+0.17),其中大部分是稳定的主题偏好。在句子层次,两阶段个性化自动高亮(非个性化模型提出候选,个性化模型重新排序)并未优于其非个性化基线:两个现成的零样本大语言模型(包括前沿模型)预测高亮位置的效果不如首句基线,且即使在最高召回率的候选池中,个性化重排序也被显著性顺序击败,因此零结果并非仅仅是第一阶段的天花板效应。可测量的个性化主要出现在选择层:适度(约+0.13)、以主题为主,在显著性层没有可靠增益。我们还发现了一个控制负例偏差,该偏差在审计前将我们的文档差距膨胀到虚假的+0.227。超越共享显著性层可能更适合通过聚合个体而非加强个性化来实现。

英文摘要

Does personalizing what a reader sees pay off, and where does it stop? Using a social web highlighter and a co-readership identity control (the same document highlighted by many users, which holds document and topic fixed and asks whether a person's own history predicts their marks better than another reader's does), we map the shape and limits of personalization across reading altitudes. At the document altitude we give the clean, leakage-free, identity-controlled measurement that prior next-document evaluations could only upper-bound: a person's history identifies which documents in a co-reading neighborhood are theirs, with an own-versus-other gap of +0.169 against community negatives and +0.119 against topic-matched hard negatives (both highly significant); a content-based arm suggests the signal is not purely title-driven but is largely thematic. This is comparable to the span-level selection signal (+0.14) from our prior work: the selection signal is of comparable magnitude across altitudes (+0.12 to +0.17), most of it stable topic preference. At the sentence altitude, a two-stage personalized auto-highlight (an impersonal model proposes candidates, a personal model re-ranks them) does not improve on its impersonal baseline: two off-the-shelf zero-shot LLMs, including a frontier model, predict highlight locations worse than a lead baseline, and personal re-ranking is beaten by the salience order even on the highest-recall candidate pool, so the null is not merely a Stage-1 ceiling artifact. Measurable personalization appears primarily at the selection layer: modest (~+0.13), topic-dominated, with no reliable gain at the salience layer. We also surface a control-in-negatives bias that inflated our document gap to a spurious +0.227 until audited. Going beyond the shared salience layer may be better approached by aggregating individuals than by personalizing them harder.

2606.10388 2026-06-10 cs.IR cs.AI 新提交

SkillResolve-Bench: Measuring and Resolving Same-Capability Ambiguity in Agent Skill Retrieval

SkillResolve-Bench:衡量和解决智能体技能检索中的同能力歧义

Jiandong Ding

发表机构 * Huawei Technologies Ltd(华为技术有限公司)

AI总结 针对智能体技能库中同一能力族内不同技能的执行风险,提出SkillResolve-Bench基准和SkillResolve方法,通过候选族解析和代表性选择,在保持高召回率的同时将有害技能暴露率降至0。

Comments Preprint

详情
AI中文摘要

智能体技能库正成为可路由的软件资产:检索到的技能可以为智能体提供指令、脚本、资源绑定和执行假设。这使得技能检索不仅仅是广泛的相关性匹配。检索器可以找到正确的能力族,却暴露出错误的同能力代表。我们将这种失败研究为同能力执行风险检索。每个查询将一个有用的技能与一个特定于查询的有风险兄弟技能配对,该兄弟技能共享能力族,但可能导致执行指向过时资源、缺失前提或错误程序。我们引入了SkillResolve-Bench 1.0,这是一个针对该场景的可审计基准,包含661个有用/有风险对、源角色和准入证据、线索/泄漏检查、查询不相交划分,以及一个包含6,660个公共SkillRet候选的7,982个候选池。该基准报告有用性排名以及有害兄弟率(HSR@K),即前K个中暴露有风险兄弟的比例。我们还提供了SkillResolve,一种参考方法,它解析活跃候选族,从易混淆的库负样本和契约配置文件线索中评分查询条件效用,并在最终前K列表之前从每个族中选择一个代表。在已发布族关系下,SkillResolve达到Recall@3 0.766和NDCG@3 0.699,同时保持HSR@3=0。与SkillRouter相比,Recall@3提升0.112,NDCG@3提升0.165,同时将HSR@3从0.693降至0。如果没有代表性选择,在相同评分器下HSR@3升至0.236,这表明族内代表性选择是将能力检索转化为更安全过程暴露的机制。

英文摘要

Agent skill libraries are becoming routable software assets: a retrieved skill can contribute instructions, scripts, resource bindings, and execution assumptions to an agent. This makes skill retrieval more than broad relevance matching. A retriever can find the right capability family yet expose the wrong same-capability representative. We study this failure as same-capability execution-risk retrieval. Each query pairs a helpful skill with a query-specific risky sibling that shares the capability family but can lead execution toward a stale resource, missing precondition, or wrong procedure. We introduce SkillResolve-Bench 1.0, an auditable benchmark for this setting with 661 helpful/risky pairs, source-role and admission evidence, cue/leakage checks, query-disjoint splits, and a 7,982-candidate pool that includes 6,660 public SkillRet candidates. The benchmark reports helpful ranking together with harmful sibling rate (HSR@K), the top-K exposure of the risky sibling. We also provide SkillResolve, a reference method that resolves active candidate families, scores query-conditioned utility from confusable library negatives and contract-profile cues, and selects one representative from each family before the final top-K list. Under the released family relation, SkillResolve reaches Recall@3 0.766 and NDCG@3 0.699 while keeping HSR@3=0. It improves over SkillRouter by 0.112 Recall@3 and 0.165 NDCG@3 while reducing HSR@3 from 0.693 to 0. Without representative selection, HSR@3 rises to 0.236 under the same scorer, identifying within-family representative choice as the mechanism that turns capability retrieval into safer procedural exposure.

2606.10357 2026-06-10 cs.IR cs.AI 新提交

Atomic Intent Reasoning: Bringing LLM Semantics to Industrial Cross-Domain Recommendations

原子意图推理:将LLM语义引入工业跨域推荐

Zhuohang Jiang, Yuxin Chen, Shijie Wang, Haohao Qu, Zhou Jindong, Wenqi Fan, Li Qing, Dongxu Liang, Jun Wang

发表机构 * The Hong Kong Polytechnic University(香港理工大学) Kuaishou Technology(快手科技)

AI总结 提出AIR框架,通过离线LLM推理与在线高效检索组合,实现工业级跨域推荐,在快手电商中GMV提升3.446%。

Journal ref Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.2 (KDD '26), August 09--13, 2026, Jeju Island, Republic of Korea

详情
AI中文摘要

跨域推荐是内容到电子商务平台的核心问题。其目标是利用用户与内容的交互来推断电子商务端的潜在购买意图,从而提高转化率和商业价值。然而,在真实的工业场景中,跨域推荐面临多重挑战:不同领域之间存在显著的语义鸿沟,用户跨域行为序列通常规模庞大且噪声丰富。尽管大型语言模型(LLM)具有强大的语义理解和推理能力,但其毫秒级的推理延迟使得直接应用于在线推荐系统变得困难。为了解决这些问题,本文介绍了AIR(原子意图推理),一个为工业级部署设计的LLM驱动的跨域推荐框架。通过将LLM推理迁移到离线阶段,并在在线操作期间通过高效检索和组合动态构建用户意图表示,它在保持语义一致性的同时实现了约400倍的推理加速。在多个公共数据集上的实验结果表明,我们的方法在跨域推荐任务中达到了最先进的性能。此外,在快手电商真实业务场景中进行的大规模在线A/B测试显示,我们的方法在多个核心业务指标上取得了稳定且显著的提升,包括GMV增长+3.446%,充分验证了其在工业级推荐系统中的有效性和实用价值。

英文摘要

Cross-domain recommendation is a core problem in content-to-e-commerce platforms. Its objective is to leverage user interactions with content to infer potential purchasing intent on the e-commerce side, thereby enhancing conversion rates and commercial value. However, in real industrial scenarios, cross-domain recommendation faces multiple challenges: significant semantic gaps exist between different domains, and user cross-domain behavior sequences are often massive in scale and rich in noise. Although large language models (LLMs) possess powerful semantic understanding and reasoning capabilities, their millisecond-level inference latency makes direct application in online recommendation systems difficult. To address these issues, this paper introduces AIR (Atomic Intent Reasoning), an LLM-driven cross-domain recommendation framework designed for industrial-grade deployment. By migrating LLM inference to the offline phase and dynamically constructing user intent representations through efficient retrieval and composition during online operations, it achieves approximately 400* inference acceleration while maintaining semantic consistency. Experimental results across multiple public datasets demonstrate that our method achieves state-of-the-art performance in cross-domain recommendation tasks. Furthermore, large-scale online A/B testing conducted in Kuaishou E-commerce's real-world business scenarios shows that our approach delivers stable and significant improvements across multiple core business metrics, including a +3.446% increase in GMV, fully validating its effectiveness and practical value in industrial-scale recommendation systems.

2606.10281 2026-06-10 cs.CR cs.CL 新提交

Benchmarking and Exploring the Capabilities of LLMs for Attack Investigations

基准测试与探索LLM在攻击调查中的能力

Aniket Anand, Yiwei Hou, Daniel Fields, Alex Kantchelian, David Tao, Kurt Thomas, Grant Ho

发表机构 * University of Chicago(芝加哥大学) University of California, Berkeley(加州大学伯克利分校) Google(谷歌)

AI总结 提出AuditBench基准数据集,评估LLM在安全审计日志分析中的性能,涵盖四种常见调查任务,揭示模型在不同设计选择下的表现差异与错误类型。

详情
AI中文摘要

本文提出了AuditBench,一个新的基准数据集,用于评估LLM在调查安全相关系统审计日志方面的能力。我们设计并使用该基准来探索LLM在事件响应团队通常执行的四种日志调查任务上的表现,范围从对检测器生成的警报进行分类到识别受损系统上的持久性机制。AuditBench包含从Linux和Windows机器收集的系统审计日志,涵盖50多种不同的安全调查场景,包括恶意和良性活动。利用我们的基准,我们评估并分析了五个前沿LLM在分析审计日志以进行攻击调查方面的性能。我们的分析揭示了LLM性能和错误概况如何根据不同的设计选择而变化,例如模型大小、数据表示、提示构建和特定调查任务的差异。此外,我们描述了LLM生成的解释质量以及模型在我们的基准中犯的错误类型。总的来说,我们的工作为评估LLM调查安全日志的能力提供了基础,为在安全运营中使用LLM的从业者提供了新颖的见解,并为未来研究指明了重要方向。

英文摘要

This paper presents AuditBench, a new benchmark dataset for evaluating the capabilities of LLMs at investigating security-related system audit logs. We design and use this benchmark to explore the performance of LLMs on four log-investigation tasks that incident response teams commonly perform, ranging from triaging alerts generated by detectors to identifying persistence mechanisms on compromised systems. AuditBench consists of system audit logs collected from Linux and Windows machines, and spans over 50 different security investigation scenarios, including both malicious and benign activity. Using our benchmark, we evaluate and analyze the performance of five frontier LLMs at analyzing audit logs for attack investigations. Our analysis illuminates how LLM performance and error profiles vary according to different design choices, such as differences in model size, data representation, prompt construction, and specific investigation tasks. Additionally, we characterize the quality of the explanations produced by LLMs and the types of errors that models make across our benchmark. Collectively, our work provides a foundation for assessing the capabilities of LLMs for investigating security logs, novel insights for practitioners using LLMs in security operations, and important directions for future research.

2606.10173 2026-06-10 cs.CR cs.AI 新提交

Local Is Not a Sufficient Privacy Boundary: Governing OS-Integrated On-Device AI

本地并非充分的隐私边界:治理操作系统集成的设备端AI

Jonghyun Chung, Sanket Badhe

发表机构 * University of California, Berkeley(加州大学伯克利分校)

AI总结 本文提出一个以操作系统为中心的隐私框架,将隐私视为制度问责问题而非部署属性,通过威胁模型、六部分隐私风险分类、隐私架构控制和四级审计标准来治理设备端AI。

详情
AI中文摘要

随着AI系统进入操作系统,隐私不再仅仅取决于模型是否在本地运行。本地助手可能整合电子邮件、日历条目、文件、截图、通知和应用意图;保留嵌入或摘要;调用工具;发送遥测数据;或将困难请求路由到云基础设施。本地推理减少了一些暴露,但它只回答了一个问题:计算发生在哪里。它没有回答谁可以整合上下文、哪些派生状态持续存在、哪些操作被授权,或者更新如何改变系统的权限。我们为设备端AI开发了一个以操作系统为中心的隐私框架,将隐私视为一个制度问责问题,而不是一个部署属性。该框架指定了一个威胁模型、一个六部分隐私风险分类、隐私架构控制和一个四级审计标准。我们通过一个文档约束的比较来展示该标准,比较对象包括Apple Intelligence/Foundation Models、Android AICore/Gemini Nano和Microsoft Recall。设备端AI中有意义的隐私取决于受限的信息流、有限的权限、可见的用户控制以及在操作系统生命周期中可审计的治理。

英文摘要

As AI systems move into operating systems, privacy no longer turns only on whether a model runs locally. A local assistant may assemble email, calendar entries, files, screenshots, notifications, and app intents; retain embeddings or summaries; invoke tools; emit telemetry; or route difficult requests to cloud infrastructure. Local inference reduces some exposure, but it answers only one question: where computation occurs. It does not answer who may assemble context, what derived state persists, which actions are authorized, or how updates change the system's authority. We develop an OS-centered privacy framework for on-device AI that treats privacy as an institutional accountability problem rather than a deployment attribute. The framework specifies a threat model, a six-part privacy risk taxonomy, privacy-by-architecture controls, and a four-level audit rubric. We demonstrate the rubric through a documentation-bounded comparison of Apple Intelligence/Foundation Models, Android AICore/Gemini Nano, and Microsoft Recall. Meaningful privacy in on-device AI depends on constrained information flow, bounded authority, visible user control, and auditable governance across the operating-system lifecycle.

2606.10156 2026-06-10 cs.IR cs.AI cs.CL 新提交

$τ$-Rec: A Verifiable Benchmark for Agentic Recommender Systems

$τ$-Rec:面向智能推荐系统的可验证基准

Bharath Sivaram Narasimhan, Karthik R Narasimhan

发表机构 * Independent Researcher(独立研究员) Princeton University(普林斯顿大学)

AI总结 针对多轮对话式智能推荐系统评估中主观性强、成本高的问题,提出$τ$-Rec基准,通过可验证奖励和揭示标记引导机制,结合pass^k可靠性指标,系统评估模型推理一致性,发现当前最佳模型可靠性仅约57%。

详情
AI中文摘要

随着推荐系统向智能、多轮对话界面转变,评估范式难以跟上步伐。当前的基准通常依赖“LLM作为评判者”的评估,这引入了主观性、高成本和不一致性。我们提出了$τ$-Rec,一个用于智能推荐系统的基准,它用可验证奖励取代主观评估,并采用揭示标记引导(RTE)机制来控制任务约束在对话中如何呈现。通过针对结构化目录谓词测试智能体,并采用pass^k可靠性指标,$τ$-Rec为一致的推理提供了系统测试。我们对五个模型家族(GPT-5.4、Claude Sonnet 4.6、Gemini 2.5 Flash、DeepSeek V4 Flash、Qwen3-32B和GPT-5 mini)的九种配置进行了评估,揭示了一个陡峭的可靠性悬崖,即使是最好的模型在pass^1上也仅达到约57%,在pass^4上约38%,突显了当前对话智能体部署中的关键差距。所有代码和数据均在此https URL公开。

英文摘要

As recommender systems transition toward agentic, multi-turn conversational interfaces, evaluation paradigms have struggled to keep pace. Current benchmarks often rely on "LLM-as-a-judge" evaluations, which introduce subjectivity, high costs and inconsistency. We present $τ$-Rec, a benchmark for agentic recommender systems that replaces subjective evaluation with verifiable rewards and a reveal-tagged elicitation (RTE) mechanism that controls how task constraints surface during dialogue. By testing agents against structured catalog predicates and employing a pass^k reliability metric, $τ$-Rec provides a systematic test for consistent reasoning. Our evaluation of nine configurations across five model families -- GPT-5.4, Claude Sonnet 4.6, Gemini 2.5 Flash, DeepSeek V4 Flash, Qwen3-32B and GPT-5 mini -- reveals a steep reliability cliff, where even the best model achieves only ~57% at pass^1 and ~38% at pass^4, highlighting a critical gap in current conversational agent deployment. All code and data are publicly available at https://github.com/nbharaths/tau-rec.

2606.10112 2026-06-10 cs.GT cs.AI cs.LG econ.TH 新提交

Duality for Optimal Multi-Item, Multi-Bidder Auction Design: Revenue Certificates through Deep Learning

最优多物品多竞拍者拍卖设计的对偶性:通过深度学习的收入证书

Yanchen Jiang, David C. Parkes, Tonghan Wang

发表机构 * Harvard University(哈佛大学) College of AI, Tsinghua University(清华大学人工智能学院)

AI总结 提出首个直接处理多物品多竞拍者拍卖对偶问题的计算框架,通过神经网络参数化拉格朗日乘子并引入提升技术,生成可证明的收入上界,为连续类型提供近最优性证书。

详情
AI中文摘要

刻画多物品、多竞拍者设置下的收入最优拍卖仍然是一个基本开放问题,除了限制性的二元类型实例外,没有已知的闭式解。这激发了人们对最优拍卖设计的计算方法的兴趣。在本文中,我们引入了第一个直接处理多物品、多竞拍者拍卖和占优策略激励相容(DSIC)的对偶问题的计算框架,生成有证书的收入上界。我们的方法使用神经网络参数化具有结构保证的严格流量守恒性质的拉格朗日乘子,从而通过梯度下降对可行对偶解进行高效优化。为了弥合离散计算方法与连续类型的理论保证之间的差距,我们开发了一种新颖的提升技术,将对偶证书从粗离散化映射到精细细化。我们证明,对于具有连续均匀估值的多物品、多竞拍者拍卖,提升给出了有效的收入上界。此外,我们给出了任意连续分布的广义提升构造,并证明了这些提升对偶在离散极限下收敛到原始连续问题的收入。我们通过恢复典型实例的已知分析机制,验证了该对偶拍卖设计问题的计算框架。对于多物品多竞拍者问题,我们的框架在最优收入与已知最佳DSIC机制之间建立了小差距,提供了近最优性的计算证书。

英文摘要

Characterizing revenue-optimal auctions for multi-item, multi-bidder settings remains a fundamental open problem, with no known closed-form solution existing beyond restrictive binary-type instances. This has motivated interest in computational approaches to optimal auction design. In this paper, we introduce the first computational framework that directly tackles the dual problem for multi-item, multi-bidder auctions and dominant-strategy incentive compatibility (DSIC), generating certified revenue upper bounds. Our approach parametrizes Lagrange multipliers with a structurally guaranteed strict flow-conservation property using neural networks, enabling efficient optimization over feasible dual solutions via gradient descent. To bridge the gap between discrete computational methods and theoretical guarantees for continuous types, we develop a novel lifting technique that maps dual certificates from coarse discretizations to fine refinements. We prove that lifting gives valid revenue upper bounds for multi-item, multi-bidder auctions with continuous uniform valuations. Furthermore, we give a generalized lifting construction for arbitrary continuous distributions and demonstrate that these lifted duals converge to the revenue of the original continuous problem in the discrete limit. We validate this computational framework for the dual auction design problem by recovering known analytical mechanisms for canonical instances. For multi-item multi-bidder problems, our framework establishes a small gap between the optimal revenue and best-known DSIC mechanisms, providing computational certificates of near-optimality.

2606.10106 2026-06-10 cs.SE cs.AI 新提交

What makes a harness a harness: necessary and sufficient conditions for an agent harness

什么使一个工具成为工具:智能体工具的必要和充分条件

Sanderson Oliveira de Macedo

发表机构 * Federal Institute of Goiás(戈亚斯联邦理工学院)

AI总结 本文通过概念分析,定义了智能体工具的必要和充分条件,并提供了包含/排除测试,以区分智能体工具与智能体框架、SDK、IDE插件等。

详情
AI中文摘要

术语“智能体工具”现在在软件工程中随着生成式人工智能广泛流传。它指的是包裹语言模型并将其转化为能够在仓库上行动的编码智能体的层。该用法松散且多义。有时该术语指整个产品(Claude Code, Codex CLI);有时指运行智能体执行任务的评估脚手架(SWE-bench工具);有时它与智能体框架、SDK、IDE插件或编排器混为一谈。缺失的是一个作为工具的参考定义,能够一致地包含和排除案例。我们通过概念分析构建该定义,结合了具有持久标识符的作品和主要灰色文献来源,如官方文档、词汇表和工程报告。我们重构了该术语的谱系,从马具到经典测试工具,到机器学习评估工具,最后到智能体工具。然后我们提出一个构成性定义,陈述一个系统成为智能体工具的必要和充分条件,我们将其操作化为包含和排除测试,并绘制该概念与智能体框架、智能体SDK、IDE插件、评估工具和编排器的边界。我们将该定义应用于六个真实工具(Claude Code, Codex CLI, Aider, Cline, OpenHands和SWE-agent)以及故意的边缘案例;测试一致地包含和排除。最后我们以按设计张力轴组织的研究议程结束。贡献是智能体工具的操作性定义,具有共享词汇,能够指导工程实践和智能体系统的科学比较。

英文摘要

The term agent harness now circulates widely in software engineering with generative artificial intelligence. It names the layer that wraps a language model and turns it into a coding agent able to act on a repository. The usage is loose and polysemous. Sometimes the term denotes the whole product (Claude Code, Codex CLI); sometimes it denotes the evaluation scaffold that runs an agent against tasks (the SWE-bench harness); sometimes it gets conflated with an agent framework, an SDK, an IDE plugin, or an orchestrator. What is missing is a reference definition that works as an instrument, one that includes and excludes cases consistently. We build that definition through a conceptual analysis that combines works with persistent identifiers and primary grey-literature sources, such as official documentation, glossaries, and engineering reports. We reconstruct the genealogy of the term, from the horse's tack to the classic test harness, to the machine-learning evaluation harness, and finally to the agent harness. We then propose a constitutive definition that states the necessary and sufficient conditions for a system to be an agent harness, we operationalize it as an inclusion and exclusion test, and we draw the boundary of the concept against an agent framework, an agent SDK, an IDE plugin, an eval harness, and an orchestrator. We apply the definition to six real harnesses (Claude Code, Codex CLI, Aider, Cline, OpenHands, and SWE-agent) and to deliberate edge cases; the test includes and excludes consistently. We close with a research agenda organized by design tension axes. The contribution is an operational definition of agent harness, with a shared vocabulary, able to guide engineering practice and the scientific comparison of agentic systems.

2606.10091 2026-06-10 cs.CR cs.LG 新提交

SoK: Colluding Adversaries in Machine Learning Pipelines

SoK: 机器学习流水线中的合谋攻击者

Vasisht Duddu, Lipeng He, Asim Waheed, N. Asokan

发表机构 * University of Waterloo(滑铁卢大学) KTH Royal Institute of Technology(皇家理工学院)

AI总结 本文提出一个系统框架,研究机器学习流水线中训练阶段与推理阶段攻击者之间的合谋行为,通过五个实证案例验证了合谋的潜在风险,并讨论了攻击者特征对合谋可能性的影响。

Comments USENIX Security Symposium, 2026

详情
AI中文摘要

机器学习模型容易受到各种安全、隐私和公平性风险的影响。具有不同特征(即目标、知识和能力)的攻击者可以通过执行一种攻击来放大其他攻击,从而进行合谋。现有工作缺乏一个系统框架来探索攻击者之间的合谋,以及研究攻击者特征的影响。我们提出了一个涵盖(a)训练阶段和推理阶段攻击者之间,以及(b)推理阶段攻击者之间的合谋框架。我们的框架考虑了促成攻击者之间合谋的因素。我们提出了一种指南,利用促成因素推测合谋的可能性。我们用它来解释先前的工作,推测未探索的合谋,并实证验证了五个这样的案例。最后,我们讨论了攻击者特征如何影响合谋的可能性。

英文摘要

Machine learning (ML) models are susceptible to various security, privacy, and fairness risks. Adversaries with different characteristics (i.e., objectives, knowledge, and capabilities) can collude by executing one attack to amplify others. Existing work lacks a systematic framework to explore collusion among adversaries, and to study the implications of the adversaries' characteristics. We present a framework covering collusion (a) between train- and inference-time adversaries, and (b) among inference-time adversaries. Our framework accounts for factors enabling collusion between adversaries. We propose a guideline to conjecture about the potential for collusion using enabling factors. We use it to explain prior work, conjecture about unexplored collusions, and empirically validate five such cases. Finally, we discuss how adversaries' characteristics influence the potential for collusion.

2606.10059 2026-06-10 cs.FL cs.CL 新提交

Compiling Rewrite Rules to Finite-State Transducers with the Worsening Trick

使用恶化技巧将重写规则编译为有限状态转录机

Mans Hulden, Michael Ginn

发表机构 * New College of Florida(佛罗里达新学院) University of Colorado(科罗拉多大学)

AI总结 提出基于“恶化技巧”的紧凑编译方案,将重写规则编译为有限状态转录机,支持多种上下文和重写模式,实现简单且易于扩展。

Comments 17 pages, 6 figures, tool track proceedings at CIAA 2026

详情
AI中文摘要

有限状态转录机(FST)对于计算语言学和自然语言处理(NLP)中的字符串重写建模至关重要,特别是对于音韵和形态重写规则。编译形式为 $A \ o B / L \, \_ \, R$ 的一般重写规则(其中 $A$、$B$、$L$ 和 $R$ 是任意正则语言)由于重叠匹配和上下文约束而复杂。传统方法(如 Kaplan 和 Kay 或 Karttunen 的方法)依赖于带有辅助标记的复杂转录机组合。本文提出了一种基于“恶化技巧”的紧凑编译方案:生成所有合法的重写候选,然后过滤那些对于相同输入比其他候选更差的候选。该构造作为 PyFoma 中的内置重写编译器实现,支持多个上下文、任意转录、标记、定向重写、权重和并行重写。得到的公式简短且统一,并且在语义一致的情况下,它们重现了与早期方法相同的规则转录机,同时更易于扩展。该实现已在大量重写语法集合和涵盖主要重写模式的自动回归测试套件上针对 foma 进行了验证,得到的转录机除了状态编号外完全匹配。

英文摘要

Finite-state transducers (FSTs) are essential for modeling string rewriting in computational linguistics and natural language processing (NLP), particularly for phonological and morphological rewrite rules. Compiling general rewrite rules of the form $A \to B / L \, \_ \, R$, where $A$, $B$, $L$, and $R$ are arbitrary regular languages, is complex due to overlapping matches and context constraints. Traditional methods, such as those by Kaplan and Kay or Karttunen, rely on intricate transducer compositions with auxiliary markers. This paper presents a compact compilation scheme based on the "worsening trick'': generate all legal rewrite candidates, then filter candidates that are worse than another candidate for the same input. Implemented as the built-in rewrite compiler in PyFoma, the construction supports multiple contexts, arbitrary transductions, markup, directed rewriting, weights, and parallel rewriting. The resulting formulas are short and uniform, and where semantics coincide, they reproduce the same rule transducers as earlier approaches while remaining easier to extend. The implementation has been validated against foma on both a substantial collection of rewrite grammars and an automated regression suite covering the major rewrite modalities, with the resulting transducers matching exactly apart from state numbering.

2606.10050 2026-06-10 cs.GR cs.CV 新提交

Continuous Neural Reparameterization as a Deep Geometric Prior for Robust Fixed-Chart UV Repair

连续神经重参数化作为鲁棒固定图表UV修复的深度几何先验

Mohammad Sadegh Salehi

发表机构 * Zero One Creative London, UK(伦敦零一创意公司)

AI总结 提出将固定图表UV展开视为连续神经重参数化,使用未训练的SIREN网络优化几何目标,结合谱初始化、Tutte残差预热等策略,实现零翻转的鲁棒图表求解。

详情
AI中文摘要

传统的UV展开依赖于几何畸变能量的直接优化,可能因无效初始化、局部最小值或拓扑翻转而失败。我们将固定图表UV展开重新定义为连续神经重参数化:一个未训练的SIREN将每个顶点的网格特征映射到UV坐标,其权重针对几何目标进行优化。实际贡献是一个鲁棒的图表求解器配方,结合了Laplace-Beltrami谱输入、Tutte残差预热、$C^2$行列式扩展、单射性屏障以及有效性检查的重试/回退路由,而非声称任何单一组件能保证有效性或应取代重切割方法。NTK-LBO诊断表明,谱条件改变更新几何,尤其在初始化和中秩子空间,但本身不能预测图表成功。在紧凑预切割图表和47图表分层Thingi10K/xatlas切割基准上,神经求解器在所有紧凑图表上产生零翻转,并在42/47个分层求解中有效零翻转。与BFF和OptCuts的比较明确了范围:允许时重切割可以更快且畸变更低,而神经求解器针对提供图表的有效性和验证优先的图集构建。在Amara Spatial生成的网格上,完整的图集构建路径在25个资产集上提供打包图集覆盖,并在大规模Rust图集运行中通过回退路由实现1000/1000严格局部有效且零UV翻转的图集。

英文摘要

Traditional UV unwrapping relies on direct optimization of geometric distortion energies and can fail through invalid initialization, local minima, or topological foldovers. We recast fixed-chart UV unwrapping as continuous neural reparameterization: an untrained SIREN maps per-vertex mesh features to UV coordinates, and its weights are optimized for a geometric objective. The practical contribution is a robust chart-solver recipe, combining Laplace--Beltrami spectral inputs, Tutte residual warm-up, a $C^2$ determinant extension, an injectivity barrier, and validity-checked retry/fallback routing, rather than a claim that any single component guarantees validity or that recutting methods should be replaced. NTK--LBO diagnostics show that spectral conditioning changes update geometry, especially at initialization and mid-rank subspaces, but does not by itself predict chart success. On compact pre-cut charts and a 47-chart stratified Thingi10K/xatlas-cut benchmark, the neural solver produces zero flips on all compact charts and 42/47 valid zero-flip stratified solves. BFF and OptCuts comparisons sharpen the scope: recutting can be faster and lower-distortion when allowed, while the neural solver targets supplied-chart validity and validation-first atlas construction. On Amara Spatial generated meshes, the full atlas construction path gives packed-atlas coverage on a 25-asset set and 1000/1000 strict locally valid atlases with zero UV flips in a large-scale Rust atlas run after fallback routing.

2606.10008 2026-06-10 cs.NE cs.LG 新提交

Spiking Neural Network inference on FPGAs with hls4ml

基于hls4ml的FPGA上脉冲神经网络推理

Barry M. Dillon

发表机构 * ISRC, Ulster University(ISRC、乌斯特大学)

AI总结 本文扩展hls4ml工具,实现将PyTorch训练的脉冲神经网络(SNN)部署到FPGA固件上,在Heidelberg Spiking Digits数据集上达到约34μs的推理延迟。

详情
AI中文摘要

脉冲神经网络(SNN)提供了一种自然的时序机器学习框架。它们的神经元维持内部状态并通过离散脉冲传播信息,从而实现低延迟的时序推理。尽管SNN通常与异步神经形态处理器相关联,但许多科学实时推理系统依赖于传统的同步现场可编程门阵列(FPGA)和高级综合(HLS)工作流程。在本文中,我们提出了hls4ml的扩展,使得在PyTorch中训练的SNN能够以时钟驱动方式部署到FPGA固件上。我们使用在Heidelberg Spiking Digits数据集上训练的密集量化SNN演示了该工作流程,其推理延迟约为34μs。通过软件参考比较、HLS C仿真、HLS综合、导出和Vivado综合报告,我们验证了生成的设计。这项工作将hls4ml工具包开放给神经形态计算,允许对SNN模型进行流线型优化、综合和部署,用于实时推理。

英文摘要

Spiking Neural Networks (SNNs) provide a naturally temporal machine-learning framework. Their neurons maintain an internal state and propagate information through discrete spikes, enabling low-latency temporal inference. Although SNNs are often associated with asynchronous neuromorphic processors, many scientific real-time inference systems rely on conventional synchronous field-programmable gate arrays (FPGAs) and high-level synthesis (HLS) workflows. In this paper we present an extension of hls4ml that enables clock-driven deployment of SNNs trained in pytorch onto FPGA firmware. We demonstrate the workflow using a dense quantised SNN trained on the Heidelberg Spiking Digits dataset where it achieves inference latencies of approximately $34μ$s. We validate the generated design through software reference comparisons, HLS C simulation, HLS synthesis, export, and Vivado synthesis reports. This work opens up the hls4ml toolkit to neuromorphic computing, allowing streamlined optimisation, synthesis, and deployment of SNN models for real-time inference.

2606.09957 2026-06-10 cs.SE cs.LG 新提交

Data-aware Static Analysis: Improving Detection of Semantic Faults in Machine Learning Code Using Data Characteristics

数据感知静态分析:利用数据特征改进机器学习代码中语义故障的检测

Willem Meijer, Kristian Sandahl, Dániel Varró

发表机构 * Knut and Alice Wallenberg Foundation(Knut和Alice沃尔贝格基金会) Software Center Project 61(软件中心项目61) Vinnova CoDig competence center(Vinnova CoDig专业中心)

AI总结 提出一种数据感知静态分析方法,结合数据流与控制流分析及API契约,在编写代码时而非训练后检测机器学习代码中的语义故障,如误用未缩放数据训练尺度敏感模型。

Comments 6 pages, 3 figures, 2 listings, 1 table; To be published in "2026 IEEE/ACM 48th International Conference on Software Engineering (ICSE-NIER '26)"

详情
AI中文摘要

机器学习模型使用中的语义故障是机器学习开发者常见的问题,会导致预测次优、计算成本高或输出错误。例如,有人可能错误地使用未缩放的数据来训练尺度敏感模型。机器学习开发者在训练模型后手动分析结果来检测这些故障,这使得过程效率低下。我们提出了一种新颖的数据感知静态分析方法来检测机器学习代码中的语义故障,使开发者能够在编写代码时而不是在训练模型后揭示这些错误。我们的方法结合了数据流和控制流分析以及API契约,能够在高抽象层次上对机器学习代码进行数据感知推理。通过分析真实世界的机器学习笔记本样本,我们展示了我们解决方案的潜力,发现我们可以检测需要数据感知方法的故障。

英文摘要

Semantic faults specific to the use of machine learning models are a common problem for machine learning developers, causing suboptimal predictions, high computational cost, or incorrect outputs. For example, one may erroneously use unscaled data to train a scale-sensitive model. Machine learning developers detect these faults after training their models and manually analyzing the results, making it an inefficient process. We propose a novel data-aware static analysis approach to detect semantic faults in machine learning code, allowing developers to reveal these bugs while writing code instead of after training the model. Our approach uses combined data and control flow analysis, and API contracts, enabling data-aware reasoning about machine learning code at a high level of abstraction. We highlight the potential of our solution by analyzing a sample of real-world machine learning notebooks, finding that we can detect faults that require a data-aware approach.

2606.09956 2026-06-10 cs.SE cs.LG 新提交

Multi-task LLMs for Bug Classification: Efficient Inference with Auxiliary Decoding Heads

多任务大语言模型用于缺陷分类:基于辅助解码头的高效推理

Nikolai Rozanov

发表机构 * Recurse Ltd. Department of Computing, Imperial College London(帝国理工学院计算机系)

AI总结 提出一种轻量级多任务大语言模型(MLC),通过令牌对齐算法和优化训练策略,实现全文件上下文下的行级缺陷定位,性能与代理方法相当但推理延迟降低数个数量级。

Comments 8 pages, 6 pages appendix

详情
AI中文摘要

基于大语言模型的代码生成技术被迅速采用,极大地加速了软件开发,但有效的验证方法仍然严重不足。现有的缺陷定位技术要么成本过高(每个文件需要数分钟的代理推理和数千个生成令牌),要么以粗粒度的函数级别运行,不适合精确调试。而专注于行级粒度且更轻量的工作往往在性能或上下文大小上受到限制。我们提出了一种新颖的行级缺陷定位方法,通过三个关键贡献解决了这些限制:(1)一种令牌对齐算法,克服了先前工作中的基本令牌化挑战;(2)一种轻量级多任务大语言模型用于缺陷定位(MLC),实现高效的行级缺陷分类;(3)一种针对多行预测的优化训练策略。我们的方法在全文件上下文下的行级缺陷定位中,在类似设置中达到了最先进的性能。同时,在Defects4J和PypiBugs基准测试中,我们达到了与代理方法相当的性能,同时将推理延迟降低了数个数量级,每个文件仅需生成一个令牌。我们还通过引入并在一个小型域外评估数据集(Python)上进行评估,进一步证明了强大的泛化能力。我们将在论文被接收后开源我们的代码、模型和数据集。

英文摘要

The rapid adoption of LLM-powered code generation has dramatically accelerated software development, yet effective verification methods remain severely underdeveloped. Existing bug localization techniques are either prohibitively expensive, requiring minutes of agentic reasoning and thousands of generated tokens per file, and/or operate at coarse function-level granularity unsuitable for precise debugging. While works that focus on line-level granularity and are more light-weight are often limited in their performance or context size. We introduce a novel line-level bug localization approach that addresses these limitations through three key contributions: (1) a token alignment algorithm that overcomes fundamental tokenization challenges in previous work, (2) a lightweight multi-task LLM for bug localization (MLC) enabling efficient line-level bug classification, and (3) an optimized training recipe for multi-line prediction. Our method achieves state-of-the-art performance among similar setups on line-level bug localization with full-file context. At the same time we reach comparable performance to agentic approaches on Defects4J and PypiBugs benchmarks while reducing inference latency by orders of magnitudes, requiring only a single generated token per file. We further demonstrate strong generalization by introducing and evaluating on a small out-of-domain evaluation datasets in Python. We will open source our code, models, and datasets upon acceptance.

2606.09935 2026-06-10 cs.CR cs.AI 新提交

GitInject: Real-World Prompt Injection Attacks in AI-Powered CI/CD Pipelines

GitInject: AI驱动的CI/CD流水线中的真实提示注入攻击

Jafar Isbarov, Umid Suleymanov, Ilia Shumailov, Murat Kantarcioglu

发表机构 * Virginia Tech(弗吉尼亚理工学院) AI Sequrity Company(AI安全公司)

AI总结 提出GitInject框架,在真实GitHub工作流中评估AI代理的提示注入漏洞,发现所有测试提供商均存在结构性风险,并给出最低成本防护措施。

详情
AI中文摘要

AI代理越来越多地嵌入持续集成和持续交付/部署(CI/CD)流水线中,以自主审查拉取请求(PR)、分类问题和维护代码库。这些代理在操作时摄入不可信内容,同时拥有提升的仓库权限,使其成为具有供应链后果的提示注入攻击的自然目标。我们提出GitInject,一个开源框架,用于评估真实、活跃的GitHub工作流(CI/CD流水线的广泛部署实例)中的提示注入漏洞。与先前模拟工具调用的代理安全基准不同,GitInject提供临时仓库并触发实际工作流运行,因此沙箱约束、凭证处理和权限边界的行为与生产环境完全一致。使用GitInject,我们研究了四个AI提供商的工作流配置,并记录了十一种命名攻击,涵盖配置文件注入、凭证窃取、判断操纵和可用性。我们发现,所有测试的提供商在其默认配置中至少容易受到一类攻击,且最关键的漏洞是结构性的:它们源于CI/CD基础设施处理凭证和配置文件的方式,而非任何特定模型的行为。对于每个确认的攻击类别,我们确定了最低成本的工作流级对策,并分析了其覆盖范围和局限性。GitInject已公开发布,以促进这一方向的进一步研究。

英文摘要

AI-powered agents are increasingly embedded in continuous integration and continuous delivery/deployment (CI/CD) pipelines to autonomously review pull requests (PRs), triage issues, and maintain codebases. These agents ingest untrusted content while operating with elevated repository permissions, making them a natural target for prompt injection attacks with supply chain consequences. We present GitInject, an open-source framework for evaluating prompt injection vulnerabilities in real, live GitHub workflows, a widely deployed instance of CI/CD pipelines. Unlike prior agent security benchmarks that simulate tool calls, GitInject provisions ephemeral repositories and triggers actual workflow runs, so that sandbox constraints, credential handling, and permission boundaries behave exactly as in production. Using GitInject, we study workflow configurations across four AI providers and document eleven named attacks spanning config-file injection, credential exfiltration, judgment manipulation, and availability. We find that all tested providers are susceptible to at least one attack class in their default configuration, and that the most critical vulnerabilities are structural: they arise from how CI/CD infrastructure handles credentials and configuration files, not from any specific model's behavior. For each confirmed attack class, we identify the minimum-cost workflow-level countermeasure and analyze its coverage and limitations. GitInject is released publicly to facilitate further research in this direction.

2606.09931 2026-06-10 cs.GT cs.AI 新提交

A Note on the Strategic Confinement Problem

关于战略约束问题的一个注记

Christian Schroeder de Witt

发表机构 * University of Oxford(牛津大学)

AI总结 本文引入战略约束问题,指出当通信方为具有共享协调资源的战略智能体时,即使信道容量极小,也可能导致机密信息的高影响泄露,并论证学习型战略智能体系统自然实例化该问题。

详情
AI中文摘要

Lampson的约束问题询问如何防止处理机密信息的程序将其泄露给第三方。我们引入战略约束问题,当通信方是具有共享协调资源的战略智能体时出现该问题。在此设置中,剩余通信能力可以集中在机密数据的低熵、高影响谓词上。因此,信息泄露的界限不一定导致最坏情况危害的相应界限:一个容量可忽略的信道仍可能足以选择破坏性结果。我们认为,学习型战略智能体系统自然实例化此问题,因为它们不允许完整的行为规范,它们习得的惯例通常无法被外部观察者预测或重现,并且足够能力的智能体可以构建难以检测或消除的隐蔽通信方案。因此,我们的贡献不是一种新的通信理论,而是在存在战略智能体的情况下对约束的重新解释。经典约束限制了可能流动的信息;战略约束强调这不一定限制战略智能体可以共同实现的目标。

英文摘要

Lampson's confinement problem asks how to prevent a program that processes confidential information from leaking it to a third party. We introduce the strategic confinement problem, which arises when the communicating parties are strategic agents with shared coordination resources. In this setting, residual communication capacity can be concentrated on low-entropy, high-impact predicates of the confidential data. Consequently, bounds on information leakage need not induce corresponding bounds on worst-case harm: a channel with negligible capacity may still suffice to select damaging outcomes. We argue that systems of learnt strategic agents naturally instantiate this problem because they do not admit complete behavioural specifications, their learnt conventions generally cannot be predicted or reproduced by an external observer, and sufficiently capable agents can construct covert communication schemes that are difficult to detect or eliminate. Our contribution is therefore not a new theory of communication, but a reinterpretation of confinement in the presence of strategic agents. Classical confinement bounds what information may flow; strategic confinement highlights that this need not bound what strategic agents can jointly achieve.

2606.09909 2026-06-10 cs.CR cs.AI cs.CV 新提交

Bypassing Copyright Protection in Diffusion-based Customization via Two-Stage Latent Feature Optimization

通过两阶段潜在特征优化绕过基于扩散的定制中的版权保护

Ziang Xu, Wenbo Yu, Hongyao Yu, Hao Fang, Jiawei Kong, Bin Chen, Hao Wu, Shu-Tao Xia, Zhiyong Wu

发表机构 * Harbin Institute of Technology, Shenzhen(哈尔滨工业大学(深圳)) Tsinghua Shenzhen International Graduate School(清华大学深圳国际研究生院)

AI总结 提出两阶段潜在特征优化(TS-LFO)攻击方法,通过潜在去噪和重建阶段恢复被防御破坏的映射,有效绕过扩散模型定制中的版权保护。

Comments accepted by KDD 2026

详情
AI中文摘要

随着基于扩散的定制中版权侵权问题的日益关注,对抗性攻击已成为一种突出的防御策略,以防止个性化图像生成中的恶意内容伪造。然而,当前的防御通常会在潜在扩散模型(LDM)的潜在空间中引入持久扰动,这些扰动仍然容易被对手自适应绕过。在本文中,我们引入了两阶段潜在特征优化(TS-LFO),一种针对受保护的基于扩散的定制的高效且有效的版权窃取攻击。我们首先观察到现有防御主要破坏输入图像与其潜在表示之间的映射,从而降低模型生成个性化输出的能力。为了应对这一点,TS-LFO通过两阶段优化过程恢复被破坏的映射。在潜在去噪阶段,我们通过联合最小化潜在-图像对齐损失和具有时间步长依赖权重的潜在扩散损失来增强潜在代码与输入图像之间的语义一致性,有效抑制防御引入的高频噪声。在潜在重建阶段,我们使用像素级约束恢复低频语义信息以细化潜在特征。大量实验表明,TS-LFO持续绕过最先进的(SOTA)版权防御,并在各种设置下优于SOTA版权攻击,如DiffPure、GrIDPure和IMPRESS。

英文摘要

With the growing concerns over copyright infringement in diffusion-based customization, adversarial attacks have emerged as a prominent defense strategy to prevent malicious content forgery in personalized image generation. However, current defenses typically introduce persistent perturbations in the latent space of Latent Diffusion Models (LDMs), which remain susceptible to adaptive bypasses by adversaries. In this paper, we introduce Two-Stage Latent Feature Optimization (TS-LFO), an efficient and effective copyright-stealing attack against protected diffusion-based customization. We begin by observing that existing defenses primarily disrupt the mapping between input images and their latent representations, thereby degrading the model's ability to produce personalized outputs. To counteract this, TS-LFO restores the broken mapping through a two-stage optimization process. In the Latent Denoising Stage, we enhance semantic consistency between latent codes and input images by jointly minimizing a Latent-Image Alignment Loss and a Latent Diffusion Loss with timestep-dependent weights, effectively suppressing the high-frequency noise introduced by defenses. In the Latent Reconstruction Stage, we recover low-frequency semantic information using pixel-level constraints to refine the latent features. Extensive experiments show that TS-LFO consistently bypasses state-of-the-art (SOTA) copyright defenses and outperforms SOTA copyright attacks such as DiffPure, GrIDPure and IMPRESS across diverse settings.

2606.09908 2026-06-10 cs.CR cs.AI 新提交

IDP-Bench: Benchmarking ability of LLMs to protect personal information in interdependent privacy contexts

IDP-Bench:评估大语言模型在相互依赖隐私上下文中保护个人信息能力的基准

Ayana Hussain, Soumya Sharma, Golnoosh Farnadi, Nicholas Vincent, Héber Hwang Arcolezi, Ulrich Aïvodji

发表机构 * Simon Fraser University(西蒙弗雷泽大学) McGill University(麦吉尔大学) Mila(Mila研究所) ÉTS

AI总结 提出IDP-Bench,首个基于情境完整性框架的LLM相互依赖隐私基准,评估8个开源模型在三个推理层次上的表现,发现模型在识别CI参数和IDP特定参数方面存在弱点。

详情
AI中文摘要

大语言模型(LLMs)正被广泛部署为个人AI助手,可访问敏感用户数据,这使得隐私成为其设计和评估的主要挑战。先前的工作主要关注个体层面的风险,忽视了\textbf{相互依赖隐私(IDP)}——即一个人的数据可能在未经其知情或同意的情况下被他人泄露。我们通过引入\textbf{IDP-Bench}来填补这一空白:这是首个针对IDP场景的LLM基准,基于情境完整性(CI)框架。我们使用两个LLM评判员,评估了八个开源LLM在三个IDP推理层次上对IDP场景的理解。结果显示,模型对共同所有权有较强的识别能力(6/8模型超过90%),但在识别CI参数(信息属性、主要主体)和IDP特定参数(如次要主体)方面持续存在弱点,其中7/8模型得分低于74%。模型在判断共享适当性方面也存在困难(5/8模型得分低于77%)。虽然判断共享适当性的能力随规模提升而提高,但较小模型的性能趋于下降,且IDP特定问题的提示敏感性仍然很高——这凸显了在LLM隐私研究中需要更有针对性地研究IDP。数据和代码可在此处获取:\href{ this https URL }{here}。

英文摘要

Large language models (LLMs) are becoming widely deployed as personal AI assistants with access to sensitive user data, making privacy a major challenge for their design and evaluation. Prior work focuses mainly on individual-level risks, overlooking \textbf{interdependent privacy (IDP)}--where one person's data may be revealed by others without their knowledge or consent. We address this gap by introducing \textbf{IDP-Bench}: the first LLM benchmark for IDP scenarios, grounded in the Contextual Integrity (CI) framework. We evaluate eight open-source LLMs on their understanding of IDP scenarios across three levels of IDP reasoning using two LLM judges. Results show strong co-ownership recognition (6/8 models exceed 90\%) but persistent weaknesses in identifying CI parameters (information attribute, primary subject) and IDP-specific parameters such as secondary subjects, where 7/8 models score below 74\%. Models also struggle to judge sharing appropriateness (5/8 scoring below 77\%). While the ability to judge the appropriateness of sharing improves with scale, performance tends to decline in smaller models, and prompt sensitivity remains high on IDP-specific questions--highlighting the need for more targeted study of IDP in LLM privacy research. Data \& code available \href{https://github.com/tisl-lab/Interdependent_Privacy_Bench}{here}.