arXivDaily arXiv每日学术速递 周一至周五更新
全部学科分类 2136
专题追踪
2606.10809 2026-06-10 q-bio.PE 新提交

Chaos and stability in the marine trophic network: the importance of interactions over complexity

海洋营养网络中的混沌与稳定性:相互作用比复杂性更重要

Ilaria Cunico, Guido Occhipinti, Gregor Fussmann, Paolo Lazzari

AI总结 通过数值模拟研究复杂海洋营养网络动力学,发现较长的营养链和更多的消费者增加混沌性,而杂食性相互作用促进稳定性,表明相互作用而非复杂性是稳定性的关键驱动因素。

详情
AI中文摘要

理解现实世界复杂网络的动力学对于评估其可预测性、恢复力以及改善生态系统管理至关重要,尤其是在气候变化的背景下。生态网络中稳定性与复杂性之间的关系在文献中仍存在争议。在这项建模研究中,我们探讨了一个以多种营养相互作用和环境约束为特征的复杂海洋营养网络是否表现出主要稳定、周期或混沌动力学。我们将微生物环纳入营养网络模型,该模型包括一到三个初级生产者、一个或两个消费者,以及多达三个营养级的捕食者。微生物环是一个关键过程,其中细菌将来自较高营养级的碎屑回收为可供初级生产者生长的营养物质,确保系统内的质量守恒。我们进行数值模拟以研究网络的动态行为,通过关闭物种间的捕食-被捕食链接并探索高维参数空间,考察了几种配置。我们的结果表明:(i) 较长的营养链和 (ii) 更多的消费者增加了系统的混沌性,而 (iii) 杂食性相互作用促进了稳定性。值得注意的是,许多配置表现出高比例的混沌行为。反馈环分析表明,负反馈和正反馈之间的平衡在系统趋向稳态的过程中起着关键作用。这项研究表明,相互作用和反馈,而非复杂性,是稳定性的关键驱动因素,指出了稳定性-复杂性关系的不明确性,反而强调了稳定性-相互作用的依赖性。混沌动力学也可能发挥重要作用,对可预测性和生态系统管理具有潜在影响。

英文摘要

Understanding the dynamics of real world complex networks is crucial for assessing their predictability, resilience, and improving ecosystem management, especially in the context of climate change. The relationship between stability and complexity in ecological networks is still debated in the literature. In this modeling study, we investigate whether a complex marine trophic network, characterized by multiple trophic interactions and environmental constraints, exhibits predominantly stable, periodic or chaotic dynamics. We incorporate the microbial loop into a trophic network model, which includes one to three primary producers, one or two consumers, and up to three trophic levels of predators. The microbial loop is a key process in which bacteria recycle detritus from higher trophic levels into nutrients available for the growth of primary producers, ensuring mass conservation within the system. We perform numerical simulations to investigate the dynamic behavior of the network, exploring several configurations by turning off predator prey links between species and varying the high dimensional parameter space. Our results show that (i) longer trophic chains and (ii) a higher number of consumers increase system chaoticity, whereas (iii) omnivorous interactions promote stability. Notably, many of the configurations exhibit high percentages of chaotic behavior. Feedback loop analysis suggests that the balance between negative and positive interactions plays a key role in the convergence of the system toward a steady state. This study shows that interactions and feedback, rather than complexity, are key drivers of stability, pointing to the absence of a clear stability complexity relationship and instead highlighting a stability interaction dependence. Chaotic dynamics may also play an important role, with potential implications for predictability and ecosystem management.

2606.10636 2026-06-10 q-bio.OT 新提交

Compositional proofreading through critical self-tuning

通过临界自调谐的组合校对

Omer Karin

AI总结 提出临界调谐机制通过竞争实现多组分系统的校对,将群体集中于持久组分,并预测去钉扎转变可能解释癌症、免疫缺陷和衰老中的异常激活。

详情
AI中文摘要

高维多组分系统,包括免疫和表观遗传库,必须选择性地保留稀有、有益的组分,同时清除大量涌入的次优变体。我们证明,通过竞争对组分控制参数进行临界调谐,自然地在这些系统中实现了校对。对共享输入的竞争将系统钉扎在最持久物种的边缘稳定阈值上。这赋予优势物种更长的寿命,将群体集中在优势组分中,同时迫使稳定性较差的变体进入快速漂移驱动的更替。当总驱动力超过特征尺度时,这种钉扎失效,产生非选择性状态,其中组分寿命随总驱动力呈通用幂律标度。将该框架应用于生物记忆,我们在浆细胞积累动力学中识别出这种效应的特征,并提出去钉扎转变可能代表跨生物领域的失效点,包括癌症、免疫缺陷以及衰老过程中有害基因组元件的异常激活。

英文摘要

High-dimensional multicomponent systems, including immune and epigenetic repertoires, must selectively retain rare, beneficial components while purging a massive influx of suboptimal variants. We demonstrate that critical tuning of component control parameters through competition naturally implements proofreading in these systems. Competition for shared inputs pins the system to the marginal stability threshold of the most persistent species. This grants dominant species extended lifetimes, concentrating the population into dominant components while forcing less-stable variants into rapid drift-driven turnover. When aggregate drive exceeds a characteristic scale, this pinning fails, producing a non-selective state where component lifetimes scale as a universal power law with aggregate drive. Applying this framework to biological memory, we identify the hallmarks of this effect in plasma cell accumulation dynamics and propose that de-pinning transitions may represent failure points across biological domains, including cancer, immunodeficiencies, and the aberrant activation of harmful genomic elements during ageing.

2606.10605 2026-06-10 q-bio.PE 新提交

Modeling pest dynamics in trap cropping to improve yield: the effects of attraction, retention, and land allocation

模拟诱集作物中的害虫动态以提高产量:吸引力、滞留力和土地分配的影响

Matthew H Holden

AI总结 通过产量最大化模型,研究诱集作物的吸引力、害虫滞留力及土地分配比例对害虫防治效果和可行性的影响,发现降低害虫从诱集作物向主作物的扩散可大幅减少所需诱集面积。

详情
AI中文摘要

诱集作物通过将害虫吸引离开主作物来减少对主作物的损害。然而,当害虫重新扩散回主作物时,这种保护作用会被削弱。本文重点关注防止这种回流的重要性,表明有效的诱集作物取决于害虫被吸引到诱集植物的强度以及它们离开诱集植物的频率。结合用于诱集作物的土地比例,这些过程决定了诱集作物在商业规模上的有效性和可行性。我们使用一个简单的产量最大化框架来形式化这种关系,其中种植者权衡害虫抑制效益与牺牲给诱集作物的土地。模型表明,当从诱集植物扩散的害虫数量等于从主作物扩散的数量时,最优诱集覆盖率可能超过景观的20%至30%,这一水平很少被种植者接受。然而,将害虫从诱集植物扩散的比例降低到主作物扩散的四分之一,可将最优所需诱集面积降至约5%,从而使诱集作物从不可行变为可行。理解这些关系可以指导诱集作物的设计,从植物选择到减少害虫移动的针对性干预措施,以最小化损害、最大化产量,并使诱集作物成为可持续害虫管理的可靠组成部分。

英文摘要

Trap crops reduce damage to a cash (main) crop by attracting pests away from it. Yet this protection is weakened when pests disperse back into the cash crop. In this paper, we focus on the importance of preventing this backflow, showing that effective trap cropping depends jointly on how strongly pests are attracted to trap plants and how rarely they leave them. Together with the proportion of the field devoted to trap plants, these processes determine both the efficacy and feasibility of trap cropping at commercial scales. We formalise this relationship using a simple yield-maximisation framework, in which growers weigh pest suppression benefits against the land sacrificed to trap plants. The model shows that when dispersal from trap plants equals that from the cash crop, optimal trap coverage can exceed 20 to 30 percent of the landscape, levels rarely acceptable to growers. However, reducing pest dispersal off trap plants to just one-quarter of cash crop dispersal lowers the optimal required trap area to approximately 5 percent, transforming trap cropping from impractical to feasible. Understanding these relationships can guide trap-cropping design, from plant choice to targeted interventions that reduce pest movement, to minimise damage, maximise yield, and make trap cropping a reliable component of sustainable pest management.

2606.10109 2026-06-10 q-bio.OT 新提交

When is Enough Enough? A Proposed Termination Point for the Number of Replicates in Computational Simulations

何时足够?计算模拟中重复次数的终止点提议

Eric T. Lofgren, Kellen Myers, Nina H. Fefferman

AI总结 针对计算模拟中通过增加试验次数来获得统计显著性的问题,提出Ω检验作为确定模拟重复次数的终止标准,以提高效率并统一理解。

详情
AI中文摘要

计算模拟为计算机实验提供了强大的工具包。然而,尽管该领域已经为这类模型的设计和实施制定了最佳实践,但在讨论如何理解和/或解释其结果时仍存在模糊性,因为其固有的能力可以通过简单地增加模拟试验次数来压倒传统的频率统计。这从两个方面使学科失效:首先,它使社区不确定什么是统一理解的最佳实践;其次,它可能使计算研究负担过重,这些研究消耗时钟周期仅仅是为了确保“足够的运行以满足同行”,而没有任何关于“足够”定义的理论基础。我们提出了一个简单直接的停止模拟额外试验的标准,即Ω检验,其设计类似于传统频率P检验的功能。社区采用合理且统一的标准将允许更高效的计算实验,并清晰地沟通/解释以此方式发现的发现。

英文摘要

Computational simulation provides a powerful toolkit for in silico experimentation. However, while the field has developed best practices for the design and implementation of such models, there remains ambiguity in discussions about how to understand and/or interpret their results due to their inherent ability to overwhelm traditional frequentist statistics by simply increasing the number of trials simulated. This fails the discipline in two ways: first, it leaves the community unsure of what constitutes a best practice for uniform understanding, and second, it potentially overburdens computational studies that burn clock cycles solely to ensure "enough runs to satisfy peers" without any theoretical underpinning for a definition of "enough". We propose a simple and straightforward standard for when to stop simulating additional trials, the Ω test, designed to be analogous to the function of traditional frequentist P-tests. Community adoption of a reasonable and uniform standard will permit more efficient computational experimentation and clearly communication/interpretation of the findings discovered in this way.

2606.10631 2026-06-10 econ.GN cs.CR q-fin.EC 新提交

From Transactions to Records: Reconceptualizing Blockchain Systems through a Lifecycle Lens

从交易到记录:通过生命周期视角重新概念化区块链系统

Tom Barbereau, Ruggero Montalto, Christian Beyer

AI总结 本文引入ISO 15489-1:2016记录管理原则,提出区块链数据的七阶段生命周期模型,应用于比特币、同质化代币和非同质化代币,论证区块链系统不仅是交易基础设施,更是具有独特特征的记录管理系统。

详情
AI中文摘要

当前的区块链研究和分析倾向于优先考虑可观察的链上交易,掩盖了加密货币创建、公开、保留和处置的过程。为此,本文从ISO 15489-1:2016的记录管理原则出发,考虑分布式账本技术。首先指定相似之处——即交易作为“记录”,加密资产单元作为“信息资产”,区块链作为“聚合”——我们引入了区块链数据的七阶段生命周期。我们将该框架应用于比特币、同质化代币和非同质化代币。在此基础上,我们认为区块链系统不仅仅是交易基础设施,而是具有独特特征的记录管理系统。我们讨论了链上/链下边界和隐私增强技术如何使生命周期可见性复杂化,这对加密犯罪研究和调查尤为重要。作为一个元级框架,生命周期视角能够定位现有研究,按阶段分解法律、监管、技术和运营挑战,并为区块链治理、分析和监管提供生命周期感知的方法。

英文摘要

Current blockchain research and analytics tend to prioritize observable on-chain transactions, obscuring the processes through which cryptocurrencies are created, publicised, retained, and disposed of. In response, this paper considers distributed ledger technologies from records management principles in ISO 15489-1:2016. Setting off by specifying the parallels -- that is transactions as "records", crypto-asset units as "information assets", and blockchains as "aggregations" -- we introduce a seven-stage lifecycle for blockchain data. We apply the framework to Bitcoin, a fungible token, and a non-fungible token. On this basis, we argue that blockchain systems are not merely transactional infrastructures but record management systems with distinctive characteristics. We discuss how the on-chain/off-chain boundary and privacy-enhancing technologies can complicate lifecycle visibility, with particular relevance for crypto-crime research and investigation. As a meta-level framework, the lifecycle perspective enables positioning existing research, decomposing legal, regulatory, technological, and operational challenges by stage, and informing lifecycle-aware approaches to blockchain governance, analytics, and regulation.

2606.11175 2026-06-10 cs.CR 新提交

Anchors that Don't Lift: Understanding Supply Chain Driven Kernel Lock-In and Governance-Mediated Mitigation Strategies in SOHO Devices

无法提升的锚点:理解SOHO设备中供应链驱动的内核锁定及治理中介的缓解策略

Ritwik Badola, Rajdeep Ghosh, Ashita Gupta, Chester Rebeiro, Mainack Mondal

AI总结 通过分析306款SOHO设备的900多个固件,发现SoC供应商的SDK导致内核版本锁定,造成漏洞债务沿供应链传递,而社区参与是唯一有效的缓解途径。

Comments This extended version of our USENIX Security '26 paper on supply chain analysis of SOHO devices includes appendices for interested readers

详情
AI中文摘要

小型办公室/家庭办公室(SOHO)设备广泛流行,但由于固件中的安全漏洞经常受到攻击,影响数千台设备。这些安全漏洞通常源于SOHO设备固件中包含的过时Linux内核版本。自然,先前的工作通过简单的Linux版本提取和基于版本号的漏洞映射来审计此问题的范围和影响。然而,尚不清楚这些预期漏洞中有多少实际存在于高度定制的SOHO内核中,以及是否存在更新SOHO固件中Linux内核的障碍。为了解决这一差距,我们使用高精度的基于模板的CVE检测机制,对来自这些设备的900多个固件的GPL源代码发布版,揭示了306款SOHO设备中实际存在的内核相关漏洞。接下来,作为首次尝试,我们大规模追踪了这些易受攻击的SOHO设备的供应链,并将内核锁定识别为一个重要的安全问题——由于使用的片上系统(SoC)SDK,SOHO供应商实际上被锁定在特定(通常是较旧)的内核版本上。这种内核锁定产生了漏洞债务,该债务沿供应链从SoC供应商继承到固件创建者(ODM/OEM),再到路由器/IP摄像头供应商,最终由最终用户承担。我们数据集中的所有五家SoC供应商都使用了Linux内核已达到生命周期结束(EoL)超过一年的SDK。最后,我们通过分析社交媒体帖子、法规和社区努力,探讨了个人、监管和社区治理的缓解潜力。我们的结果表明,法规合规性不足,只有与社区合作进行内核升级的SoC供应商才提供了可行的缓解途径。这项工作的数据和代码可在https://this https URL获取。

英文摘要

Small Office/Home Office (SOHO) devices are widely popular, yet often attacked due to security vulnerabilities in their firmware, affecting thousands of devices. These security vulnerabilities often stem from outdated Linux kernel versions included in SOHO device firmware. Naturally, prior work audited the extent and impact of this issue by simple Linux version extraction and version number based vulnerability mapping. However, it is unclear how many of these anticipated vulnerabilities actually exist in the heavily customized SOHO kernels and if there are any barriers towards updating Linux kernels in SOHO firmwares. To address this gap, we uncover actual kernel-related vulnerabilities found in 306 SOHO devices using a high-precision template-based CVE detection mechanism on GPL source releases of more than 900 firmwares from these devices. Next, as a first, we traced the supply chain of these vulnerable SOHO devices at scale and identify kernel lock-in as a significant security issue -- SOHO vendors are effectively locked to specific (often older) kernel versions due to the system-on-chip (SoC) SDKs they use. This kernel lock-in produces a vulnerability debt that is inherited along the supply chain from SoC vendor to firmware creators (ODM/OEM) to router/IP-camera vendor and ultimately borne by end users. All five SoC vendors in our dataset had used SDKs with Linux kernels that had reached EoL more than a year before their usage in a SOHO device. Finally, we explore the mitigation-potential of individual, regulatory and community governance by analyzing social media posts, regulations and community efforts. Our results show that regulation compliance is insufficient and only SoC vendors who engage with communities for kernel upgradation offered a viable path towards mitigation. The data and code for this work is available at https://doi.org/10.5281/zenodo.20433799

2606.11163 2026-06-10 cs.DC cs.AR 新提交

Revisiting "Cooler is Better": ITD-Aware Per-CPU Thermal Optimization for Sustainable Data Center Operation

重新审视“更冷更好”:面向可持续数据中心运行的ITD感知每CPU热优化

Jason Crop, Hayden Moore, Sudeep Pasricha

AI总结 本文发现现代低压CPU存在逆温度依赖(ITD)导致低温下功耗增加,通过实验表征并利用ITD感知热分组和入口温度调整,可实现数据中心总能耗降低4-13%而不牺牲性能。

详情
AI中文摘要

随着数据中心能源需求接近电网限制,优化传统服务器基础设施对于可持续增长至关重要。长期以来的假设“更冷更好”,即较低的CPU温度可降低功耗,对于现代低压CPU并不完全成立,因为逆温度依赖(ITD)在较低温度下驱动更高的供电电压。这产生了非单调的性能功耗比曲线,其中效率在中间热点达到峰值。在本文中,我们首次在生产的Intel Xeon CPU上经验性地表征了ITD,并证明了效率最优温度是CPU部件特定的,并且通常高于典型的数据中心运行条件。来自商业云数据中心平台(Amazon、Equinix)的测量显示,大约一半的现代高功率CPU在其效率最优热点以下约10°C运行。通过实施ITD感知的CPU热分组和入口温度调整,数据中心运营商可以优化设施级冷却和整体可持续性。我们的案例研究表明,这种方法可以在不牺牲性能或可靠性的情况下,将数据中心总能耗降低4-13%。

英文摘要

As data center energy demand approaches grid-level constraints, optimizing conventional server infrastructure is essential for sustainable growth. The long-standing assumption that "cooler is better", i.e., lower CPU temperatures reduce power, does not fully hold for modern low-voltage CPUs, where inverse temperature dependence (ITD) drives higher supply voltages at lower temperatures. This creates a non-monotonic performance-per-watt curve where efficiency peaks at an intermediate thermal point. In this paper, for the first time, we empirically characterize ITD on production Intel Xeon CPUs and demonstrate that efficiency-optimal temperatures are CPU part-specific, and frequently higher than typical data center operating conditions. Measurements from commercial cloud data center platforms (Amazon, Equinix) reveal that approximately half of modern high-power CPUs operate about 10°C below their efficiency-optimal thermal point. By implementing ITD-aware thermal grouping of CPUs and inlet temperature adjustments, data center operators can optimize facility-level cooling and overall sustainability. Our case study shows that this approach can reduce total data center energy by 4-13% without sacrificing performance or reliability.

2606.11160 2026-06-10 cs.GT 新提交

Fractional Pareto-Optimality in Multiwinner Voting

多赢家投票中的分数帕累托最优性

Patrick Becker, Niclas Boehmer, Fabian Frank, Lara Glessen

AI总结 研究多赢家投票中分数帕累托最优性(fPO)的概念,通过加权功利福利最大化刻画,提出多项式时间验证算法,并揭示其与平等导向目标的不兼容性。

详情
AI中文摘要

多赢家投票中的效率最自然地由帕累托最优性(PO)捕捉,但这一概念在计算和结构上难以处理。因此,我们研究了分数帕累托最优性(fPO),在该概念下,一个委员会甚至不能被分数委员会(即委员会的任意凸组合)支配。fPO 被证明是 PO 的一个自然细化,因为它恰好保留了那些在候选人的均匀克隆下效率稳健的帕累托最优委员会。此外,fPO 委员会保证存在并具有强结构性质。我们给出了 fPO 在加权功利福利最大化方面的刻画,这产生了一个用于验证 fPO 的多项式时间算法,并表明 fPO 委员会的集合满足委员会单调性,且在单候选人交换下是连通的。通过 fPO 的视角分析福利主义规则,我们进一步揭示了 fPO 与平等导向目标之间的不兼容性。最值得注意的是,我们表明比例批准投票(PAV)在批准设置中违反了 fPO。最后,我们指出了偏好域(包括各种一维域),在这些域上 PO 和 fPO 合并为一个概念。

英文摘要

Efficiency in multiwinner voting is most naturally captured by Pareto-optimality (PO), yet this notion is computationally and structurally difficult to handle. We therefore study fractional Pareto-optimality (fPO), under which a committee may not be dominated even by a fractional committee, i.e., any convex combination of committees. fPO turns out to be a natural refinement of PO as it retains exactly those Pareto-optimal committees whose efficiency is robust under uniform cloning of candidates. Furthermore, fPO committees are guaranteed to exist and have strong structural properties. We present a characterization of fPO in terms of weighted utilitarian welfare maximization, which yields a polynomial-time algorithm for verifying fPO and shows that the set of fPO committees satisfies committee monotonicity and is connected under single-candidate swaps. Analyzing welfarist rules through the lens of fPO, we further uncover an incompatibility between fPO and equality-oriented objectives. Most notably, we show that proportional approval voting (PAV) violates fPO in the approval setting. We close by pinpointing preference domains, including various one-dimensional ones, on which PO and fPO collapse into one notion.

2606.11158 2026-06-10 cs.AR cs.PL 新提交

Defeat the Heap: Zero-Copy Data Movement in AXI4MLIR

击败堆:AXI4MLIR中的零拷贝数据移动

Elam Cohavi, Nicolas Bohm Agostini, Jude Haris, Antonino Tumeo, David Kaeli, José Cano

AI总结 针对AXI4MLIR框架中非零拷贝数据移动导致的运行时开销,提出通过直接分配DMA映射内存实现零拷贝,消除冗余暂存拷贝,最高可减少2倍主存数据移动并提升加速器利用率。

Comments Accepted to the 7th Compilers for Machine Learning Workshop (C4ML), co-located with CGO 2026

详情
AI中文摘要

随着定制硬件加速器在机器学习工作负载中日益重要,高效的数据传输对于最大化线性代数内核上的加速器性能至关重要。AXI4MLIR是多级中间表示(MLIR)编译器框架的扩展,用于自动生成主机-加速器驱动代码,但由于非零拷贝的CPU-加速器数据移动,其运行时开销显著。在从主机到加速器的传输过程中,数据从堆分配的缓冲区复制到连续的DMA映射缓冲区。本文将此复制识别为冗余暂存操作,并通过零拷贝数据移动消除它。该优化扩展了AXI4MLIR引入的MLIR方言accel,并实现了降低支持,直接在DMA映射内存中分配缓冲区,从而省略暂存拷贝。我们使用可配置的矩阵-矩阵乘法加速器评估所提出的方案,并表明零拷贝优化将主存数据移动减少高达2倍,从而提高了整体加速器利用率。

英文摘要

As custom hardware accelerators become increasingly central to machine learning workloads, efficient data transfer is critical for maximizing accelerator performance on linear algebra kernels. AXI4MLIR, an extension of the Multi-Level Intermediate Representation (MLIR) compiler framework for automated generation of host-accelerator driver code, incurs significant runtime overhead due to non-zero-copy CPU-accelerator data movement. During transfers from the host to the accelerator, data is copied from heap-allocated memory buffers into contiguous Direct Memory Access (DMA)-mapped buffers. This work identifies this copy as a redundant staging operation and eliminates it through zero-copy data movement. The optimization extends accel, an MLIR dialect introduced by AXI4MLIR, and implements lowering support that allocates buffers directly within DMA-mapped memory, thereby omitting the staging copy. We evaluate the proposed scheme using a configurable matrix-matrix multiplication accelerator and show that the zero-copy optimization reduces main memory data movement by up to 2x, increasing overall accelerator utilization.

2606.11145 2026-06-10 cs.CR 新提交

OpenPCC: Open and Confidential LLM Serving on Commodity TEEs

OpenPCC:在商用TEE上提供开放且保密的LLM服务

Haoling Zhou, Shixuan Zhao, Chao Wang, Zhiqiang Lin

AI总结 针对云推理服务中用户请求包含敏感信息的问题,提出OpenPCC框架,利用商用TEE实现开放、保密的LLM服务,并通过原型验证其可行性与安全性。

详情
AI中文摘要

生成式AI应用,如个人AI代理、图像生成器和聊天助手,提供了先进功能以改善用户体验。在幕后,为这些服务提供动力的大型语言模型(LLM)需要大量计算,通常部署在云端,以API形式提供,这意味着用户的请求必须发送到云推理服务(CIS)进行处理。然而,LLM的强大能力也意味着用户的请求现在包含更多个人敏感或企业机密信息,要求CIS提供同样强大的保护。尽管早期的行业努力,如Apple Private Cloud Compute(PCC)和Google Private AI Compute,已经显示出安全CIS的潜力,但由于它们依赖专有硬件和封闭生态系统,无法被其他人部署采用。此外,它们都存在自身的设计缺陷,可能削弱为最终用户带来真正隐私保护的雄心目标。在本文中,我们分析了构建安全且开放的CIS的基本要求。然后,我们提出了OpenPCC,一个不依赖专有硬件而是使用商用TEE的保密CIS框架。我们实现了一个开源原型,并在Llama-3 8B vLLM工作负载上对其进行端到端表征,将OpenPCC自身的成本与底层TEE硬件分离。我们的分析和评估证明了该系统的可行性和安全性。

英文摘要

Generative AI applications such as personal AI agents, image generators, and chat assistants offer advanced capabilities to improve user experience. Behind the scenes, Large Language Models (LLMs) that power these services require a massive amount of computation and are usually deployed in the cloud, available as APIs, meaning that a user's request has to be sent to a Cloud Inference Service (CIS) for processing. However, the strong capabilities of LLM also mean that user's requests now contain much more personal sensitive or enterprise confidential information, demanding equally strong protection in CIS. While early industry efforts such as Apple Private Cloud Compute (PCC) and Google Private AI Compute have emerged to show the potential of secure CIS, they are not adoptable for deployment by others due to their reliance on proprietary hardware and closed ecosystem. In addition, they all suffer from their own design glitches that can undermine the ambitious goal of bringing in true privacy protection to end users. In this paper, we present our analysis of the fundamental requirements of building a secure yet open CIS. We then present OpenPCC, a Confidential CIS framework that does not rely on proprietary hardware but instead uses commercially available TEEs. We implement an open-source prototype and characterize it end-to-end on a Llama-3 8B vLLM workload, separating OpenPCC's own cost from the underlying TEE hardware. Our analysis and evaluation demonstrated the feasibility and security of the system.

2606.11132 2026-06-10 cs.SE 新提交

Operationalizing Property-Based Testing for Data-Intensive Scalable Computing Systems

为数据密集型可扩展计算系统实现基于属性的测试的操作化

Yaoxuan Wu, Ingrid Lee, Ahmad Humayun, Muhammad Ali Gulzar, Miryung Kim

AI总结 提出DiscPBT引擎,通过可复用的元属性和生成器,在Apache Spark中实现基于属性的测试,以检测语义漂移和优化错误,相比模糊测试覆盖率更高。

详情
AI中文摘要

虽然模糊测试能有效捕获崩溃,但其浅层预言机常常遗漏数据密集型可扩展计算(DISC)框架中的语义漂移和优化相关错误。基于属性的测试(PBT)通过检查跨不同工作负载和输入的通用语义不变量(而非依赖特定预期输出)来解决这一局限。然而,为DISC系统系统地操作化PBT仍然困难,因为它既需要可复用的属性定义,也需要有效实例化为有效的工作负载和数据。我们提出了DiscPBT,一个用于Apache Spark的基于属性的测试引擎。DiscPBT引入了八个可复用的元属性用于DISC语义测试,涵盖等价重写、数据分解、计算分解和算子局部语义关系。为了操作化这些元属性,DiscPBT提供了可复用的生成器用于合成有效的工作负载骨架和输入数据,以及一个实例化框架,通过兼容的算子、表达式和UDF在模式兼容的上下文中实现每个元属性。我们在PySpark上的评估显示,DiscPBT比CometFuzz实现了1.2倍的分支覆盖率和1153倍的计划多样性。在66个具体属性中,DiscPBT揭示了跨版本语义漂移以及涉及NaN和空输入的微妙边界情况陷阱,这些是仅基于崩溃的模糊测试无法捕获的。这些结果证明了系统化PBT在揭示DISC框架中语义问题方面的价值。

英文摘要

While fuzzing effectively catches crashes, its shallow oracles often miss semantic drifts and optimization-related errors in data-intensive scalable computing (DISC) frameworks. Property-based testing (PBT) addresses this limitation by checking general semantic invariants across diverse workloads and inputs, rather than relying on specific expected outputs. However, systematically operationalizing PBT for DISC systems remains difficult because it requires both reusable property definitions and effective instantiation into valid workloads and data. We present DiscPBT, a property-based testing engine for Apache Spark. DiscPBT introduces eight reusable meta-properties for DISC semantic testing, spanning equivalence rewriting, data decomposition, computation decomposition, and operator-local semantic relations. To operationalize these meta-properties, DiscPBT provides reusable generators for synthesizing valid workload skeletons and input data, together with an instantiation framework that realizes each meta-property in schema-compatible contexts through compatible operators, expressions, and UDFs. Our evaluation on PySpark shows that DiscPBT achieves 1.2$\times$ higher branch coverage and 1153$\times$ greater plan diversity than CometFuzz. Across 66 concrete properties, DiscPBT reveals cross-version semantic drift as well as subtle corner-case pitfalls involving NaN and empty inputs, that are not captured by crash-based fuzzing alone. These results demonstrate the value of systematic PBT for uncovering semantic issues in DISC frameworks.

2606.11113 2026-06-10 cs.DC 新提交

A Neurosymbolic Prolog Skill for LLM-Driven Service Placement

一种用于LLM驱动服务放置的神经符号Prolog技能

Jacopo Massa, Giuseppe Bisicchia, Patrizio Dazzi, Antonio Brogi

AI总结 提出一种基于Prolog技能的神经符号方法,将服务放置意图转化为符号事实和规则,利用Prolog进行约束验证和推理,实现可检查的策略感知放置。

详情
AI中文摘要

云边连续体中的服务放置需要在多种约束(包括延迟、位置和策略要求)下将应用组件分配给异构资源。现有方法依赖于需要显式建模的优化模型或启发式方法,而神经方法缺乏透明度和形式化保证。本文提出一种基于Prolog技能的神经符号替代方案,这是一种用于模式约束事实生成和查询的可重用接口,用于约束感知放置。该技能使语言模型能够将放置意图结构化为符号事实、规则和查询,同时将验证和推理委托给Prolog。这种设计桥接了高级意图和形式化约束评估,使得在云边环境中能够实现可检查且策略感知的放置决策。

英文摘要

Service placement in the cloud-edge continuum requires assigning application components to heterogeneous resources under multiple constraints, including latency, locality, and policy requirements. Existing approaches rely on optimisation models or heuristics that require explicit modelling, while neural methods lack transparency and formal guarantees. This work proposes a neuro-symbolic alternative based on a Prolog skill, a reusable interface for schema-constrained fact generation and querying, for constraint-aware placement. The skill enables a language model to structure placement intent into symbolic facts, rules, and queries, while delegating validation and reasoning to Prolog. This design bridges high-level intent and formal constraint evaluation, enabling inspectable and policy-aware placement decisions in cloud-edge environments.

2606.11112 2026-06-10 cs.ET 新提交

Toward a Full-Stack Framework for Industrial Augmented Reality: Benefits, Risks, and Design Considerations for Dependable Deployment in Manufacturing

面向工业增强现实的全面框架:在制造中可靠部署的益处、风险与设计考量

Narges Chinichian, Maximilian Anton Palm

AI总结 提出一个六轴全栈框架,综合价值、技术、人因、组织、数据安全和治理,通过部署清单指导工业增强现实从演示到可靠基础设施的过渡。

详情
AI中文摘要

工业增强现实(AR)已从实验室演示发展到设计、培训、装配、维护和质量保证等领域的操作试点,但在制造业中广泛、可靠的部署仍然是个例外。我们将现有证据综合成一个全栈部署框架,该框架沿着六个不同但相互耦合的决策轴构建:(i) 价值与益处,(ii) 技术与集成约束,(iii) 人因与安全,(iv) 组织与经济考量,(v) 数据、安全与隐私,以及(vi) 治理、伦理与长期风险。在每个轴内,我们区分了(a)益处、(b)故障模式和(c)设计考量,并通过一个部署清单将它们相互关联,工程经理和供应商在规划项目时可以应用该清单。该贡献是概念性和实践导向的:基于文献和公开部署报告的综合。我们标记了证据基础成熟(例如装配任务时间、培训效果)、新兴(例如认知工作负载权衡、协作机器人安全区域)或推测性(例如元宇宙规模治理)的领域,并识别了那些解决方案决定从演示到可靠基础设施过渡的开放性问题。

英文摘要

Industrial Augmented Reality (AR) has progressed from laboratory demonstrations to operational pilots across design, training, assembly, maintenance and quality assurance, yet broad, dependable deployment in manufacturing remains the exception. We synthesise existing evidence into a full-stack deployment framework structured along six distinct but coupled decision axes: (i) value and benefits, (ii) technical and integration constraints, (iii) human factors and safety, (iv) organisational and economic considerations, (v) data, security and privacy, and (vi) governance, ethics and long-term risk. Within each axis we separate (a)benefits, (b)failure modes and (c)design considerations, and cross-link them through a deployment checklist that engineering managers and vendors can apply when scoping projects. The contribution is conceptual and practice-oriented: a synthesis grounded in the literature and public deployment reports. We mark where the evidence base is mature (e.g. assembly task time, training efficacy), emerging (e.g. cognitive workload trade-offs, cobot safety zones), or speculative (e.g. metaverse-scale governance), and identify open questions whose resolution conditions the transition from demos to dependable infrastructure.

2606.11111 2026-06-10 cs.CR 新提交

A Longitudinal Study of Recently Observed Malicious Domains: Characteristics, Infrastructure, and Abuse Patterns

近期观察到的恶意域名的纵向研究:特征、基础设施和滥用模式

Fathima Mashood, Mohamed Nabeel

AI总结 本研究对VirusTotal上约152万个恶意域名进行纵向分析,发现攻击者创建的域名多为短期注册,少数注册商和TLD集中了大部分滥用,Cloudflare基础设施被广泛用于域名前置,且存在大规模批量注册和品牌冒充现象。

Comments malicious domains, 5 pages, virustotal, analysis

详情
AI中文摘要

我们对2026年1月至5月期间在VirusTotal(VT)上观察到的约152万个恶意域名进行了纵向研究。域名选择基于至少五个独立VT扫描引擎的检测,且首次出现日期在研究窗口内。我们将数据集分为被入侵域名和攻击者创建的域名,后者约占数据集的89.3%。结合WHOIS注册记录和被动DNS(PDNS)数据与VT数据集,我们从八个维度刻画攻击者行为:时间分布、此http URL分类、首次检测时的域名年龄、注册商和TLD偏好、作为损害代理的DNS查询量、托管基础设施集中度(IP和ASN级别)、批量注册模式以及品牌冒充。主要发现包括:大多数攻击者创建的域名是短期注册,在创建后几周内使用;少数注册商和TLD占据了大部分滥用;Cloudflare基础设施被广泛用于域名前置;涉及单个注册商在单日内注册数千个域名的批量注册事件很普遍;多个全球品牌,特别是WhatsApp和Google,被大量冒充。我们在GitHub仓库https://github.com/...(此https URL)中共享了带注释的数据集以供进一步研究。

英文摘要

We present a longitudinal study of approximately 1.52 million malicious domains observed on VirusTotal (VT) between January and May 2026. Domains were selected on the basis of detection by at least five independent VT scanning engines and a first-seen date within the study window. We group the dataset into compromised domains and attacker created domains, which account for approximately 89.3% of the dataset. Combining WHOIS registration records and passive DNS (PDNS) data with the VT dataset, we characterise attacker behaviour across eight dimensions: temporal distribution, compromisedvs.attack classification, domain age at first detection, registrar and TLD preferences, DNS query volume as a damage proxy, hosting infrastructure concentration (IP and ASN level), bulk registration patterns, and brand impersonation. Key findings include: the majority of attacker created domains are short lived registrations used within weeks of creation; a small number of registrars and TLDs account for most abuse; Cloudflare infrastructure is heavily exploited for domain fronting; bulk registration events involving thousands of domains from a single registrar on a single day are widespread; and several global brands, particularly WhatsApp and Google, are heavily impersonated. We share the annotated dataset in the GitHub repo https://github.com/mufimash/malicious_domains for further research.

2606.11091 2026-06-10 eess.SY cs.SY q-bio.NC 新提交

QUIET: Quantifying Underutilized Influential Edges for Targeted Synchronization

QUIET: 量化未充分利用的影响边以实现目标同步

Sovesh Mohapatra, Christoffer G. Alexandersen, Panagiotis Fotiadis, Max B. Kelz, John A. Detre, Fabio Pasqualetti, Dani S. Bassett

AI总结 提出边中心框架QUIET,结合结构可控性和功能互信息识别能量高效的同步路径,验证其在合成网络和人类连接组中的有效性。

Comments 38 Pages; 6 Figures; 8 SIs

详情
AI中文摘要

网络控制理论可用于建模内在和外在策略以引导神经动力学。标准方法是节点中心、结构性的,并专注于实现期望的瞬时状态。在这里,我们开发了一种边中心方法,该方法结合了结构和功能,以实现由期望同步状态表征的扩展神经动力学模式。我们的方法,量化未充分利用的影响边以实现目标同步(QUIET),是一个边中心框架,它整合了个体白质连接的结构可控性和成对功能时间序列之间的互信息,以识别能量高效的同步路径。QUIET识别安静高速公路,即结构上有影响力但功能上未充分利用的边,以优化区域同步。我们在75种合成配置上验证了QUIET,其中QUIET排名的边集在93%的情况下显著优于随机选择(p<0.01)。该框架在人类连接组计划参与者上测试,揭示了显著性网络同步所需的控制能量与流体智力相关。将QUIET应用于接受右美托咪定诱导无反应的健康成年人,显示额顶叶和默认模式网络在清醒和镇静状态下均表现出同步所需的最大控制能量。QUIET作为独立软件发布,用于研究理论上定义的同步路径,进而可为扰动研究中的可测试假设提供信息。

英文摘要

Network control theory can be used to model intrinsic and extrinsic strategies to steer neural dynamics. Standard approaches are node-centric, structural, and focused on achieving desired instantaneous states. Here, we develop an edge-centric approach which incorporates both structure and function to achieve extended patterns of neural dynamics characterized by desired synchronization states. Our method, Quantifying Underutilized Influential Edges for Targeted Synchronization (QUIET), is an edge-centric framework that integrates structural controllability of individual white matter connections and mutual information between pairwise functional timeseries to identify energy-efficient synchronization pathways. QUIET identifies quiet highways, edges that are structurally influential but functionally underutilized, to optimize regional synchronization. We validated QUIET across 75 synthetic configurations, where QUIET-ranked edge sets significantly outperformed random selection in 93% of cases (p<0.01). The framework, tested on Human Connectome Project participants, revealed that the control energy required for synchronization of the salience network correlates with fluid intelligence. QUIET, applied to healthy adults undergoing dexmedetomidine-induced unresponsiveness, showed that the frontoparietal and default-mode networks exhibited the largest control energy required for synchronization in both awake and sedated states. QUIET is released as a stand-alone software to be used to study theoretically-defined synchronization pathways, which in turn could inform testable hypotheses in perturbative studies.

2606.11067 2026-06-10 cs.DS 新提交

Enumerating Inclusion-Maximal Arithmetic Progressions

枚举包含极大算术级数

Brian Bemman, Maximilien Gadouleau, Oliver W. Gnilke, George B. Mertzios

AI总结 提出一个O(n^2 log N / log log N + N)算法,用于枚举整数序列中的包含极大算术级数(IMAP),并证明随机序列中IMAP的期望数量,实验表明算法显著优于先前方法。

详情
AI中文摘要

我们提出了一个简单的 $\mathcal{O}\left( n^2 \frac{ \log N }{ \log \log N } + N \right)$ 枚举算法,用于解决数学和计算音乐分析中的一个问题:给定一个严格递增的整数序列 $S$,包含 $n$ 个元素,最大值为 $N$,任务是枚举该序列中所有 $m$ 个 $\textit{包含极大算术级数(IMAPs)}$。一个 IMAP 是一个子序列 $S' \subseteq S$,包含 $k>2$ 个整数,满足 (i) 任意两个连续整数之差为相同常数 $d$(即 $S'$ 是一个 $\textit{算术级数}$),(ii) $S'$ 不能向左或向右扩展 $S$ 中的任何整数而仍保持为算术级数(即 $S'$ 是一个 $\textit{极大}$ 算术级数),(iii) 不存在另一个极大算术级数 $S'' \subseteq S$ 真包含 $S'$(即 $S'$ 是一个 $\textit{包含极大}$ 算术级数)。我们进一步提供了随机整数序列 $S$ 中 IMAP 的期望数量及其增长阶的界。最后,我们进行了实验比较:(a) 所提算法与先前已知的具有更高时间复杂度 $\mathcal{O}(N^{2+o(1)}n)$ 的算法的实际运行时间性能,以及 (b) 实际枚举的 IMAP 数量与其数学期望数量。值得注意的是,所提算法在运行时间上相比先前已知算法有显著改进,并在直接实际应用中将允许对大型且节奏复杂的音乐作品进行更高效的分析。

英文摘要

We present a simple $\mathcal{O}\left( n^2 \frac{ \log N }{ \log \log N } + N \right)$ enumeration algorithm for solving a problem from mathematical and computational music analysis where, given a strictly increasing integer sequence, $S$, with $n$ entries and maximum value $N$, the task is to enumerate all $m$ $\textit{inclusion-maximal arithmetic progressions (IMAPs)}$ in this sequence. An IMAP is a subsequence, $S' \subseteq S$ with $k>2$ integers, in which (i) the difference between any two consecutive integers is the same number, $d$ (i.e., $S'$ is an $\textit{arithmetic progression}$), (ii) $S'$ cannot be further extended to the left or to the right with any additional integers from $S$ while still remaining an arithmetic progression (i.e., $S'$ is a $\textit{maximal}$ arithmetic progression), and (iii) there is no other maximal arithmetic progression, $S'' \subseteq S$, which $\textit{properly}$ contains $S'$ (i.e., $S'$ is an $\textit{inclusion-maximal}$ arithmetic progression). We further provide proofs for the expected number of IMAPs in random integer sequences, $S$, and a bound on their order of growth. Finally, we provide empirical experiments comparing both (a) the practical running time performance of the proposed algorithm against that of a previously known algorithm which has higher time complexity $\mathcal{O}(N^{2+o(1)}n)$, and (b) the actual enumerated number of IMAPs to that of their mathematically expected number. Notably, the proposed algorithm demonstrates a significant improvement in running time over the previously known algorithm, and in immediate practical applications, will allow for more efficient analysis of large and rhythmically complex musical pieces.

2606.11065 2026-06-10 cs.AR 新提交

Arithmetic Packing on Wide Integer Datapaths in DSP Primitives of Modern FPGA Devices

现代FPGA器件DSP原语中宽整数数据路径上的算术打包

Titus Bornträger, Shane Fleming, Philipp Holzinger, Dietmar Fey, Michaela Blott, Thomas B. Preußer

AI总结 提出利用DSP内部预加法器动态打包任意位宽多输入到宽乘法器路径的方法,并设计两种架构分别优化矩阵向量乘和卷积,集成到FINN框架后相比参考实现LUT减少21%,FPS/DSP提升36%。

Comments 8 pages, 9 figures, 4 tables

详情
AI中文摘要

深度神经网络越来越多地采用低精度量化来降低计算需求。虽然FPGA非常适合异构精度的工作负载,但其专用的数字信号处理(DSP)切片仅具有固定宽度的数据路径,低比特宽算术会显著利用不足。虽然先前的方法已经引入了将多个值打包到同一宽DSP数据路径上,但它们要么只支持特定的固定位宽,要么在DSP外部额外使用支持逻辑造成浪费。本文提出了一种高效的方法,通过利用DSP的内部预加法器,动态地将多个(无)符号输入与任意位宽打包到宽乘法器路径中。在此基础上,我们提出了两种不同的架构,一种针对矩阵向量乘法优化,另一种针对卷积优化。我们的实现已集成到AMD的FINN框架中。通过这些优化,与FINN参考相比,UltraNet模型的LUT利用率降低了21%,FPS/DSP提高了36%。

英文摘要

Deep Neural Networks increasingly employ low-precision quantization to reduce computational requirements. While FPGAs are well suited for workloads with heterogeneous precisions, their dedicated digital signal processing (DSP) slices only feature fixed-width datapaths that are significantly underutilized by low-bitwidth arithmetic. While previous approaches have already introduced the packing of multiple values onto the same wide DSP datapath, they either only support specific fixed bitwidths or are wasteful regarding the use of additional support logic external to the DSP. This paper proposes an efficient method to dynamically pack multiple (un-)signed inputs with arbitrary bitwidths into a wide multiplier path by leveraging the DSP's internal pre-adder. Building on this, we present two distinct architectures, one optimized for matrix-vector multiplications and the other for convolutions. Our implementations are integrated into AMD's FINN framework. With these optimizations, we reduce the LUT utilization by 21% and increase the FPS/DSP by 36% for the UltraNet model compared to the FINN reference.

2606.11051 2026-06-10 cs.SE cs.HC 新提交

Making Software Meaningful

使软件有意义

Eagon Meng, Abutalib Namazov, Carmel Schare, Alcino Cunha, Daniel Jackson

AI总结 提出通过承诺显式意义来改善软件可用性、模块化和可问责性,构建领域行为表示作为通用词汇,并应用于设计、代码生成和智能体问责。

详情
AI中文摘要

采用单一度量可以提高软件的可用性、模块化和可问责性:承诺显式意义。这需要构建并商定软件行为的表示,该表示在应用领域中观察得到。构成该行为的现象成为词汇,支撑所有利益相关者之间、所有工件和活动中关于软件的所有讨论。这些现象是个体、它们参与的动作以及动作产生的事实。通过划分动作集,可以将它们组织成概念,提供更大的意义单元。给出了利用意义的三个领域示例:为可用性设计(通过使用户和设计者在单一共享意义上对齐);使用LLM生成模块化代码(通过将意义单元映射到代码单元,实现模块化和可读性);以及使智能体可问责(通过让它们遵守定义其预期行为的行为准则)。

英文摘要

Adopting a single measure can improve the usability, modularity and accountability of software: a commitment to explicit meaning. This entails constructing and agreeing upon a representation of the behavior of the software, as observed in the domain of application. The phenomena comprising this behavior become a vocabulary that grounds all discourse about the software, among all stakeholders, and for all artifacts and activities. These phenomena are individuals; actions they participate in; and facts that result from actions. They can be organized, by partitioning the set of actions, into concepts, offering larger units of meaning. Examples of exploiting meaning are given in three areas: designing for usability (by aligning user and designer on a single shared meaning); generating modular code with LLMs (by mapping units of meaning to units of code, achieving not only modularity but also legibility); and making agents accountable (by having them adhere to a code of conduct that defines their intended behavior).

2606.11050 2026-06-10 cs.MA cs.GT cs.SY eess.SY 新提交

LLM-Mediated Demand Response Coordination in Smart Microgrids

LLM介导的智能微电网需求响应协调

J. de Curtò, I. de Zarzà

AI总结 针对智能微电网中产消者自愿合作的需求响应协调问题,提出一种结合博弈论与LLM叙事评估的混合决策架构,通过结构化指令实现33.3%的需求削减合作率,优于非结构化消息和基线。

Comments Accepted for publication in 18th International Conference on Sustainability in Energy and Buildings (SEB-26), to appear in Springer Nature proceedings (KES Smart Innovation Systems and Technologies). The final authenticated version will be available online at Springer

详情
AI中文摘要

智能微电网中的有效需求响应要求产消者在战略自利下自愿合作,这一协调问题在结构上等价于社交网络上的重复囚徒困境。本文提出一个多智能体模拟,其中大型语言模型(LLM)影响编译器向异质产消者智能体群体发布结构化需求响应指令,每个智能体由混合决策架构控制,该架构结合了博弈论基础概率(基于收益历史、邻居模仿和利用记忆)与对传入协调信号的LLM叙事评估。该混合架构解决了一个关键方法论挑战:通过人类反馈强化学习(RLHF)对齐的LLM在作为直接决策者时表现出强烈的合作偏差,无论电网条件如何都产生平坦的动态。通过将战略推理与基于情境的叙事评估分离,该模型在六种人格原型中生成真实的产消者行为,基线合作率接近50%,并在影响下表现出明显分化。编译的结构化指令实现了33.3%的需求削减合作率,而非结构化消息为27.0%,无干预基线为28.0%(Δ_comp = +0.063),该优势在基于真实和理想化的智能体基质(Δ = +0.083)以及所有抵抗水平(R = 0.1至0.7)中均保持。通过高中心性网络节点的枢纽定向传播优于外围或随机传播,证实电网拓扑提供了独立于消息内容的机制放大。这些结果表明,结构化LLM编译、基于情境的智能体推理和网络感知传播是可扩展、可解释的需求响应协调的互补设计原则,适用于智慧城市能源系统。

英文摘要

Effective demand response in smart microgrids requires prosumers to cooperate voluntarily under strategic self-interest, a coordination problem structurally equivalent to a repeated Prisoner's Dilemma on a social network. This paper presents a multi-agent simulation in which a Large Language Model (LLM) Influence Compiler issues structured demand-response directives to a population of heterogeneous prosumer agents, each governed by a hybrid decision architecture combining game-theoretic base probability (derived from payoff history, neighbour imitation, and exploitation memory) with LLM narrative evaluation of incoming coordination signals. The hybrid architecture resolves a key methodological challenge: LLMs aligned via Reinforcement Learning from Human Feedback (RLHF) exhibit strong cooperation bias when used as direct decision-makers, producing flat dynamics regardless of grid conditions. By separating strategic reasoning from grounded narrative evaluation, the model generates realistic prosumer behaviour across six personality archetypes, with baseline cooperation near 50% and clear differentiation under influence. Compiled structured directives achieve 33.3% demand-curtailment cooperation versus 27.0% for unstructured messaging and 28.0% for a no-intervention baseline ($Δ_\mathrm{comp} = +0.063$), with the advantage preserved across both grounded and idealized agent substrates ($Δ= +0.083$) and across all resistance levels ($R = 0.1$ to $0.7$). Hub-targeted dissemination via high-centrality network nodes outperforms peripheral or random targeting, confirming that grid topology provides mechanistic amplification independent of message content. These results suggest that structured LLM compilation, grounded agent reasoning, and network-aware targeting are complementary design principles for scalable, interpretable demand-response coordination in smart-city energy systems.

2606.11049 2026-06-10 eess.SY cs.SY 新提交

Free Parametrization of L_2-Bounded Structured State-Space Controllers for Nonlinear Control with Stability Guarantees

具有稳定性保证的非线性控制中L2有界结构化状态空间控制器的自由参数化

Muhammad Zakwan, Leonardo Massai, Efe C. Balta, Giancarlo Ferrari-Trecate

AI总结 提出一种新的线性时不变系统L2增益自由参数化方法,构建L2RU层,通过小增益定理保证非线性系统闭环稳定性,实现无约束优化。

详情
AI中文摘要

为非线性系统设计稳定控制策略同时优化复杂目标仍然是一个艰巨的挑战。神经网络尽管具有表达能力,但对小的输入扰动高度敏感,容易使闭环系统失稳。现有方法通常对控制器参数施加显式约束以确保稳定性,但这通常会导致额外的计算开销。为解决此问题,我们利用最近提出的结构化状态空间模型(SSM)来参数化非线性系统的离散时间控制策略。我们的关键贡献是一种新的具有规定L2增益的线性时不变(LTI)系统的自由参数化。我们利用这一结果构建了L2递归单元(L2RU),这是一种通过设计强制实现所需L2界的SSM层。所得架构可通过小增益定理或所谓的性能提升框架保证闭环稳定性,而与控制器的优化参数无关,从而实现对一般非线性目标的完全无约束优化。此外,所提参数化引入的结构使得能够高效处理长输入序列,因为它通过并行扫描等算法高度可并行化。我们在移动机器人编队控制任务上展示了该方法的有效性,其中基于L2RU的控制器在保持稳定性和性能的同时确保碰撞和障碍物避免。

英文摘要

Designing stabilizing control policies for nonlinear systems while optimizing complex objectives remains a formidable challenge. Neural networks (NNs), despite their expressive power, can be highly sensitive to small input perturbations and can easily destabilize the closed-loop system. Existing approaches often impose explicit constraints on the controller's parameters to ensure stability, but this typically leads to additional computational overhead. To address this issue, we leverage recently proposed structured state-space models (SSMs) to parametrize discrete-time control policies for nonlinear systems. Our key contribution is a new free parametrization of linear time-invariant (LTI) systems with a prescribed L2 gain. We use this result to construct the L2-Recurrent Unit (L2RU), an SSM layer that enforces the desired L2 bound by design. The resulting architecture can be used to guarantee closed-loop stability via the small-gain theorem or the so-called performance-boosting framework, independently of the controller's optimization parameters, thereby enabling fully unconstrained optimization of general nonlinear objectives. Furthermore, the structure induced by the proposed parametrization enables the efficient processing of long input sequences, as it is highly parallelizable through algorithms such as parallel scan. We demonstrate the effectiveness of this approach on a formation-control task for mobile robots, where the L2RU-based controller ensures collision and obstacle avoidance while maintaining stability and performance.

2606.11040 2026-06-10 cs.NI cs.CY 新提交

Internet Quality Barometer (IQB): A preliminary data-driven evaluation of the IQB framework

互联网质量晴雨表 (IQB): 对 IQB 框架的初步数据驱动评估

Pavlos Sermpezis, Zeynep Arslan

AI总结 提出IQB框架将原始网速数据转化为可解释的互联网质量评分(0-1),通过开源库和原型应用实现全球评估,并初步分析参数敏感性以识别关键影响因素和潜在异常。

详情
AI中文摘要

互联网质量晴雨表 (IQB) 框架旨在将原始互联网测量数据转化为关于互联网质量的可操作见解。具体来说,该框架将原始网速测试测量结果映射到网络需求(如吞吐量、延迟),将这些需求映射到代表性互联网用例(如视频流或网页浏览),并最终将跨用例的性能聚合为单个 IQB 分数。IQB 分数是一个从 0 到 1 的综合指数,旨在以既可解释又可跨地点比较的方式捕获整体互联网质量。我们通过开发开源 IQB 库和原型 Web 应用,在实践中实现了 IQB 框架。这些工具使我们能够大规模计算 IQB 分数,包括在国家、地区和城市层面聚合的全球估计。在本报告中,我们对 IQB 框架进行了初步敏感性分析,研究不同参数选择如何影响最终的 IQB 分数,识别框架最敏感的参数,并强调可能导致异常值或潜在误导结果的情况。

英文摘要

The Internet Quality Barometer (IQB) framework was designed to transform raw Internet measurement data into actionable insights about Internet quality. Specifically, the framework maps raw speed test measurements to network requirements (e.g., throughput, latency), maps these requirements to representative Internet use cases (such as video streaming or web browsing), and finally aggregates performance across use cases into a single IQB score. The IQB score is a composite index ranging from 0 to 1, intended to capture overall Internet quality in a way that is both interpretable and comparable across locations. We implemented the IQB framework in practice by developing an open-source IQB library and a prototype web application. These tools enabled us to compute IQB scores at scale, including global estimates aggregated at the level of countries, regions, and cities. In this report we conduct a preliminary sensitivity analysis of the IQB framework, investigating how different parameter choices affect the resulting IQB scores, identifying which parameters the framework is most sensitive to, and highlighting cases that may lead to outliers or potentially misleading results.

2606.11035 2026-06-10 cs.SE 新提交

GapFuzz: Cross-Plane Divergence Fuzzing for Distributed SDN Controllers

GapFuzz: 面向分布式SDN控制器的跨平面分歧模糊测试

Moustapha Awwalou Diouf, Samuel Ouya, Jacques Klein, Tegawendé F. Bissyandé

AI总结 提出GapFuzz,通过注入矛盾北向请求并重构跨平面状态,检测分布式SDN集群中因异步复制导致的控制-数据平面分歧,在ONOS 2.7上81.7%尝试产生分歧。

详情
AI中文摘要

分布式软件定义网络(SDN)集群在主节点及其备份之间异步复制流状态,留下一个窗口,在此期间两个备份节点可以各自提交一条矛盾的规则,主节点可以将两者序列化到数据平面,并且内核数据路径可以锁定在一个没有节点认为权威的动作上。现有的SDN模糊测试器未能发现此故障:它们将预言限制在控制平面,针对单个控制器,或者不引导并发以引发复制竞争。我们提出GapFuzz,一个用于分布式SDN集群的有状态并发模糊测试器。GapFuzz在两个非主节点上注入一对矛盾的北向请求,并控制注入间延迟Δt,通过查询每个副本以及通过ovs-appctl ofproto/trace查询内核数据路径动作来重建全局跨平面状态。两阶段时序搜索检测是否存在分歧,然后对Δt进行加倍和二分以界定注入时间窗口;生命周期探测将每个判定标记为瞬态或持久,并将其分配给从ONOS 2.7源代码派生的四个跨平面状态类别之一。在一个三节点ONOS 2.7集群上,GapFuzz在81.7%的尝试中产生分歧判定(N=50,Wilson 95% CI [77.3, 85.4]%);每个分歧都位于集群的权威状态和内核数据路径之间。第二阶段将一个模板的5 ms竞争窗口与六个其他模板的加倍机制(Δt_max=10.24 s)分开,并且99.4%的分歧持续超过30秒。将内核数据路径探测替换为先前模糊测试器使用的OpenFlow用户空间探测,整体检测率下降26.6个百分点,在排除规范化强制判定后下降46.5个百分点。

英文摘要

Distributed Software-Defined Networking (SDN) clusters replicate flow state asynchronously between a master node and its backups, leaving a window during which two backup nodes can each commit a contradictory rule, the master can serialize both into the data plane, and the kernel datapath can latch onto an action that no node believes authoritative. Existing SDN fuzzers miss this fault: they confine their oracle to the control plane, target a single controller, or do not steer concurrency to provoke replication races. We present GapFuzz, a stateful concurrency fuzzer for distributed SDN clusters. GapFuzz injects pairs of contradictory Northbound requests on two non-master nodes with controlled inter-injection delay $Δt$, and reconstructs the global cross-plane state by querying every replica and the kernel-datapath action through ovs-appctl ofproto/trace. A two-phase timing search detects whether a divergence exists, then doubles and bisects on $Δt$ to bound the injection-time window; a lifetime probe labels each verdict transient or persistent and assigns it to one of four cross-plane state classes derived from the ONOS 2.7 source. On a three-node ONOS 2.7 cluster, GapFuzz produces a divergent verdict in 81.7% of attempts ($N=50$, Wilson 95% CI $[77.3, 85.4]$%); every divergence sits between the cluster's authoritative state and the kernel datapath. Phase 2 separates a 5 ms race window for one template from a doubling-cap regime ($Δt_{\max}=10.24$ s) for six others, and 99.4% of divergences persist past 30 s. Replacing the kernel-datapath probe with the OpenFlow user-space probe used by prior fuzzers drops detection by 26.6 percentage points overall and by 46.5 points after excluding canonicalization-forced verdicts.

2606.11022 2026-06-10 cs.CR 新提交

When Discovery Outpaces Remediation: Modeling AI-Accelerated Vulnerability Discovery in Interconnected Systems

当发现速度超过修复速度:建模AI加速的互联系统漏洞发现

Mohamamad Reza Faghani

AI总结 针对AI加速漏洞发现可能压垮修复管线的问题,提出基于排队论和网络理论的模型,分析漏洞积压与级联风险,发现修复能力不足时风险非线性增长,结合修复自动化与网络解耦是最优策略。

详情
AI中文摘要

用于代码分析、二进制分析、模糊测试编排和渗透测试规划的高级AI系统可能会显著提高潜在漏洞的发现速度。虽然改进的发现可以惠及防御者,但也可能使修复管线过载并加速对手的武器化。本文开发了一个排队论和网络理论模型,用于研究互联系统中AI加速的漏洞发现。我们将企业表示为一个加权依赖图,其中包含可补充的漏洞池、有限的修复能力、分类降级、漏洞利用窗口压缩和动态传播。我们推导了漏洞积压的稳定性条件,建立了未解决积压与级联风险之间的动态耦合,并通过模拟评估了缓解策略。结果表明,当可操作的发现到达率超过修复吞吐量时,积压迅速增长,系统风险非线性增加。在中心化拓扑中,分段比单独提高修复速度更能有效减少传播的破坏,而最强的防御是将修复自动化与减少网络耦合相结合。

英文摘要

Advanced AI systems for code analysis, binary analysis, fuzzing orchestration, and penetration-test planningmay significantly increase the rate at which latent vulnerabilities are discovered. While improved discovery can benefit defenders, it can also overload remediation pipelines and accelerate adversarial weaponization. This paper develops a queueing and network-theoretic model of AI-accelerated vulnerability discovery in interconnected systems. We represent an enterprise as a weighted dependency graph with replenishing vulnerability pools, finite remediation capacity, triage degradation, exploit window compression, and dynamic compromise propagation. We derive stability conditions for vulnerability backlogs, formulate a dynamic coupling between unresolved backlog and cascade risk, and evaluate mitigation strategies through simulation. Results indicate that when actionable discovery arrivals exceed remediation throughput, backlogs grow rapidly and systemic risk increases nonlinearly. In hub-dominated topologies, segmentation can reduce propagated compromise more effectively than remediation speed alone, while the strongest defense combines remediation automation with reduced network coupling.

2606.10997 2026-06-10 eess.SY cs.CY cs.SY 新提交

A Companion App for an Autonomous Family Vehicle: Identification of Values for an Autonomous Mobility System

面向自主家庭车辆的伴侣应用:自主出行系统的价值识别

Leon Johann Brettin, Tobias Schräder, Kerstin Kuhlmann, Vanessa Schmidt, Markus Maurer

AI总结 本文提出一款面向需要陪同人员的用户群体的自主车辆伴侣应用,支持可信人员跟踪行程和远程设置车辆,并识别影响乘客和可信人员的价值观。

Comments Accepted to be published in the 2026 IEEE Intelligent Vehicles Symposium (IV)

详情
AI中文摘要

在本文中,我们提出了一款面向通常需要陪同人员驾驶的用户群体的自主车辆伴侣应用。本文介绍了伴侣应用的两个方面:首先,可信人员能够跟踪需要支持人员的行程;其次,将需要支持人员的车辆设置交由可信人员掌控。此外,本文描述了需求和相关价值观,并讨论了此类伴侣应用的安全相关方面。我们还讨论并识别了影响使用伴侣应用的乘客和可信人员的价值观。总体而言,伴侣应用可以为需要支持的人员提供新的视角和机会,使他们能够利用自主车辆提供的功能。它使可信人员能够根据乘客需求配置车辆。同时,此类应用可以作为一种机制,让可信人员参与车辆提供的选项,并使他们能够根据需要支持人员的情况调整车辆。

英文摘要

In this paper, we present a companion app for an autonomous vehicle aimed at user groups who would normally require an accompanying person to drive them. Two aspects of a companion app are presented in this paper: First, the possibility for a trusted person to track the ride of the person in need of support and second, to put the settings of the vehicle for persons in need of support in the hands of a trusted person. In addition, this article describes the requirements and addressed values and discusses the safety-relevant aspects of such a companion app. We also discuss and identify the values that influence passengers and trusted persons using the companion app. Overall, a companion app can provide new perspectives and opportunities for people in need of support, allowing them to take advantage of the features offered by autonomous vehicles. It enables trusted individuals to configure the vehicle according to the passengers needs. Also such an app can be a mechanism to involve trusted persons in the options given by the vehicle and give them the possibility to adapt the vehicle to the needs of the person in need of support.

2606.10961 2026-06-10 cs.LO 新提交

Labelled Process Logic

带标签的过程逻辑

Yuanrui Zhang

AI总结 提出带标签的循环证明框架,解决一阶过程逻辑中同时处理轨迹信息和一阶计算的难题,并证明其可靠性与完备性。

详情
AI中文摘要

本文为过程逻辑(一种动态逻辑的扩展,其中公式指定执行轨迹的属性而非仅最终状态)开发了一个循环带标签的证明论框架。主要困难在于一阶过程逻辑必须推理具体计算,同时沿正则程序轨迹保留时间信息。现有的组合演算覆盖了重要片段,但未能完整处理正则程序上的一阶过程逻辑。我们通过用标签丰富过程逻辑公式来解决这一困难,这些标签在推导过程中显式记录轨迹和更新信息。基于这一构造,我们定义了命题和一阶过程逻辑的循环带标签证明系统,分别记为G3PPL和G3FOPL。我们通过使用循环条件在良基多重集排序中获得无限下降来证明可靠性,并通过证明带标签系统可以推导过程逻辑和一阶动态逻辑的已有证明规则来证明完备性。结果是一个统一的过程逻辑框架,其中首次可以在同一证明结构中处理基于轨迹的程序属性和一阶计算。

英文摘要

This paper develops a cyclic labelled proof-theoretic framework for process logic -- an extension of dynamic logic in which formulas specify properties of execution traces rather than only final states. The main difficulty is that first-order process logic must reason about concrete computations while preserving temporal information along regular-program traces. Existing compositional calculi cover important fragments, but do not provide a complete treatment of full first-order process logic over regular programs. We address this difficulty by enriching process-logic formulas with labels that explicitly record trace and update information during derivations. Based on this construction, we define cyclic labelled proof systems for propositional and first-order process logic, respectively denoted by G3PPL and G3FOPL. We prove the soundness by using the cyclic conditions to obtain an infinite descent in a well-founded multiset ordering, and prove the completeness by showing that the labelled systems can derive the established proof rules of process logic and first-order dynamic logic. The result is a uniform framework for process logic in which for the first time, trace-based program properties and first-order computations can be handled within the same proof structure.

2606.10958 2026-06-10 cs.CY 新提交

Dismantle and Dissolve, (Re)build, Remix: A Research-creation Inquiry into the Political Economy of Graphics Cards

拆解与溶解、(重)建、混合:图形卡政治经济的研究-创作探究

Cyrus Khalatbari

AI总结 通过四年研究-创作实践,拆解图形卡(GPU)微型化的政治经济,揭示人工智能的物质基础与环境基础设施,提出研究-创作作为理解技术黑箱的完整认识论。

Comments Paper in Proceedings of LIMITS 2026: 12th Workshop on Computing within Limits, 2026-06-23-25, Online

详情
AI中文摘要

本文追踪了为期四年(2022-2026)的图形卡微型化政治经济研究。它从这样一个前提开始:重新思考我们与人工智能及其社会技术纠缠的关系,需要揭开这一技术对象的黑箱。在我们的算法文化中,图形卡(GPU)实现了大规模数据集的并行处理,使得支撑智能系统的模型训练成为可能。GPU微型化同样至关重要:作为物联网的关键驱动力,这一社会技术现象使得这些卡能够被纳入日益紧凑和强大的系统中,同时也能更好地管理能源资源。然而,这些日常物品和技术的发展加剧了几个主要问题。借鉴社会科学以及研究-创作的批判性、反思性、思辨性和虚构性方法论,作者开发了几个调查实地——包括台湾的液氮超频者和加纳的城市矿工——并对约五十块获取的图形卡进行了情境化实验。本文围绕三个主题(拆解与溶解、重建、混合)展开,展示了研究-创作方法如何构成完整的认识论,以把握那些看似外在、不透明或不可及的事物,并将人工智能还原为其有形的物质性。通过这样做,它通过确认研究-创作作为解开计算系统所依赖和掩盖的物质与环境基础设施的严谨手段,为信息通信技术促进可持续发展领域做出了贡献。

英文摘要

This contribution follows a four-year investigation (2022--2026) into the political economy of graphics card miniaturization. It begins from the premise that rethinking our relationship to artificial intelligence and its sociotechnical entanglements requires demystifying and opening the black box of this technical object. Within our algorithmic culture, the graphics card (GPU) enables the massive, parallel processing of large datasets, making possible the training of the models that underpin our intelligent systems. GPU miniaturization is equally crucial: as a key driver of the Internet of Things, this sociotechnical phenomenon enables the inclusion of these cards in increasingly compact and powerful systems while also enabling better management of energy resources. The development of these everyday objects and technologies nevertheless reinforces several major problems. Drawing on both the social sciences and the critical, reflexive, speculative, and fictional methodologies of research-creation, the author developed several investigative fieldwork sites -- among liquid nitrogen overclockers in Taiwan and urban miners in Ghana -- and conducted situated experimentations on some fifty acquired graphics cards. Structured around three themes (dismantle and dissolve, rebuild, remix), this paper demonstrates how research-creation methods constitute full epistemologies for apprehending what seems a priori external, opaque, or inaccessible, and for restoring artificial intelligence to its tangible materialities. In doing so, it contributes to the field of ICT for sustainability by affirming research-creation as a rigorous means of disentangling the material and environmental infrastructures that computational systems both depend on and obscure.

2606.10947 2026-06-10 cs.DM 新提交

Enumerating tuples of spanning trees

枚举生成树的元组

Rahul CS, Michal Wlodarczyk

AI总结 本文基于Kaiser对Tutte-Nash-Williams定理的替代证明,提出一种多项式延迟算法,用于枚举图中所有k边不交生成树的集合。

详情
AI中文摘要

判断一个图是否具有k条边不交生成树是一个被充分研究的问题。我们考虑以多项式延迟枚举所有生成树集合的问题。这项工作基于Kaiser [1] 对Tutte和Nash-Williams关于具有k条边不交生成树的图的特征的替代证明。其思想是为所有森林包维护一个决策树,并对其进行深度优先搜索。我们通过使用Kaiser技术 [1] 的变体计算每个节点的子节点,归纳地构建这个决策树。

英文摘要

Deciding whether a graph has k-edge-disjoint spanning trees is a well-studied problem. We consider the problem of enumerating all sets of spanning trees with polynomial delay. This work is based on the alternate proof of Tutte and Nash-Williams' characterization of graphs with k edge-disjoint spanning trees by Kaiser [1]. The idea is to maintain a decision tree for all forest-packs and perform a Depth-First Search over it. We build this decision tree inductively by computing each node's children using a variant of Kaiser's technique [1].

2606.10945 2026-06-10 cs.CR cs.SE 新提交

Context-Based Adversarial Attacks on AI Code Generators: Vulnerability Analysis and Implications

基于上下文的AI代码生成器对抗攻击:漏洞分析与影响

Walther A. Del Orbe, John D. Hastings, Varghese Vaidyan

AI总结 研究通过2800次实验,发现上下文对抗攻击使代码生成漏洞增加10.7倍,并提出双层防御框架实现89.1%检测率。

Comments 6 pages, 8 tables

详情
AI中文摘要

AI驱动的代码生成系统已转变软件开发,但引入了关键的推理时安全漏洞。本研究对基于上下文的对抗攻击进行了系统调查,其中策略性构建的上下文输入(包括注释、文档、变量名)使大型语言模型偏向生成可利用的代码。通过在CodeT5+、CodeLlama、GPT-3.5-Turbo和GPT-4上进行的2800次受控实验,我们量化了攻击有效性和防御机制。结果表明,对抗条件使漏洞生成增加10.7倍(从3.5%到37.4%),其中直接指令攻击在GPT-3.5-Turbo上达到100%成功率。跨模型迁移率达到60-100%,表明存在系统架构漏洞而非模型特定缺陷。我们的双层防御框架实现了89.1%的检测率,0.3%的误报率和520ms延迟,证明了在开发环境中实时部署的实践可行性。

英文摘要

AI-powered code generation systems have transformed software development but introduce critical inference-time security vulnerabilities. This research presents a systematic investigation of context-based adversarial attacks, where strategically crafted contextual inputs, including comments, documentation, variable names, bias large language models toward generating exploitable code. Through 2,800 controlled experiments across CodeT5+, CodeLlama, GPT-3.5-Turbo, and GPT-4, we quantify attack effectiveness and defense mechanisms. Results demonstrate that adversarial conditions increase vulnerability generation 10.7x (from 3.5% to 37.4%), with direct instruction attacks achieving 100% success on GPT-3.5-Turbo. Cross-model transferability reaches 60-100%, indicating systemic architectural vulnerabilities rather than model-specific flaws. Our dual-layer defense framework achieves 89.1% detection rate with 0.3% false positives and 520ms latency, demonstrating practical feasibility for real-time deployment in development environments.

2606.10930 2026-06-10 cs.FL cs.DM 新提交

On the Complexity of the Bi-infinite Post Correspondence Problem

双无限波斯特对应问题的复杂性

Olivier Finkel, Vesa Halava

AI总结 研究双无限波斯特对应问题在算术层级中的复杂性,通过归约证明其位于第2层(Σ₂⁰ \ (Π₁⁰ ∪ Σ₁⁰)),并证明相关变体均为Π₁⁰-完全。

详情
AI中文摘要

在双无限波斯特对应问题($\Z$PCP)中,询问是否可以从给定的有限对词集合中相应地构造出相同的双无限词。本文研究其在算术层级中的复杂性,并证明它属于$\Si^0_2 \setminus (\Pi^0_1 \cup \Si^0_1)$,因此位于算术层级的第2层。为了证明,我们提出了一系列归约,从图灵机的不停机问题通过无限PCP、$s$-移位无限PCP和$s$-移位$\Z$PCP(对所有自然数$s$)最终归约到$\Z$PCP。在此过程中,我们证明了对于单射态射的无限PCP是不可判定的,并且无限单射PCP、$s$-移位无限PCP、$s$-移位$\Z$PCP以及(确定性和可逆)半Thue系统的不终止问题都是$\Pi^0_1$-完全的。

英文摘要

In the bi-infinite Post Correspondence Problem ($\Z$PCP), it is asked whether the same bi-infinite word can be constructed correspondingly from a given finite set of pairs of words. In this article, we study its complexity with respect to the arithmetical hierarchy and prove that it is in $\Si^0_2 \setminus (Π^0_1 \cup \Si^0_1)$ and, therefore, at the level 2 of the arithmetical hierarchy. For the proof, we present a sequence of reductions starting from the nonhalting of the Turing machine all the way to $\Z$PCP via infinite PCP, an $s$-shift infinite PCP and $s$-shift $\Z$PCP for all natural numbers $s$. In the process, we prove that the infinite PCP is undecidable for injective morphisms, and that the infinite injective PCP, $s$-shift infinite PCP, $s$-shift $\Z$PCP and the non-termination problem for (deterministic and reversible) semi-Thue systems are all $Π^0_1$-complete.

2606.10925 2026-06-10 cs.SE 新提交

Early Comparative Evaluation of Transformer Models for Multilingual Software Vulnerability Detection

用于多语言软件漏洞检测的Transformer模型的早期比较评估

Fiza Naseer, Javad Khan, Muhammad Yaqoob, Alexios Mylonas

AI总结 本文使用CVEFixes数据集和分层交叉验证,比较BERT、RoBERTa和CodeBERT在HTML、Python、JavaScript和PHP上的漏洞检测性能,发现跨语言性能差异显著,需更语言感知的Transformer策略。

详情
AI中文摘要

随着现代应用程序结合多种编程语言,软件漏洞检测变得越来越重要。本文使用CVEFixes数据集和按语言的三折分层交叉验证,对BERT、RoBERTa和CodeBERT在HTML、Python、JavaScript和PHP上的二进制漏洞检测进行了早期比较评估。结果显示跨语言存在明显的性能差异,表明多语言漏洞检测需要更语言感知和鲁棒的基于Transformer的建模策略。

英文摘要

Software vulnerability detection is increasingly important as modern applications combine multiple programming languages. This paper presents an early comparative evaluation of BERT, RoBERTa, and CodeBERT for binary vulnerability detection across HTML, Python, JavaScript, and PHP using the CVEFixes dataset and language-wise three-fold stratified cross-validation. The results show clear performance differences across languages, indicating that multilingual vulnerability detection requires more language-aware and robust transformer-based modelling strategies.