arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.19657 2026-04-22 cs.CR cs.AI cs.OS

An AI Agent Execution Environment to Safeguard User Data

Robert Stanley, Avi Verma, Lillian Tsai, Konstantinos Kallas, Sam Kumar

详情

英文摘要

AI agents promise to serve as general-purpose personal assistants for their users, which requires them to have access to private user data (e.g., personal and financial information). This poses a serious risk to security and privacy. Adversaries may attack the AI model (e.g., via prompt injection) to exfiltrate user data. Furthermore, sharing private data with an AI agent requires users to trust a potentially unscrupulous or compromised AI model provider with their private data. This paper presents GAAP (Guaranteed Accounting for Agent Privacy), an execution environment for AI agents that guarantees confidentiality for private user data. Through dynamic and directed user prompts, GAAP collects permission specifications from users describing how their private data may be shared, and GAAP enforces that the agent's disclosures of private user data, including disclosures to the AI model and its provider, comply with these specifications. Crucially, GAAP provides this guarantee deterministically, without trusting the agent with private user data, and without requiring any AI model or the user prompt to be free of attacks. GAAP enforces the user's permission specification by tracking how the AI agent accesses and uses private user data. It augments Information Flow Control with novel persistent data stores and annotations that enable it to track the flow of private information both across execution steps within a single task, and also over multiple tasks separated in time. Our evaluation confirms that GAAP blocks all data disclosure attacks, including those that make other state-of-the-art systems disclose private user data to untrusted parties, without a significant impact on agent utility.

URL PDF HTML ☆

赞 0 踩 0

2604.19639 2026-04-22 eess.SY cs.AI cs.SY

Safety-Critical Contextual Control via Online Riemannian Optimization with World Models

Tongxin Li

Comments 20 pages, 12 figures

2604.19566 2026-04-22 cs.IR cs.CL

Diagnosable ColBERT: Debugging Late-Interaction Retrieval Models Using a Learned Latent Space as Reference

François Remy

2604.19540 2026-04-22 cs.MA cs.AI

Mesh Memory Protocol: Semantic Infrastructure for Multi-Agent LLM Systems

Hongwei Xu

Comments 23 pages, 2 figures, 2 listings, 1 table. MMP v0.2.3 specification at https://sym.bot/spec/mmp (CC BY 4.0). Reference implementations on npm (@sym-bot/sym, @sym-bot/mesh-channel; Apache 2.0)

详情

英文摘要

Teams of LLM agents increasingly collaborate on tasks spanning days or weeks: multi-day data-generation sprints where generator, reviewer, and auditor agents coordinate in real time on overlapping batches; specialists carrying findings forward across session restarts; product decisions compounding over many review rounds. This requires agents to share, evaluate, and combine each other's cognitive state in real time across sessions. We call this cross-session agent-to-agent cognitive collaboration, distinct from parallel agent execution. To enable it, three problems must be solved together. (P1) Each agent decides field by field what to accept from peers, not accept or reject whole messages. (P2) Every claim is traceable to source, so returning claims are recognised as echoes of the receiver's own prior thinking. (P3) Memory that survives session restarts is relevant because of how it was stored, not how it is retrieved. These are protocol-level properties at the semantic layer of agent communication, distinct from tool-access and task-delegation protocols at lower layers. We call this missing protocol layer "semantic infrastructure," and the Mesh Memory Protocol (MMP) specifies it. Four composable primitives work together: CAT7, a fixed seven-field schema for every Cognitive Memory Block (CMB); SVAF, which evaluates each field against the receiver's role-indexed anchors and realises P1; inter-agent lineage, carried as parents and ancestors of content-hash keys and realising P2; and remix, which stores only the receiver's own role-evaluated understanding of each accepted CMB, never the raw peer signal, realising P3. MMP is specified, shipped, and running in production across three reference deployments, where each session runs an autonomous agent as a mesh peer with its own identity and memory, collaborating with other agents across the network for collective intelligence.

URL PDF HTML ☆

赞 0 踩 0

2604.19526 2026-04-22 cs.CR cs.LG cs.SE

Evaluating LLM-Generated Obfuscated XSS Payloads for Machine Learning-Based Detection

Divyesh Gabbireddy, Suman Saha

2604.19505 2026-04-22 cs.IR cs.CL cs.DL

Enhancing Unsupervised Keyword Extraction in Academic Papers through Integrating Highlights with Abstract

Yi Xiang, Chengzhi Zhang

Comments Scientometrics

2604.19468 2026-04-22 cs.CY cs.AI cs.HC

Fairness Audits of Institutional Risk Models in Deployed ML Pipelines

Kelly McConvey, Dipto Das, Maya Ghai, Angelina Zhai, Rosa Lee, Shion Guha

2604.19431 2026-04-22 cs.LO cs.AI

Counting Worlds Branching Time Semantics for post-hoc Bias Mitigation in generative AI

Alessandro G. Buda, Giuseppe Primiero, Leonardo Ceragioli, Melissa Antonelli

2604.19414 2026-04-22 cs.IR cs.LG

CAST: Modeling Semantic-Level Transitions for Complementary-Aware Sequential Recommendation

Qian Zhang, Lech Szymanski, Haibo Zhang, Jeremiah D. Deng

Comments 10 pages, 5 figures

详情

英文摘要

Sequential Recommendation (SR) aims to predict the next interaction of a user based on their behavior sequence, where complementary relations often provide essential signals for predicting the next item. However, mainstream models relying on sparse co-purchase statistics often mistake spurious correlations (e.g., due to popularity bias) for true complementary relations. Identifying true complementary relations requires capturing the fine-grained item semantics (e.g., specifications) that simple cooccurrence statistics would be unable to model. While recent semantics-based methods utilize discrete semantic codes to represent items, they typically aggregate semantic codes into coarse item representations. This aggregation process blurs specific semantic details required to identify complementarity. To address these critical limitations and effectively leverage semantics for capturing reliable complementary relations, we propose a Complementary-Aware Semantic Transition (CAST) framework that introduces a new modeling paradigm built upon semantic-level transitions. Specifically, a semantic-level transition module is designed to model dynamic transitions directly in the discrete semantic code space, effectively capturing fine-grained semantic dependencies often lost in aggregated item representations. Then, a complementary prior injection module is designed to incorporate LLM-verified complementary priors into the attention mechanism, thereby prioritizing complementary patterns over co-occurrence statistics. Experiments on multiple e-commerce datasets demonstrate that CAST consistently outperforms the state-of-the-art approaches, achieving up to 17.6% Recall and 16.0% NDCG gains with 65x training acceleration. This validates its effectiveness and efficiency in uncovering latent item complementarity beyond statistics. The code will be released upon acceptance.

URL PDF HTML ☆

赞 0 踩 0

2604.19383 2026-04-22 cond-mat.mtrl-sci cs.AI

Multimodal Transformer for Sample-Aware Prediction of Metal-Organic Framework Properties

Seunghee Han, Jaewoong Lee, Jihan Kim

Comments 22 pages, 7 figures

2604.19343 2026-04-22 cs.NE cs.LG

Scalable Memristive-Friendly Reservoir Computing for Time Series Classification

Coşku Can Horuz, Andrea Ceni, Claudio Gallicchio, Sebastian Otte

Comments 12 pages, 3 figures, 7 tables

2604.19340 2026-04-22 physics.ao-ph cs.LG

Improvements to the post-processing of weather forecasts using machine learning and feature selection

Kazuma Iwase, Tomoyuki Takenawa

Comments 24 pages

2604.18603 2026-04-22 q-bio.QM cs.LG

Dual Triangle Attention: Effective Bidirectional Attention Without Positional Embeddings

Logan Hallee, Jason P. Gleghorn

2604.16529 2026-04-22 cs.SE cs.AI cs.CL cs.LG

Scaling Test-Time Compute for Agentic Coding

Joongwon Kim, Wannan Yang, Kelvin Niu, Hongming Zhang, Yun Zhu, Eryk Helenowski, Ruan Silva, Zhengxing Chen, Srinivasan Iyer, Manzil Zaheer, Daniel Fried, Hannaneh Hajishirzi, Sanjeev Arora, Gabriel Synnaeve, Ruslan Salakhutdinov, Anirudh Goyal

Comments 70 pages, 26 figures, 12 tables

2603.09172 2026-04-22 math.CO cs.AI cs.CC

Reinforced Generation of Combinatorial Structures: Ramsey Numbers

Ansh Nagda, Prabhakar Raghavan, Abhradeep Thakurta

2602.12851 2026-04-22 cs.NI cs.AI cs.CR cs.LG

Chimera: Neuro-Symbolic Attention Primitives for Trustworthy Dataplane Intelligence

Rong Fu, Xiaowen Ma, Kun Liu, Wangyu Wu, Ziyu Kong, Jia Yee Tan, Tailong Luo, Xianda Li, Zeli Su, Youjin Wang, Yongtai Liu, Simon Fong

Comments 22 pages, 10 figures

2602.05416 2026-04-22 cs.CE cs.AI cs.LG physics.ao-ph physics.flu-dyn

Reduced-Order Surrogates for Forced Flexible Mesh Coastal-Ocean Models

Freja Høgholm Petersen, Jesper Sandvig Mariegaard, Rocco Palmitessa, Allan P. Engsig-Karup

Comments Submitted for peer-review in a journal. v2: revised version submitted to journal after minor revisions

2602.02063 2026-04-22 cs.HC cs.AI

See2Refine: Vision-Language Feedback Improves LLM-Based eHMI Action Designers

Ding Xia, Xinyue Gui, Mark Colley, Fan Gao, Zhongyi Zhou, Dongyuan Li, Renhe Jiang, Takeo Igarashi

Comments Accepted to ACL2026

2601.03442 2026-04-22 eess.SY cs.LG cs.SY

Local Updates in Distributed Optimization: Provable Acceleration and Topology Effects

Zuang Wang, Yongqiang Wang

2601.00679 2026-04-22 cs.NE cs.AI cs.LG

QSLM: A Performance- and Memory-aware Quantization Framework with Tiered Search Strategy for Spike-driven Language Models

Rachmad Vidya Wicaksana Putra, Pasindu Wickramasinghe, Muhammad Shafique

Comments Accepted at the Design, Automation and Test in Europe Conference (DATE) 2025 on April 20th-22nd, 2026 in Verona, Italy

详情

英文摘要

Large Language Models (LLMs) have been emerging as prominent AI models for solving many natural language tasks due to their high performance (e.g., accuracy) and capabilities in generating high-quality responses to the given inputs. However, their large computational cost, huge memory footprints, and high processing power/energy make it challenging for their embedded deployments. Amid several tinyLLMs, recent works have proposed spike-driven language models (SLMs) for significantly reducing the processing power/energy of LLMs. However, their memory footprints still remain too large for low-cost and resource-constrained embedded devices. Manual quantization approach may effectively compress SLM memory footprints, but it requires a huge design time and compute power to find the quantization setting for each network, hence making this approach not-scalable for handling different networks, performance requirements, and memory budgets. To bridge this gap, we propose QSLM, a novel framework that performs automated quantization for compressing pre-trained SLMs, while meeting the performance and memory constraints. To achieve this, QSLM first identifies the hierarchy of the given network architecture and the sensitivity of network layers under quantization, then employs a tiered quantization strategy (e.g., global-, block-, and module-level quantization) while leveraging a multi-objective performance-and-memory trade-off function to select the final quantization setting. Experimental results indicate that our QSLM reduces memory footprint by up to 86.5%, reduces power consumption by up to 20%, maintains high performance across different tasks (i.e., by up to 84.4% accuracy of sentiment classification on the SST-2 dataset and perplexity score of 23.2 for text generation on the WikiText-2 dataset) close to the original non-quantized model while meeting the performance and memory constraints.

URL PDF HTML ☆

赞 0 踩 0

2512.19442 2026-04-22 eess.SP cs.LG cs.SD

Real-Time Streamable Generative Speech Restoration with Flow Matching

Simon Welker, Bunlong Lay, Maris Hillemann, Tal Peer, Timo Gerkmann

Comments This work has been submitted to the IEEE for possible publication

2510.19341 2026-04-22 math.OC cs.LG

Nonmonotone subgradient methods based on a local descent lemma

Francisco J. Aragón-Artacho, Rubén Campoy, Pedro Pérez-Aros, David Torregrosa-Belén

2510.17925 2026-04-22 cs.SE cs.AI

SpecAgent: A Speculative Retrieval and Forecasting Agent for Code Completion

George Ma, Anurag Koul, Qi Chen, Yawen Wu, Sachit Kuhar, Yu Yu, Aritra Sengupta, Varun Kumar, Murali Krishna Ramanathan

Comments In Proceedings of the Sixty-Fourth Annual Meeting of the Association for Computational Linguistics (2026)

2510.10866 2026-04-22 stat.ML cs.LG

Quantifying Data Similarity Using Cross Learning

Shudong Sun, Hao Helen Zhang, Joseph C Watkins

2509.04072 2026-04-22 eess.AS cs.CL cs.SD

Computational Narrative Understanding for Expressive Text-to-Speech

Gaspard Michel, Elena V. Epure, Christophe Cerisara

Comments Findings of ACL 2026

2508.12268 2026-04-22 cs.HC cs.CV

iTrace: Click-Based Gaze Visualization on the Apple Vision Pro

Esra Mehmedova, Santiago Berrezueta-Guzman, Stefan Wagner

Comments Paper submitted to review

2506.14665 2026-04-22 physics.chem-ph cs.AI cs.CE cs.LG physics.comp-ph

Accurate and scalable exchange-correlation with deep learning

Giulia Luise, Chin-Wei Huang, Thijs Vogels, Derk P. Kooi, Sebastian Ehlert, Stephanie Lanius, Klaas J. H. Giesbertz, Amir Karton, Deniz Gunceler, Stefano Battaglia, Gregor N. C. Simm, P. Bernát Szabó, Megan Stanley, Wessel P. Bruinsma, Lin Huang, Xinran Wei, José Garrido Torres, Abylay Katbashev, Rodrigo Chavez Zavaleta, Bálint Máté, Sékou-Oumar Kaba, Roberto Sordillo, Yingrong Chen, David B. Williams-Young, Christopher M. Bishop, Jan Hermann, Rianne van den Berg, Paola Gori-Giorgi

Comments The Skala model and inference code are available under MIT license at https://github.com/microsoft/skala

2501.02200 2026-04-22 cs.NE cs.AI cs.CV cs.LG

Learning Evolution via Optimization Knowledge Adaptation

Chao Wang, Lingling Li, Licheng Jiao, Jiaxuan Zhao, Fang Liu, Shuyuan Yang

Comments This work has been accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence

2410.16893 2026-04-22 math.OC cs.LG

Global Optimization of Gaussian Process Acquisition Functions Using a Piecewise-Linear Kernel Approximation

Yilin Xie, Shiqiang Zhang, Joel A. Paulson, Calvin Tsay

Comments 17 pages, 4 figures, 5 tables

2409.06080 2026-04-22 cond-mat.mtrl-sci cs.LG

Regression with Large Language Models for Materials and Molecular Property Prediction

Ryan Jacobs, Maciej P. Polak, Lane E. Schultz, Hamed Mahdavi, Vasant Honavar, Dane Morgan