arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.01591 2026-05-05 cs.IR cs.CL

Led to Mislead: Adversarial Content Injection for Attacks on Neural Ranking Models

Amin Bigdeli, Amir Khosrojerdi, Radin Hamidi Rad, Morteza Zihayat, Charles L. A. Clarke, Ebrahim Bagheri

详情

英文摘要

Neural Ranking Models (NRMs) are central to modern information retrieval but remain highly vulnerable to adversarial manipulation. Existing attacks often rely on heuristics or surrogate models, limiting effectiveness and transferability. We propose CRAFT, a supervised framework for black-box adversarial rank attacks powered by large language models (LLMs). CRAFT operates in three stages: adversarial dataset generation via retrieval-augmented generation and self-refinement, supervised fine-tuning on curated adversarial examples, and preference-guided optimization to align generations with rank-promotion objectives. Extensive experiments on the MS MARCO passage dataset, TREC Deep Learning 2019, and TREC Deep Learning 2020 benchmarks show that CRAFT significantly outperforms state-of-the-art baselines, achieving higher promotion rates and rank boosts while preserving fluency and semantic fidelity. Moreover, CRAFT transfers effectively across diverse ranking architectures, including cross-encoder, embedding-based, and LLM-based rankers, underscoring vulnerabilities in real-world retrieval systems. This work provides a principled framework for studying adversarial threats in NRMs, underscores the risks of generative AI in rank manipulation, and provides a foundation for developing more robust retrieval systems. To support reproducibility, we publicly release our source code, trained models, and prompt templates.

URL PDF HTML ☆

赞 0 踩 0

2605.01582 2026-05-05 cs.IR cs.AI

KG-First, LLM-Fallback: A Hybrid Microservice for Grounded Skill Search and Explanation

Ngoc Luyen Le, Marie-Hélène Abel, Bertrand Laforge

2605.01579 2026-05-05 stat.ME cs.LG

Minimum Specification Perturbation: Robustness as Distance-to-Falsification in Causal Inference

Hoang Dang, Luan Pham, Minh Nguyen

Comments 36 pages, 2 figures

2605.01567 2026-05-05 cs.SE cs.CL cs.LG

Feedback-Normalized Developer Memory for Reinforcement-Learning Coding Agents: A Safety-Gated MCP Architecture

Mehmet Iscan

Comments 25 pages, 5 figures, 7 tables. Preprint. Implementation and supplementary artifacts are available at the project repository

详情

英文摘要

Large language model (LLM) coding agents increasingly operate over repositories, terminals, tests, and execution traces across long software-engineering episodes. Persistent memory is useful, but static vector stores or generic retrieval-augmented generation (RAG) are insufficient for reinforcement-learning (RL) code development, where small details can alter Bellman targets, terminal masks, gradient flow, or validation claims. This paper presents RL Developer Memory, a local-first, Model Context Protocol (MCP)-native developer-memory architecture for RL coding agents. It treats memory selection as a logged contextual decision process: issue_match ranks candidates and records telemetry, issue_feedback maps raw labels to bounded rewards, and issue_record_resolution links verified resolutions to earlier retrieval events. A deterministic ranker remains deployed, while a contextual-bandit residual policy runs in shadow mode and can affect canary behavior only through conservative off-policy-evaluation (OPE) gates. RL/control memories require theory-to-code metadata and review-gated governance. The system is evaluated on a deterministic 200-case benchmark with RL algorithm bugs, hard negatives, review-gated RL/control cases, and low-risk failures. In the same-commit comparison, deterministic control and full shadow/OPE both achieve 80.0% expected-decision accuracy and 100.0% hard-negative suppression; the full configuration adds learning telemetry rather than accuracy gain. Static validation passed 11/11 checks; dynamic integration passed 10/10 cases. The evidence reports limits: active learned-policy deployment and official-client MCP interoperability are unsupported, live full-configuration latency regresses, and 40 residual non-RL failures remain. The contribution is an auditable memory-control architecture with explicit claim boundaries, not a universal coding-agent improvement claim.

URL PDF HTML ☆

赞 0 踩 0

2605.01562 2026-05-05 cs.SE cs.AI

Neuro-Symbolic Agents for Hallucination-Free Requirements Reuse

Ahmed Ibrahim

2605.01561 2026-05-05 econ.EM cs.LG physics.soc-ph

Hall-Like Transversal Stress and Sandpile Criticality on Real Production Networks

Diego Vallarino

2605.01546 2026-05-05 cs.NI cs.AI

6G Needs Agents: Toward Agentic AI-Native Networks for Autonomous Intelligence

Mohamed Amine Ferrag, Abderrahmane Lakas, Merouane Debbah

2605.01492 2026-05-05 stat.ML cs.IT cs.LG math.IT

Stabilizing Private LASSO under Heterogeneous Covariates via Anisotropic Objective Perturbation

Haruka Tanzawa, Ayaka Sakata

Comments 6 pages, 5 figures

2605.01471 2026-05-05 cs.SE cs.AI

Practical Limits of Autonomous Test Repair: A Multi-Agent Case Study with LLM-Driven Discovery and Self-Correction

Hyukjoo Lee

Comments Industrial case study; submitted for review

2605.01467 2026-05-05 math.OC cs.CV cs.NA math.NA

Quaternion Nonlinear Transform-Induced Nuclear Norm for Low-Rank Tensor Completion

Biswarup Karmakar, Ratikanta Behera

Comments 25 pages

2605.01452 2026-05-05 stat.ME cs.LG

Stable Localized Conformal Prediction via Transduction

Yinjie Min, Liuhua Peng, Changliang Zou

2605.01449 2026-05-05 cs.CR cs.AI

VisInject: Disruption != Injection -- A Dual-Dimension Evaluation of Universal Adversarial Attacks on Vision-Language Models

Pang Liu, Yingjie Lao

2605.01423 2026-05-05 hep-ex cs.AI cs.MA

HepScript: A Dual-Use DSL for Human-AI Collaborative Data Analysis Workflows in High-Energy Physics

Junkun Jiao, Tong Liu, Ke Li, Weimin Song, Yipu Liao, Bolun Zhang, Beijiang Liu, Chang-Zheng Yuan, Yue Sun

2605.01416 2026-05-05 cs.CY cs.CL

Who Decides What Is Harmful? Content Moderation Policy Through A Multi-Agent Personalised Inference Framework

Ewelina Gajewska, Michal Wawer, Katarzyna Budzynska, Jaroslaw A. Chudziak

Comments The paper has been accepted to the 34th European Conference on Information Systems (ECIS 2026). The official paper version will appear in the conference proceedings

2605.01407 2026-05-05 cs.IR cs.CL

The Pre-Training Study of Expanded-SPLADE Models on Web Document Titles

Hiun Kim, Tae Kwan Lee, Taeryun Won

2605.01404 2026-05-05 cs.AR cs.AI

AMSnet-q: Unsupervised Circuit Identification and Performance Labeling for AMS Circuits

Ze Zhang, Junzhuo Zhou, Yichen Shi, Zhuofu Tao, Rui Ji, Zhiping Yu, Quan Chen, Ting-Jung Lin, Lei He

2605.01400 2026-05-05 cs.HC cs.AI cs.CY cs.IR

Investigating the Effects of Different Levels of User Control in an Interactive Educational Recommender System

Qurat Ul Ain, Mohamed Amine Chatti, William Kana Tsoplefack, Rawaa Alatrash, Shoeb Joarder

Comments Submitted to TORS. arXiv admin note: text overlap with arXiv:2501.12894

2605.01394 2026-05-05 cs.SE cs.AI

LiveFMBench: Unveiling the Power and Limits of Agentic Workflows in Specification Generation

Dong Xu, Jialun Cao, Guozhao Mo, Junjie Hu, Cheng Wen, Hongyu Lin, Xianpei Han, Shengchao Qin, Cong Tian, Shing-Chi Cheung, Le Sun, Yaojie Lu

2605.01392 2026-05-05 cs.SE cs.AI

Using LLMs in Software Design: An Empirical Study of GitHub and A Practitioner Survey

Yifei Wang, Ruiyin Li, Peng Liang, Yangxiao Cai, Zengyang Li, Mojtaba Shahin, Arif Ali Khan, Qiong Feng

Comments 29 pages, 8 images, 6 tables, Manuscript submitted to a Journal (2026)

2605.01367 2026-05-05 quant-ph cs.LG cs.SY eess.SY

From Characterization To Construction: Generative Quantum Circuit Synthesis from Gate Set Tomography Data

King Yiu Yu, Aritra Sarkar, Erbing Hua, Maximilian Rimbach-Russ, Ryoichi Ishihara, Sebastian Feld

Comments 19 pages, 3 figures

详情

英文摘要

High-fidelity circuit execution on noisy intermediate-scale quantum devices is bottlenecked by compilation pipelines that disregard complex, correlated noise. To address this, this methodology article proposes a quantum machine learning control (QMLC) framework for generative quantum circuit synthesis from gate-set tomography (GST) data that bypasses the traditional two-step pipeline of characterizing native quantum gates via GST followed by unitary decomposition algorithms. Instead, a generative concept space is directly learnt from GST data, enabling conditional synthesis of quantum circuits on a desired output distribution. Our approach tokenizes GST germ circuits and embeds them into a structured latent space using a curriculum-learning-motivated strategy, starting with short circuits and progressively incorporating longer ones with diverse output statistics. The embedded sequences are processed by a set-vision transformer with permutation-invariant pooling, producing k-seed vectors that represent the learned concept space of the quantum device. Aggregating data across multiple circuits makes this latent representation inherently context-aware, capturing the shared physical noise environment (e.g., crosstalk, drift) that isolated gate metrics miss. We propose an unconditional diffusion model to sample from the concept space. During inference, a user provides a target measurement distribution, and the model generates a corresponding circuit. To ensure fidelity and robustness, the output is denoised using a diffusion model that operates on the target conditional covariance matrix. This end-to-end framework is a step towards context-aware, hardware-native circuit synthesis directly from raw GST data, which offers a new paradigm for integrating quantum control and compilation. The QMLC framework is particularly suited for near-term quantum devices with complex calibration procedures.

URL PDF HTML ☆

赞 0 踩 0

2605.01363 2026-05-05 hep-ex cs.LG hep-ph stat.ME

Data-Driven, Geometry-Aware Optimal-Transport Calibration of Flavor Tagger

Yeonjoon Kim, Un-ki Yang

Comments 32 Pages, 12 Figures

2605.01352 2026-05-05 cs.OS cs.AI cs.DC

VUDA: Breaking CUDA-Vulkan Isolation for Spatial Sharing of Compute and Graphics on the Same GPU

Bin Xu, Pengfei Hu, Wenxin Zheng, Jinyu Gu, Haibo Chen

2605.01341 2026-05-05 cs.LO cs.AI

ABox Abduction for Inconsistent Knowledge Bases under Repair Semantics

Anselm Haak, Patrick Koopmann, Yasir Mahmood, Anni-Yasmin Turhan

2605.01335 2026-05-05 stat.ML cs.LG math.ST stat.TH

Mean Testing under Truncation beyond Gaussian

Yuhao Wang, Roberto Imbuzeiro Oliveira, Themis Gouleakis

2605.01319 2026-05-05 quant-ph cs.LG

Barren Plateaus as Destructive Interference: A Diagnostic Framework and Implications for Structured Ansatzes

Pilsung Kang

2605.01307 2026-05-05 eess.SP cs.AI cs.NI

Spectral- and Energy-efficient Multi-BS Multi-RIS Pinching-antenna Systems: A GNN-based Approach

Changpeng He, Yang Lu, Wei Chen, Bo Ai, Arumugam Nallanathan, Zhiguo Ding

2605.01306 2026-05-05 physics.optics cs.LG physics.app-ph

Machine Learning Enhanced Laser Spectroscopy for Multi-Species Gas Detection in Complex and Harsh Environments

Mohamed Sy

Comments PhD thesis

2605.01298 2026-05-05 cs.CR cs.CV

Checkerboard: A Simple, Effective, Efficient and Learning-free Clean Label Backdoor Attack with Low Poisoning Budget

Yi Yang, Jinyang Huang, Binbin Liu, Feng-Qi Cui, Xiaokang Zhou, Zhi Liu, Jie Zhang, Meng Li

2605.01282 2026-05-05 eess.IV cs.AI

A Target-Free Harmonization Method for MRI

Minjun Kim, Dong Ju Mun, Hwihun Jeong, Hangyeol Park, Haechang Lee, Se Young Chun, Jongho Lee

Comments 37 pages, 10 figures

2605.01280 2026-05-05 cs.DC cs.AI

Position: LLM Serving Needs Mathematical Optimization and Algorithmic Foundations, Not Just Heuristics

Zijie Zhou