arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2601.00911 2026-04-23 cs.CR cs.AI cs.ET cs.HC cs.LG

Device-Native Autonomous Agents for Privacy-Preserving Negotiations

Joyjit Roy, Samaresh Kumar Singh

Comments 9 pages, 6 figures, 9 tables. This version updates metadata after publication in IEEE Xplore

详情

DOI: 10.1109/SoutheastCon63549.2026.11476181
Journal ref: 2026 IEEE SoutheastCon, Huntsville, AL, USA, 2026

英文摘要

Automated negotiations in insurance and business-to-business (B2B) commerce encounter substantial challenges. Current systems force a trade-off between convenience and privacy by routing sensitive financial data through centralized servers, increasing security risks, and diminishing user trust. This study introduces a device-native autonomous Agentic AI system for privacy-preserving negotiations. The proposed system operates exclusively on user hardware, enabling real-time bargaining while maintaining sensitive constraints locally. It integrates zero-knowledge proofs to ensure privacy and employs distilled world models to support advanced on-device reasoning. The architecture incorporates six technical components within an Agentic AI workflow. Agents autonomously plan negotiation strategies, conduct secure multi-party bargaining, and generate cryptographic audit trails without exposing user data to external servers. The system is evaluated in insurance and B2B procurement scenarios across diverse device configurations. Results show an average success rate of 87 %, a 2.4x reduction in latency relative to cloud baselines, and strong privacy preservation through zero-knowledge proofs. User studies show 27 % higher trust scores when decision trails are available. These findings establish a foundation for trustworthy autonomous agents in privacy-sensitive financial domains.

URL PDF HTML ☆

赞 0 踩 0

2512.15808 2026-04-23 q-bio.QM cs.AI cs.CV cs.LG

Foundation Models in Biomedical Imaging: Turning Hype into Reality

Amgad Muneer, Kai Zhang, Ibraheem Hamdi, Rizwan Qureshi, Muhammad Waqas, Shereen Fouad, Hazrat Ali, Syed Muhammad Anwar, Jia Wu

Comments 9 figures and 3 tables

2512.12463 2026-04-23 stat.ML cs.LG math.ST stat.TH

Understanding Overparametrization in Survival Models through Interpolation

Yin Liu, Jianwen Cai, Didong Li

2511.17265 2026-04-23 cs.AR cs.AI cs.ET cs.PF

DISCA: A Digital In-memory Stochastic Computing Architecture Using A Compressed Bent-Pyramid Format

Shady Agwa, Yikang Shen, Shiwei Wang, Themis Prodromakis

Comments This work has been accepted for publication in the 2025 37th International Conference on Microelectronics (ICM)

详情

DOI: 10.1109/ICM66518.2025.11322444
Journal ref: 2025 37th International Conference on Microelectronics (ICM)

英文摘要

Nowadays, we are witnessing an Artificial Intelligence revolution that dominates the technology landscape in various application domains, such as healthcare, robotics, automotive, security, and defense. Massive-scale AI models, which mimic the human brain's functionality, typically feature millions and even billions of parameters through data-intensive matrix multiplication tasks. While conventional Von-Neumann architectures struggle with the memory wall and the end of Moore's Law, these AI applications are migrating rapidly towards the edge, such as in robotics and unmanned aerial vehicles for surveillance, thereby adding more constraints to the hardware budget of AI architectures at the edge. Although in-memory computing has been proposed as a promising solution for the memory wall, both analog and digital in-memory computing architectures suffer from substantial degradation of the proposed benefits due to various design limitations. We propose a new digital in-memory stochastic computing architecture, DISCA, utilizing a compressed version of the quasi-stochastic Bent-Pyramid data format. DISCA inherits the same computational simplicity of analog computing, while preserving the same scalability, productivity, and reliability of digital systems. Post-layout modeling results of DISCA show an energy efficiency of 3.59TOPS/W per bit at 500 MHz using a commercial 180 nm CMOS technology. Therefore, DISCA significantly improves the energy efficiency for matrix multiplication workloads by orders of magnitude if scaled and compared to its counterpart architectures.

URL PDF HTML ☆

赞 0 踩 0

2511.17113 2026-04-23 cs.CR cs.AI cs.LG

AutoGraphAD: Unsupervised network anomaly detection using Variational Graph Autoencoders

Georgios Anyfantis, Pere Barlet-Ros

Comments 6 pages, 5 figures

2511.15141 2026-04-23 cs.IR cs.AI

ItemRAG: Item-Based Retrieval-Augmented Generation for LLM-Based Recommendation

Sunwoo Kim, Geon Lee, Kyungho Kim, Jaemin Yoo, Kijung Shin

Comments Published as a conference paper at SIGIR 2026 (short)

2511.14311 2026-04-23 eess.SY cs.RO cs.SY

Multi-Timescale Model Predictive Control for Slow-Fast Systems

Lukas Schroth, Daniel Morton, Amon Lahr, Daniele Gammelli, Andrea Carron, Marco Pavone

2511.02849 2026-04-23 eess.SP cs.CV eess.IV

Benchmarking ResNet for Short-Term Hypoglycemia Classification with DiaData

Beyza Cinar, Maria Maleshkova

Comments 11 pages, 5 Tables, 4 Figures, BHI 2025 conference (JBHI special issue). References were corrected

2510.08465 2026-04-23 stat.ML cs.LG

Accumulated Aggregated D-Optimal Designs for Estimating Main Effects in Black-Box Models

Chih-Yu Chang, Ming-Chung Chang

2510.05786 2026-04-23 cs.GT cs.DM cs.LG math.CO

Möbius transforms and Shapley values for vector-valued functions on weighted directed acyclic multigraphs

Patrick Forré, Abel Jansma

Comments 50 pages, 2 figures

详情

英文摘要

Möbius inversion and Shapley values are two mathematical tools for characterizing and decomposing higher-order structure in complex systems. The former defines higher-order interactions as discrete derivatives over a partial order; the latter provides a principled way to attribute those interactions back to the `atomic' elements of the system. Both have found wide application, from combinatorics and cooperative game theory to machine learning and explainable AI. We generalize both tools simultaneously in two orthogonal directions: 1) from real-valued functions to functions valued in any abelian group (in particular, vector-valued functions), and 2) from partial orders and lattices to directed acyclic multigraphs (DAMGs) and weighted versions thereof. The classical axioms, linearity, efficiency, null player, and symmetry, which uniquely characterize Shapley values on lattices, are insufficient in this more general setting. We resolve this by introducing projection operators that recursively re-attribute higher-order synergies down to the roots of the graph, and by proposing two natural axioms: weak elements (coalitions with zero synergy can be removed without affecting any attribution) and flat hierarchy (on graphs with no intermediate hierarchy, attributions are distributed proportionally to edge counts). Together with linearity, these three axioms uniquely determine the Shapley values via a simple explicit formula, while automatically implying efficiency, null player, symmetry, and a novel projection property. The resulting framework recovers all existing lattice-based definitions as special cases, and naturally handles settings, such as games on non-lattice partial orders, which were previously out of reach. The extension to vector-valued functions and general DAMG-structured hierarchies opens new application areas in machine learning, natural language processing, and explainable AI.

URL PDF HTML ☆

赞 0 踩 0

2509.19367 2026-04-23 eess.SP cs.LG stat.ML

Low-Cost Sensor Fusion Framework for Organic Substance Classification and Quality Control Using Classification Methods

Borhan Uddin Chowdhury, Damian Valles, Md Raf E Ul Shougat

Comments Copyright 2025 IEEE. This is the author's version of the work accepted for publication in FMLDS 2025. The final version will be published by IEEE and available via DOI (to be inserted when available). Accepted at FMLDS 2025, to appear in IEEE Xplore. 8 pages, 17 figures, 3 tables

2509.16002 2026-04-23 quant-ph cs.LG

Scalable Quantum Reinforcement Learning on NISQ Devices with Dynamic-Circuit Qubit Reuse and Grover Optimization

Thet Htar Su, Shaswot Shresthamali, Masaaki Kondo

2509.08539 2026-04-23 cs.HC cs.LG

Motion-Based User Identification across XR and Metaverse Applications by Deep Classification and Similarity Learning

Lukas Schach, Christian Rack, Ryan P. McMahan, Marc Erich Latoschik

2509.02060 2026-04-23 q-bio.BM cs.LG

Morphology-Aware Peptide Discovery via Masked Conditional Generative Modeling

Nuno Costa, Julija Zavadlav

Comments 46 pages, 4 figures, 6 tables

2508.18948 2026-04-23 hep-th cond-mat.dis-nn cs.LG stat.ML

Gauge-covariant stochastic neural fields: Stability and finite-width effects

Rodrigo Carmo Terin

Comments 20 pages, 2 figures, 1 table. Accepted version for publication in Scientific Reports

2508.15411 2026-04-23 cs.SE cs.CL cs.LG cs.MA

Foundational Design Principles and Patterns for Building Robust and Adaptive GenAI-Native Systems

Frederik Vandeputte

2508.08822 2026-04-23 cs.AR cs.AI cs.ET cs.PF

OISMA: On-the-fly In-memory Stochastic Multiplication Architecture for Matrix-Multiplication Workloads

Shady Agwa, Yihan Pan, Georgios Papandroulidakis, Themis Prodromakis

Comments This work has been accepted for publication by the IEEE Journal on Exploratory Solid-State Computational Devices and Circuits

详情

DOI: 10.1109/JXCDC.2026.3680281
Journal ref: IEEE Journal on Exploratory Solid-State Computational Devices and Circuits 2026

英文摘要

Artificial intelligence (AI) models are currently driven by a significant upscaling of their complexity, with massive matrix-multiplication workloads representing the major computational bottleneck. In-memory computing (IMC) architectures are proposed to avoid the von Neumann bottleneck. However, both digital/binary-based and analog IMC architectures suffer from various limitations, which significantly degrade the performance and energy efficiency gains. This work proposes OISMA, an energy-efficient IMC architecture that utilizes the computational simplicity of a quasi-stochastic computing (SC) domain (bent-pyramid (BP) system) while keeping the same efficiency, scalability, and productivity of digital memories. OISMA converts normal memory read operations into in situ stochastic multiplication operations with a negligible cost. An accumulation periphery then accumulates the output multiplication bitstreams, achieving the matrix multiplication (MatMul) functionality. A 4-kB 1T1R OISMA array was implemented using a commercial 180-nm technology node and in-house resistive random-access memory (RRAM) technology. At 50 MHz, it achieves 0.789 TOPS/W and 3.98 GOPS/mm2 for energy and area efficiency, respectively, occupying an effective computing area of 0.804241 mm2. Scaling OISMA to 22-nm technology shows a significant improvement of two orders of magnitude in energy efficiency and one order of magnitude in area efficiency, compared to dense MatMul IMC architectures.

URL PDF HTML ☆

赞 0 踩 0

2508.07050 2026-04-23 cs.IR cs.AI cs.CL cs.LG

ReasonRank: Empowering Passage Ranking with Strong Reasoning Ability

Wenhan Liu, Xinyu Ma, Weiwei Sun, Yutao Zhu, Yuchen Li, Dawei Yin, Zhicheng Dou

Comments 25 pages, accepted by ACL2026 main conference

2507.16433 2026-04-23 stat.ME cs.LG

Adaptive Multi-task Learning for Multi-sector Portfolio Optimization

Qingliang Fan, Ruike Wu, Yanrong Yang

2507.08540 2026-04-23 cs.CR cs.AI

White-Basilisk: A Hybrid Model for Code Vulnerability Detection

Ioannis Lamprou, Alexander Shevtsov, Ioannis Arapakis, Sotiris Ioannidis

2507.07800 2026-04-23 q-bio.QM cs.CV

A novel attention mechanism for noise-adaptive and robust segmentation of microtubules in microscopy images

Achraf Ait Laydi, Louis Cueff, Mewen Crespo, Yousef El Mourabit, Hélène Bouvrais

详情

英文摘要

Segmenting cytoskeletal filaments in microscopy images is essential for studying their roles in cellular processes. However, this task is highly challenging due to the fine, densely packed, and intertwined nature of these structures. Imaging limitations further complicate analysis. While deep learning has advanced segmentation of large, well-defined biological structures, its performance often degrades under such adverse conditions. Additional challenges include obtaining precise annotations for curvilinear structures and managing severe class imbalance during training. We introduce a novel noise-adaptive attention mechanism that extends the Squeeze-and-Excitation (SE) module to dynamically adjust to varying noise levels. Integrated into a U-Net decoder with residual encoder blocks, this yields ASE_Res_UNet, a lightweight yet high-performance model. We also developed a synthetic dataset generation strategy that ensures accurate annotations of fine filaments in noisy images. We systematically evaluated loss functions and metrics to mitigate class imbalance, ensuring robust performance assessment. ASE_Res_UNet effectively segmented microtubules in noisy synthetic images, outperforming its ablated variants. It also demonstrated superior segmentation compared to models with alternative attention mechanisms or distinct architectures, while requiring fewer parameters, making it efficient for resource-constrained environments. Evaluation on a newly curated real microscopy dataset and a recently reannotated dataset highlighted ASE_Res_UNet's effectiveness in segmenting microtubules beyond synthetic images. For these datasets, ASE_Res_UNet was competitive with a recent synthetic data-driven approach that shares two cytoskeleton pretrained models. Importantly, ASE_Res_UNet showed strong transferability to other curvilinear structures (blood vessels and nerves) across diverse imaging conditions.

URL PDF HTML ☆

赞 0 踩 0

2506.20910 2026-04-23 math.OC cs.LG stat.ML

Faster Fixed-Point Methods for Multichain MDPs

Matthew Zurek, Yudong Chen

2506.16658 2026-04-23 math.ST cs.LG stat.ML stat.TH

Multi-Armed Bandits With Machine Learning-Generated Surrogate Rewards

Wenlong Ji, Yihan Pan, Ruihao Zhu, Lihua Lei

详情

英文摘要

Multi-armed bandit (MAB) is a widely adopted framework for sequential decision-making under uncertainty. Traditional bandit algorithms rely solely on online data, which tends to be scarce as it must be gathered during the online phase when the arms are actively pulled. However, in many practical settings, rich auxiliary data, such as covariates of past users, is available prior to deploying any arms. We introduce a new setting for MAB where pre-trained machine learning (ML) models are applied to convert side information and historical data into \emph{surrogate rewards}. A prominent challenge of this setting is that the surrogate rewards may exhibit substantial bias, as true reward data is typically unavailable in the offline phase, forcing ML predictions to heavily rely on extrapolation. To address the issue, we propose the Machine Learning-Assisted Upper Confidence Bound (MLA-UCB) algorithm, which can be applied to any reward prediction model and any form of auxiliary data. When the predicted and true rewards are jointly Gaussian, it provably improves the cumulative regret, even in cases where the mean surrogate reward completely misaligns with the true mean rewards, and achieves the asymptotic optimality among a broad class of policies. Notably, our method requires no prior knowledge of the covariance matrix between true and surrogate rewards. We further extend the method to a batched reward MAB problem, where each arm pull yields a batch of observations and rewards may be non-Gaussian, and we derive computable confidence bounds and regret guarantees that improve upon classical UCB algorithms. Finally, extensive simulations with both Gaussian and ML-generated surrogates, together with real-world studies on language model selection and video recommendation, demonstrate consistent and often substantial regret reductions with moderate offline surrogate sample sizes and correlations.

URL PDF HTML ☆

赞 0 踩 0

2505.07849 2026-04-23 cs.SE cs.AI cs.IR

SweRank: Software Issue Localization with Code Ranking

Revanth Gangi Reddy, Tarun Suresh, JaeHyeok Doo, Ye Liu, Xuan Phi Nguyen, Yingbo Zhou, Semih Yavuz, Caiming Xiong, Heng Ji, Shafiq Joty

Comments ICLR 2026 Camera Ready Version

2504.19239 2026-04-23 quant-ph cs.LG

The effect of the number of parameters and the number of local feature patches on loss landscapes in distributed quantum neural networks

Yoshiaki Kawase

Comments 15 pages + Appendices

详情

英文摘要

Quantum neural networks hold promise for tackling computationally challenging tasks that are intractable for classical computers. However, their practical application is hindered by significant optimization challenges, arising from complex loss landscapes characterized by barren plateaus and numerous local minima. These problems become more severe as the number of parameters or qubits increases, hampering effective training. To mitigate these optimization challenges, particularly for classical data, we distribute overlapping local patches across multiple quantum neural networks, processing each patch with an independent quantum neural network, and aggregating their outputs for prediction. In this study, we investigate how the number of parameters and patches affects the loss landscape geometry of this distributed quantum neural network architecture via theoretical and empirical Hessian analyses and loss landscape visualization. Our results confirm that increasing the number of parameters tends to lead to deeper and sharper loss landscapes. Crucially, we theoretically derive and empirically demonstrate that increasing the number of patches significantly reduces the largest Hessian eigenvalue at minima. Furthermore, our analysis of the full Hessian eigenspectrum reveals a structure consisting of a bulk of near-zero eigenvalues and distinct outlier spikes corresponding to the number of classes, similar to classical deep learning models. These findings suggest that our distributed patch approach acts as a form of implicit structural regularization, promoting optimization stability and potentially enhancing generalization. Our study provides valuable insights into optimization challenges and highlights that the distributed patch approach is a promising strategy for developing more trainable and scalable quantum machine learning models for classical data tasks.

URL PDF HTML ☆

赞 0 踩 0

2503.23729 2026-04-23 math.NA cs.LG cs.NA

Integral regularization PINNs for evolution equations

Xiaodong Feng, Haojiong Shangguan, Tao Tang, Xiaoliang Wan

2503.03816 2026-04-23 astro-ph.GA cs.LG

The Optical and Infrared Are Connected

Christian K. Jespersen, Peter Melchior, David N. Spergel, Andy D. Goulding, ChangHoon Hahn, Kartheik G. Iyer

Comments Accepted to ApJ. 18 pages, 14 figures. 11 pages of Appendix

2501.03624 2026-04-23 cs.HC cs.CL

LLAMADRS: Evaluating Open-Source LLMs on Real Clinical Interviews--To Reason or Not to Reason?

Gaoussou Youssouf Kebe, Jeffrey M. Girard, Einat Liebenthal, Justin Baker, Fernando De la Torre, Louis-Philippe Morency

2412.18208 2026-04-23 quant-ph cs.LG

Quantum framework for Reinforcement Learning: Integrating Markov decision process, quantum arithmetic, and trajectory search

Thet Htar Su, Shaswot Shresthamali, Masaaki Kondo

2411.00585 2026-04-23 cs.CY cs.AI

Fairness Testing of Large Language Models in Role-Playing

Xinyue Li, Zhenpeng Chen, Jie M. Zhang, Ying Xiao, Tianlin Li, Weisong Sun, Yang Liu, Yiling Lou, Xuanzhe Liu

Comments Accepted by ACM International Conference on the Foundations of Software Engineering (FSE 2026)