arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.16090 2026-04-20 cs.DC cs.AI

Robust Synchronisation for Federated Learning in The Face of Correlated Device Failure

Stefan Behfar, Richard Mortier

详情

英文摘要

Probabilistic Synchronous Parallel (PSP) is a technique in distributed learning systems to reduce synchronization bottlenecks by sampling a subset of participating nodes per round. In Federated Learning (FL), where edge devices are often unreliable due to factors including mobility, power constraints, and user activity, PSP helps improve system throughput. However, PSP has a key limitation: it assumes device behavior is static and different devices are independent. This can lead to unfair distributed synchronization, due to highly available nodes dominating training while those that are often unavailable rarely participate and so their data may be missed. If both data distribution and node availability are simultaneously correlated with the device, then both PSP and standard FL algorithms will suffer from persistent under-representation of certain classes or groups resulting in inefficient or ineffective learning of certain features. We introduce Availability-Weighted PSP (AW-PSP), an extension to PSP that addresses the issue of co-correlation of unfair sampling and data availability by dynamically adjusting node sampling probabilities using real-time availability predictions, historical behavior, and failure correlation metrics. A Markov-based availability predictor distinguishes transient \emph{vs} chronic failures, while a Distributed Hash Table (DHT) layer decentralizes metadata, including latency, freshness, and utility scores. We implement AW-PSP and trace-driven evaluation shows that it improves robustness to both independent and correlated failures, increases label coverage, and reduces fairness variance compared to standard PSP. AW-PSP thus provides an availability-aware, and fairness-conscious node sampling protocol for FL deployments that will scale to large numbers of nodes even in heterogeneous and failure-prone environments.

URL PDF HTML ☆

赞 0 踩 0

2604.16061 2026-04-20 cs.DS cs.CY cs.LG

Constant-Factor Approximations for Doubly Constrained Fair k-Center, k-Median and k-Means

Nicole Funk, Annika Hennes, Johanna Hillebrand, Sarah Sturm

Comments 30 pages, 3 figures

2604.16058 2026-04-20 cs.SE cs.CL

LLMSniffer: Detecting LLM-Generated Code via GraphCodeBERT and Supervised Contrastive Learning

Mahir Labib Dihan, Abir Muhtasim

2604.16052 2026-04-20 math.OC cs.LG math.PR

A Wasserstein Geometric Framework for Hebbian Plasticity

Ulrich Tan

Comments Preprint. 75 pages including appendices and bibliography

2604.16047 2026-04-20 cs.HC cs.CY cs.LG

Driving Assistance System for Ambulances to Minimise the Vibrations in Patient Cabin

Abdulaziz Aldegheishem, Nabil Alrajeh, Lorena Parra, Oscar Romero, Jaime Lloret

Comments 19 pages, 14 figures, 10 tables

详情

DOI: 10.3390/electronics11233965
Journal ref: Electronics, MDPI, 2022

英文摘要

The ambulance service is the main transport for diseased or injured people which suffers the same acceleration forces as regular vehicles. These accelerations, caused by the movement of the vehicle, impact the performance of tasks executed by sanitary personnel, which can affect patient survival or recovery time. In this paper, we have trained, validated, and tested a system to assess driving in ambulance services. The proposed system is composed of a sensor node which measures the vehicle vibrations using an accelerometer. It also includes a GPS sensor, a battery, a display, and a speaker. When two possible routes reach the same destination point, the system compares the two routes based on previously classified data and calculates an index and a score. Thus, the index balances the possible routes in terms of time to reach the destination and the vibrations suffered in the patient cabin to recommend the route that minimises those vibrations. Three datasets are used to train, validate, and test the system. Based on an Artificial Neural network (ANN), the classification model is trained with tagged data classified as low, medium, and high vibrations, and 97% accuracy is achieved. Then, the obtained model is validated using data from three routes of another region. Finally, the system is tested in two new scenarios with two possible routes to reach the destination. The results indicate that the route with less vibration is preferred when there are low time differences (less than 6%) between the two possible routes. Nonetheless, with the current weighting factors, the shortest route is preferred when time differences between routes are higher than 20%, regardless of the higher vibrations in the shortest route.

URL PDF HTML ☆

赞 0 踩 0

2604.11754 2026-04-20 eess.SY cs.RO cs.SY

Angle-based Localization and Rigidity Maintenance Control for Multi-Robot Networks

J. Francisco Presenza, Leonardo J. Colombo, Juan I. Giribet, Ignacio Mas

2603.03188 2026-04-20 stat.ML cs.LG

Scalable Posterior Uncertainty for Flexible Density-Based Clustering

Nicola Bariletto, Stephen G. Walker

2602.11327 2026-04-20 cs.CR cs.AI

Security Threat Modeling for Emerging AI-Agent Protocols: A Comparative Analysis of MCP, A2A, Agora, and ANP

Zeynab Anbiaee, Mahdi Rabbani, Mansur Mirani, Gunjan Piya, Igor Opushnyev, Ali Ghorbani, Sajjad Dadkhah

2602.07303 2026-04-20 cs.DB cs.AI cs.SE

KRONE: Scalable LLM-Augmented Log Anomaly Detection via Hierarchical Abstraction

Lei Ma, Jinyang Liu, Tieying Zhang, Peter M. VanNostrand, Dennis M. Hofmann, Lei Cao, Elke A. Rundensteiner, Jianjun Chen

Comments Accepted at ICDE 2026

2602.05523 2026-04-20 cs.SE cs.AI

Capture the Flags: Family-Based Evaluation of Agentic LLMs via Semantics-Preserving Transformations

Shahin Honarvar, Amber Gorzynski, James Lee-Jones, Harry Coppock, Marek Rei, Joseph Ryan, Alastair F. Donaldson

2409.01794 2026-04-20 stat.ME cs.LG stat.ML

Estimating Joint Interventional Distributions from Marginal Interventional Data

Sergio Hernan Garrido Mejia, Elke Kirschbaum, Armin Kekić, Bernhard Schölkopf, Atalanti Mastakouri

Comments Accepted at the Causal Reasoning and Learning (CLeaR) conference 2026

2403.18026 2026-04-20 eess.IV cs.LG q-bio.QM

Deep Learning-Enabled Modality Transfer Between Independent Microscopes for High-Throughput Imaging

Dominik Panek, Carina Rząca, Maksymilian Szczypior, Joanna Sorysz, Krzysztof Misztal, Zbigniew Baster, Zenon Rajfur

Comments 17 Pages, 5 Figures, 1 Table, 4 pages Supplementary Materials

详情

英文摘要

High-throughput biological imaging is often constrained by a trade-off between acquisition speed and image quality. Fast imaging modalities, such as wide-field fluorescence microscopy, enable large-scale data acquisition but suffer from reduced contrast and resolution, whereas high-resolution techniques, including confocal microscopy or single-molecule localization microscopy-based super-resolution techniques, provide superior image quality at the cost of throughput and instrument time. Here, we present a deep learning-based approach for modality transfer across independent microscopes, enabling the transformation of low-quality images acquired on fast systems into high-quality representations comparable to those obtained using advanced imaging platforms. To achieve this, we employ a generative adversarial network (GAN)-based model trained on paired datasets acquired on physically separate wide-field and confocal microscopes, demonstrating that image quality can be reliably transferred between independent instruments. Quantitative evaluation shows substantial improvement in structural similarity and signal fidelity, with median SSIM and PSNR of 0.94 and 31.87, respectively, compared to 0.83 and 21.48 for the original wide-field images. These results indicate that key structural features can be recovered with high accuracy. Importantly, this approach enables a workflow in which high-throughput imaging can be performed on fast, accessible microscopy systems while preserving the ability to computationally recover high-quality structural information. High-resolution microscopy can then be reserved for targeted validation, reducing acquisition time and improving overall experimental efficiency. Together, our results establish deep learning-enabled modality transfer as a practical strategy for bridging independent microscopy systems and supporting scalable, high-content imaging workflows.

URL PDF HTML ☆

赞 0 踩 0

2604.16033 2026-04-20 eess.SY cs.AI cs.SY

Safe Deep Reinforcement Learning for Building Heating Control and Demand-side Flexibility

Colin Jüni, Mina Montazeri, Yi Guo, Federica Bellizio, Giovanni Sansavini, Philipp Heer

2604.16024 2026-04-20 cs.MA cs.CV

AstroVLM: Expert Multi-agent Collaborative Reasoning for Astronomical Imaging Quality Diagnosis

Yaohui Han, Tianshuo Wang, Zixi Zhao, Zhengchun Zhu, Shuo Ren, Yiru Wang, Rongliang Fu, Tinghuan Chen, Tsung-Yi Ho

2604.16015 2026-04-20 quant-ph cond-mat.stat-mech cs.LG

Discovering quantum phenomena with Interpretable Machine Learning

Paulin de Schoulepnikoff, Hendrik Poulsen Nautrup, Hans J. Briegel, Gorka Muñoz-Gil

2604.15990 2026-04-20 cs.CY cs.AI cs.CV cs.HC

From Vulnerable Data Subjects to Vulnerabilizing Data Practices: Navigating the Protection Paradox in AI-Based Analyses of Platformized Lives

Delfina S. Martinez Pandiani, Ella Streefkerk, Laurens Naudts, Paula Helm

Comments In The 2026 ACM Conference on Fairness, Accountability, and Transparency (FAccT '26), June 25-28, 2026, Montreal, QC, Canada. ACM, New York, NY, USA, 23 pages

详情

DOI: 10.1145/3805689.3806735

英文摘要

This paper traces a conceptual shift from understanding vulnerability as a static, essentialized property of data subjects to examining how it is actively enacted through data practices. Unlike reflexive ethical frameworks focused on missing or counter-data, we address the condition of abundance inherent to platformized life-a context where a near inexhaustible mass of data points already exists, shifting the ethical challenge to the researcher's choices in operating upon this existing mass. We argue that the ethical integrity of data science depends not just on who is studied, but on how technical pipelines transform "vulnerable" individuals into data subjects whose vulnerability can be further precarized. We develop this argument through an AI for Social Good (AI4SG) case: a journalist's request to use computer vision to quantify child presence in monetized YouTube 'family vlogs' for regulatory advocacy. This case reveals a "protection paradox": how data-driven efforts to protect vulnerable subjects can inadvertently impose new forms of computational exposure, reductionism, and extraction. Using this request as a point of departure, we perform a methodological deconstruction of the AI pipeline to show how granular technical decisions are ethically constitutive. We contribute a reflexive ethics protocol that translates these insights into a reflexive roadmap for research ethics surrounding platformized data subjects. Organized around four critical junctures-dataset design, operationalization, inference, and dissemination-the protocol identifies technical questions and ethical tensions where well-intentioned work can slide into renewed extraction or exposure. For every decision point, the protocol offers specific prompts to navigate four cross-cutting vulnerabilizing factors: exposure, monetization, narrative fixing, and algorithmic optimization. Rather than uncritically...

URL PDF HTML ☆

赞 0 踩 0

2604.15967 2026-04-20 cs.CR cs.CV

TwoHamsters: Benchmarking Multi-Concept Compositional Unsafety in Text-to-Image Models

Chaoshuo Zhang, Yibo Liang, Mengke Tian, Chenhao Lin, Zhengyu Zhao, Le Yang, Chong Zhang, Yang Zhang, Chao Shen

2604.15964 2026-04-20 eess.IV cs.CV cs.LG

Topology-Driven Fusion of nnU-Net and MedNeXt for Accurate Brain Tumor Segmentation on Sub-Saharan Africa Dataset

Prabin Bohara, Pralhad Kumar Shrestha, Arpan Rai, Usha Poudel Lamgade, Confidence Raymond, Dong Zhang, Aondona Lorumbu, Craig Jones, Mahesh Shakya, Bishesh Khanal, Pratibha Kulung

2604.15958 2026-04-20 cs.CR cs.CL

A Case Study on the Impact of Anonymization Along the RAG Pipeline

Andreea-Elena Bodea, Stephen Meisenbacher, Florian Matthes

Comments 7 pages, 1 figure, 6 tables. Accepted to IWSPA 2026

2604.15937 2026-04-20 cs.SI cs.AI cs.CL cs.CY cs.MA

Polarization by Default: Auditing Recommendation Bias in LLM-Based Content Curation

Nicolò Pagan, Christopher Barrie, Chris Andrew Bail, Petter Törnberg

2604.15882 2026-04-20 cs.IR cs.CL

JFinTEB: Japanese Financial Text Embedding Benchmark

Masahiro Suzuki, Hiroki Sakaji

Comments 5 pages. Accepted at SIGIR 2026 Resource Track

2604.15827 2026-04-20 cs.IR cs.CL

UsefulBench: Towards Decision-Useful Information as a Target for Information Retrieval

Tobias Schimanski, Stefanie Lewandowski, Christian Woerle, Nicola Reichenau, Yauheni Huryn, Markus Leippold

2604.15821 2026-04-20 cs.DC cs.LG

Breaking the Training Barrier of Billion-Parameter Universal Machine Learning Interatomic Potentials

Yuanchang Zhou, Hongyu Wang, Yiming Du, Yan Wang, Mingzhen Li, Siyu Hu, Xiangyu Zhang, Weijian Liu, Chen Wang, Zhuoqiang Guo, Long Wang, Jingde Bu, Yutong Lu, Guangming Tan, Weile Jia

Comments 11 pages, 8 figures

2604.15800 2026-04-20 cs.HC cs.AI cs.CL

From Intention to Text: AI-Supported Goal Setting in Academic Writing

Yueling Fan, Richard Lee Davis, Olga Viberg

Comments Accepted at AIED 2026

2604.15773 2026-04-20 cond-mat.stat-mech cs.AI stat.ME

Phase Transitions as the Breakdown of Statistical Indistinguishability

Taiyo Narita, Hideyuki Miyahara

2604.15728 2026-04-20 cs.CR cs.AI

Privacy-Preserving LLMs Routing

Xidong Wu, Yukuan Zhang, Yuqiong Ji, Reza Shirkavand, Qian Lou, Shangqian Gao

2604.15714 2026-04-20 cs.NE cs.LG cs.SY eess.SY

Neuromorphic Parameter Estimation for Power Converter Health Monitoring Using Spiking Neural Networks

Hyeongmeen Baik, Hamed Poursiami, Maryam Parsa, Jinia Roy

Comments 10 pages, 11 figures, 4 tables. Submitted to ICONS 2026

2604.15695 2026-04-20 cs.GT cs.AI

The Price of Paranoia: Robust Risk-Sensitive Cooperation in Non-Stationary Multi-Agent Reinforcement Learning

Deep Kumar Ganguly, Chandradithya S Jonnalagadda, Pratham Chintamani, Adithya Ananth

Comments Accepted to AAMAS ALA Workshop 2026

详情

英文摘要

Cooperative equilibria are fragile. When agents learn alongside each other rather than in a fixed environment, the process of learning destabilizes the cooperation they are trying to sustain: every gradient step an agent takes shifts the distribution of actions its partner will play, turning a cooperative partner into a source of stochastic noise precisely where the cooperation decision is most sensitive. We study how this co-learning noise propagates through the structure of coordination games, and find that the cooperative equilibrium, even when strongly Pareto-dominant, is exponentially unstable under standard risk-neutral learning, collapsing irreversibly once partner noise crosses the game's critical cooperation threshold. The natural response to apply distributional robustness to hedge against partner uncertainty makes things strictly worse: risk-averse return objectives penalize the high-variance cooperative action relative to defection, widening the instability region rather than shrinking it, a paradox that reveals a fundamental mismatch between the domains where robustness is applied and instability originates. We resolve this by showing that robustness should target the policy gradient update variance induced by partner uncertainty, not the return distribution. This distinction yields an algorithm whose gradient updates are modulated by an online measure of partner unpredictability, provably expanding the cooperation basin in symmetric coordination games. To unify stability, sample complexity, and welfare consequences of this approach, we introduce the Price of Paranoia as the structural dual of the Price of Anarchy. Together with a novel Cooperation Window, it precisely characterizes how much welfare learning algorithms can recover under partner noise, pinning down the optimal degree of robustness as a closed-form balance between equilibrium stability and sample efficiency.

URL PDF HTML ☆

赞 0 踩 0

2604.15663 2026-04-20 cs.SE cs.AI

CodeMMR: Bridging Natural Language, Code, and Image for Unified Retrieval

Jiahui Geng, Qing Li, Fengyu Cai, Fakhri Karray

2604.15642 2026-04-20 cs.AR cs.AI

HYPERHEURIST: A Simulated Annealing-Based Control Framework for LLM-Driven Code Generation in Optimized Hardware Design

Shiva Ahir, Prajna Bhat, Alex Doboli

Comments 8 pages, 2 figures, 5 tables. Accepted at IJCNN 2026