arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2504.09114 2026-04-22 cs.LG

Deploying Large AI Models on Resource-Limited Devices with Split Federated Learning

Xianke Qiang, Hongda Liu, Xinran Zhang, Zheng Chang, Ying-Chang Liang

详情

DOI: 10.1109/TMC.2026.3681351

英文摘要

Large Artificial Intelligence Models (LAMs) powered by massive datasets, extensive parameter scales, and extensive computational resources, leading to significant transformations across various industries. Yet, their practical deployment on resource-limited mobile edge devices is hindered by critical challenges such as data privacy, constrained resources, and high overhead costs. Addressing this gap, this paper proposes a novel framework, named Quantized Split Federated Fine-Tuning Large AI Model (SFLAM). By partitioning the training load between edge devices and servers using a split learning paradigm, SFLAM can facilitate the operation of large models on devices and significantly lowers the memory requirements on edge devices. Additionally, SFLAM incorporates quantization management, power control, and bandwidth allocation strategies to enhance training efficiency while concurrently reducing energy consumption and communication latency. A theoretical analysis exploring the latency-energy trade-off is presented, and the framework's efficacy is validated via comprehensive simulations. The findings indicate that SFLAM achieves superior performance in terms of learning efficiency and scalability compared to conventional methods, thereby providing a valuable approach for enabling advanced AI services in resource-constrained scenarios.

URL PDF HTML ☆

赞 0 踩 0

2503.23439 2026-04-22 cs.CL cs.AI cs.LG cs.SD eess.AS

Speculative End-Turn Detector for Efficient Speech Chatbot Assistant

Hyunjong Ok, Suho Yoo, Jaeho Lee

Comments ACL 2026

2503.13304 2026-04-22 cs.LG

AutoNFS: Automatic Neural Feature Selection

Witold Wydmański, Marek Śmieja

2503.03023 2026-04-22 cs.LG quant-ph

Quantum Non-Linear Bandit Optimization

Zakaria Shams Siam, Chaowen Guan, Chong Liu

Comments Camera-ready version

2503.01605 2026-04-22 cs.CV

A Leaf-Level Dataset for Soybean-Cotton Detection and Segmentation

Thiago H. Segreto, Juliano Negri, Paulo H. Polegato, João Manoel Herrera Pinheiro, Ricardo V. Godoy, Marcelo Becker

2502.02779 2026-04-22 cs.CV cs.AI

3D Foundation Model for Generalizable Disease Detection in Head Computed Tomography

Weicheng Zhu, Haoxu Huang, Huanze Tang, Rushabh Musthyala, Boyang Yu, Long Chen, Emilio Vega, Thomas O'Donnell, Seena Dehkharghani, Jennifer A. Frontera, Arjun V. Masurkar, Kara Melmed, Narges Razavian

Comments Nature Biomedical Engineering (2026)

详情

DOI: 10.1038/s41551-026-01668-w

英文摘要

Head computed tomography (CT) imaging is a widely-used imaging modality with multitudes of medical indications, particularly in assessing pathology of the brain, skull, and cerebrovascular system. It is commonly the first-line imaging in neurologic emergencies given its rapidity of image acquisition, safety, cost, and ubiquity. Deep learning models may facilitate detection of a wide range of diseases. However, the scarcity of high-quality labels and annotations, particularly among less common conditions, significantly hinders the development of powerful models. To address this challenge, we introduce FM-CT: a Foundation Model for Head CT for generalizable disease detection, trained using self-supervised learning. Our approach pre-trains a deep learning model on a large, diverse dataset of 361,663 non-contrast 3D head CT scans without the need for manual annotations, enabling the model to learn robust, generalizable features. To investigate the potential of self-supervised learning in head CT, we employed both discrimination with self-distillation and masked image modeling, and we construct our model in 3D rather than at the slice level (2D) to exploit the structure of head CT scans more comprehensively and efficiently. The model's downstream classification performance is evaluated using internal and three external datasets, encompassing both in-distribution (ID) and out-of-distribution (OOD) data. Our results demonstrate that the self-supervised foundation model significantly improves performance on downstream diagnostic tasks compared to models trained from scratch and previous 3D CT foundation models on scarce annotated datasets. This work highlights the effectiveness of self-supervised learning in medical imaging and sets a new benchmark for head CT image analysis in 3D, enabling broader use of artificial intelligence for head CT-based diagnosis.

URL PDF HTML ☆

赞 0 踩 0

2501.04410 2026-04-22 cs.AI cs.HC cs.IR cs.LG

User Simulation in the Era of Generative AI: User Modeling, Synthetic Data Generation, and System Evaluation

Krisztian Balog, ChengXiang Zhai

2411.16312 2026-04-22 cs.CV

EPS: Efficient Patch Sampling for Video Overfitting in Deep Super-Resolution Model Training

Yiying Wei, Hadi Amirpour, Jong Hwan Ko, Christian Timmerer

2411.06837 2026-04-22 cs.CL

Persuasion with Large Language Models: A Survey of Empirical Evidence, Study Methodologies, and Ethical Implications

Sander Noels, Alexander Rogiers, Maarten Buyl, Tijl De Bie

Comments Main changes: - Slightly altered title & author ordering - New section detailing survey methodology - Expanded literature coverage and improved discussion of all references for clarity, precision & conciseness - Removed the "appealing to authority" subsection & integrated its content elsewhere - Overhauled the experimental design section - Significantly expanded success metrics discussion

2410.16431 2026-04-22 cs.AI

Conjuring Semantic Similarity

Tian Yu Liu, Stefano Soatto

Comments ICLR 2026

2407.11107 2026-04-22 cs.RO cs.LG

Latent Linear Quadratic Regulator for Robotic Control Tasks

Yuan Zhang, Shaohui Yang, Toshiyuki Ohtsuka, Colin Jones, Joschka Boedecker

Comments Accepted at L4DC 2026

2406.14294 2026-04-22 cs.SD cs.AI eess.AS

DASB - Discrete Audio and Speech Benchmark

Pooneh Mousavi, Jarod Duret, Darius Petermann, Artem Ploujnikov, Luca Della Libera, Anastasia Kuznetsova, Cem Subakan, Mirco Ravanelli

2405.14779 2026-04-22 cs.CL cs.LG

Smart Bilingual Focused Crawling of Parallel Documents

Cristian García-Romero, Miquel Esplà-Gomis, Felipe Sánchez-Martínez

Comments Pre-Cambridge University Press publication version

2304.02296 2026-04-22 cs.CV

Data Leakage Detection and De-duplication in Large Scale Geospatial Image Datasets

Yeshwanth Kumar Adimoolam, Charalambos Poullis, Melinos Averkiou

Comments 15 pages, 8 figures, 6 tables (Accepted as CVPR 2026 Oral)

2604.19336 2026-04-22 cs.LG math.OC

FedSEA: Achieving Benefit of Parallelization in Federated Online Learning

Harekrushna Sahu, Pratik Jawanpuria, Pranay Sharma

2604.19335 2026-04-22 cs.LG

When Active Learning Falls Short: An Empirical Study on Chemical Reaction Extraction

Simin Yu, Sufia Fathima

2604.19334 2026-04-22 cs.CV eess.IV

Silicon Aware Neural Networks

Sebastian Fieldhouse, Kea-Tiong Tang

2604.19324 2026-04-22 cs.CV cs.AI

PLaMo 2.1-VL Technical Report

Tommi Kerola, Yuya Masuda, Takashi Masuko, Toshiki Nakanishi, Daisuke Nishino, Kuniyuki Takahashi, Hanqin Wang, Yoshihiro Yamada

Comments 35 pages, 9 figreus

2604.19323 2026-04-22 cs.LG cs.CV

Concept Inconsistency in Dermoscopic Concept Bottleneck Models: A Rough-Set Analysis of the Derm7pt Dataset

Gonzalo Nápoles, Isel Grau, Yamisleydi Salgueiro

详情

英文摘要

Concept Bottleneck Models (CBMs) route predictions exclusively through a clinically grounded concept layer, binding interpretability to concept-label consistency. When a dataset contains concept-level inconsistencies, identical concept profiles mapped to conflicting diagnosis labels create an unresolvable bottleneck that imposes a hard ceiling on achievable accuracy. In this paper, we apply rough set theory to the Derm7pt dermoscopy benchmark and characterize the full extent and clinical structure of this inconsistency. Among 305 unique concept profiles formed by the 7 dermoscopic criteria of the 7-point melanoma checklist, 50 (16.4%) are inconsistent, spanning 306 images (30.3% of the dataset). This yields a theoretical accuracy ceiling of 92.1%, independent of backbone architecture or training strategy for CBMs that exclusively operate with hard concepts. In addition, we characterize the conflict-severity distribution, identify the clinical features most responsible for boundary ambiguity, and evaluate two filtering strategies with quantified effects on dataset composition and CBM interpretability. Symmetric removal of all boundary-region images yields Derm7pt+, a fully consistent benchmark subset of 705 images with perfect quality of classification and no hard accuracy ceiling. Building on this filtered dataset, we present a hard CBM evaluated across 19 backbone architectures from the EfficientNet, DenseNet, ResNet, and Wide ResNet families. Under symmetric filtering, explored for completeness, EfficientNet-B5 achieves the best label F1 score (0.85) and label accuracy (0.90) on the held-out test set, with a concept accuracy of 0.70. Under asymmetric filtering, EfficientNet-B7 leads across all four metrics, reaching a label F1 score of 0.82 and concept accuracy of 0.70. These results establish reproducible baselines for concept-consistent CBM evaluation on dermoscopic data.

URL PDF HTML ☆

赞 0 踩 0

2604.19321 2026-04-22 cs.LG cs.AI cs.CL cs.CV

RDP LoRA: Geometry-Driven Identification for Parameter-Efficient Adaptation in Large Language Models

Yusuf Çelebi, Yağız Asker, Özay Ezerceli, Mahmoud ElHussieni, Selva Taş, Reyhan Bayraktar, Fatma Betül Terzioğlu

2604.19318 2026-04-22 cs.CV

Multi-view Crowd Tracking Transformer with View-Ground Interactions Under Large Real-World Scenes

Qi Zhang, Jixuan Chen, Kaiyi Zhang, Xinquan Yu, Antoni B. Chan, Hui Huang

Comments CVPR 2026

2604.19314 2026-04-22 cs.CV cs.NA math.NA

Framelet-Based Blind Image Restoration with Minimax Concave Regularization

Heng Zhang, Reza Parvaz, Rui Yang

2604.19312 2026-04-22 cs.LG

On the Conditioning Consistency Gap in Conditional Neural Processes

Robin Young

2604.19301 2026-04-22 cs.AI cs.MA cs.NE

Large Language Models Exhibit Normative Conformity

Mikako Bito, Keita Nishimoto, Kimitaka Asatani, Ichiro Sakata

2604.19300 2026-04-22 cs.SD cs.AI

HalluAudio: A Comprehensive Benchmark for Hallucination Detection in Large Audio-Language Models

Feiyu Zhao, Yiming Chen, Wenhuan Lu, Daipeng Zhang, Xianghu Yue, Jianguo Wei

Comments Accepted to ACL 2026

2604.19299 2026-04-22 cs.CL cs.AI

Rethinking Scale: Deployment Trade-offs of Small Language Models under Agent Paradigms

Xinlin Wang, Mats Brorsson

2604.19296 2026-04-22 cs.LG

Debiased neural operators for estimating functionals

Konstantin Hess, Dennis Frauen, Niki Kilbertus, Stefan Feuerriegel

2604.19295 2026-04-22 cs.LG

TEMPO: Scaling Test-time Training for Large Reasoning Models

Qingyang Zhang, Xinke Kong, Haitao Wu, Qinghua Hu, Minghao Wu, Baosong Yang, Yu Cheng, Yun Luo, Ganqu Cui, Changqing Zhang

Comments Preprint

2604.19292 2026-04-22 cs.CL cs.AI

Location Not Found: Exposing Implicit Local and Global Biases in Multilingual LLMs

Guy Mor-Lan, Omer Goldman, Matan Eyal, Adi Mayrav Gilady, Sivan Eiger, Idan Szpektor, Avinatan Hassidim, Yossi Matias, Reut Tsarfaty

Comments ACL 2026 main conference

2604.19270 2026-04-22 cs.RO cs.HC

Warmth and Competence in the Swarm: Designing Effective Human-Robot Teams

Genki Miyauchi, Roderich Groß, Chaona Chen

Comments 15 pages, 4 figures, camera-ready version for ANTS 2026