arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2509.25667 2026-04-14 cs.LG cs.AI cs.HC

EEG-based AI-BCI Wheelchair Advancement: Hybrid Deep Learning with Motor Imagery for Brain Computer Interface

Bipul Thapa, Biplov Paneru, Bishwash Paneru, Khem Narayan Poudyal

详情

英文摘要

This paper presents an Artificial Intelligence (AI) integrated approach to Brain-Computer Interface (BCI)-based wheelchair development, utilizing a motor imagery right-left-hand movement mechanism for control. The system is designed to simulate wheelchair navigation based on motor imagery right and left-hand movements using electroencephalogram (EEG) data. A pre-filtered dataset, obtained from an open-source EEG repository, was segmented into arrays of 19x200 to capture the onset of hand movements. The data was acquired at a sampling frequency of 200Hz. The system integrates a Tkinter-based interface for simulating wheelchair movements, offering users a functional and intuitive control system. We propose a framework that uses Convolutional Neural Network-Transformer Hybrid Model, named CTHM, for motor imagery EEG classification. The model achieves a test accuracy of 91.73% compared with various machine learning baseline models, including XGBoost, EEGNet, and a transformer-based model. The CTHM achieved a mean accuracy of 90% through stratified cross-validation, showcasing the effectiveness of the CNN-Transformer hybrid architecture in BCI applications.

URL PDF HTML ☆

赞 0 踩 0

2509.22830 2026-04-14 cs.CL

ChatInject: Abusing Chat Templates for Prompt Injection in LLM Agents

Hwan Chang, Yonghyun Jun, Hwanhee Lee

Comments ICLR 2026

2509.21879 2026-04-14 cs.LG math.OC

Learning Aligned Stability in Neural ODEs Reconciling Accuracy with Robustness

Chaoyang Luo, Yan Zou, Nanjing Huang

2509.12694 2026-04-14 cs.LG cs.IT eess.SP math.IT

Soft Graph Transformer for MIMO Detection

Jiadong Hong, Lei Liu, Xinyu Bian, Wenjie Wang, Zhaoyang Zhang

Comments 5 pages with 3 figures and 2 tables, Accepted by IEEE ICASSP 2026

2508.09533 2026-04-14 cs.CV cs.AI

COXNet: Cross-Layer Fusion with Adaptive Alignment and Scale Integration for RGBT Tiny Object Detection

Peiran Peng, Tingfa Xu, Liqiang Song, Mengqi Zhu, Yuqiang Fang, Jianan Li

2508.08574 2026-04-14 cs.RO cs.MA

DeepFleet: Multi-Agent Foundation Models for Mobile Robots

Ameya Agaskar, Sriram Siva, William Pickering, Kyle O'Brien, Charles Kekeh, Alexandre Ormiga Galvao Barbosa, Ang Li, Brianna Gallo Sarker, Alicia Chua, Mayur Nemade, Charun Thattai, Jiaming Di, Isaac Iyengar, Ramya Dharoor, Dino Kirouani, Jimmy Erskine, Tamir Hegazy, Scott Niekum, Usman A. Khan, Federico Pecora, Joseph W. Durham

Comments 27 pages, 10 figures, 2 tables

2508.04436 2026-04-14 cs.RO cs.SY eess.SY

Reliable and Real-Time Highway Trajectory Planning via Hybrid Learning-Optimization Frameworks

Yujia Lu, Chong Wei, Lu Ma, Lounis Adouane

2507.13292 2026-04-14 cs.CV

DiffClean: Diffusion-based Makeup Removal for Accurate Age Estimation

Ekta Gavas, Sudipta Banerjee, Chinmay Hegde, Nasir Memon

Comments Revised version with minor changes and code release

2506.14493 2026-04-14 cs.CL cs.CR

LingoLoop Attack: Trapping MLLMs via Linguistic Context and State Entrapment into Endless Loops

Jiyuan Fu, Kaixun Jiang, Lingyi Hong, Jinglun Li, Haijing Guo, Dingkang Yang, Zhaoyu Chen, Wenqiang Zhang

Comments Accepted to ICLR 2026. Code is available at: https://github.com/fuhaha824/LingoLoop-Attack

2506.14170 2026-04-14 cs.CV cs.AI cs.ET

Progressive Multimodal Interaction Network for Reliable Quantification of Fish Feeding Intensity in Aquaculture

Shulong Zhang, Mingyuan Yao, Jiayin Zhao, Daoliang Li, Yingyi Chen, Haihua Wang

详情

英文摘要

Accurate quantification of fish feeding intensity is crucial for precision feeding in aquaculture, as it directly affects feed utilization and farming efficiency. Although multimodal fusion has proven to be an effective solution, existing methods often overlook the inconsistencies in responses and decision conflicts between different modalities, thus limiting the reliability of the quantification results. To address this issue, this paper proposes a Progressive Multimodal Interaction Network (PMIN) that integrates image, audio, and water-wave data for fish feeding intensity quantification. Specifically, a unified feature extraction framework is first constructed to map inputs from different modalities into a structurally consistent feature space, thereby reducing representational discrepancies across modalities. Then, an auxiliary-modality reinforcement primary-modality mechanism is designed to facilitate the fusion of cross-modal information, which is achieved through channel aware recalibration and dual-stage attention interaction. Furthermore, a decision fusion strategy based on adaptive evidence reasoning is introduced to jointly model the confidence, reliability, and conflicts of modality-specific outputs, so as to improve the stability and robustness of the final judgment. Experiments are conducted on a multimodal fish feeding intensity dataset containing 7089 samples. The results show that PMIN has an accuracy of 96.76%, while maintaining relatively low parameter count and computational cost, and its overall performance outperforms both homogeneous and heterogeneous comparison models. Ablation studies, comparative experiments, and real-world application results further validate the effectiveness and superiority of the proposed method. It can provide reliable support for automated feeding monitoring and precise feeding decisions in smart aquaculture.

URL PDF HTML ☆

赞 0 踩 0

2506.06248 2026-04-14 cs.LG

Lagrangian-based Equilibrium Propagation: generalisation to arbitrary boundary conditions & equivalence with Hamiltonian Echo Learning

Guillaume Pourcel, Debabrota Basu, Maxence Ernoult, Aditya Gilra

2506.02387 2026-04-14 cs.AI

VS-Bench: Evaluating VLMs for Strategic Abilities in Multi-Agent Environments

Zelai Xu, Zhexuan Xu, Xiangmin Yi, Huining Yuan, Mo Guang, Kaiwen Long, Xinlei Chen, Yi Wu, Chao Yu, Yu Wang

Comments Published at CVPR 2026 (Oral)

2505.24665 2026-04-14 cs.LG

Learning Geometry and Topology via Multi-Chart Flows

Hanlin Yu, Søren Hauberg, Marcelo Hartmann, Arto Klami, Georgios Arvanitidis

2505.17012 2026-04-14 cs.CV cs.AI

SpatialScore: Towards Comprehensive Evaluation for Spatial Intelligence

Haoning Wu, Xiao Huang, Yaohui Chen, Ya Zhang, Yanfeng Wang, Weidi Xie

Comments Accepted by CVPR 2026 (Highlight); Project Page: https://haoningwu3639.github.io/SpatialScore

2505.15489 2026-04-14 cs.CV cs.CL cs.MM

Seeing Through Deception: Uncovering Misleading Creator Intent in Multimodal News with Vision-Language Models

Jiaying Wu, Fanxiao Li, Zihang Fu, Min-Yen Kan, Bryan Hooi

Comments ICLR 2026

2505.09368 2026-04-14 cs.CV cs.LG

RobustSpring: Benchmarking Robustness to Image Corruptions for Optical Flow, Scene Flow and Stereo

Victor Oei, Jenny Schmalfuss, Lukas Mehl, Madlen Bartsch, Shashank Agnihotri, Margret Keuper, Andreas Bulling, Andrés Bruhn

2504.04099 2026-04-14 cs.CV cs.AI

TARAC: Mitigating Hallucination in LVLMs via Temporal Attention Real-time Accumulative Connection

Lei Jiang, Chunzhao Xie, Tongxuan Liu, Yuting Zeng, jinrong Guo, Yunheng Shen, Weizhe Huang, Jing Li, Xiaohua Xu

Comments 8 pages, 9 figures

2503.23514 2026-04-14 cs.CL cs.AI

If an LLM Were a Character, Would It Know Its Own Story? Evaluating Lifelong Learning in LLMs

Siqi Fan, Xiusheng Huang, Yiqun Yao, Xuezhi Fang, Kang Liu, Peng Han, Shuo Shang, Aixin Sun, Yequan Wang

2503.23001 2026-04-14 cs.LG cs.GT

Quotation-Based Data Retention Mechanism for Data Privacy in LLM-Empowered Network Services

Bin Han, Di Feng, Zexin Fang, Jie Wang, Hans D. Schotten

Comments Accepted by IEEE ICC 2026 WKSPS

2503.15481 2026-04-14 cs.RO cs.AI cs.LG

Learning to Play Piano in the Real World

Yves-Simon Zeulner, Simon Crämer, Sandeep Selvaraj, Roberto Calandra

2502.19731 2026-04-14 cs.CL

Preference Learning Unlocks LLMs' Psycho-Counseling Skills

Mian Zhang, Shaun M. Eack, Zhiyu Zoey Chen

Comments ACL 2026 Camera-Ready

2502.18026 2026-04-14 cs.LG cs.AI

ExPath: Targeted Pathway Inference for Biological Knowledge Bases via Graph Learning and Explanation

Rikuto Kotoge, Ziwei Yang, Zheng Chen, Yushun Dong, Yasuko Matsubara, Jimeng Sun, Yasushi Sakurai

Comments Accepted at AAAI 2026 (Main Technical Track)

2502.07432 2026-04-14 cs.LG

CapyMOA: Efficient Machine Learning for Data Streams and Online Continual Learning in Python

Heitor Murilo Gomes, Anton Lee, Nuwan Gunasekara, Yibin Sun, Guilherme Weigert Cassales, Justin Liu, Marco Heyden, Vitor Cerqueira, Maroua Bahri, Yun Sing Koh, Bernhard Pfahringer, Albert Bifet

2502.02189 2026-04-14 cs.LG

deCIFer: Crystal Structure Prediction from Powder Diffraction Data using Autoregressive Language Models

Frederik Lizak Johansen, Ulrik Friis-Jensen, Erik Bjørnager Dam, Kirsten Marie Ørnsbjerg Jensen, Rocío Mercado, Raghavendra Selvan

Comments 24 pages, 18 figures, 8 tables. v2: Figure 8 revision. v3: added benchmarks, text revisions. v4: accepted to TMLR (https://openreview.net/forum?id=LftFQ35l47)

2501.19227 2026-04-14 cs.CV cs.AI

Integrating Semi-Supervised and Active Learning for Semantic Segmentation

Wanli Ma, Oktay Karakus, Paul L. Rosin

2501.18490 2026-04-14 cs.RO cs.AI

Curriculum-based Sample Efficient Reinforcement Learning for Robust Stabilization of a Quadrotor

Fausto Mauricio Lagos Suarez, Akshit Saradagi, Vidya Sumathy, Shruti Kotpaliwar, George Nikolakopoulos

Comments 8 pages, 7 figures

2412.17574 2026-04-14 cs.CV cs.AI

HumanVBench: Probing Human-Centric Video Understanding in MLLMs with Automatically Synthesized Benchmarks

Ting Zhou, Daoyuan Chen, Qirui Jiao, Bolin Ding, Yaliang Li, Ying Shen

Comments Accepted as a conference paper at CVPR 2026

2412.10273 2026-04-14 cs.CV cs.LG

How to Spin an Object: First, Get the Shape Right

Rishabh Kabra, Drew A. Hudson, Sjoerd van Steenkiste, Joao Carreira, Niloy J. Mitra

2410.14383 2026-04-14 cs.RO

MARLIN: Multi-Agent Reinforcement Learning Guided by Language-Based Inter-Robot Negotiation

Toby Godfrey, William Hunt, Mohammad D. Soorati

Comments 15 pages, 8 figures, 1 table

2408.07587 2026-04-14 cs.LG cs.DC

FedQUIT: On-Device Federated Unlearning via a Quasi-Competent Virtual Teacher

Alessio Mora, Lorenzo Valerio, Paolo Bellavista, Andrea Passarella