arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.13757 2026-04-16 cs.AI cs.HC

Rethinking AI Hardware: A Three-Layer Cognitive Architecture for Autonomous Agents

Li Chen

Comments A system architecture paper with simulation-based evaluation

详情

英文摘要

The next generation of autonomous AI systems will be constrained not only by model capability, but by how intelligence is structured across heterogeneous hardware. Current paradigms -- cloud-centric AI, on-device inference, and edge-cloud pipelines -- treat planning, reasoning, and execution as a monolithic process, leading to unnecessary latency, energy consumption, and fragmented behavioral continuity. We introduce the Tri-Spirit Architecture, a three-layer cognitive framework that decomposes intelligence into planning (Super Layer), reasoning (Agent Layer), and execution (Reflex Layer), each mapped to distinct compute substrates and coordinated via an asynchronous message bus. We formalize the system with a parameterized routing policy, a habit-compilation mechanism that promotes repeated reasoning paths into zero-inference execution policies, a convergent memory model, and explicit safety constraints. We evaluate the architecture in a reproducible simulation of 2000 synthetic tasks against cloud-centric and edge-only baselines. Tri-Spirit reduces mean task latency by 75.6 percent and energy consumption by 71.1 percent, while decreasing LLM invocations by 30 percent and enabling 77.6 percent offline task completion. These results suggest that cognitive decomposition, rather than model scaling alone, is a primary driver of system-level efficiency in AI hardware.

URL PDF HTML ☆

赞 0 踩 0

2604.13756 2026-04-16 cs.CL cs.CV

MedRCube: A Multidimensional Framework for Fine-Grained and In-Depth Evaluation of MLLMs in Medical Imaging

Zhijie Bao, Fangke Chen, Licheng Bao, Chenhui Zhang, Wei Chen, Jiajie Peng, Zhongyu Wei

2604.13746 2026-04-16 cs.CV

ClipGStream: Clip-Stream Gaussian Splatting for Any Length and Any Motion Multi-View Dynamic Scene Reconstruction

Jie Liang, Jiahao Wu, Chao Wang, Jiayu Yang, Xiaoyun Zheng, Kaiqiang Xiong, Zhanke Wang, Jinbo Yan, Feng Gao, Ronggang Wang

Comments CVPR 2026, Project pages: https://liangjie1999.github.io/ClipGStreamWeb/

2604.13740 2026-04-16 cs.LG stat.ML

Online learning with noisy side observations

Tomáš Kocák, Gergely Neu, Michal Valko

Comments Published at International Conference on Artificial Intelligence and Statistics (AISTATS) 2016. 13 pages, 7 figures

2604.13739 2026-04-16 cs.LG stat.ML

Spectral Thompson sampling

Tomas Kocak, Michal Valko, Remi Munos, Shipra Agrawal

Comments Published at AAAI Conference on Artificial Intelligence (AAAI) 2014

2604.13731 2026-04-16 cs.CL

Doc-V*:Coarse-to-Fine Interactive Visual Reasoning for Multi-Page Document VQA

Yuanlei Zheng, Pei Fu, Hang Li, Ziyang Wang, Yuyi Zhang, Wenyu Ruan, Xiaojin Zhang, Zhongyu Wei, Zhenbo Luo, Jian Luan, Wei Chen, Xiang Bai

2604.13730 2026-04-16 cs.CV

ReConText3D: Replay-based Continual Text-to-3D Generation

Muhammad Ahmed Ullah Khan, Muhammad Haris Bin Amir, Didier Stricker, Muhammad Zeshan Afzal

Comments Accepted at CVPR Findings 2026

2604.13723 2026-04-16 cs.LG physics.comp-ph

Physics-Informed Neural Networks for Solving Derivative-Constrained PDEs

Kentaro Hoshisashi, Carolyn E Phelan, Paolo Barucca

Comments Phys. Rev. E - Accepted 14 April, 2026

2604.13722 2026-04-16 cs.CV

Granularity-Aware Transfer for Tree Instance Segmentation in Synthetic and Real Forests

Pankaj Deoli, Atef Tej, Anmol Ashri, Anandatirtha JS, Karsten Berns

2604.13715 2026-04-16 cs.SD cs.AI

Towards Fine-grained Temporal Perception: Post-Training Large Audio-Language Models with Audio-Side Time Prompt

Yanfeng Shi, Pengfei Cai, Jun Liu, Qing Gu, Nan Jiang, Lirong Dai, Ian McLoughlin, Yan Song

Comments Submitted to Interspeech 2026

2604.13713 2026-04-16 cs.CL

Learning the Cue or Learning the Word? Analyzing Generalization in Metaphor Detection for Verbs

Sinan Kurtyigit, Sabine Schulte im Walde, Alexander Fraser

2604.13706 2026-04-16 cs.CL

Co-FactChecker: A Framework for Human-AI Collaborative Claim Verification Using Large Reasoning Models

Dhruv Sahnan, Subhabrata Dutta, Tanmoy Chakraborty, Preslav Nakov, Iryna Gurevych

Comments 11 pages, 3 figures. Under review

2604.13705 2026-04-16 cs.CL cs.AI cs.GT cs.MA

Beyond Arrow's Impossibility: Fairness as an Emergent Property of Multi-Agent Collaboration

Sayan Kumar Chaki, Antoine Gourru, Julien Velcin

2604.13695 2026-04-16 cs.CV cs.AI

Med-CAM: Minimal Evidence for Explaining Medical Decision Making

Pirzada Suhail, Aditya Anand, Amit Sethi

2604.13694 2026-04-16 cs.AI

Weight Patching: Toward Source-Level Mechanistic Localization in LLMs

Chenghao Sun, Chengsheng Zhang, Guanzheng Qin, Rui Dai, Xinmei Tian

Comments 36 pages. Submitted to IEEE for possible publication

2604.13692 2026-04-16 cs.CL

Breaking the Generator Barrier: Disentangled Representation for Generalizable AI-Text Detection

Xiao Pu, Zepeng Cheng, Lin Yuan, Yu Wu, Xiuli Bi

2604.13688 2026-04-16 cs.CV cs.AI

Beyond Voxel 3D Editing: Learning from 3D Masks and Self-Constructed Data

Yizhao Xu, Hongyuan Zhu, Caiyun Liu, Tianfu Wang, Keyu Chen, Sicheng Xu, Jiaolong Yang, Nicholas Jing Yuan, Qi Zhang

2604.13686 2026-04-16 cs.CL cs.AI cs.DB

IndicDB -- Benchmarking Multilingual Text-to-SQL Capabilities in Indian Languages

Aviral Dawar, Roshan Karanth, Vikram Goyal, Dhruv Kumar

Comments Under Review

2604.13677 2026-04-16 cs.RO cs.SY eess.SY

Empirical Prediction of Pedestrian Comfort in Mobile Robot Pedestrian Encounters

Alireza Jafari, Hong-Son Nguyen, Yen-Chen Liu

Comments 9 pages, 4 figures, 7 tables

2604.13672 2026-04-16 cs.LG

Optimization with SpotOptim

Thomas Bartz-Beielstein

2604.13667 2026-04-16 cs.CV cs.ET

From Pixels to Nucleotides: End-to-End Token-Based Video Compression for DNA Storage

Cihan Ruan, Lebin Zhou, Bingqing Zhao, Rongduo Han, Qiming Yuan, Chenchen Zhu, Linyi Han, Liang Yang, Wei Wang, Wei Jiang, Nam Ling

2604.13658 2026-04-16 cs.LG

A Bayesian Framework for Uncertainty-Aware Explanations in Power Quality Disturbance Classification

Yinsong Chen, Samson S. Yu, Kashem M. Muttaqi

2604.13656 2026-04-16 cs.LG cs.AI math.ST stat.ML stat.TH

Ordinary Least Squares is a Special Case of Transformer

Xiaojun Tan, Yuchen Zhao

2604.13654 2026-04-16 cs.RO

Vision-and-Language Navigation for UAVs: Progress, Challenges, and a Research Roadmap

Hanxuan Chen, Jie Zheng, Siqi Yang, Tianle Zeng, Siwei Feng, Songsheng Cheng, Ruilong Ren, Hanzhong Guo, Shuai Yuan, Xiangyue Wang, Kangli Wang, Ji Pei

2604.13645 2026-04-16 cs.RO cs.AI cs.LG

A Mechanistic Analysis of Sim-and-Real Co-Training in Generative Robot Policies

Yu Lei, Minghuan Liu, Abhiram Maddukuri, Zhenyu Jiang, Yuke Zhu

Comments 24 pages, 18 figure. Project page: https://science-of-co-training.github.io/

2604.13634 2026-04-16 cs.CL cs.LG

Calibrated Speculative Decoding: Frequency-Guided Candidate Selection for Efficient Inference

Xuwen Zhou, Fangxin Liu, Chao Wang, Xiao Zheng, Hao Zheng, Min He, Li Jiang, Haibing Guan

Comments ACL 2026 Main Conference

2604.13633 2026-04-16 cs.CV cs.RO

ESCAPE: Episodic Spatial Memory and Adaptive Execution Policy for Long-Horizon Mobile Manipulation

Jingjing Qian, Zeyuan He, Chen Shi, Lei Xiao, Li Jiang

2604.13627 2026-04-16 cs.LG cs.CL

(How) Learning Rates Regulate Catastrophic Overtraining

Mark Rofin, Aditya Varre, Nicolas Flammarion

2604.13622 2026-04-16 cs.LG

Self-Organizing Maps with Optimized Latent Positions

Seiki Ubukata, Akira Notsu, Katsuhiro Honda

Comments 8 pages, 4 figures. Accepted for publication in the 2026 International Joint Conference on Neural Networks (IJCNN 2026), part of the 2026 IEEE World Congress on Computational Intelligence (WCCI 2026). This version is the author's accepted manuscript

2604.13620 2026-04-16 cs.CL cs.AI

Syn-TurnTurk: A Synthetic Dataset for Turn-Taking Prediction in Turkish Dialogues

Ahmet Tuğrul Bayrak, Mustafa Sertaç Türkel, Fatma Nur Korkmaz

Comments Accepted for publication in IEEE ICASI 2026