arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2506.05952 2026-05-05 cs.CV cs.AI

MOGO: Residual Quantized Hierarchical Causal Transformer for High-Quality and Real-Time 3D Human Motion Generation

Dongjie Fu, Tengjiao Sun, Pengcheng Fang, Xiaohao Cai, Hansung Kim

Comments 9 pages, 4 figures, conference

详情

英文摘要

Recent advances in transformer-based text-to-motion generation have led to impressive progress in synthesizing high-quality human motion. Nevertheless, jointly achieving high fidelity, streaming capability, real-time responsiveness, and scalability remains a fundamental challenge. In this paper, we propose MOGO (Motion Generation with One-pass), a novel autoregressive framework tailored for efficient and real-time 3D motion generation. MOGO comprises two key components: (1) MoSA-VQ, a motion scale-adaptive residual vector quantization module that hierarchically discretizes motion sequences with learnable scaling to produce compact yet expressive representations; and (2) RQHC-Transformer, a residual quantized hierarchical causal transformer that generates multi-layer motion tokens in a single forward pass, significantly reducing inference latency. To enhance semantic fidelity, we further introduce a text condition alignment mechanism that improves motion decoding under textual control. Extensive experiments on benchmark datasets including HumanML3D, KIT-ML, and CMP demonstrate that MOGO achieves competitive or superior generation quality compared to state-of-the-art transformer-based methods, while offering substantial improvements in real-time performance, streaming generation, and generalization under zero-shot settings.

URL PDF HTML ☆

赞 0 踩 0

2506.03820 2026-05-05 cs.CL

Automatic Correction of Writing Anomalies in Hausa Texts

Ahmad Mustapha Wali, Sergiu Nisioi

Comments Accepted at ACL2026

2505.20340 2026-05-05 cs.CL cs.AI

Latent Trajectory Dynamics in Large Language Models: A Manifold Evolution Framework with Empirical Validation

Yukun Zhang, Qi Dong, Mengkang Li

2505.19607 2026-05-05 cs.LG cs.AI

Contrastive Residual Energy Test-time Adaptation

Yewon Han, Seoyun Yang, Taesup Kim

2505.17370 2026-05-05 cs.LG cs.AI

Ellipsoidal Time Series Forecasting

Qilin Wang

Comments Accepted by ICML 2026. Public code at https://anonymous.4open.science/r/ FernPaper-58B4

2505.16850 2026-05-05 cs.LG cs.CL cs.CV

ATR-Bench: A Federated Learning Benchmark for Adaptation, Trust, and Reasoning

Tajamul Ashraf, Mohammed Mohsen Peerzada, Moloud Abdar, Yutong Xie, Yuyin Zhou, Xiaofeng Liu, Iqra Altaf Gillani, Janibul Bashir

Comments This paper is withdrawn due to issues in attribution to related work and the fair attribution of benchmark results, which were not adequately addressed at the time of submission. These issues affect the experimental analysis and require substantial revision

2505.12546 2026-05-05 cs.CL cs.CY cs.LG

Extracting memorized pieces of (copyrighted) books from open-weight language models

A. Feder Cooper, Mark A. Lemley, Allison Casasola, Ahmed Ahmed, Aaron Gokaslan, Amy B. Cyphert, Christopher De Sa, Daniel E. Ho, Percy Liang

2505.02380 2026-05-05 cs.LG

EntroLLM: Entropy Encoded Weight Compression for Efficient Large Language Model Inference on Edge Devices

Arnab Sanyal, Gourav Datta, Prithwish Mukherjee, Sandeep P. Chinchali, Michael Orshansky

Comments 4 pages, 1 reference page

2504.20605 2026-05-05 cs.CL cs.AI cs.DL cs.LG

TF1-EN-3M: Three Million Synthetic Moral Fables for Training Small, Open Language Models

Mihai Nadas, Laura Diosan, Andrei Piscoran, Andreea Tomescu

Comments 18 pages, 6 tables, 1 figure. v2: revised evaluation with open-weight LLM judge panel, expanded citations

2504.02293 2026-05-05 cs.CL cs.AI

Breaking the Silence: A Dataset and Benchmark for Bangla Text-to-Gloss Translation

Sharif Mohammad Abdullah, Abhijit Paul, Shubhashis Roy Dipta, Zarif Masud, Shebuti Rayana, Ahmedul Kabir

2503.12001 2026-05-05 cs.CV

3D Gaussian Splatting against Moving Objects for High-Fidelity Street Scene Reconstruction

Peizhen Zheng, Dongjing Jiang, Qingchong Jiao, Redouane EL Bouchtaoui, Flynnwell Jianfei Zhang

2503.07557 2026-05-05 cs.RO

AutoSpatial: Visual-Language Reasoning for Social Robot Navigation through Efficient Spatial Reasoning Learning

Yangzhe Kong, Daeun Song, Jing Liang, Dinesh Manocha, Ziyu Yao, Xuesu Xiao

2502.16810 2026-05-05 cs.AI cs.CL cs.HC econ.GN q-fin.EC

AI Realtor: Towards Grounded Persuasive Language Generation for Automated Copywriting

Jibang Wu, Chenghao Yang, Yi Wu, Simon Mahns, Chaoqi Wang, Hao Zhu, Fei Fang, Haifeng Xu

2502.15311 2026-05-05 cs.CV cs.ET

A Comprehensive Review of Fish Feeding Behavior Analysis in Aquaculture: Tasks, Techniques, and Applications

Shulong Zhang, Daoliang Li, Jiayin Zhao, Mingyuan Yao, Yingyi Chen, Haihua Wang

Comments 37 pages, 8 figures,

详情

英文摘要

Fish feeding behavior analysis is a key foundation for intelligent feeding and precision aquaculture management, and plays an important role in improving feed utilization efficiency, reducing production costs, and mitigating environmental burden. Existing reviews mainly focus on specific technical modalities or related applications in smart aquaculture, which makes it difficult to present the overall development of fish feeding behavior analysis in a comprehensive manner. To address these issues, this paper provides a thematic review of fish feeding behavior analysis in aquaculture, and systematically examines its task definition, technical support, and application status. First, from the task perspective, two core subtasks of fish feeding behavior analysis are clearly distinguished, and relevant behavioral characteristics and evaluation metrics are summarized. Second, from the technical perspective, the development trajectories of computer vision, acoustics, sensors, and multimodal fusion technologies are examined, and their advantages, limitations, and applicable scenarios are analyzed. On this basis, the application value of fish feeding behavior analysis in intelligent feeding and aquaculture management is further summarized. Finally, this paper discusses the challenges in robust perception under complex environments, generalization across fish species and farming scenarios, collaborative multimodal modeling and lightweight deployment, closed loop intelligent feeding, coordinated optimization of multiple tasks, and long-term production validation, and outlines future research directions. This review provides a reference for task standardization, technical selection, and engineering application in fish feeding behavior analysis, and offers insights into the development of smart aquaculture and sustainable aquaculture management.

URL PDF HTML ☆

赞 0 踩 0

2502.00204 2026-05-05 cs.LG cs.GT

Nearly-Optimal Bandit Learning in Stackelberg Games with Side Information

Maria-Florina Balcan, Martino Bernasconi, Matteo Castiglioni, Andrea Celli, Keegan Harris, Zhiwei Steven Wu

Comments Accepted to ICLR 2026

2501.00112 2026-05-05 cs.RO

QuadPiPS: A Perception-informed Footstep Planner for Quadrupeds With Semantic Affordance Prediction

Max Asselmeier, Ye Zhao, Patricio A. Vela

Comments Under review. Project site: https://quadpips.github.io/

2409.14500 2026-05-05 cs.LG cs.AI

GraphLand: Evaluating Graph Machine Learning Models on Diverse Industrial Data

Gleb Bazhenov, Oleg Platonov, Liudmila Prokhorenkova

Comments Accepted at NeurIPS 2025 (Datasets & Benchmarks Track)

2407.11933 2026-05-05 cs.LG

Fairness-Aware Multi-Group Target Detection in Online Discussion

Soumyajit Gupta, Maria De-Arteaga, Matthew Lease

2407.06150 2026-05-05 cs.CV

PanDORA: Casual HDR Radiance Acquisition of Indoor Scenes for Image-based Lighting

Mohammad Reza Karimi Dastjerdi, Dominique Tanguay-Gaudreau, Frédéric Fortier-Chouinard, Yannick Hold-Geoffroy, Nima Kalantari, Jean-François Lalonde

Comments 10 pages, 11 figures

2402.05284 2026-05-05 cs.LG

Analyzing Adversarial Inputs in Deep Reinforcement Learning

Davide Corsi, Guy Amir, Guy Katz, Alessandro Farinelli

Comments Accepted to AISoLA 2025

2311.10320 2026-05-05 cs.CV eess.IV

Boosting Multimodal Remote Sensing Image Classification with Transformer-based Heterogeneously Salient Graph Representation

Jiaqi Yang, Bo Du, Rong Liu, Zhu Mao, Liangpei Zhang

2306.04498 2026-05-05 cs.LG cs.CY cs.DC

Near-Optimal Privacy-Preserving Learning for Max-Min Fair Multi-Agent Bandits

Amir Leshem

Comments 17 pages, 3 figures

2301.08719 2026-05-05 cs.AI physics.med-ph

The stochastic digital human is now enrolling for in silico imaging trials -- Methods and tools for generating digital cohorts

A Badano, M Lago, E Sizikova, JG Delfino, S Guan, MA Anastasio, B Sahiner

2007.02392 2026-05-05 cs.LG cs.DS math.ST stat.CO stat.ML stat.TH

Efficient Parameter Estimation of Truncated Boolean Product Distributions

Dimitris Fotakis, Alkis Kalavasis, Christos Tzamos

Comments 33rd Conference on Learning Theory (COLT 2020)

1910.09876 2026-05-05 cs.LG stat.ML

Neural Network Training with Approximate Logarithmic Computations

Arnab Sanyal, Peter A. Beerel, Keith M. Chugg

2605.01656 2026-05-05 q-bio.NC cs.AI cs.LG

From Cortical Synchronous Rhythm to Brain Inspired Learning Mechanism: An Oscillatory Spiking Neural Network with Time-Delayed Coordination

Tingting Dan, Guorong Wu

Comments 19 pages, 6 figures

2605.01655 2026-05-05 math.CA cs.LG

Exact Loop Controllers for ReLU Realization of Homogeneous Curve Refinements

Boldsaikhan Bolorkhuu, Tsogtgerel Gantumur

Comments 39 pages, 6 figures

2605.01628 2026-05-05 stat.ML cs.LG math.ST stat.TH

Self-Normalized Martingales and Uniform Regret Bounds for Linear Regression

Fan Chen, Jian Qian, Alexander Rakhlin, Nikita Zhivotovskiy

详情

英文摘要

Self-normalized martingale inequalities lie at the heart of confidence ellipsoids for online least squares and, more broadly, many bandit and reinforcement-learning results. Yet existing vector and scalar results typically rely on bounded covariates and an explicit regularization matrix, producing bounds that are \emph{not scale-invariant}: although the self-normalized quantity is scale-invariant by definition, its standard upper bounds are not. We characterize when scale-invariant upper bounds on self-normalized martingales are possible. Without further assumptions, we prove that nontrivial scale-invariant bounds exist only in dimension $d=1$; moreover, in $d=1$ we obtain $O(\log T)$ scale-invariant self-normalized bounds without any assumptions on the covariates. In contrast, for $d>1$ we show that no nontrivial scale-invariant bound can hold in full generality. We then connect this dichotomy to \emph{doubly-uniform} regret in online linear regression (i.e., regret bounds that are simultaneously independent of the covariate scale and the comparator norm) and use it to resolve the open question of Gaillard, Gerchinovitz, Huard, and Stoltz, \emph{``Uniform regret bounds over $\mathbb{R}^d$ for the sequential linear regression problem with the square loss''} (ALT 2019): in $d=1$ we give an explicit algorithm with $O(\log T)$ doubly-uniform regret, whereas for $d>1$ sublinear doubly-uniform regret is impossible. Finally, under a natural \emph{smoothness} condition (bounded Radon--Nikodym derivatives of the conditional covariate laws with respect to a fixed base measure), we recover sublinear regret for $d>1$ without bounded covariates and derive a self-normalized concentration inequality free of the usual regularization penalties, yielding arguably a first natural scale-invariant bound for adaptive, non-i.i.d. vector martingales.

URL PDF HTML ☆

赞 0 踩 0

2605.01611 2026-05-05 cs.CY cs.AI cs.LG

The Case for ESM3 as a General-Purpose AI Model with Systemic Risk Under the EU AI Act

Taro Qureshi, Jacob Griffith, Koen Holtman, Marcel Mir Teijeiro, Ze Shen Chin, Rokas Gipiškis

Comments 8 pages, 1 figure, Technical AI Safety Conference

2605.01610 2026-05-05 cs.HC cs.AI

Less Interaction But More Explanation: A Communication Perspective on Agentic AI Interfaces

Eunchae Jang, S. Shyam Sundar