arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2604.10074 2026-04-14 cs.LG

Transformers Learn the Optimal DDPM Denoiser for Multi-Token GMMs

Hongkang Li, Hancheng Min, Rene Vidal

2604.10073 2026-04-14 cs.LG cs.AI

Graph-RHO: Critical-path-aware Heterogeneous Graph Network for Long-Horizon Flexible Job-Shop Scheduling

Yujie Li, Jiuniu Wang, Mugen Peng, Guangzuo Li, Wenjia Xu

Comments 8 pages, 3 figures; Accepted by IJCNN 2026

详情

英文摘要

Long-horizon Flexible Job-Shop Scheduling~(FJSP) presents a formidable combinatorial challenge due to complex, interdependent decisions spanning extended time horizons. While learning-based Rolling Horizon Optimization~(RHO) has emerged as a promising paradigm to accelerate solving by identifying and fixing invariant operations, its effectiveness is hindered by the structural complexity of FJSP. Existing methods often fail to capture intricate graph-structured dependencies and ignore the asymmetric costs of prediction errors, in which misclassifying critical-path operations is significantly more detrimental than misclassifying non-critical ones. Furthermore, dynamic shifts in predictive confidence during the rolling process make static pruning thresholds inadequate. To address these limitations, we propose Graph-RHO, a novel critical-path-aware graph-based RHO framework. First, we introduce a topology-aware heterogeneous graph network that encodes subproblems as operation-machine graphs with multi-relational edges, leveraging edge-feature-aware message passing to predict operation stability. Second, we incorporate a critical-path-aware mechanism that injects inductive biases during training to distinguish highly sensitive bottleneck operations from robust ones. Third, we devise an adaptive thresholding strategy that dynamically calibrates decision boundaries based on online uncertainty estimation to align model predictions with the solver's search space. Extensive experiments on standard benchmarks demonstrate that \mbox{Graph-RHO} establishes a new state of the art in solution quality and computational efficiency. Remarkably, it exhibits exceptional zero-shot generalization, reducing solve time by over 30\% on large-scale instances (2000 operations) while achieving superior solution quality. Our code is available \href{https://github.com/IntelliSensing/Graph-RHO}{here}.

URL PDF HTML ☆

赞 0 踩 0

2604.10071 2026-04-14 cs.CV

Spotlight and Shadow: Attention-Guided Dual-Anchor Introspective Decoding for MLLM Hallucination Mitigation

Yebo Wu, Han Jin, Zhijiang Guo, Li Li

Comments Accepted for Findings of ACL 2026

2604.10065 2026-04-14 cs.CL cs.AI cs.SD eess.AS

ASPIRin: Action Space Projection for Interactivity-Optimized Reinforcement Learning in Full-Duplex Speech Language Models

Chi-Yuan Hsiao, Ke-Han Lu, Yu-Kuan Fu, Guan-Ting Lin, Hsiao-Tsung Hung, Hung-yi Lee

2604.10064 2026-04-14 cs.CV

On The Application of Linear Attention in Multimodal Transformers

Armin Gerami, Seyedehanita Madani, Ramani Duraiswami

Comments Workshop on Any-to-Any Multimodal Learning (Any2Any), CVPR 2026

2604.10058 2026-04-14 cs.RO cs.CG

A Ray Intersection Algorithm for Fast Growth Distance Computation Between Convex Sets

Akshay Thirugnanam, Koushil Sreenath

Comments 14 pages, 7 figures

2604.10057 2026-04-14 cs.RO

Natural Gradient Gaussian Approximation Filter on Lie Groups for Robot State Estimation

Tianyi Zhang, Wenhan Cao, Chang Liu, Yao Lyu, Shengbo Eben Li

2604.10056 2026-04-14 cs.CV

U$^{2}$Flow: Uncertainty-Aware Unsupervised Optical Flow Estimation

Xunpei Sun, Wenwei Lin, Yi Chang, Gang Chen

Comments Accepted as an oral presentation at CVPR 2026

2604.10054 2026-04-14 cs.LG cs.SD

Cross-Validated Cross-Channel Self-Attention and Denoising for Automatic Modulation Classification

Prakash Suman, Yanzhen Qu

2604.10044 2026-04-14 cs.AI

LoopGuard: Breaking Self-Reinforcing Attention Loops via Dynamic KV Cache Intervention

Dongjie Xu, Hao Wu, Weijie Shi, Yue Cui, Yuanjun Liu, Jiawei Li, Haolun Ma, An Liu, Jia Zhu, Jiajie Xu

2604.10040 2026-04-14 cs.CV

Intra-finger Variability of Diffusion-based Latent Fingerprint Generation

Noor Hussein, Anil K. Jain, Karthik Nandakumar

Comments Accepted at the 2nd Workshop on Foundation and Generative Models in Biometrics (FoundGen-Bio), held in conjunction with CVPR 2026

2604.10039 2026-04-14 cs.CV

Counting to Four is still a Chore for VLMs

Duy Le Dinh Anh, Patrick Amadeus Irawan, Tuan Van Vo

2604.10035 2026-04-14 cs.CL cs.AI

Computational Implementation of a Model of Category-Theoretic Metaphor Comprehension

Fumitaka Iwaki, Miho Fuyama, Hayato Saigo, Tatsuji Takahashi

Comments 7 pages, 8 figures, CogSci member abstract

2604.10034 2026-04-14 cs.AI

AI Achieves a Perfect LSAT Score

Bonmu Ku

2604.10032 2026-04-14 cs.LG cs.AI

Closed-Form Concept Erasure via Double Projections

Chi Zhang, Jingpu Cheng, Zhixian Wang, Ping Liu

2604.10031 2026-04-14 cs.CL cs.AI

CoSToM:Causal-oriented Steering for Intrinsic Theory-of-Mind Alignment in Large Language Models

Mengfan Li, Xuanhua Shi, Yang Deng

Comments Accepted to ACL 2026 (Main Conference)

2604.10030 2026-04-14 cs.CV

Prompt Relay: Inference-Time Temporal Control for Multi-Event Video Generation

Gordon Chen, Ziqi Huang, Ziwei Liu

2604.10024 2026-04-14 cs.CV cs.AI cs.LG

LVSum: A Benchmark for Timestamp-Aware Long Video Summarization

Alkesh Patel, Melis Ozyildirim, Ying-Chang Cheng, Ganesh Nagarajan

Comments 25 pages, 5 tables, 3 figures

2604.10023 2026-04-14 cs.CV cs.AI

FREE-Switch: Frequency-based Dynamic LoRA Switch for Style Transfer

Shenghe Zheng, Minyu Zhang, Tianhao Liu, Hongzhi Wang

Comments CVPR Findings 2026

2604.10022 2026-04-14 cs.CL

Weird Generalization is Weirdly Brittle

Miriam Wanner, Hannah Collison, William Jurayj, Benjamin Van Durme, Mark Dredze, William Walden

2604.10021 2026-04-14 cs.SD cs.LG

Masked Contrastive Pre-Training Improves Music Audio Key Detection

Ori Yonay, Tracy Hammond, Tianbao Yang

Comments Code and models available at github.com/echo-cipher/keymyna

2604.10017 2026-04-14 cs.CV

What and Where to Adapt: Structure-Semantics Co-Tuning for Machine Vision Compression via Synergistic Adapters

Shaobo Liu, Haobo Xiong, Kai Liu, Yuna Lin

Comments Accepted by the IEEE/CVF Conference on Computer Vision and Pattern Recognition Findings, 2026

2604.10014 2026-04-14 cs.CV cs.AI cs.CL

Demographic and Linguistic Bias Evaluation in Omnimodal Language Models

Alaa Elobaid

Comments Accepted at ICPR 2026. Full paper with complete appendix (31 pages total)

2604.10009 2026-04-14 cs.LG cs.CV cs.RO

Towards Multi-Source Domain Generalization for Sleep Staging with Noisy Labels

Kening Wang, Di Wen, Yufan Chen, Ruiping Liu, Junwei Zheng, Jiale Wei, Kailun Yang, Rainer Stiefelhagen, Kunyu Peng

Comments The benchmark and code will be made publicly available at https://github.com/KNWang970918/FF-TRUST.git

2604.10000 2026-04-14 cs.CV

SwinTextUNet: Integrating CLIP-Based Text Guidance into Swin Transformer U-Nets for Medical Image Segmentation

Ashfak Yeafi, Parthaw Goswami, Md Khairul Islam, Ashifa Islam Shamme

2604.09999 2026-04-14 cs.CV

GIF: A Conditional Multimodal Generative Framework for IR Drop Imaging in Chip Layouts

Kiran Thorat, Nicole Meng, Mostafa Karami, Caiwen Ding, Yingjie Lao, Zhijie Jerry Shi

2604.09996 2026-04-14 cs.CV

A Comparative Study of Modern Object Detectors for Robust Apple Detection in Orchard Imagery

Mohammed Asad, Ajai Kumar Gautam, Priyanshu Dhiman, Rishi Raj Prajapati

Comments Accepted at ICICV 2026; 8 pages, 4 figures

2604.09993 2026-04-14 cs.RO math.OC

GPU-Accelerated Continuous-Time Successive Convexification for Contact-Implicit Legged Locomotion

Samuel C. Buckner, Purnanand Elango

Comments Accepted to IEEE ICRA 2026

2604.09991 2026-04-14 cs.CV

Revisiting the Scale Loss Function and Gaussian-Shape Convolution for Infrared Small Target Detection

Hao Li, Man Fung Zhuo

2604.09990 2026-04-14 cs.CV

Gait Recognition with Temporal Kolmogorov-Arnold Networks

Mohammed Asad, Dinesh Kumar Vishwakarma

Comments 10 pages, 4 figures

AI 大模型

视觉与机器人

科学与医疗