arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.22496 2026-04-27 cs.LG

Deep Learning for Model Calibration in Simulation of Itaconic Acid Production

Daria Fokina, Marco Baldan, Constantin Romankiewicz, Wolfgang Laudensack, Roland Ulber, Michael Bortz

详情

英文摘要

In this study, deep learning is used to estimate kinetic parameters for modeling itaconic acid production based on real batch experiments conducted at different agitation speeds and reactor scales. Two deep learning strategies, namely direct deep learning (DDL) and generative conditional flow matching (CFM) are compared and benchmarked against nonlinear regression as a reference method. Compared with DDL, CFM consistently yields more accurate results. The concentration profiles predicted by CFM closely match those obtained from nonlinear regression, whereas DDL results in larger deviations. Similar behavior is observed in the scale-up experiments, where the CFM model again generalizes better and is more robust than the direct approach. These findings demonstrate that CFM can reliably predict system behavior across different operating conditions and scales, offering a flexible and data-efficient framework for parameter estimation in dynamic bioprocess models.

URL PDF HTML ☆

赞 0 踩 0

2604.22479 2026-04-27 cs.CV eess.IV

Improving Driver Drowsiness Detection via Personalized EAR/MAR Thresholds and CNN-Based Classification

Gökdeniz Ersoy, Mehmet Alper Tatar, Eray Tonbul, Serap Kırbız

2604.22464 2026-04-27 cs.LG

Towards Adaptive Continual Model Merging via Manifold-Aware Expert Evolution

Haiyun Qiu, Xingyu Wu, Kay Chen Tan

2604.22455 2026-04-27 cs.AI

On the Hybrid Nature of ABPMS Process Frames and its Implications on Automated Process Discovery

Anti Alman, Izack Cohen, Avigdor Gal, Fabrizio Maria Maggi, Marco Montali

2604.22452 2026-04-27 cs.AI cs.CL cs.LG

Superminds Test: Actively Evaluating Collective Intelligence of Agent Society via Probing Agents

Xirui Li, Ming Li, Yunze Xiao, Ryan Wong, Dianqi Li, Timothy Baldwin, Tianyi Zhou

2604.22446 2026-04-27 cs.AI

From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company

Zhengxu Yu, Yu Fu, Zhiyuan He, Yuxuan Huang, Lee Ka Yiu, Meng Fang, Weilin Luo, Jun Wang

Comments 33 pages,13 figures

详情

英文摘要

Individual agent capabilities have advanced rapidly through modular skills and tool integrations, yet multi-agent systems remain constrained by fixed team structures, tightly coupled coordination logic, and session-bound learning. We argue that this reflects a deeper absence: a principled organisational layer that governs how a workforce of agents is assembled, governed, and improved over time, decoupled from what individual agents know. To fill this gap, we introduce \emph{OneManCompany (OMC)}, a framework that elevates multi-agent systems to the organisational level. OMC encapsulates skills, tools, and runtime configurations into portable agent identities called \emph{Talents}, orchestrated through typed organisational interfaces that abstract over heterogeneous backends. A community-driven \emph{Talent Market} enables on-demand recruitment, allowing the organisation to close capability gaps and reconfigure itself dynamically during execution. Organisational decision-making is operationalised through an \emph{Explore-Execute-Review} ($\text{E}^2$R) tree search, which unifies planning, execution, and evaluation in a single hierarchical loop: tasks are decomposed top-down into accountable units and execution outcomes are aggregated bottom-up to drive systematic review and refinement. This loop provides formal guarantees on termination and deadlock freedom while mirroring the feedback mechanisms of human enterprises. Together, these contributions transform multi-agent systems from static, pre-configured pipelines into self-organising and self-improving AI organisations capable of adapting to open-ended tasks across diverse domains. Empirical evaluation on PRDBench shows that OMC achieves an $84.67\%$ success rate, surpassing the state of the art by $15.48$ percentage points, with cross-domain case studies further demonstrating its generality.

URL PDF HTML ☆

赞 0 踩 0

2604.22442 2026-04-27 cs.LG cs.NE

HubRouter: A Pluggable Sub-Quadratic Routing Primitive for Hybrid Sequence Models

Abhinaba Basu

2604.22439 2026-04-27 cs.CV

NRGS: Neural Regularization for Robust 3D Semantic Gaussian Splatting

Zaiyan Yang, Xinpeng Liu, Heng Guo, Jinglei Shi, Zhanyu Ma, Fumio Okura

2604.22436 2026-04-27 cs.AI cs.IR cs.MA

AgentSearchBench: A Benchmark for AI Agent Search in the Wild

Bin Wu, Arastun Mammadli, Xiaoyu Zhang, Emine Yilmaz

2604.22433 2026-04-27 cs.LG

Beyond Land Surface Temperature: Explainable Spatial Machine Learning Reveals Urban Morphology Effects on Human-Centric Heat Stress

Yuan Wang, Shengao Yi, Xiaojiang Li, Pengyuan Liu, Zhiwei Yang, Ronita Bardhan, Rudi Stouffs

详情

英文摘要

Heat exposure connects the built environment and public health, directly shaping the livability and sustainability of urban areas. Understanding the spatial heterogeneity of heat exposure and its drivers is vital for climate-adaptive urban planning. However, most planning-oriented studies rely on land surface temperature (LST), and whether LST adequately represents human heat exposure and how it differs from physiologically relevant heat stress remains insufficiently examined. Here, adopting Landsat-retrieved 30-m LST and GPU-accelerated 1-m universal thermal climate index (UTCI) in Singapore, this study establishes a comprehensive "Modeling-Comparing-Assessing" framework to systematically evaluate the spatial and mechanistic discrepancies between the two metrics. We further investigate pronounced non-stationary and threshold-based quantitative relationships of the two metrics with urban factors by employing a novel geographically weighted XGBoost (GW-XGBoost) and generalized additive model (GAM) workflow. Our results demonstrate notable discrepancies in spatial patterns of LST and UTCI, along with substantial spatial heterogeneity in how 2D and 3D urban factors impact these two thermal metrics, as revealed by explainable GW-XGBoost models (global out-of-bag R2 = 0.855 for LST and 0.905 for UTCI, respectively). Crucially, spatially explicit SHAP interprets that sky view factor plays a central role in explaining UTCI variability but exhibits a comparatively marginal independent contribution to LST, indicating that LST inadequately captures shading-driven and radiative processes governing actual human heat stress. Notably, SHAP-GAM analysis indicates that higher albedo is associated with increased UTCI. These novel findings provide evidence for integrating physiologically relevant thermal indices to inform targeted heat risk management and climate-adaptive urban planning.

URL PDF HTML ☆

赞 0 踩 0

2604.22428 2026-04-27 cs.AI

CognitiveTwin: Robust Multi-Modal Digital Twins for Predicting Cognitive Decline in Alzheimer's Disease

Bulent Soykan, Gulsah Hancerliogullari Koksalmis, Hsin-Hsiung Huang, Laura J. Brattain

Comments 18 pages, 6 figures

2604.22416 2026-04-27 cs.LG cs.AI

From Local to Cluster: A Unified Framework for Causal Discovery with Latent Variables

Zongyu Li

2604.22413 2026-04-27 cs.LG cs.AI

Distance-Misaligned Training in Graph Transformers and Adaptive Graph-Aware Control

Qinhan Hou, Jing Tang

Comments Accepted by Graph Signal Processing Workshop 2026 as an extended abstract

2604.22411 2026-04-27 cs.AI cs.CL cs.LG

Introducing Background Temperature to Characterise Hidden Randomness in Large Language Models

Alberto Messina, Stefano Scotta

2604.22407 2026-04-27 cs.LG cs.AI

Hidden Failure Modes of Gradient Modification under Adam in Continual Learning, and Adaptive Decoupled Moment Routing as a Repair

Yuelin Hu, Zhenbo Yu, Zhengxue Cheng, Wei Liu, Li Song

Comments 28 pages, 5 figures, preprint

2604.22405 2026-04-27 cs.LG cs.NA math.NA

Robust Fuzzy local k-plane clustering with mixture distance of hinge loss and L1 norm

Junjun Huang, Xiliang Lu, Xuelin Xie, Jerry Zhijian Yang

2604.22390 2026-04-27 cs.CV

Region Matters: Efficient and Reliable Region-Aware Visual Place Recognition

Shunpeng Chen, Yukun Song, Changwei Wang, Rongtao Xu, Kexue Fu, Longxiang Gao, Li Guo, Ruisheng Wang, Shibiao Xu

Comments 25 pages, 13 figures, 10 tables, 1 algorithm

2604.22388 2026-04-27 cs.CV

HFS-TriNet: A Three-Branch Collaborative Feature Learning Network for Prostate Cancer Classification from TRUS Videos

Xu Lu, Qianhong Peng, Qihao Zhou, Shaopeng Liu, Xiuqin Ye, Chuan Yang, Yuan Yuan

2604.22379 2026-04-27 cs.CV

Efficient Diffusion Distillation via Embedding Loss

Jincheng Ying, Yitao Chen, Li Wenlin, Minghui Xu, Yinhao Xiao

详情

英文摘要

Recent advances in distilling expensive diffusion models into efficient few-step generators show significant promise. However, these methods typically demand substantial computational resources and extended training periods, limiting accessibility for resource-constrained researchers, and existing supplementary loss functions have notable limitations. Regression loss requires pre-generating large datasets before training and limits the student model to the teacher's performance, while GAN-based losses suffer from training instability and require careful tuning. In this paper, we propose Embedding Loss (EL), a novel supplementary loss function that complements existing diffusion distillation methods to enhance generation quality and accelerate training with smaller batch sizes. Leveraging feature embeddings from a diverse set of randomly initialized networks, EL effectively aligns the feature distributions between the distilled few-step generator and the original data. By computing Maximum Mean Discrepancy (MMD) in the embedded feature space, EL ensures robust distribution matching, thereby preserving sample fidelity and diversity during distillation. Within distribution matching distillation frameworks, EL demonstrates strong empirical performance for one-step generators. On the CIFAR-10 dataset, our approach achieves state-of-the-art FID values of 1.475 for unconditional generation and 1.380 for conditional generation. Beyond CIFAR-10, we further validate EL across multiple benchmarks and distillation methods, including ImageNet, AFHQ-v2, and FFHQ datasets, using DMD, DI, and CM distillation frameworks, demonstrating consistent improvements over existing one-step distillation methods. Our method also reduces training iterations by up to 80%, offering a more practical and scalable solution for deploying diffusion-based generative models in resource-constrained environments.

URL PDF HTML ☆

赞 0 踩 0

2604.22378 2026-04-27 cs.RO

Adaptive vs. Static Robot-to-Human Handover: A Study on Orientation and Approach Direction

Federico Biagi, Dario Onfiani, Simone Silenzi, Cristina Iani, Luigi Biagiotti

2604.22374 2026-04-27 cs.CL

Selective Contrastive Learning For Gloss Free Sign Language Translation

Changhao Lai, Rui Zhao, Xuewen Zhong, Jinsong Su, Yidong Chen

Comments Accepted by ACL 2026 as the main conference

2604.22367 2026-04-27 cs.CL cs.AI

CNSL-bench: Benchmarking the Sign Language Understanding Capabilities of MLLMs on Chinese National Sign Language

Rui Zhao, Xuewen Zhong, Xiaoyun Zheng, Jinsong Su, Yidong Chen

Comments Accepted as the Main Conference at ACL 2026

2604.22363 2026-04-27 cs.RO cs.AI

LeHome: A Simulation Environment for Deformable Object Manipulation in Household Scenarios

Zeyi Li, Yushi Yang, Shawn Xie, Kyle Xu, Tianxing Chen, Yuran Wang, Zhenhao Shen, Yan Shen, Yue Chen, Wenjun Li, Yukun Zheng, Chaorui Zhang, Siyi Lin, Fei Teng, Hongjun Yang, Ming Chen, Steve Xie, Ruihai Wu

Comments ICRA2026 Accepted

2604.22360 2026-04-27 cs.LG

Revisiting Neural Activation Coverage for Uncertainty Estimation

Benedikt Franke, Nils Förster, Frank Köster, Asja Fischer, Markus Lange, Arne Raulf

Comments Published in 34th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, ESANN 2026

2604.22355 2026-04-27 cs.LG math.OC stat.ML

SOC-ICNN: From Polyhedral to Conic Geometry for Learning Convex Surrogate Functions

Kang Liu, Jianchen Hu

Comments 28 pages and no figure

2604.22354 2026-04-27 cs.CV

One Shot Learning for Edge Detection on Point Clouds

Zhikun Tu, Yuhe Zhang, Yiou Jia, Kang Li, Daniel Cohen-Or

Comments 17 pages, 14 figures. Published in IEEE Transactions on Visualization and Computer Graphics

2604.22350 2026-04-27 cs.CV

PoseFM: Relative Camera Pose Estimation Through Flow Matching

Dominik Kuczkowski, Laura Ruotsalainen

2604.22348 2026-04-27 cs.LG

A Nationwide Japanese Medical Claims Foundation Model: Balancing Model Scaling and Task-Specific Computational Efficiency

Nanae Aratake, Taisei Tosaki, Yuji Okamoto, Eiichiro Uchino, Masaki Nakamura, Nobutomo Matsui, Akiko Hatakama, Yasushi Okuno

Comments 14 pages, 5 figures, 3 tables

2604.22345 2026-04-27 cs.CL

Preference Heads in Large Language Models: A Mechanistic Framework for Interpretable Personalization

Weixu Zhang, Ye Yuan, Changjiang Han, Yuxing Tian, Zipeng Sun, Linfeng Du, Jikun Kang, Hong Kang, Xue Liu, Haolun Wu

Comments Accepted at ACL 2026

2604.22337 2026-04-27 cs.LG

TabSCM: A practical Framework for Generating Realistic Tabular Data

Sven Jacob, Bardh Prenkaj, Weijia Shao, Gjergji Kasneci