arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.20026 2026-04-23 cs.CV

Investigation of cardinality classification for bacterial colony counting using explainable artificial intelligence

Minghua Zheng, Na Helian, Peter C. R. Lane, Yi Sun, Allen Donald

Comments 54 pages, 48 figures

详情

英文摘要

Automatic bacterial colony counting is a highly sought-after technology in modern biological laboratories because it eliminates manual counting effort. Previous work has observed that MicrobiaNet, currently the best-performing cardinality classification model for colony counting, has difficulty distinguishing colonies of three or more individuals. However, it is unclear if this is due to properties of the data together with inherent characteristics of the MicrobiaNet model. By analysing MicrobiaNet with explainable artificial intelligence (XAI), we demonstrate that XAI can provide insights into how data properties constrain cardinality classification performance in colony counting. Our results show that high visual similarity across classes is the key issue hindering further performance improvement, revising prior assertions about MicrobiaNet. These findings suggest future work should focus on models that explicitly incorporate visual similarity or explore density estimation approaches, with broader implications for neural network classifiers trained on imbalanced datasets.

URL PDF HTML ☆

赞 0 踩 0

2604.20021 2026-04-23 cs.LG cs.CL

Continuous Semantic Caching for Low-Cost LLM Serving

Baran Atalar, Xutong Liu, Jinhang Zuo, Siwei Wang, Wei Chen, Carlee Joe-Wong

2604.20019 2026-04-23 cs.LG

Multi-Objective Reinforcement Learning for Generating Covalent Inhibitor Candidates

Renee Gil

2604.20017 2026-04-23 cs.RO

Strain in Sound: Soft Corrugated Tube for Local Strain Sensing with Acoustic Resonance

Michael Chun, Ananya Nukala, Tae Myung Huh

Comments 2025 IEEE 8th International Conference on Soft Robotics (RoboSoft). IEEE, 2025

2604.20012 2026-04-23 cs.CV cs.AI cs.CL

EmbodiedMidtrain: Bridging the Gap between Vision-Language Models and Vision-Language-Action Models via Mid-training

Yiyang Du, Zhanqiu Guo, Xin Ye, Liu Ren, Chenyan Xiong

2604.20006 2026-04-23 cs.CL

From Recall to Forgetting: Benchmarking Long-Term Memory for Personalized Agents

Md Nayem Uddin, Kumar Shubham, Eduardo Blanco, Chitta Baral, Gengyu Wang

Comments Accepted to ACL 2026 Findings

2604.20000 2026-04-23 cs.CV

RareSpot+: A Benchmark, Model, and Active Learning Framework for Small and Rare Wildlife in Aerial Imagery

Bowen Zhang, Jesse T. Boulerice, Charvi Mendiratta, Nikhil Kuniyil, Satish Kumar, Hila Shamon, B. S. Manjunath

2604.19999 2026-04-23 cs.CV

Optimizing Data Augmentation for Real-Time Small UAV Detection: A Lightweight Context-Aware Approach

Amir Zamani, Zeinab Abedini

Comments Accepted for presentation at the 34th International Conference on Electrical Engineering (ICEE 2026)

2604.19998 2026-04-23 cs.AI

What Makes a Good AI Review? Concern-Level Diagnostics for AI Peer Review

Ming Jin

详情

英文摘要

Evaluating AI-generated reviews by verdict agreement is widely recognized as insufficient, yet current alternatives rarely audit which concerns a system identifies, how it prioritizes them, or whether those priorities align with the review rationale that shaped the final assessment. We propose concern alignment, a diagnostic framework that evaluates AI reviews at the concern level rather than only at the verdict level. The framework's core data structure is the match graph, a bipartite alignment between official and AI-generated concerns annotated with match type, severity, and post-rebuttal treatment. From this artifact we derive an evaluation ladder that moves from binary accuracy to concern detection, verdict-stratified behavior, decision-aware calibration, and rebuttal-aware decomposition. In a pilot study of four public AI review systems evaluated in six configurations, concern-level analysis suggests that detection alone does not determine review quality; calibration is often the binding constraint. Systems detect non-trivial fractions of official concerns yet most mark 25--55% of concerns on accepted papers as decisive, where, under our operationalization, no official concern on accepted papers was treated as a decisive blocker. Identical overall verdict accuracy can conceal reject-heavy behavior versus low-recall profiles, and low full-review false decisive rates can partly reflect concern dilution rather than calibrated prioritization. Most systems do not emit a native accept/reject, and inferring it from review tone is method-sensitive, reinforcing the need for concern-level diagnostics that remain stable across inference choices. The contribution is a reusable evaluation framework for auditing which concerns AI reviewers identify, how they weight them, and whether those priorities align with the review rationale that informed the paper's final assessment.

URL PDF HTML ☆

赞 0 踩 0

2604.19989 2026-04-23 cs.CV

Online CS-based SAR Edge-Mapping

Conor Flynn, Radoslav Ivanov, Birsen Yazici

Comments SPIE Defense and Commercial Sensing 2026, Algorithms for Synthetic Aperture Radar Imagery XXXIII

2604.19980 2026-04-23 cs.RO cs.SY eess.SY

Efficient Reinforcement Learning using Linear Koopman Dynamics for Nonlinear Robotic Systems

Wenjian Hao, Yuxuan Fang, Zehui Lu, Shaoshuai Mou

2604.19979 2026-04-23 cs.LG cs.CE cs.CV

Fast Amortized Fitting of Scientific Signals Across Time and Ensembles via Transferable Neural Fields

Sophia Zorek, Kushal Vyas, Yuhao Liu, David Lenz, Tom Peterka, Guha Balakrishnan

2604.19976 2026-04-23 cs.CV

Lucky High Dynamic Range Smartphone Imaging

Baiang Li, Ruyu Yan, Ethan Tseng, Zhoutong Zhang, Adam Finkelstein, Jiawen Chen, Felix Heide

Comments 13 pages, 12 figures

2604.19974 2026-04-23 cs.LG cs.CL

Are LLM Uncertainty and Correctness Encoded by the Same Features? A Functional Dissociation via Sparse Autoencoders

Het Patel, Tiejin Chen, Hua Wei, Evangelos E. Papalexakis, Jia Chen

2604.19966 2026-04-23 cs.CV cs.AI cs.LG cs.RO

DistortBench: Benchmarking Vision Language Models on Image Distortion Identification

Divyanshu Goyal, Akhil Eppa, Vanya Bannihatti Kumar

2604.19954 2026-04-23 cs.CV

Camera Control for Text-to-Image Generation via Learning Viewpoint Tokens

Xinxuan Lu, Charless Fowlkes, Alexander C. Berg

2604.19946 2026-04-23 cs.RO

SL(C)AMma: Simultaneous Localisation, (Calibration) and Mapping With a Magnetometer Array

Thomas Edridge, Manon Kok

Comments 10 pages, 8 figures, 1 table, python implementation available at https://github.com/Tedridge/SL-C-AMma, experimental data available at https://hendeby.gitlab-pages.liu.se/research/magdata/

2604.19945 2026-04-23 cs.CV

Visual Reasoning through Tool-supervised Reinforcement Learning

Qihua Dong, Gozde Sahin, Pei Wang, Zhaowei Cai, Robik Shrestha, Hao Yang, Davide Modolo

Comments Accepted to CVPR 2026 Findings. 17 pages

2604.19943 2026-04-23 cs.CL

Structured Disagreement in Health-Literacy Annotation: Epistemic Stability, Conceptual Difficulty, and Agreement-Stratified Inference

Olga Kellert, Sriya Kondury, Candice Koo, Nemika Tyagi, Steffen Eikenberry

Comments 8 pages, 5 figures

2604.19937 2026-04-23 cs.CV cs.AI

Infection-Reasoner: A Compact Vision-Language Model for Wound Infection Classification with Evidence-Grounded Clinical Reasoning

Palawat Busaranuvong, Reza Saadati Fard, Emmanuel Agu, Deepak Kumar, Shefalika Gautam, Bengisu Tulu, Diane Strong

2604.19936 2026-04-23 cs.LG cs.AI

Generalization and Membership Inference Attack a Practical Perspective

Fateme Rahmani, Mahdi Jafari Siavoshani, Mohammad Hossein Rohban

2604.19926 2026-04-23 cs.AI

CreativeGame:Toward Mechanic-Aware Creative Game Generation

Hongnan Ma, Han Wang, Shenglin Wang, Tieyue Yin, Yiwei Shi, Yucong Huang, Yingtian Zou, Muning Wen, Mengyue Yang

详情

英文摘要

Large language models can generate plausible game code, but turning this capability into \emph{iterative creative improvement} remains difficult. In practice, single-shot generation often produces brittle runtime behavior, weak accumulation of experience across versions, and creativity scores that are too subjective to serve as reliable optimization signals. A further limitation is that mechanics are frequently treated only as post-hoc descriptions, rather than as explicit objects that can be planned, tracked, preserved, and evaluated during generation. This report presents \textbf{CreativeGame}, a multi-agent system for iterative HTML5 game generation that addresses these issues through four coupled ideas: a proxy reward centered on programmatic signals rather than pure LLM judgment; lineage-scoped memory for cross-version experience accumulation; runtime validation integrated into both repair and reward; and a mechanic-guided planning loop in which retrieved mechanic knowledge is converted into an explicit mechanic plan before code generation begins. The goal is not merely to produce a playable artifact in one step, but to support interpretable version-to-version evolution. The current system contains 71 stored lineages, 88 saved nodes, and a 774-entry global mechanic archive, implemented in 6{,}181 lines of Python together with inspection and visualization tooling. The system is therefore substantial enough to support architectural analysis, reward inspection, and real lineage-level case studies rather than only prompt-level demos. A real 4-generation lineage shows that mechanic-level innovation can emerge in later versions and can be inspected directly through version-to-version records. The central contribution is therefore not only game generation, but a concrete pipeline for observing progressive evolution through explicit mechanic change.

URL PDF HTML ☆

赞 0 踩 0

2604.19921 2026-04-23 cs.CL

Commonsense Knowledge with Negation: A Resource to Enhance Negation Understanding

Zijie Wang, MohammadHossein Rezaei, Farzana Rashid, Eduardo Blanco

Comments Accepted at Findings of ACL 2026

2604.19907 2026-04-23 cs.CV

SceneOrchestra: Efficient Agentic 3D Scene Synthesis via Full Tool-Call Trajectory Generation

Yun He, Kelin Yu, Matthias Zwicker

详情

英文摘要

Recent agentic frameworks for 3D scene synthesis have advanced realism and diversity by integrating heterogeneous generation and editing tools. These tools are organized into workflows orchestrated by an off-the-shelf LLM. Current approaches typically adopt an execute-review-reflect loop: at each step, the orchestrator executes a tool, renders intermediate results for review, and then decides on the tool and its parameters for the next step. However, this design has two key limitations. First, next-step tool selection and parameter configuration are driven by heuristic rules, which can lead to suboptimal execution flows, unnecessary tool invocations, degraded output quality, and increased runtime. Second, rendering and reviewing intermediate results after each step introduces additional latency. To address these issues, we propose SceneOrchestra, a trainable orchestration framework that optimizes the tool-call execution flow and eliminates the step-by-step review loop, improving both efficiency and output quality. SceneOrchestra consists of an orchestrator and a discriminator, which we fine-tune with a two-phase training strategy. In the first phase, the orchestrator learns context-aware tool selection and complete tool-call trajectory generation, while the discriminator is trained to assess the quality of full trajectories, enabling it to select the best trajectory from multiple candidates. In the second phase, we perform interleaved training, where the discriminator adapts to the orchestrator's evolving trajectory distribution and distills its discriminative capability back into the orchestrator. At inference, we only use the orchestrator to generate and execute full tool-call trajectories from instructions, without requiring the discriminator. Extensive experiments show that our method achieves state-of-the-art scene quality while reducing runtime compared to previous work.

URL PDF HTML ☆

赞 0 踩 0

2604.19903 2026-04-23 cs.LG cs.AI cs.CY

A Multi-Plant Machine Learning Framework for Emission Prediction, Forecasting, and Control in Cement Manufacturing

Sheikh Junaid Fayaz, Nestor D. Montiel-Bohorquez, Wilson Ricardo Leal da Silva, Shashank Bishnoi, Matteo Romano, Manuele Gatti, N. M. Anoop Krishnan

2604.19902 2026-04-23 cs.CV cs.AI cs.LG

MMCORE: MultiModal COnnection with Representation Aligned Latent Embeddings

Zijie Li, Yichun Shi, Jingxiang Sun, Ye Wang, Yixuan Huang, Zhiyao Guo, Xiaochen Lian, Peihao Zhu, Yu Tian, Zhonghua Zhai, Peng Wang

2604.19895 2026-04-23 cs.AI

Learning When Not to Decide: A Framework for Overcoming Factual Presumptuousness in AI Adjudication

Mohamed Afane, Emily Robitschek, Derek Ouyang, Daniel E. Ho

2604.19888 2026-04-23 cs.CV

SGAP-Gaze: Scene Grid Attention Based Point-of-Gaze Estimation Network for Driver Gaze

Pavan Kumar Sharma, Pranamesh Chakraborty

2604.19887 2026-04-23 cs.CL cs.AI

Depression Risk Assessment in Social Media via Large Language Models

Giorgia Gulino, Manuel Petrucci

2604.19884 2026-04-23 cs.CL cs.AI cs.LG

From Signal Degradation to Computation Collapse: Uncovering the Two Failure Modes of LLM Quantization

Chenxi Zhou, Pengfei Cao, Jiang Li, Bohan Yu, Jinyu Ye, Jun Zhao, Kang Liu

Comments Accepted to Findings of ACL 2026