arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2510.07248 2026-04-21 cs.CL

Don't Adapt Small Language Models for Tools; Adapt Tool Schemas to the Models

Jonggeun Lee, Woojung Song, Jongwook Han, Haesung Pyun, Yohan Jo

Comments Accepted at ACL 2026 (Main)

2510.05336 2026-04-21 cs.CL cs.AI

WeatherArchive-Bench: Benchmarking Retrieval-Augmented Reasoning for Historical Weather Archives

Yongan Yu, Xianda Du, Qingchen Hu, Jiahao Liang, Jingwei Ni, Dan Qiang, Kaiyu Huang, Grant McKenzie, Renee Sieber, Fengran Mo

Comments accepted to the Resource Track of SIGIR 2026

2510.02798 2026-04-21 cs.LG cs.AI

OptunaHub: A Platform for Black-Box Optimization

Yoshihiko Ozaki, Shuhei Watanabe, Toshihiko Yanase

Comments Submitted to Journal of machine learning research

2510.02370 2026-04-21 cs.CL cs.AI

How Training Data Shapes the Use of Parametric and In-Context Knowledge in Language Models

Minsung Kim, Dong-Kyum Kim, Jea Kwon, Nakyeong Yang, Kyomin Jung, Meeyoung Cha

Comments 16 pages

2510.02001 2026-04-21 cs.CV cs.AI

Generating Findings for Jaw Cysts in Dental Panoramic Radiographs Using a GPT-Based VLM: A Preliminary Study on Building a Two-Stage Self-Correction Loop with Structured Output (SLSO) Framework

Nanaka Hosokawa, Ryo Takahashi, Tomoya Kitano, Yukihiro Iida, Chisako Muramatsu, Tatsuro Hayashi, Yuta Seino, Xiangrong Zhou, Takeshi Hara, Akitoshi Katsumata, Hiroshi Fujita

Comments Revised manuscript; supplementary materials added. Published in Diagnostics

Journal ref Diagnostics 2026, 16, 1096

详情

DOI: 10.3390/diagnostics16071096

英文摘要

Vision-language models (VLMs) such as GPT (Generative Pre-Trained Transformer) have shown potential for medical image interpretation; however, challenges remain in generating reliable radiological findings in clinical practice, as exemplified by dental pathologies. This study proposes a Self-correction Loop with Structured Output (SLSO) framework as an integrated processing methodology to enhance the accuracy and reliability of AI-generated findings for jaw cysts in dental panoramic radiographs. Dental panoramic radiographs with jaw cysts were used to implement a 10-step integrated processing framework incorporating image analysis, structured data generation, tooth number extraction, consistency checking, and iterative regeneration. The framework functioned as an external validation mechanism for GPT outputs. Performance was compared against the conventional Chain-of-Thought (CoT) method across seven evaluation items: transparency, internal structure, borders, root resorption, tooth movement, relationships with other structures, and tooth number. The SLSO framework improved output accuracy for multiple items compared to the CoT method, with the most notable improvements observed in tooth number identification, tooth movement detection, and root resorption assessment. In successful cases, consistently structured outputs were achieved after up to five regenerations. The framework enforced explicit negative finding descriptions and suppressed hallucinations, although accurate identification of extensive lesions spanning multiple teeth remained limited. This investigation established the feasibility of the proposed integrated processing methodology and provided a foundation for future validation studies with larger, more diverse datasets.

URL PDF HTML ☆

赞 0 踩 0

2509.26278 2026-04-21 cs.CV cs.CL

ProfVLM: A lightweight video-language model for multi-view proficiency estimation

Edoardo Bianchi, Jacopo Staiano, Antonio Liotta

Journal ref Computer Vision and Image Understanding, Volume 268, 2026

2509.25944 2026-04-21 cs.AI

NuRisk: A Visual Question Answering Dataset for Agent-Level Risk Assessment in Autonomous Driving

Yuan Gao, Mattia Piccinini, Roberto Brusnicki, Yuchen Zhang, Johannes Betz

Comments 2026 IEEE International Conference on Robotics and Automation (ICRA)

2509.25699 2026-04-21 cs.CV

AIM-CoT: Active Information-driven Multimodal Chain-of-Thought for Vision-Language Reasoning

Xiping Li, Jianghong Ma

Comments Accepted by ACL 2026 Main Conference. 30 pages, 6 figures

2509.25210 2026-04-21 cs.LG cs.AI physics.ao-ph

STCast: Adaptive Boundary Alignment for Global and Regional Weather Forecasting

Hao Chen, Tao Han, Jie Zhang, Song Guo, Lei Bai

Comments This paper has already been accepted by CVPR 2026 (Highlight)

2509.24328 2026-04-21 cs.CL

Speculative Verification: Exploiting Information Gain to Refine Speculative Decoding

Sungkyun Kim, Jaemin Kim, Dogyung Yoon, Jiho Shin, Junyeol Lee, Jiwon Seo

Comments 16 pages, 8 figures, accepted to ACL 2026 Findings

2509.23542 2026-04-21 cs.CL cs.AI cs.LG

On the Shelf Life of Fine-Tuned LLM-Judges: Future-Proofing, Backward-Compatibility, and Question Generalization

Janvijay Singh, Austin Xu, Yilun Zhou, Yefan Zhou, Dilek Hakkani-Tur, Shafiq Joty

Comments Updated after ICLR 2026 Acceptance; 29 pages;

2509.22297 2026-04-21 cs.AI

Large Language Models as Nondeterministic Causal Models

Sander Beckers

Comments Accepted at KR 2026

2509.19979 2026-04-21 cs.CV

CamPVG: Camera-Controlled Panoramic Video Generation with Epipolar-Aware Diffusion

Chenhao Ji, Chaohui Yu, Junyao Gao, Fan Wang, Cairong Zhao

Comments SIGGRAPH Asia 2025

2509.18611 2026-04-21 cs.LG cs.AI

Flow marching for a generative PDE foundation model

Zituo Chen, Sili Deng

Comments This work has been substantially expanded and superseded by arXiv:2602.11229

2509.16538 2026-04-21 cs.CV cs.CL

VC-Inspector: Advancing Reference-free Evaluation of Video Captions with Factual Analysis

Shubhashis Roy Dipta, Tz-Ying Wu, Subarna Tripathi

Comments Accepted at ACL 2026 (Main)

2509.15974 2026-04-21 cs.CL cs.AI cs.LG

BEFT: Bias-Efficient Fine-Tuning of Language Models in Low-Data Regimes

Baichuan Huang, Ananth Balashankar, Amir Aminifar

2509.15651 2026-04-21 cs.LG cs.AI

Toward Efficient Influence Function: Dropout as a Compression Tool

Yuchen Zhang, Mohammad Mohammadi Amiri

Journal ref Transactions on Machine Learning Research, 02/2026

2509.11983 2026-04-21 cs.LG math.OC

Low-rank Orthogonalization for Large-scale Matrix Optimization with Applications to Foundation Model Training

Chuan He, Zhanwang Deng, Zhaosong Lu

Comments 20 pages, add numerical comparison with Galore and SOAP

2509.11612 2026-04-21 cs.LG

Topology Structure Optimization of Reservoirs Using GLMY Homology

Yu Chen, Shengwei Wang, Hongwei Lin

2509.04334 2026-04-21 cs.CV

GeoArena: Evaluating Open-World Geographic Reasoning in Large Vision-Language Models

Pengyue Jia, Yingyi Zhang, Xiangyu Zhao, Sharon Li

Comments ACL 2026 Main

2509.01082 2026-04-21 cs.LG cs.PL

RefineStat: Efficient Exploration for Probabilistic Program Synthesis

Madhav Kanda, Shubham Ugare, Sasa Misailovic

Comments RefineStat constrains LM decoding with statistical validity checks and uses diagnostic-guided resampling (priors/likelihoods) to transform small LMs' drafts into correct, reliable probabilistic programs that can match or surpass closed-source models

Journal ref ICLR 2026 (Oral)

2509.00789 2026-04-21 cs.CV

CogDriver: Integrating Cognitive Inertia for Temporally Coherent Planning in Autonomous Driving

Pei Liu, Qingtian Ning, Xinyan Lu, Haipeng Liu, Weiliang Ma, Dangen She, Peng Jia, Xianpeng Lang, Jun Ma

2508.19564 2026-04-21 cs.LG cs.AI

Bi-LoRA: Efficient Sharpness-Aware Minimization for Fine-Tuning Large-Scale Models

Yuhang Liu, Tao Li, Zhehao Huang, Zuopeng Yang, Xiaolin Huang

Comments 32 pages,ICLR 2026

2508.17458 2026-04-21 cs.CL

Evaluating the Impact of Verbal Multiword Expressions on Machine Translation

Linfeng Liu, Saptarshi Ghosh, Tianyu Jiang

Comments ACL 2026, 29 pages, 10 figures, Code URL: https://github.com/cincynlp/vmwe-mt-eval

2508.17434 2026-04-21 cs.CV

TinySR: Pruning Diffusion for Real-World Image Super-Resolution

Linwei Dong, Qingnan Fan, Yuhang Yu, Qi Zhang, Jinwei Chen, Yawei Luo, Changqing Zou

2508.17394 2026-04-21 cs.CV

LVLM-Aware Multimodal Retrieval for RAG-Based Medical Diagnosis with General-Purpose Models

Nir Mazor, Tom Hope

2508.16464 2026-04-21 cs.CL

What makes an entity salient in discourse?

Amir Zeldes, Jessica Lin

Comments To appear in Corpus Linguistics and Linguistic Theory

2508.15815 2026-04-21 cs.CL cs.AI cs.HC

User-Assistant Bias in LLMs

Xu Pan, Jingxuan Fan, Zidi Xiong, Ely Hahami, Jorin Overwiening, Ziqian Xie

2508.14461 2026-04-21 cs.CV

Ouroboros: Single-step Diffusion Models for Cycle-consistent Forward and Inverse Rendering

Shanlin Sun, Yifan Wang, Hanwen Zhang, Yifeng Xiong, Qin Ren, Ruogu Fang, Xiaohui Xie, Chenyu You

Comments Accepted by ICCV 2025

2508.12782 2026-04-21 cs.AI

HeroBench: A Benchmark for Long-Horizon Planning and Structured Reasoning in Virtual Worlds

Petr Anokhin, Roman Khalikov, Stefan Rebrikov, Viktor Volkov, Artyom Sorokin, Vincent Bissonnette

Comments Code is available at https://github.com/stefanrer/HeroBench

AI 大模型

视觉与机器人

科学与医疗