arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.18715 2026-04-22 cs.CL cs.AI

Characterizing AlphaEarth Embedding Geometry for Agentic Environmental Reasoning

Mashrekur Rahman, Samuel J. Barrett, Christina Last

详情

英文摘要

Earth observation foundation models encode land surface information into dense embedding vectors, yet the geometric structure of these representations and its implications for downstream reasoning remain underexplored. We characterize the manifold geometry of Google AlphaEarth's 64-dimensional embeddings across 12.1 million Continental United States samples (2017--2023) and develop an agentic system that leverages this geometric understanding for environmental reasoning. The manifold is non-Euclidean: effective dimensionality is 13.3 (participation ratio) from 64 raw dimensions, with local intrinsic dimensionality of approximately 10. Tangent spaces rotate substantially, with 84\% of locations exceeding 60\textdegree{} and local-global alignment (mean$|\cosθ| = 0.17$) approaching the random baseline of 0.125. Supervised linear probes indicate that concept directions rotate across the manifold, and compositional vector arithmetic using both PCA-derived and probe-derived directions yields poor precision. Retrieval instead produces physically coherent results, with local geometry predicting retrieval coherence ($R^2 = 0.32$). Building on this characterization, we introduce an agentic system with nine specialized tools that decomposes environmental queries into reasoning chains over a FAISS-indexed embedding database. A five-condition ablation (120 queries, three complexity tiers) shows that embedding retrieval dominates response quality ($μ= 3.79 \pm 0.90$ vs.\ $3.03 \pm 0.77$ parametric-only; scale 1--5), with peak performance on multi-step comparisons ($μ= 4.28 \pm 0.43$). A cross-model benchmark show that geometric tools reduce Sonnet 4.5's score by 0.12 points but improve Opus 4.6's by 0.07, with Opus achieving higher geometric grounding (3.38 vs.\ 2.64), suggesting that the value of geometric characterization scales with the reasoning capability of the consuming model.

URL PDF HTML ☆

赞 0 踩 0

2604.18713 2026-04-22 cs.CV

Align then Refine: Text-Guided 3D Prostate Lesion Segmentation

Cuiling Sun, Linkai Peng, Adam Murphy, Elif Keles, Hiten D. Patel, Ashley Ross, Frank Miller, Baris Turkbey, Andrea Mia Bejar, Halil Ertugrul Aktas, Gorkem Durak, Ulas Bagci

Comments Accepted to EMBC 2026

2604.18712 2026-04-22 cs.CL

Probing for Reading Times

Eleftheria Tsipidi, Samuel Kiegeland, Francesco Ignazio Re, Tianyang Xu, Mario Giulianelli, Karolina Stanczak, Ryan Cotterell

Comments ACL 2026 (main conference)

2604.18665 2026-04-22 cs.SD

APRVOS: 1st Place Winner of 5th PVUW MeViS-Audio Track

Deshui Miao, Yameng Gu, Chao Yang, Xin Li, Haijun Zhang, Ming-Hsuan Yang

2604.18645 2026-04-22 cs.AI

On Solving the Multiple Variable Gapped Longest Common Subsequence Problem

Marko Djukanović, Nikola Balaban, Christian Blum, Aleksandar Kartelj, Sašo Džeroski, Žiga Zebec

2604.18639 2026-04-22 cs.LG cs.AI

Easy Samples Are All You Need: Self-Evolving LLMs via Data-Efficient Reinforcement Learning

Zhiyin Yu, Bo Zhang, Qibin Hou, Zhonghai Wu, Xiao Luo, Lei Bai

Comments Accepted to Findings of ACL 2026

2604.18636 2026-04-22 cs.SD cs.LG

Virtual boundary integral neural network for three-dimensional exterior acoustic problems

Jiahao Li, Qiang Xi, Ilia Marchevskiy, Zhuojia Fu

2604.18632 2026-04-22 cs.CV stat.AP

StomaD2: An All-in-One System for Intelligent Stomatal Phenotype Analysis via Diffusion-Based Restoration Detection Network

Quanling Zhao, Meng'en Qin, Yanfeng Sun, Yuan Miao, Xiaohui Yang

2604.18631 2026-04-22 cs.SD

Towards Revised Tempo Indications for Beethoven's Piano and Cello Sonatas: Czerny, Moscheles, Kolisch, and Recorded Practice 1930-2012

Ignasi Sole

2604.18630 2026-04-22 cs.SD

A Complementary Visualisation Suite for Empirical Performance Analysis: Tempographs, Histograms, Ridgeline Plots, Stacked Bar Charts, and Combination Charts Applied to Beethoven's Piano and Cello Sonatas

Ignasi Sole

详情

英文摘要

The choice of visualisation in empirical performance analysis is not a neutral presentation decision but an analytical one: different graphical forms reveal different features of the same dataset, and reliance on any single type systematically conceals what the others expose. This paper presents and argues for a suite of five complementary visualisation tools; tempographs, histograms with spline-smoothed probability density functions, ridgeline plots, stacked bar charts, and combination charts. These are applied to bar-level beats-per-minute data from recordings of Beethoven's five piano and cello sonatas (Op.~5 Nos.~1 and~2; Op.~69; Op.~102 Nos.~1 and~2) spanning 1930--2012. Each tool is described formally, its analytical properties characterised, its implementation detailed in working Python and MATLAB code, and its specific contribution demonstrated on a worked example using two recordings of Op.~5 No.~1 (Casals/Horszowski 1930--39 and Isserlis/Levin 2012) separated by eight decades. A five-panel composite figure applies all five tools to the same two recordings simultaneously, making the complementarity argument concrete: the tempograph reveals moment-to-moment structural parallels invisible in aggregate statistics; the spline-smoothed histogram exposes bimodality and secondary peaks suppressed by binning artefacts; the ridgeline plot positions both recordings within the full distributional space; the stacked bar chart shows divergent sectional pacing concealed by identical movement means; and the combination chart integrates mean tempo, variability, and historical reference marks in a single view. The spline-CDF smoothing method, applied to histogram data via cubic spline interpolation with zero-slope boundary conditions, is presented as a novel contribution to the performance analysis toolkit. Full implementation code is publicly available.

URL PDF HTML ☆

赞 0 踩 0

2604.18627 2026-04-22 cs.CV cs.RO

Vision-Based Human Awareness Estimation for Enhanced Safety and Efficiency of AMRs in Industrial Warehouses

Maximilian Haug, Christian Stippel, Lukas Pscherer, Benjamin Schwendinger, Ralph Hoch, Angel Gaydarov, Sebastian Schlund, Thilo Sauter

Comments 5 pages, 2 figures

2604.18623 2026-04-22 cs.CV

Can We Build Scene Graphs, Not Classify Them? FlowSG: Progressive Image-Conditioned Scene Graph Generation with Flow Matching

Xin Hu, Ke Qin, Wen Yin, Yuan-Fang Li, Ming Li, Tao He

2604.18592 2026-04-22 cs.CL cs.AI

Two-dimensional early exit optimisation of LLM inference

Jan Hůla, David Adamczyk, Tomáš Filip, Martin Pavlíček, Petr Sosík

2604.18566 2026-04-22 cs.AI cs.HC cs.LG

Benchmarking System Dynamics AI Assistants: Cloud Versus Local LLMs on CLD Extraction and Discussion

Terry Leitch

2604.18564 2026-04-22 cs.CV

MultiWorld: Scalable Multi-Agent Multi-View Video World Models

Haoyu Wu, Jiwen Yu, Yingtian Zou, Xihui Liu

Comments 15 pages, 10 figures

2604.18509 2026-04-22 cs.CL

MASS-RAG: Multi-Agent Synthesis Retrieval-Augmented Generation

Xingchen Xiao, Heyan Huang, Runheng Liu, Jincheng Xie

Comments ACL 2026 Findings, 19 pages

2604.18177 2026-04-22 cs.CL cs.AI

STaD: Scaffolded Task Design for Identifying Compositional Skill Gaps in LLMs

Sungeun An, Swanand Ravindra Kadhe, Shailja Thakur, Chad DeLuca, Hima Patel

Comments 9 pages, 3 figures, 3 tables, ACL Findings 2026

2604.18067 2026-04-22 cs.LG

Towards Real-Time ECG and EMG Modeling on $μ$NPUs

Josh Millar, Ashok Samraj Thangarajan, Soumyajit Chatterjee, Hamed Haddadi

2604.17821 2026-04-22 cs.AI

WebUncertainty: Dual-Level Uncertainty Driven Planning and Reasoning For Autonomous Web Agent

Lingfeng Zhang, Yongan Sun, Jinpeng Hu, Hui Ma, Yang Ying, Kuien Liu, Zenglin Shi, Meng Wang

2604.17797 2026-04-22 cs.CV

Weakly-Supervised Referring Video Object Segmentation through Text Supervision

Miaojing Shi, Jun Huang, Zijie Yue, Hanli Wang

Comments Accepted by CVPR 2026 Findings

2604.17789 2026-04-22 cs.CV cs.AI cs.CL

DuQuant++: Fine-grained Rotation Enhances Microscaling FP4 Quantization

Haokun Lin, Xinle Jia, Haobo Xu, Bingchen Yao, Xianglong Guo, Yichen Wu, Zhichao Lu, Ying Wei, Qingfu Zhang, Zhenan Sun

Comments Technical Report

2604.17736 2026-04-22 cs.CV

IncreFA: Breaking the Static Wall of Generative Model Attribution

Haotian Qin, Dongliang Chang, Yueying Gao, Yuexuan Tan, Lei Chen, Zhanyu Ma

Comments Accepted to CVPR 2026, with appendix

2604.17458 2026-04-22 cs.AI

EHRAG: Bridging Semantic Gaps in Lightweight GraphRAG via Hybrid Hypergraph Construction and Retrieval

Yifan Song, Xingjian Tao, Zhicheng Yang, Yihong Luo, Jing Tang

Comments Accepted by Findings of ACL2026

2604.17299 2026-04-22 cs.CL cs.AI

Cat-DPO: Category-Adaptive Safety Alignment

Tiankai Yang, Yi Nian, Xinyuan Li, Ruiyao Xu, Kaize Ding, Yue Zhao

Comments 23 pages, 6 figures

2604.17257 2026-04-22 cs.CL cs.AI

REZE: Representation Regularization for Domain-adaptive Text Embedding Pre-finetuning

Seungmin Lee, Jeonghwan Lee, Hyunkuk Lim, Sejoon Kim, Mingi Sung

Comments ACL 2026 Main

2604.17141 2026-04-22 cs.CL

SciImpact: A Multi-Dimensional, Multi-Field Benchmark for Scientific Impact Prediction

Hangxiao Zhu, Yuyu Zhang, Ping Nie, Yu Zhang

2604.16937 2026-04-22 cs.CL

No One Fits All: From Fixed Prompting to Learned Routing in Multilingual LLMs

Wei-Chi Wu, Sheng-Lun Wei, Hen-Hsen Huang, Hsin-Hsi Chen

Comments Accepted as a short findings paper at ACL 2026

2604.16763 2026-04-22 cs.LG

LLM-Extracted Covariates for Clinical Causal Inference: Rethinking Integration Strategies

Lei Liu, Jialin Chen, Kathy Macropol

2604.16535 2026-04-22 cs.LG cs.AI

SCATR: Simple Calibrated Test-Time Ranking

Divya Shyamal, Marta Knežević, Lan Tran, Chanakya Ekbote, Vijay Lingam, Paul Pu Liang

2604.16487 2026-04-22 cs.CV cs.AI

Geometry-Aware CLIP Retrieval via Local Cross-Modal Alignment and Steering

Nirmalendu Prakash, Narmeen Fatimah Oozeer, Xin Su, Phillip Howard, Shaan Shah, Zoe Wanying He, Shuang Wu, Shivam Raval, Roy Ka-Wei Lee, Meenakshi Khosla, Amir Abdullah