arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.18940 2026-04-22 cs.CV cs.RO

Localization-Guided Foreground Augmentation in Autonomous Driving

Jiawei Yong, Deyuan Qu, Qi Chen, Kentaro Oguchi, Shintaro Fukushima

详情

英文摘要

Autonomous driving systems often degrade under adverse visibility conditions-such as rain, nighttime, or snow-where online scene geometry (e.g., lane dividers, road boundaries, and pedestrian crossings) becomes sparse or fragmented. While high-definition (HD) maps can provide missing structural context, they are costly to construct and maintain at scale. We propose Localization-Guided Foreground Augmentation (LG-FA), a lightweight and plug-and-play inference module that enhances foreground perception by enriching geometric context online. LG-FA: (i) incrementally constructs a sparse global vector layer from per-frame Bird's-Eye View (BEV) predictions; (ii) estimates ego pose via class-constrained geometric alignment, jointly improving localization and completing missing local topology; and (iii) reprojects the augmented foreground into a unified global frame to improve per-frame predictions. Experiments on challenging nuScenes sequences demonstrate that LG-FA improves the geometric completeness and temporal stability of BEV representations, reduces localization error, and produces globally consistent lane and topology reconstructions. The module can be seamlessly integrated into existing BEV-based perception systems without backbone modification. By providing a reliable geometric context prior, LG-FA enhances temporal consistency and supplies stable structural support for downstream modules such as tracking and decision-making.

URL PDF HTML ☆

赞 0 踩 0

2604.18939 2026-04-22 cs.LG

TabEmb: Joint Semantic-Structure Embedding for Table Annotation

Ehsan Hoseinzade, Ke Wang, Anandharaju Durai Raju

2604.18936 2026-04-22 cs.LG cs.AI hep-ph hep-th

Fine-Tuning Small Reasoning Models for Quantum Field Theory

Nathaniel S. Woodward, Zhiqi Gao, Yurii Kvasiuk, Kendrick M. Smith, Frederic Sala, Moritz Münchmeyer

2604.18934 2026-04-22 cs.AI

AutomationBench

Daniel Shepard, Robin Salimans

2604.18933 2026-04-22 cs.RO cs.AI

Gated Memory Policy

Yihuai Gao, Jinyun Liu, Shuang Li, Shuran Song

2604.18932 2026-04-22 cs.SD cs.AI

Tadabur: A Large-Scale Quran Audio Dataset

Faisal Alherran

Comments Project page: https://fherran.github.io/tadabur/

2604.18914 2026-04-22 cs.CL cs.AI cs.LG

MORPHOGEN: A Multilingual Benchmark for Evaluating Gender-Aware Morphological Generation

Mehul Agarwal, Aditya Aggarwal, Arnav Goel, Medha Hira, Anubha Gupta

Comments 25 pages, accepted to ACL 2026 (Main)

2604.18913 2026-04-22 cs.CL

LogosKG: Hardware-Optimized Scalable and Interpretable Knowledge Graph Retrieval

He Cheng, Yifu Wu, Saksham Khatwani, Maya Kruse, Dmitriy Dligach, Timothy A. Miller, Majid Afshar, Yanjun Gao

Comments Accepted to the ACL 2026 Main Conference. 9 pages

2604.18912 2026-04-22 cs.LG stat.ME

Collaborative Contextual Bayesian Optimization

Chih-Yu Chang, Qiyuan Chen, Tianhan Gao, David Fenning, Chinedum Okwudire, Neil Dasgupta, Wei Lu, Raed Al Kontar

2604.18907 2026-04-22 cs.LG cs.AI

Gradient-Based Program Synthesis with Neurally Interpreted Languages

Matthew V. Macfarlane, Clément Bonnet, Herke van Hoof, Levi H. S. Lelis

Comments 26 pages, The International Conference on Learning Representations (ICLR)

详情

英文摘要

A central challenge in program induction has long been the trade-off between symbolic and neural approaches. Symbolic methods offer compositional generalisation and data efficiency, yet their scalability is constrained by formalisms such as domain-specific languages (DSLs), which are labour-intensive to create and may not transfer to new domains. In contrast, neural networks flexibly learn from data but tend to generalise poorly in compositional and out-of-distribution settings. We bridge this divide with an instance of a Latent Adaptation Network architecture named Neural Language Interpreter (NLI), which learns its own discrete, symbolic-like programming language end-to-end. NLI autonomously discovers a vocabulary of primitive operations and uses a novel differentiable neural executor to interpret variable-length sequences of these primitives. This allows NLI to represent programs that are not bound to a constant number of computation steps, enabling it to solve more complex problems than those seen during training. To make these discrete, compositional program structures amenable to gradient-based optimisation, we employ the Gumbel-Softmax relaxation, enabling the entire model to be trained end-to-end. Crucially, this same differentiability enables powerful test-time adaptation. At inference, NLI's program inductor provides an initial program guess. This guess is then refined via gradient descent through the neural executor, enabling efficient search for the neural program that best explains the given data. We demonstrate that NLI outperforms in-context learning, test-time training, and continuous latent program networks on tasks that require combinatorial generalisation and rapid adaptation to unseen tasks. Our results establish a new path toward models that combine the compositionality of discrete languages with the gradient-based search and end-to-end learning of neural networks.

URL PDF HTML ☆

赞 0 踩 0

2604.18900 2026-04-22 cs.RO

Thrust Regulation Through Wing Linkage Modulation on the Aerobat Platform: Piezoelectric Slip-Stick Actuated Regulator Development

Luca Ciampaglia

2604.18897 2026-04-22 cs.CL cs.LG

Less Is More: Cognitive Load and the Single-Prompt Ceiling in LLM Mathematical Reasoning

Manuel Israel Cazares

Comments Companion repository: https://github.com/israelcazares/sair-prompt-engineering | Zenodo DOI: 10.5281/zenodo.19598433 | v15: final Contributor Network data (n=52, competition close April 20, 2026)

2604.18892 2026-04-22 cs.CL

Prioritizing the Best: Incentivizing Reliable Multimodal Reasoning by Rewarding Beyond Answer Correctness

Mengzhao Jia, Zhihan Zhang, Meng Jiang

2604.18889 2026-04-22 cs.LG

AC-SINDy: Compositional Sparse Identification of Nonlinear Dynamics

Peter Racioppo

2604.18887 2026-04-22 cs.RO cs.SY eess.SY

HALO: Hybrid Auto-encoded Locomotion with Learned Latent Dynamics, Poincaré Maps, and Regions of Attraction

Blake Werner, Sergio A. Esteban, Massimiliano De Sa, Max H. Cohen, Aaron D. Ames

Comments 20 pages, 8 figures

2604.18882 2026-04-22 cs.AI cs.LO cs.PL

Formally Verified Patent Analysis via Dependent Type Theory: Machine-Checkable Certificates from a Hybrid AI + Lean 4 Pipeline

George Koomullil

Comments 100 pages, 8 figures, 9 tables, 6 algorithms

2604.18881 2026-04-22 cs.CV cs.AI

A Proxy Consistency Loss for Grounded Fusion of Earth Observation and Location Encoders

Zhongying Wang, Kevin Lane, Levi Cai, Morteza Karimzadeh, Esther Rolf

Comments Accepted to EarthVision 2026 (CVPR Workshop). 13 pages total (10 pages main paper + 3 pages supplementary material), 5 main figures

2604.18880 2026-04-22 cs.CL cs.AI

Where Fake Citations Are Made: Tracing Field-Level Hallucination to Specific Neurons in LLMs

Yuefei Chen, Yihao Quan, Xiaodong Lin, Ruixiang Tang

2604.18874 2026-04-22 cs.AI

How Adversarial Environments Mislead Agentic AI?

Zhonghao Zhan, Huichi Zhou, Zhenhao Li, Peiyuan Jing, Krinos Li, Hamed Haddadi

Comments Accepted to Findings of the Association for Computational Linguistics: ACL 2026

2604.18873 2026-04-22 cs.AI

From Natural Language to Executable Narsese: A Neuro-Symbolic Benchmark and Pipeline for Reasoning with NARS

Mina Gabriel, Pei Wang

Comments 14 pages. Submitted to AGI-26

2604.18868 2026-04-22 cs.LG

Subgraph Concept Networks: Concept Levels in Graph Classification

Lucie Charlotte Magister, Alexander Norcliffe, Iulia Duta, Pietro Lio

2604.18867 2026-04-22 cs.CV cs.AI cs.LG

Hierarchically Robust Zero-shot Vision-language Models

Junhao Dong, Yifei Zhang, Hao Zhu, Yew-Soon Ong, Piotr Koniusz

Comments This paper is accepted by CVPR'26

2604.18864 2026-04-22 cs.LG stat.ML

ParamBoost: Gradient Boosted Piecewise Cubic Polynomials

Nicolas Salvadé, Tim Hillel

2604.18857 2026-04-22 cs.LG cs.CV

Task Switching Without Forgetting via Proximal Decoupling

Pourya Shamsolmoali, Masoumeh Zareapoor, Eric Granger, William A. P. Smith, Yue Lu

Comments Submitted to IEEE TPAMI January 2026

2604.18856 2026-04-22 cs.CV

ConvVitMamba: Efficient Multiscale Convolution, Transformer, and Mamba-Based Sequence modelling for Hyperspectral Image Classification

Mohammed Q. Alkhatib

Comments Pre-print Accepted for Publication in International Journal of Remote Sensing

2604.18853 2026-04-22 cs.CV

DDF2Pol: A Dual-Domain Feature Fusion Network for PolSAR Image Classification

Mohammed Q. Alkhatib

Comments Pre-print Accepted for Publication in Pattern Recognition Letters

2604.18842 2026-04-22 cs.CV

Multi-Domain Learning with Global Expert Mapping

Pourya Shamsolmoali, Masoumeh Zareapoor, Huiyu Zhou, Oscar Mendez, Dacheng Tao, Xuelong Li

Comments Submitted to IEEE TPAMI on August 2025

2604.18839 2026-04-22 cs.LG cs.AI

One Step Forward and K Steps Back: Better Reasoning with Denoising Recursion Models

Chris Cameron, Wangzheng Wang, Nikita Ivanov, Ashmita Bhattacharyya, Didier Chételat, Yingxue Zhang

2604.18838 2026-04-22 cs.AI quant-ph

Quantum inspired qubit qutrit neural networks for real time financial forecasting

Kanishk Bakshi, Kathiravan Srinivasan

Comments 16 pages, 7 figures. Published in Scientific Reports (2025)

2604.18835 2026-04-22 cs.CL cs.AI cs.LG

Semantic Needles in Document Haystacks: Sensitivity Testing of LLM-as-a-Judge Similarity Scoring

Sinan G. Aksoy, Alexandra A. Sabrio, Erik VonKaenel, Lee Burke

Comments 15 pages, 8 figures