arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2512.03730 2026-04-13 cs.CV cs.AI

Out-of-the-box: Black-box Causal Attacks on Object Detectors

Melane Navaratnarajah, David A. Kelly, Hana Chockler

Comments 14 pages, 12 pages of appendices

详情

英文摘要

Adversarial perturbations are a useful way to expose vulnerabilities in object detectors. Existing perturbation methods are frequently white-box, architecture specific and use a loss function. More importantly, while they are often successful, it is rarely clear why they work. Insights into the mechanism of this success would allow developers to understand and analyze these attacks, as well as fine-tune the model to prevent them. This paper presents BlackCAtt, a black-box algorithm and tool, which uses minimal, causally sufficient pixel sets to construct explainable, imperceptible, reproducible, architecture-agnostic attacks on object detectors. We evaluate BlackCAtt on standard benchmarks and compare it to other black-box adversarial attacks methods. When BlackCAtt has access only to the position and label of a bounding box, it produces attacks that are comparable or better to those produced by other black-box methods. When BlackCAtt has access to the model confidence as well, it can work as a meta-algorithm, improving the ability of standard black-box techniques to construct smaller, less perceptible attacks. As BlackCAtt attacks manipulate causes only, the attacks become fully explainable. We compare the performance of BlackCAtt with other black-box attack methods and show that targeting causal pixels leads to smaller and less perceptible attacks. For example, when using BlackCAtt with SquareAttack, it reduces the average distance ($L_0$ norm) of the attack from the original input from $0.987$ to $0.072$, while maintaining a similar success rate. We perform ablation studies on the BlackCAtt algorithm and analyze the effect of different components on its performance.

URL PDF HTML ☆

赞 0 踩 0

2511.20151 2026-04-13 cs.CV

A Compact Hybrid Convolution--Frequency State Space Network for Learned Image Compression

Haodong Pan, Hao Wei, Yusong Wang, Nanning Zheng, Caigui Jiang

Comments 20 pages, 11 figures

2511.04256 2026-04-13 cs.CL

SSPO: Subsentence-level Policy Optimization

Kun Yang, Zikang chen, Yanmeng Wang, Zhigen Li, Ning Cheng, Shaojun Wang, Jing Xiao

2511.01383 2026-04-13 cs.RO

CaRLi-V: Camera-RADAR-LiDAR Point-Wise 3D Velocity Estimation

Landson Guo, Andres M. Diaz Aguilar, William Talbot, Turcan Tuna, Marco Hutter, Cesar Cadena

2510.23636 2026-04-13 cs.LG cs.AI cs.CL

LLM4Delay: Flight Delay Prediction via Cross-Modality Adaptation of Large Language Models and Aircraft Trajectory Representation

Thaweerath Phisannupawong, Joshua Julian Damanik, Han-Lim Choi

Comments Preprint submitted to IEEE Transactions on Intelligent Transportation Systems (T-ITS) for possible publication

2510.10181 2026-04-13 cs.RO cs.AI cs.CV

Dejavu: Towards Experience Feedback Learning for Embodied Intelligence

Shaokai Wu, Yanbiao Ji, Qiuchang Li, Zhiyi Zhang, Qichen He, Wenyuan Xie, Guodong Zhang, Bayram Bayramli, Yue Ding, Hongtao Lu

2510.01767 2026-04-13 cs.CV

LoBE-GS: Load-Balanced and Efficient 3D Gaussian Splatting for Large-Scale Scene Reconstruction

Sheng-Hsiang Hung, Ting-Yu Yen, Wei-Fang Sun, Simon See, Shih-Hsuan Hung, Hung-Kuo Chu

2508.13792 2026-04-13 cs.CV

VisionLaw: Inferring Interpretable Intrinsic Dynamics from Visual Observations via Bilevel Optimization

Jiajing Lin, Shu Jiang, Qingyuan Zeng, Zhenzhong Wang, Min Jiang

Comments Accepted by ICLR 2026; Project Page: https://github.com/JiajingLin/VisionLaw

2508.06869 2026-04-13 cs.CV cs.AI

VSI: Visual Subtitle Integration for Keyframe Selection to enhance Long Video Understanding

Jianxiang He, Meisheng Hong, Jungang Li, Weiyu Guo, Xuming Hu, Hui Xiong

Comments Accepted to CVPR 2026 Findings, 10 pages

2508.06656 2026-04-13 cs.CV

ClusterMark: Towards Robust Watermarking for Autoregressive Image Generators with Visual Token Clustering

Denis Lukovnikov, Andreas Müller, Erwin Quiring, Asja Fischer

Comments CVPR 2026

2507.20185 2026-04-13 cs.CL

SessionIntentBench: A Multi-task Inter-session Intention-shift Modeling Benchmark for E-commerce Customer Behavior Understanding

Yuqi Yang, Weiqi Wang, Baixuan Xu, Wei Fan, Qing Zong, Chunkit Chan, Zheye Deng, Xin Liu, Yifan Gao, Changlong Yu, Chen Luo, Yang Li, Zheng Li, Qingyu Yin, Bing Yin, Yangqiu Song

Comments Findings of ACL 2026

2506.17788 2026-04-13 cs.AI cs.CL cs.LG cs.MA

Bayesian Social Deduction with Graph-Informed Language Models

Shahab Rahimirad, Guven Gergerli, Lucia Romero, Angela Qian, Matthew Lyle Olson, Simon Stepputtis, Joseph Campbell

Comments Accepted to ACL 2026 main conference

2506.04676 2026-04-13 cs.CV cs.AI cs.LG cs.MA

Gen-n-Val: Agentic Image Data Generation and Validation

Jing-En Huang, I-Sheng Fang, Tzuhsuan Huang, Yu-Lun Liu, Chih-Yu Wang, Jun-Cheng Chen

Comments Accepted to the CVPR 2026 Findings track

2505.23808 2026-04-13 cs.CL cs.AI

DenseLoRA: Dense Low-Rank Adaptation of Large Language Models

Lin Mu, Xiaoyu Wang, Li Ni, Yang Li, Zhize Wu, Peiquan Jin, Yiwen Zhang

2505.21472 2026-04-13 cs.CV cs.CL cs.MM

Mitigating Hallucination in Large Vision-Language Models via Adaptive Attention Calibration

Mehrdad Fazli, Bowen Wei, Ahmet Sari, Ziwei Zhu

2503.00035 2026-04-13 cs.CL cs.AI cs.LG

Constraining Sequential Model Editing with Editing Anchor Compression

Hao-Xiang Xu, Jun-Yu Ma, Zhen-Hua Ling, Ningyu Zhang, Jia-Chen Gu

Comments Accepted by NAACL 2025 Findings

2502.13718 2026-04-13 cs.CL

MSMO-ABSA: Multi-Scale and Multi-Objective Optimization for Cross-Lingual Aspect-Based Sentiment Analysis

Chengyan Wu, Bolei Ma, Ningyuan Deng, Yanqing He, Yun Xue, Xiaoyong Liu

Comments ACL 2026

2502.13388 2026-04-13 cs.AI

Reflection of Episodes: Learning to Play Game from Expert and Self Experiences

Xiaojie Xu, Zongyuan Li, Chang Lu, Runnan Qi, Yanan Ni, Lumin Jiang, Xiangbei Liu, Xuebo Zhang, Yongchun Fang, Kuihua Huang, Xian Guo, Zhanghua Wu, Zhenya Li

2502.06809 2026-04-13 cs.LG cs.AI cs.CL

Neurons Speak in Ranges: Breaking Free from Discrete Neuronal Attribution

Muhammad Umair Haider, Hammad Rizwan, Hassan Sajjad, Peizhong Ju, A. B. Siddique

2410.08559 2026-04-13 cs.LG cs.AI

Learning General Representation of 12-Lead Electrocardiogram with a Joint-Embedding Predictive Architecture

Sehun Kim

Comments ECG segmentation experiments are added. Comparison with recent ECG foundation models are added

2604.09282 2026-04-13 cs.RO cs.CV

Characterizing Lidar Range-Measurement Ambiguity due to Multiple Returns

Jason H. Rife, Yifan Li

Comments Proceedings of the 38th International Technical Meeting of the Satellite Division of The Institute of Navigation (ION GNSS+ 2025), Baltimore, Maryland, September 2025, pp. 1949-1963

2604.09276 2026-04-13 cs.LG

Distributed Online Convex Optimization with Compressed Communication: Optimal Regret and Applications

Sifan Yang, Dan-Yue Li, Lijun Zhang

2604.09271 2026-04-13 cs.LG

The causal relation between off-street parking and electric vehicle adoption in Scotland

Bernardino D'Amico, Achille Fonzone, Emma Hart

详情

英文摘要

The transition to electric mobility hinges on maximising aggregate adoption while also facilitating equitable access. This study examines whether the 'charging divide' between households with and without off-street parking reflects a genuine infrastructure constraint or a by-product of socio-economic disparity. Moving beyond conventional predictive models, we apply a probabilistic causal framework to a nationally representative dataset of Scottish households, enabling estimation of policy interventions while explicitly neutralising the confounding effect of other causal factors. The results reveal a structural hierarchy in the EV adoption process. Private off-street parking functions as a conversion catalyst: enabling access to home-charging increases the probability of EV ownership from 3.3% to 5.6% (a 70% relative, 2.3 percentage point absolute increase). However, this effect primarily accelerates households already economically positioned to purchase an EV rather than recruiting new entrants. By contrast, household income operates as the fundamental affordability ceiling. A causal contrast between lower- and higher-income strata, shows a reduction in market non-participation by 23.1 percentage points, identifying financial capacity as the principal gatekeeper to entering the EV transition funnel. Crucially, the analysis demonstrates that standard observational models overstate the isolated effect of off-street parking infrastructure. The apparent effect emerges from selection bias: higher-income households are disproportionately likely to possess both private parking and the means to purchase EVs. These findings support a dual-track policy strategy: lowering the affordability ceiling for non-participants through financial instruments, while addressing EV home-charging access for the 'latent intent' cohort in high-density urban contexts.

URL PDF HTML ☆

赞 0 踩 0

2604.09265 2026-04-13 cs.CL

EthicMind: A Risk-Aware Framework for Ethical-Emotional Alignment in Multi-Turn Dialogue

Jiawen Deng, Wei Li, Wentao Zhang, Ziyun Jiao, Fuji Ren

Comments 18 pages, Accepted to the ACL 2026 Main Conference

2604.09260 2026-04-13 cs.CV cs.GR cs.LG

Beyond Segmentation: Structurally Informed Facade Parsing from Imperfect Images

Maciej Janicki, Aleksander Plocharski, Przemyslaw Musialski

Comments 4 pages, 4 figures, EUROGRAPHICS 2026 Short Paper

2604.09253 2026-04-13 cs.CV cs.AI

Mosaic: Multimodal Jailbreak against Closed-Source VLMs via Multi-View Ensemble Optimization

Yuqin Lan, Gen Li, Yuanze Hu, Weihao Shen, Zhaoxin Fan, Faguo Wu, Xiao Zhang, Laurence T. Yang, Zhiming Zheng

Comments 14pages, 9 figures

2604.09246 2026-04-13 cs.SD cs.AI

DDSP-QbE++: Improving Speech Quality for Speech Anonymisation for Atypical Speech

Suhita Ghosh, Yamini Sinha, Sebastian Stober

Comments accepted in CHI workshop (Speech AI For All) 2026

2604.09240 2026-04-13 cs.LG

DiffHLS: Differential Learning for High-Level Synthesis QoR Prediction with GNNs and LLM Code Embeddings

Zedong Peng, Zeju Li, Qiang Xu, Jieru Zhao

2604.09237 2026-04-13 cs.CL

ScheMatiQ: From Research Question to Structured Data through Interactive Schema Discovery

Shahar Levy, Eliya Habba, Reshef Mintz, Barak Raveh, Renana Keydar, Gabriel Stanovsky

2604.09234 2026-04-13 cs.LG cs.AI cs.NE

Statistical Properties of the King Wen Sequence: An Anti-Habituation Structure That Does Not Improve Neural Network Training

Augustin Chan

Comments 9 pages, 8 tables, negative results paper. Code and data: https://doi.org/10.5281/zenodo.14679537