arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.09544 2026-04-13 cs.CL cs.AI cs.LG

Large Language Models Generate Harmful Content Using a Distinct, Unified Mechanism

Hadas Orgad, Boyi Wei, Kaden Zheng, Martin Wattenberg, Peter Henderson, Seraphina Goldfarb-Tarrant, Yonatan Belinkov

详情

英文摘要

Large language models (LLMs) undergo alignment training to avoid harmful behaviors, yet the resulting safeguards remain brittle: jailbreaks routinely bypass them, and fine-tuning on narrow domains can induce ``emergent misalignment'' that generalizes broadly. Whether this brittleness reflects a fundamental lack of coherent internal organization for harmfulness remains unclear. Here we use targeted weight pruning as a causal intervention to probe the internal organization of harmfulness in LLMs. We find that harmful content generation depends on a compact set of weights that are general across harm types and distinct from benign capabilities. Aligned models exhibit a greater compression of harm generation weights than unaligned counterparts, indicating that alignment reshapes harmful representations internally--despite the brittleness of safety guardrails at the surface level. This compression explains emergent misalignment: if weights of harmful capabilities are compressed, fine-tuning that engages these weights in one domain can trigger broad misalignment. Consistent with this, pruning harm generation weights in a narrow domain substantially reduces emergent misalignment. Notably, LLMs harmful generation capability is dissociated from how they recognize and explain such content. Together, these results reveal a coherent internal structure for harmfulness in LLMs that may serve as a foundation for more principled approaches to safety.

URL PDF HTML ☆

赞 0 踩 0

2604.09537 2026-04-13 cs.CL cs.AI cs.IR cs.LG

Case-Grounded Evidence Verification: A Framework for Constructing Evidence-Sensitive Supervision

Soroosh Tayebi Arasteh, Mehdi Joodaki, Mahshad Lotfinia, Sven Nebelung, Daniel Truhn

2604.09535 2026-04-13 cs.CV

EgoTL: Egocentric Think-Aloud Chains for Long-Horizon Tasks

Lulin Liu, Dayou Li, Yiqing Liang, Sicong Jiang, Hitesh Vijay, Hezhen Hu, Xuhai Xu, Zirui Liu, Srinivas Shakkottai, Manling Li, Zhiwen Fan

Comments https://ego-tl.github.io/

2604.09532 2026-04-13 cs.CV cs.AI

Seeing is Believing: Robust Vision-Guided Cross-Modal Prompt Learning under Label Noise

Zibin Geng, Xuefeng Jiang, Jia Li, Zheng Li, Tian Wen, Lvhua Wu, Sheng Sun, Yuwei Wang, Min Liu

2604.09531 2026-04-13 cs.CV cs.AI cs.CL

VisionFoundry: Teaching VLMs Visual Perception with Synthetic Images

Guanyu Zhou, Yida Yin, Wenhao Chai, Shengbang Tong, Xingyu Fu, Zhuang Liu

Comments Project Page: https://zlab-princeton.github.io/VisionFoundry/

2604.09529 2026-04-13 cs.CV cs.AI cs.CL

VL-Calibration: Decoupled Confidence Calibration for Large Vision-Language Models Reasoning

Wenyi Xiao, Xinchi Xu, Leilei Gan

Comments 24 pages, ACL 2026 Main. Repository: https://github.com/Mr-Loevan/VL-Calibration

2604.09527 2026-04-13 cs.CV cs.AI cs.LG

Envisioning the Future, One Step at a Time

Stefan Andreas Baumann, Jannik Wiese, Tommaso Martorella, Mahdi M. Kalayeh, Björn Ommer

Comments CVPR 2026. For code and models, see http://compvis.github.io/myriad

2604.09523 2026-04-13 cs.LG cs.MA

Event-Driven Temporal Graph Networks for Asynchronous Multi-Agent Cyber Defense in NetForge_RL

Igor Jankowski

Comments 26 pages, 14 figures, 5 tables

2604.09514 2026-04-13 cs.CL cs.HC

Many Ways to Be Fake: Benchmarking Fake News Detection Under Strategy-Driven AI Generation

Xinyu Wang, Sai Koneru, Wenbo Zhang, Wenliang Zheng, Saksham Ranjan, Sarah Rajtmajer

2604.09512 2026-04-13 cs.LG physics.optics

Integrated electro-optic attention nonlinearities for transformers

Luis Mickeler, Kai Lion, Alfonso Nardi, Jost Kellner, Pierre Didier, Bhavin J. Shastri, Niao He, Rachel Grange

2604.09511 2026-04-13 cs.CV

RIRF: Reasoning Image Restoration Framework

Wending Yan, Rongkai Zhang, Kaihua Tang, Yu Cheng, Qiankun Liu

2604.09508 2026-04-13 cs.CV cs.AI

VISOR: Agentic Visual Retrieval-Augmented Generation via Iterative Search and Over-horizon Reasoning

Yucheng Shen, Jiulong Wu, Jizhou Huang, Dawei Yin, Lingyong Yan, Min Cao

2604.09501 2026-04-13 cs.CL

You Can't Fight in Here! This is BBS!

Richard Futrell, Kyle Mahowald

Comments Accepted at Behavioral and Brain Sciences as a response to the commentaries to the accepted target article "How Linguistics Learned to Stop Worrying and Love the Language Models", whose preprint appears here: arXiv:2501.17047

2604.09499 2026-04-13 cs.RO

Physics-Informed Reinforcement Learning of Spatial Density Velocity Potentials for Map-Free Racing

Shathushan Sivashangaran, Apoorva Khairnar, Sepideh Gohari, Vihaan Dutta, Azim Eskandarian

2604.09497 2026-04-13 cs.CL cs.AI

BERT-as-a-Judge: A Robust Alternative to Lexical Methods for Efficient Reference-Based LLM Evaluation

Hippolyte Gisserot-Boukhlef, Nicolas Boizard, Emmanuel Malherbe, Céline Hudelot, Pierre Colombo

2604.09494 2026-04-13 cs.CL cs.AI cs.IR cs.LG

RecaLLM: Addressing the Lost-in-Thought Phenomenon with Explicit In-Context Retrieval

Kyle Whitecross, Negin Rahimi

Comments Code, data, and models available at https://github.com/kswhitecross/RecaLLM

2604.09480 2026-04-13 cs.CV

Online3R: Online Learning for Consistent Sequential Reconstruction Based on Geometry Foundation Model

Shunkai Zhou, Zike Yan, Fei Xue, Dong Wu, Yuchen Deng, Hongbin Zha

2604.09478 2026-04-13 cs.CV cs.RO

Incremental Semantics-Aided Meshing from LiDAR-Inertial Odometry and RGB Direct Label Transfer

Muhammad Affan, Ville Lehtola, George Vosselman

Comments 8 pages, 5 figures, 2 tables. Accepted in ISPRS Archives 2026

2604.09474 2026-04-13 cs.RO cs.AI

SafeMind: A Risk-Aware Differentiable Control Framework for Adaptive and Safe Quadruped Locomotion

Zukun Zhang, Kai Shu, Mingqiao Mo

2604.09473 2026-04-13 cs.CV

Realizing Immersive Volumetric Video: A Multimodal Framework for 6-DoF VR Engagement

Zhengxian Yang, Shengqi Wang, Shi Pan, Hongshuai Li, Haoxiang Wang, Lin Li, Guanjun Li, Zhengqi Wen, Borong Lin, Jianhua Tao, Tao Yu

Comments Journal extension of CVPR 2025. See also arXiv:2503.14359 . Project page and code: https://github.com/Metaverse-AI-Lab-THU/ImViD

详情

英文摘要

Fully immersive experiences that tightly integrate 6-DoF visual and auditory interaction are essential for virtual and augmented reality. While such experiences can be achieved through computer-generated content, constructing them directly from real-world captured videos remains largely unexplored. We introduce Immersive Volumetric Videos, a new volumetric media format designed to provide large 6-DoF interaction spaces, audiovisual feedback, and high-resolution, high-frame-rate dynamic content. To support IVV construction, we present ImViD, a multi-view, multi-modal dataset built upon a space-oriented capture philosophy. Our custom capture rig enables synchronized multi-view video-audio acquisition during motion, facilitating efficient capture of complex indoor and outdoor scenes with rich foreground--background interactions and challenging dynamics. The dataset provides 5K-resolution videos at 60 FPS with durations of 1-5 minutes, offering richer spatial, temporal, and multimodal coverage than existing benchmarks. Leveraging this dataset, we develop a dynamic light field reconstruction framework built upon a Gaussian-based spatio-temporal representation, incorporating flow-guided sparse initialization, joint camera temporal calibration, and multi-term spatio-temporal supervision for robust and accurate modeling of complex motion. We further propose, to our knowledge, the first method for sound field reconstruction from such multi-view audiovisual data. Together, these components form a unified pipeline for immersive volumetric video production. Extensive benchmarks and immersive VR experiments demonstrate that our pipeline generates high-quality, temporally stable audiovisual volumetric content with large 6-DoF interaction spaces. This work provides both a foundational definition and a practical construction methodology for immersive volumetric videos.

URL PDF HTML ☆

赞 0 踩 0

2604.09470 2026-04-13 cs.CL

Agentic Jackal: Live Execution and Semantic Value Grounding for Text-to-JQL

Vishnu Murali, Anmol Gulati, Elias Lumer, Kevin Frank, Sindy Campagna, Vamse Kumar Subbiah

2604.09466 2026-04-13 cs.CL

Across the Levels of Analysis: Explaining Predictive Processing in Humans Requires More Than Machine-Estimated Probabilities

Sathvik Nair, Colin Phillips

Comments 9 pages, Behavioral & Brain Sciences Commentary on Futrell & Mahowald (forthcoming)

2604.09462 2026-04-13 cs.RO

Adaptor: Advancing Assistive Teleoperation with Few-Shot Learning and Cross-Operator Generalization

Yu Liu, Yihang Yin, Tianlv Huang, Fei Yan, Yuan Xu, Weinan Hong, Wei Han, Yue Cao, Xiangyu Chen, Zipei Fan, Xuan Song

Comments Accepted to the 2026 IEEE International Conference on Robotics and Automation (ICRA 2026)

2604.09455 2026-04-13 cs.AI

E3-TIR: Enhanced Experience Exploitation for Tool-Integrated Reasoning

Weiyang Guo, Zesheng Shi, Liye Zhao, Jiayuan Ma, Zeen Zhu, Junxian He, Min Zhang, Jing Li

Comments 22 pages 10 figures, published in acl2026

2604.09452 2026-04-13 cs.LG cs.AI

SafeAdapt: Provably Safe Policy Updates in Deep Reinforcement Learning

Maksim Anisimov, Francesco Belardinelli, Matthew Wicker

Comments Code available at: https://github.com/maxanisimov/provably-safe-policy-updates

2604.09445 2026-04-13 cs.CV

AsymLoc: Towards Asymmetric Feature Matching for Efficient Visual Localization

Mohammad Omama, Gabriele Berton, Eric Foxlin, Yelin Kim

2604.09442 2026-04-13 cs.CL

UIPress: Bringing Optical Token Compression to UI-to-Code Generation

Dasen Dai, Shuoqi Li, Ronghao Chen, Huacan Wang, Biao Wu, Qizhen Lan

Comments 10 pages, 3 figures

2604.09437 2026-04-13 cs.LG

AdaCubic: An Adaptive Cubic Regularization Optimizer for Deep Learning

Ioannis Tsingalis, Constantine Kotropoulos, Corentin Briat

2604.09436 2026-04-13 cs.CV

SCoRe: Clean Image Generation from Diffusion Models Trained on Noisy Images

Yuta Matsuzaki, Seiichi Uchida, Shumpei Takezaki

Comments Accepted at IJCNN2026

2604.09431 2026-04-13 cs.RO

Musculoskeletal Motion Imitation for Learning Personalized Exoskeleton Control Policy in Impaired Gait

Itak Choi, Ilseung Park, Eni Halilaj, Inseung Kang

Comments 9 pages, 7 figures