arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2509.14225 2026-05-08 cs.LG stat.ML

Defending Diffusion Models Against Membership Inference Attacks via Higher-Order Langevin Dynamics

Benjamin Sterling, Yousef El-Laham, Mónica F. Bugallo

Comments 11 pages, 4 figures

详情

英文摘要

Recent advances in generative artificial intelligence applications have raised new data security concerns. This paper focuses on defending diffusion models against membership inference attacks. This type of attack occurs when the attacker can determine if a certain data point was used to train the model. Although diffusion models are intrinsically more resistant to membership inference attacks than other generative models, they are still susceptible. The defense proposed here utilizes critically-damped higher-order Langevin dynamics, which introduces several auxiliary variables and a joint diffusion process along these variables. The idea is that the presence of auxiliary variables mixes external randomness that helps to corrupt sensitive input data earlier on in the diffusion process. This concept is theoretically investigated and validated on a toy dataset and a speech dataset using the Area Under the Receiver Operating Characteristic (AUROC) curves and the FID metric.

URL PDF HTML ☆

赞 0 踩 0

2508.09193 2026-05-08 cs.LG cs.AI

Multi-Objective Instruction-Aware Representation Learning in Procedural Content Generation RL

Sung-Hyun Kim, Geum-Hwan Hwang, In-Chang Baek, Seo-Young Lee, Kyung-Joong Kim

Comments 9 pages, 4 figures

2508.06412 2026-05-08 cs.LG cs.CL

Sample-efficient LLM Optimization with Reset Replay

Zichuan Liu, Jinyu Wang, Lei Song, Jiang Bian

2507.00480 2026-05-08 cs.LG stat.ML

Posterior Inference in Latent Space for Scalable Constrained Black-box Optimization

Kiyoung Om, Kyuil Sim, Taeyoung Yun, Hyeongyu Kang, Jinkyoo Park

Comments 25 pages, 14 figures, 6 tables. Equal contribution by Kiyoung Om, Kyuil Sim, and Taeyoung Yun

2506.20616 2026-05-08 cs.CV

Shape2Animal: Creative Animal Generation from Natural Silhouettes

Quoc-Duy Tran, Anh-Tuan Vo, Dinh-Khoi Vo, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le

2506.14123 2026-05-08 cs.CL cs.FL cs.LG

Sampling from Your Language Model One Byte at a Time

Jonathan Hayase, Alisa Liu, Noah A. Smith, Sewoong Oh

Comments 28 pages, 9 figures

2506.11563 2026-05-08 cs.LG cs.AI

A Survey of Personalized Federated Foundation Models for Privacy-Preserving Recommendation

Zhiwei Li, Guodong Long, Chunxu Zhang, Honglei Zhang, Jing Jiang, Chengqi Zhang

Comments 10 pages, 6 figures, conference, position paper

2506.06816 2026-05-08 cs.CL cs.CY cs.HC

How do datasets, developers, and models affect biases in a low-resourced language?: The Case of the Bengali Language

Dipto Das, Shion Guha, Bryan Semaan

2506.01665 2026-05-08 cs.LG cs.AI cs.RO

Leveraging Analytic Gradients in Provably Safe Reinforcement Learning

Tim Walter, Hannah Markgraf, Jonathan Külz, Matthias Althoff

Comments 21 pages, 10 figures

2505.21938 2026-05-08 cs.LG cs.AI cs.CR

Practical Adversarial Attacks on Stochastic Bandits via Fake Data Injection

Qirun Zeng, Eric He, Richard Hoffmann, Xuchuang Wang, Jinhang Zuo

2505.20825 2026-05-08 cs.CL

Reinforced Informativeness Optimization for Long-Form Retrieval-Augmented Generation

Yuhao Wang, Ruiyang Ren, Yucheng Wang, Wayne Xin Zhao, Jing Liu, Hua Wu, Haifeng Wang

2505.20628 2026-05-08 cs.LG math.OC

Position: Adopt Constraints Over Fixed Penalties in Deep Learning

Juan Ramirez, Meraj Hashemizadeh, Simon Lacoste-Julien

Comments Code available at https://github.com/merajhashemi/constraints-vs-penalties

2505.18875 2026-05-08 cs.CV

Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation

Shuo Yang, Haocheng Xi, Yilong Zhao, Muyang Li, Jintao Zhang, Han Cai, Yujun Lin, Xiuyu Li, Chenfeng Xu, Jianfei Chen, Song Han, Kurt Keutzer, Ion Stoica

2505.18842 2026-05-08 cs.CL cs.CV

v1: Learning to Point Visual Tokens for Multimodal Grounded Reasoning

Jiwan Chung, Junhyeok Kim, Siyeol Kim, Jaeyoung Lee, Min Soo Kim, Youngjae Yu

2505.16791 2026-05-08 cs.LG cs.AI

Cohort-Based Active Modality Acquisition

Tillmann Rheude, Roland Eils, Benjamin Wild

2505.16516 2026-05-08 cs.LG cs.AI

Amortized Linear-time Exact Shapley Value for Product-Kernel Methods

Majid Mohammadi, Siu Lun Chau, Krikamol Muandet

2505.13674 2026-05-08 cs.RO

Risk-Averse Traversal of Graphs with Stochastic and Correlated Edge Costs for Safe Global Planetary Mobility

Olivier Lamarre, Jonathan Kelly

Comments Published in the Autonomous Robots journal

详情

DOI: 10.1007/s10514-025-10240-5
Journal ref: Autonomous Robots (AURO), Vol. 50, No. 2, Mar. 2026

英文摘要

In robotic planetary surface exploration, strategic mobility planning is an important task that involves finding candidate long-distance routes on orbital maps and identifying segments with uncertain traversability. Then, expert human operators establish safe, adaptive traverse plans based on the actual navigation difficulties encountered in these uncertain areas. In this paper, we formalize this challenge as a new, risk-averse variant of the Canadian Traveller Problem (CTP) tailored to global planetary mobility. The objective is to find a traverse policy minimizing a conditional value-at-risk (CVaR) criterion, which is a risk measure with an intuitive interpretation. We propose a novel search algorithm that finds exact CVaR-optimal policies. Our approach leverages well-established optimal AND-OR search techniques intended for (risk-agnostic) expectation minimization and extends these methods to the risk-averse domain. We validate our approach through simulated long-distance planetary surface traverses; we employ real orbital maps of the Martian surface to construct problem instances and use terrain maps to express traversal probabilities in uncertain regions. Our results illustrate different adaptive decision-making schemes depending on the level of risk aversion. Additionally, our problem setup allows accounting for traversability correlations between similar areas of the environment. In such a case, we empirically demonstrate how information-seeking detours can mitigate risk.

URL PDF HTML ☆

赞 0 踩 0

2504.19455 2026-05-08 cs.CV

Masked Language Prompting for Generative Data Augmentation in Few-shot Fashion Style Recognition

Yuki Hirakawa, Ryotaro Shimizu

2502.20650 2026-05-08 cs.CV cs.CR

Gungnir: Exploiting Stylistic Features in Images for Backdoor Attacks on Diffusion Models

Lei Zhang, Yu Pan, Bingrong Dai, Lin Wang

2502.18816 2026-05-08 cs.CV

Grad-ECLIP: Gradient-based Visual and Textual Explanations for CLIP

Chenyang Zhao, Kun Wang, Janet H. Hsiao, Antoni B. Chan

2502.16022 2026-05-08 cs.CL

Enhancing LLMs for Identifying and Prioritizing Important Medical Jargons from Electronic Health Record Notes Utilizing Data Augmentation

Won Seok Jang, Sharmin Sultana, Zonghai Yao, Hieu Tran, Zhichao Yang, Sunjae Kwon, Hong Yu

Comments 21pages, 5 figures, 4 tables

2502.12717 2026-05-08 cs.LG math.CO math.RT

Learning the symmetric group: large from small

Max Petschack, Alexandr Garbali, Jan de Gier

Comments 15 pages, 8 figures

2501.02721 2026-05-08 cs.LG

Learning Stochastic Nonlinear Dynamics with Embedded Latent Transfer Operators

Naichang Ke, Ryogo Tanaka, Yoshinobu Kawahara

2412.15689 2026-05-08 cs.CV

DOLLAR: Few-Step Video Generation via Distillation and Latent Reward Optimization

Zihan Ding, Chi Jin, Difan Liu, Haitian Zheng, Krishna Kumar Singh, Qiang Zhang, Yan Kang, Zhe Lin, Yuchen Liu

2411.19182 2026-05-08 cs.CV cs.AI

SOWing Information: Cultivating Contextual Coherence with MLLMs in Image Generation

Yuhan Pei, Ruoyu Wang, Yongqi Yang, Ye Zhu, Olga Russakovsky, Yu Wu

Comments Project page: https://pyh-129.github.io/SOW/

2411.13549 2026-05-08 cs.CV

KFC-W: Generating 3D-Consistent Videos from Unposed Internet Photos

Gene Chou, Kai Zhang, Sai Bi, Hao Tan, Zexiang Xu, Fujun Luan, Bharath Hariharan, Noah Snavely

Comments project page: https://genechou.com/kfcw/

2409.00417 2026-05-08 cs.LG stat.ME

Learning linear acyclic causal model including Gaussian noise using ancestral relationships

Ming Cai, Penggang Gao, Hisayuki Hara

Comments 30 pages, 6 figures

2407.18128 2026-05-08 cs.CV eess.IV

Estimating Earthquake Magnitude in Sentinel-1 Imagery via Ranking

Daniele Rege Cambrin, Isaac Corley, Paolo Garza, Peyman Najafirad

Comments Accepted to ECML-PKDD 2024 MACLEAN Workshop

2404.02534 2026-05-08 cs.CL cs.AI

ANGOFA: Leveraging OFA Embedding Initialization and Synthetic Data for Angolan Language Model

Osvaldo Luamba Quinjica, David Ifeoluwa Adelani

Comments Accepted at AfricaNLP 2024

2401.04560 2026-05-08 cs.CV

Phase-shifted remote photoplethysmography for estimating heart rate and blood pressure from facial video

Gyutae Hwang, Sang Jun Lee

Comments 13 pages, 10 figures

详情

DOI: 10.1016/j.measurement.2026.121240
Journal ref: Measurement 2026

英文摘要

Human health can be critically affected by cardiovascular diseases, such as hypertension, arrhythmias, and stroke. Heart rate and blood pressure are important biometric information for the monitoring of cardiovascular system and early diagnosis of cardiovascular diseases. Existing methods for estimating the heart rate are based on electrocardiography and photoplethyomography, which require contacting the sensor to the skin surface. Moreover, catheter and cuff-based methods for measuring blood pressure cause inconvenience and have limited applicability. Therefore, in this thesis, we propose a vision-based method for estimating the heart rate and blood pressure. This thesis proposes a 2-stage deep learning framework consisting of a dual remote photoplethysmography network (DRP-Net) and bounded blood pressure network (BBP-Net). In the first stage, DRP-Net infers remote photoplethysmography (rPPG) signals for the acral and facial regions, and these phase-shifted rPPG signals are utilized to estimate the heart rate. In the second stage, BBP-Net integrates temporal features and analyzes phase discrepancy between the acral and facial rPPG signals to estimate SBP and DBP values. To improve the accuracy of estimating the heart rate, we employed a data augmentation method based on a frame interpolation model. Moreover, we designed BBP-Net to infer blood pressure within a predefined range by incorporating a scaled sigmoid function. Our method resulted in estimating the heart rate with the mean absolute error (MAE) of 1.78 BPM, reducing the MAE by 34.31 % compared to the recent method, on the MMSE-HR dataset. The MAE for estimating the systolic blood pressure (SBP) and diastolic blood pressure (DBP) were 10.19 mmHg and 7.09 mmHg. On the V4V dataset, the MAE for the heart rate, SBP, and DBP were 3.83 BPM, 13.64 mmHg, and 9.4 mmHg, respectively.

URL PDF HTML ☆

赞 0 踩 0