arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.15347 2026-04-20 cs.HC cs.AI cs.IR cs.MA

SocialWise: LLM-Agentic Conversation Therapy for Individuals with Autism Spectrum Disorder to Enhance Communication Skills

Albert Tang

详情

DOI: 10.1145/3716553.3757091

英文摘要

Autism Spectrum Disorder (ASD) affects more than 75 million people worldwide. However, scalable support for practicing everyday conversation is scarce: Low-cost activities such as story reading yield limited improvement. At the same time, effective role-play therapy demands expensive, in-person sessions with specialists. SocialWise bridges this gap through a browser-based application that pairs LLM conversational agents with a therapeutic retrieval augmented generation (RAG) knowledge base. Users select a scenario (e.g., ordering food, joining a group), interact by text or voice, and receive instant, structured feedback on tone, engagement, and alternative phrasing. The SocialWise prototype, implemented with Streamlit, LangChain, and ChromaDB, runs on any computer with internet access, and demonstrates how recent advances in LLM can provide evidence-based, on-demand communication coaching for individuals with ASD.

URL PDF HTML ☆

赞 0 踩 0

2604.15344 2026-04-20 cs.HC cs.AI cs.IR cs.LG

To LLM, or Not to LLM: How Designers and Developers Navigate LLMs as Tools or Teammates

Varad Vishwarupe, Ivan Flechais, Nigel Shadbolt, Marina Jirotka

Comments 6 pages, 2 figures, 1 table

2604.15341 2026-04-20 cs.HC cs.AI

MRGEN: A Conceptual Framework for LLM-Powered Mixed Reality Authoring Tools for Education

Mohammed Oussama Seddini, Mohamed Ez-Zaouia, Ngoc Luyen Le, Iza Marfisi

2604.15339 2026-04-20 cs.HC cs.AI cs.RO

Uncertainty, Vagueness, and Ambiguity in Human-Robot Interaction: Why Conceptualization Matters

Xiaowen Sun, Cornelius Weber, Matthias Kerzel, Josua Spisak, Stefan Wermter

Comments Accepted to InterAI@HRI'26

2604.15336 2026-04-20 cs.HC cs.AI

Facial-Expression-Aware Prompting for Empathetic LLM Tutoring

Shuangquan Feng, Laura Fleig, Ruisen Tu, Philip Chi, Edmund Bu, Melinda Ozel, Junhua Ma, Teng Fei, Virginia R. de Sa

详情

英文摘要

Large language models (LLMs) enable increasingly capable tutoring-style conversational agents, yet effective tutoring requires sensitivity to learners' affective and cognitive states beyond text alone. Facial expressions provide immediate and practical cues of confusion, frustration, or engagement, but remain underexplored in LLM-driven tutoring. We investigate whether facial-expression-aware signals can improve empathetic tutoring responses through prompt-level integration, without end-to-end retraining. We build a scalable simulated tutoring environment where a student agent exhibits diverse facial behaviors from a large unlabeled facial expression video dataset, and compare four tutor variants: a text-only LLM baseline, a multimodal baseline using a random facial frame, and two Action Unit estimation model (AUM)-based methods that either inject textual AU descriptions or select a peak-expression frame for visual grounding. Across 960 multi-turn conversations spanning three tutor backbones (GPT-5.1, Claude Ops 4.5, and Gemini 2.5 Pro), we evaluate targeted pairwise comparisons with five human raters and an exhaustive AI evaluator. AU-based conditioning consistently improves empathetic responsiveness to facial expressions across all tutor backbones, while AUM-guided peak-frame selection outperforms random-frame visual input. Textual AU abstraction and peak-frame visual injection show model-dependent advantages. Control analyses show that this improvement does not come at the expense of worse pedagogical clarity or responsiveness to textual cues. Finally, AI-human agreement is highest on facial-expression-grounded empathy, supporting scalable AI evaluation for this dimension. Overall, our results show that lightweight, structured facial expression representations can meaningfully enhance empathy in LLM-based tutoring systems with minimal overhead.

URL PDF HTML ☆

赞 0 踩 0

2604.15335 2026-04-20 cs.HC cs.AI

A Comparative Study on the Impact of Traditional Learning and Interactive Learning on Students' Academic Performance and Emotional Well-Being

Siva Raja Sindiramutty

Comments 29 pages, 5 figures,

2604.15334 2026-04-20 cs.HC cs.AI

Beyond Passive Viewing: A Pilot Study of a Hybrid Learning Platform Augmenting Video Lectures with Conversational AI

Mohammed Abraar, Raj Abhijit Dandekar, Rajat Dandekar, Sreedath Panat

2604.15332 2026-04-20 cs.HC cs.AI cs.CV cs.SE

Automating Crash Diagram Generation Using Vision-Language Models: A Case Study on Multi-Lane Roundabouts

Xiao Lu, Hao Zhen, Jidong J. Yang

Comments 16 pages, 5 figures, 3 tables

2604.15331 2026-04-20 cs.HC cs.AI cs.CY

How people use Copilot for Health

Beatriz Costa-Gomes, Pavel Tolmachev, Eloise Taysom, Viknesh Sounderajah, Hannah Richardson, Philipp Schoenegger, Xiaoxuan Liu, Matthew M Nour, Seth Spielman, Samuel F. Way, Yash Shah, Michael Bhaskar, Harsha Nori, Christopher Kelly, Peter Hames, Bay Gross, Mustafa Suleyman, Dominic King

Comments 12 pages, 7 figures

2604.15329 2026-04-20 cs.HC cs.AI cs.CL

Evaluating LLMs as Human Surrogates in Controlled Experiments

Adnan Hoq, Tim Weninger

2604.15327 2026-04-20 cs.HC cs.AI

Eco-Bee: A Personalised Multi-Modal Agent for Advancing Student Climate Awareness and Sustainable Behaviour in Campus Ecosystems

Caleb Adu, Neil Kapadia, Binhe Liu, Jonathan Randall, Sruthi Viswanathan

2604.15325 2026-04-20 cs.HC cs.ET cs.RO

NEFFY 2.0: A Breathing Companion Robot: User-Centered Design and Findings from a Study with Ukrainian Refugees

Ilona Buchem, Jessica Kazubski, Charly Goerke

Comments 5 pages, 1 figure, 1st ACM/IEEE International Conference on Human-Robot Interaction

详情

DOI: 10.1145/3776734.3794450

英文摘要

This paper presents the design of NEFFY 2.0, a social robot designed as a haptic slow-paced breathing companion for stress reduction, and reports findings from a mixed-methods user study with 14 refugees from Ukraine. Developed through a user-centered design process, NEFFY 2.0 builds on NEFFY 1.0 and integrates embodiment and multi-sensory interaction to provide low-threshold, accessible guidance of slow-paced breathing for stress relief, which may be particularly valuable for individuals experiencing prolonged periods of anxiety. To evaluate effectiveness, an experimental comparison of a robot-assisted breathing intervention versus an audio-only condition was conducted. Measures included subjective ratings and physiological indicators, such as heart rate (HR), heart rate variability (HRV) using RMSSD parameter, respiratory rate (RR), and galvanic skin response (GSR), alongside qualitative data from interviews exploring user experience and perceived support. Qualitative findings showed that NEFFY 2.0 was perceived as intuitive, calming and supportive. Survey results showed a substantially larger effect in significant reduction of perceived stress in the NEFFY 2.0 condition compared to audio-only. Physiological data reveled mixed results combined with large inter-personal variability. Three patterns of breathing practice with NEFFY 2.0 were identified using k-means clustering. Despite the small sample size, this study makes a novel contribution by providing empirical evidence of stress reduction in a vulnerable population through a direct comparison of robot-assisted and non-robot conditions. The findings position NEFFY 2.0 as a promising low-threshold tool that supports stress relief and contributes to the vision of HRI empowering society.

URL PDF HTML ☆

赞 0 踩 0

2604.15324 2026-04-20 cs.HC cs.AI cs.CY

Struggle Premium : How Human Effort and Imperfection Drive Perceived Value in the Age of AI

Nazneen Sultana, Mst Rafia Islam, Md. Tanvir Hossain, Azmine Toushik Wasi

Comments Short Paper. In Review. 12 Pages

2604.15322 2026-04-20 cs.HC cs.CL cs.LG

Acoustic and Facial Markers of Perceived Conversational Success in Spontaneous Speech

Thanushi Withanage, Elizabeth Redcay, Carol Espy-Wilson

Comments Accepted for presentation at ICASSP 2026

2604.15316 2026-04-20 cs.HC cs.AI

Anthropomorphism and Trust in Human-Large Language Model interactions

Akila Kadambi, Ylenia D'Elia, Tanishka Shah, Iulia Comsa, Alison Lentz, Katie Siri-Ngammuang, Tara Buechler, Jonas Kaplan, Antonio Damasio, Srini Narayanan, Lisa Aziz-Zadeh

2604.15314 2026-04-20 cs.HC cs.AI

Modeling of ASD/TD Children's Behaviors in Interaction with a Virtual Social Robot During a Music Education Program Using Deep Neural Networks

Armin Tandiseh, Morteza Memari, Alireza Taheri

Comments 22 pages, 5 figures

2604.15214 2026-04-20 quant-ph cs.LG

Optimal algorithmic complexity of inference in quantum kernel methods

Elies Gil-Fuster, Seongwook Shin, Sofiene Jerbi, Jens Eisert, Maximilian J. Kramer

Comments 26 pages (13+13), 4 figures, comments welcome

详情

英文摘要

Quantum kernel methods are among the leading candidates for achieving quantum advantage in supervised learning. A key bottleneck is the cost of inference: evaluating a trained model on new data requires estimating a weighted sum $\sum_{i=1}^N α_i k(x,x_i)$ of $N$ kernel values to additive precision $\varepsilon$, where $α$ is the vector of trained coefficients. The standard approach estimates each term independently via sampling, yielding a query complexity of $O(N\lVertα\rVert_2^2/\varepsilon^2)$. In this work, we identify two independent axes for improvement: (1) How individual kernel values are estimated (sampling versus quantum amplitude estimation), and (2) how the sum is approximated (term-by-term versus via a single observable), and systematically analyze all combinations thereof. The query-optimal combination, encoding the full inference sum as the expectation value of a single observable and applying quantum amplitude estimation, achieves a query complexity of $O(\lVertα\rVert_1/\varepsilon)$, removing the dependence on $N$ from the query count and yielding a quadratic improvement in both $\lVertα\rVert_1$ and $\varepsilon$. We prove a matching lower bound of $Ω(\lVertα\rVert_1/\varepsilon)$, establishing query-optimality of our approach up to logarithmic factors. Beyond query complexity, we also analyze how these improvements translate into gate costs and show that the query-optimal strategy is not always optimal in practice from the perspective of gate complexity. Our results provide both a query-optimal algorithm and a practically optimal choice of strategy depending on hardware capabilities, along with a complete landscape of intermediate methods to guide practitioners. All algorithms require only amplitude estimation as a subroutine and are thus natural candidates for early-fault-tolerant implementations.

URL PDF HTML ☆

赞 0 踩 0

2604.14460 2026-04-20 cs.HC cs.LG

Bias in Surface Electromyography Features across a Demographically Diverse Cohort

Aditi Agrawal, Celine John Philip, Giancarlo K. Sagastume, Marcus A. Battraw, Wilsaan M. Joiner, Jonathon S. Schofield, Lee M. Miller, Richard S. Whittle

Comments 17 pages, 4 Figures

2604.14334 2026-04-20 q-bio.QM cs.AI

Mamba-SSM with LLM Reasoning for Feature Selection: Faithfulness-Aware Biomarker Discovery

Pushpa Kumar Balan, Aijing Feng

Comments 9 pages, 4 figures. Accepted at ICLR 2026 Workshop on Logical Reasoning of Large Language Models

2604.14309 2026-04-20 cs.IT cs.AI eess.SP math.IT

Aerial Multi-Functional RIS in Fluid Antennas-Aided Full-Duplex Networks: A Self-Optimized Hybrid Deep Reinforcement Learning Approach

Li-Hsiang Shen, Yu-Quan Zheng

详情

英文摘要

To address high data traffic demands of sixth-generation (6G) networks, this paper proposes a novel architecture that integrates autonomous aerial vehicles (AAVs) and multi-functional reconfigurable intelligent surfaces (MF-RISs) as AM-RIS in fluid antenna (FA)-assisted full-duplex (FD) networks. The AM-RIS provides hybrid functionalities, including signal reflection, amplification, and energy harvesting (EH), potentially improving both signal coverage and sustainability. Meanwhile, FA facilitates fine-grained spatial adaptability at FD-enabled base station (BS), which complements residual self-interference (SI) suppression. We aim at maximizing the overall energy efficiency (EE) by jointly optimizing transmit DL beamforming at BS, UL user power, configuration of AM-RIS, and positions of the FA and AM-RIS. Owing to the hybrid continuous-discrete parameters and high dimensionality of the intractable problem, we have conceived a self-optimized multi-agent hybrid deep reinforcement learning (DRL) framework (SOHRL), which integrates multi-agent deep Q-networks (DQN) and multi-agent proximal policy optimization (PPO), respectively handling discrete and continuous actions. To enhance self-adaptability, an attention-driven state representation and meta-level hyperparameter optimization are incorporated, enabling multi-agents to autonomously adjust learning hyperparameters. Simulation results validate the effectiveness of the proposed AM-RIS-enabled FA-aided FD networks empowered by SOHRL algorithm. The results reveal that SOHRL outperforms benchmarks of the case without attention mechanism and conventional hybrid/multi-agent/standalone DRL. Moreover, AM-RIS in FD achieves the highest EE compared to half-duplex, conventional rigid antenna arrays, partial EH, and conventional RIS without amplification, highlighting its potential as a compelling solution for EE-aware wireless networks.

URL PDF HTML ☆

赞 0 踩 0

2604.10577 2026-04-20 cs.CR cs.AI

The Blind Spot of Agent Safety: How Benign User Instructions Expose Critical Vulnerabilities in Computer-Use Agents

Xuwei Ding, Skylar Zhai, Linxin Song, Jiate Li, Taiwei Shi, Nicholas Meade, Siva Reddy, Jian Kang, Jieyu Zhao

Comments 63 pages

2604.10126 2026-04-20 cs.SE cs.AI

MR-Coupler: Automated Metamorphic Test Generation via Functional Coupling Analysis

Congying Xu, Hengcheng Zhu, Songqiang Chen, Jiarong Wu, Valerio Terragni, Shing-Chi Cheung

Comments Note: Accepted by ACM International Conference on the Foundations of Software Engineering (FSE) 2026

2603.21735 2026-04-20 cs.HC cs.AI

Cognitive Agency Surrender: Defending Epistemic Sovereignty via Scaffolded AI Friction

Kuangzhe Xu, Yu Shen, Longjie Yan, Yinghui Ren

Comments 26 pages, 4 figure (one in appendix). This is a preprint of a perspective article

2603.19339 2026-04-20 cs.IR cs.AI cs.CL

Spectral Tempering for Embedding Compression in Dense Passage Retrieval

Yongkang Li, Panagiotis Eustratiadis, Evangelos Kanoulas

Comments This paper has been accepted as a short paper at SIGIR 2026

2602.13088 2026-04-20 cs.CY cs.AI

Puppets or partners? Governing cyborg propaganda in the digital public square

Jonas R. Kunst, Kinga Bierwiaczonek, Meeyoung Cha, Omid V. Ebrahimi, Marc Fawcett-Atkinson, Asbjørn Følstad, Anton Gollwitzer, Nils Köbis, Gary Marcus, Jon Roozenbeek, Daniel Thilo Schroeder, Jay J. Van Bavel, Sander van der Linden, Rory White, Live Leonhardsen Wilhelmsen

Comments 38 pages

2602.06105 2026-04-20 stat.ML cs.LG math.AG

Robustness Verification of Polynomial Neural Networks

Yulia Alexandr, Hao Duan, Guido Montúfar

2602.00052 2026-04-20 cs.IR cs.AI cs.CL cs.LG

AI-assisted Protocol Information Extraction For Improved Accuracy and Efficiency in Clinical Trial Workflows

Ramtin Babaeipour, François Charest, Madison Wright

Comments Updated to accepted manuscript. Published in Journal of Biomedical Informatics, Volume 179, July 2026, 105036

2512.05717 2026-04-20 physics.chem-ph cond-mat.mtrl-sci cs.LG

Comparing the latent features of universal machine-learning interatomic potentials

Sofiia Chorna, Davide Tisi, Cesare Malosso, Wei Bin How, Michele Ceriotti, Sanggyu Chong

2510.24058 2026-04-20 eess.SP cs.AI cs.LG

PULSE: Privileged Knowledge Transfer from Rich to Deployable Sensors for Embodied Multi-Sensory Learning

Zihan Zhao, Kaushik Pendiyala, Masood Mortazavi, Ning Yan

Comments v2: Accepted at the CVPR 2026 Workshop on Sense of Space. 8 pages main content + references + appendix

2510.09689 2026-04-20 cs.CR cs.AI

When Search Goes Wrong: Red-Teaming Web-Augmented Large Language Models

Haoran Ou, Kangjie Chen, Xingshuo Han, Gelei Deng, Jie Zhang, Han Qiu, Tianwei Zhang