arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.23069 2026-05-25 cs.CL

DFKI-MLT at SemEval-2026 TASK 7: Steering Multilingual Models Towards Cultural Knowledge

DFKI-MLT 在 SemEval-2026 任务 7 中：将多语言模型引导至文化知识

Yusser Al Ghussin, Daniil Gurgurov, Yasser Hamidullah, Josef van Genabith, Cristina España-Bonet, Simon Ostermann

发表机构 * German Research Center for Artificial Intelligence (DFKI GmbH)（德国人工智能研究中心（DFKI GmbH））； Saarland Informatics Campus（萨尔布吕肯信息学校区）； Barcelona Supercomputing Center (BSC-CNS)（巴塞罗那超级计算中心（BSC-CNS））

AI总结该研究针对多语言大语言模型在文化知识理解上的不足，提出了一种基于激活引导的方法，通过从平行语料FLORES中提取语言向量，对多语言模型进行推理时的适应性调整。研究参与了SemEval-2026任务7的多选题和简答题两个赛道，其中多选题部分取得了86.96%的准确率，排名第七。分析表明，激活引导在不同语言和层面上的效果不一，提示在文化感知任务中应综合优化提示设计与激活引导策略。

Comments Accepted to The 20th International Workshop on Semantic Evaluation at ACL 2026

详情

AI中文摘要

大型语言模型（LLMs）越来越多地用于不同的语言和文化背景，但其文化知识在不同地区和语言之间仍然不均匀。我们提出了用于 SemEval-2026 任务 7（文化意识）的 DFKI-MLT 系统，该系统使用从并行 FLORES 数据中提取的语言向量，对多语言 LLMs 应用激活引导。我们的方法通过在选定的 Transformer 层的残差流中添加特定语言的引导向量来进行推理时调整，无需任何参数更新。我们参加了简答题（SAQ）和多项选择题（MCQ）两个赛道；然而，只有我们的 MCQ 提交获得了官方评分。在官方 MCQ 赛道中，我们达到了 86.96% 的准确率，在 17 个队伍中排名第 7。为了更好地理解系统行为，我们对共享任务的 MCQ 和 SAQ 设置进行了事后分析。这些分析表明，激活引导对文化推理产生了适度且异质的改进：增益对层高度敏感，在不同语言-区域对之间差异很大，某些配置甚至降低了性能，并且与提示表述相互作用，比较了通用提示和文化条件提示。我们的发现表明，提示设计和激活引导应联合优化，以实现具有文化意识的多语言推理。

英文摘要

Large language models (LLMs) are increasingly used across diverse linguistic and cultural contexts, yet their cultural knowledge remains uneven across regions and languages. We present the DFKI-MLT system for SemEval-2026 Task 7 on cultural awareness, where we apply activation steering to multilingual LLMs using language vectors extracted from parallel FLORES data. Our method performs inference-time adaptation by adding language-specific steering vectors to the residual stream at a selected transformer layer, without any parameter updates. We participated in both the short-answer (SAQ) and multiple-choice (MCQ) tracks; however, only our MCQ submission received an official score. In the official MCQ track, we achieved 86.96% accuracy, ranking 7th out of 17 teams. To better understand system behavior, we conduct post-hoc analyses on the shared-task MCQ and SAQ settings. These analyses show that activation steering yields modest and heterogeneous improvements on cultural reasoning: gains are strongly layer-sensitive, vary substantially across language-region pairs, with some configurations even degrading performance, and interact with prompt formulation, comparing generic and culturally conditioned prompts. Our findings suggest that prompt design and activation steering should be jointly optimized for culturally aware multilingual inference.

URL PDF HTML ☆

赞 0 踩 0

2605.23068 2026-05-25 cs.CV

毫米波成像用于人体测量

Miriam Senne, Benjamin D. Killeen, Christoph Baur, Nassir Navab, Azade Farshad

发表机构 * Chair for Computer Aided Medical Procedures（计算机辅助医疗程序研究所）； Technical University of Munich（慕尼黑技术大学）； Rohde & Schwarz GmbH & Co. KG（罗德与施瓦茨 GmbH & Co. KG）； Munich Center for Machine Learning（慕尼黑机器学习中心）； ELLIS Unit Helsinki, Dept. Computer Science, Aalto University（赫尔辛基ELLIS单位，计算机科学系，阿alto大学）

AI总结该研究提出了一种基于毫米波雷达的无接触人体体型测量方法，旨在解决传统测量工具在隐私、效率和适用性方面的不足。通过优化框架，该方法能够从毫米波点云数据中恢复人体三维形状并提取全面的体态测量指标。其核心贡献在于引入了一种顶点加权策略，结合参数化人体模型（SMPL）进行鲁棒的表面对齐与噪声抑制，实现了无需脱衣、无需摄像头的快速、隐私保护的测量流程，适用于各类人群的临床风险评估。

详情

AI中文摘要

身体形状和围度是临床上用于风险分层的信息性生物标志物，包括腰臀比、肢体和躯干周长等指标，然而传统工具如手动卷尺和光学扫描仪通常需要脱衣和保持姿势。这些要求减缓了工作流程，损害了尊严，并且排除了许多老年人和行动不便者。为了实现快速无接触测量，我们利用毫米波雷达，它保护隐私并能穿透典型衣物，实现快速全身采集。在这项工作中，我们提出了一个新的基于优化的框架，从体积毫米波数据中恢复3D人体形状并提取一套全面的人体测量数据。我们的方法引入了一个加权配准流程，将参数化身体模型（SMPL）直接拟合到噪声毫米波点云上。我们贡献的核心是一种顶点加权策略，该策略调节Chamfer能量函数以实现可靠的表面对齐和噪声消除。我们通过加入脚-地面约束和姿态先验进一步稳定拟合，直接优化SMPL参数。这些组件共同实现了一个快速、保护隐私的工作流程，无需摄像头或脱衣，且只需最小程度的配合，即可通过衣物提供高保真度的身体形状和测量数据，支持在诊所和护理机构中对所有年龄和活动水平的患者进行频繁的风险导向评估。

英文摘要

Body shape and circumferences are clinically informative biomarkers for risk stratification, including measures such as waist to hip ratio, limb and trunk girths, yet conventional tools such as manual tape measures and optical scanners often require undressing and sustained poses. These demands slow workflows, compromise dignity, and exclude many older adults and people with limited mobility. To make measurement fast and contactless, we leverage millimeter-wave (mmWave) radar, which preserves privacy and operates through typical clothing, enabling quick full-body acquisition. In this work, we present a new optimization-based framework to recover 3D human shape and extract a comprehensive set of anthropometric measurements from volumetric mmWave data. Our method introduces a weighted registration pipeline that fits a parametric body model (SMPL) directly to the noisy mmWave point cloud. The core of our contribution is a vertex-weighting strategy that modulates a Chamfer energy function for reliable surface alignment and noise elimination. We further stabilize the fit by incorporating a foot-ground plane constraint and pose priors, optimizing directly for the SMPL parameters. Together, these components enable a fast, privacy preserving workflow that delivers high fidelity body shape and measurements through clothing without cameras or disrobing and with minimal cooperation, supporting frequent risk oriented assessments in clinics and care facilities for patients of all ages and mobility levels.

URL PDF HTML ☆

赞 0 踩 0

2605.23061 2026-05-25 cs.LG cs.AI math.OC stat.ML

时间机器：论运动在高效感知中的力量

Mantas Skackauskas, Xinyue Hao, Laura Sevilla-Lara

发表机构 * School of Informatics University of Edinburgh（信息学院爱丁堡大学）

AI总结本文提出了一种以运动为核心模态的视频表征学习方法，旨在解决现有视频模型在时序理解和训练成本方面的局限。通过使用点轨迹表示视频中的运动，并利用掩码自编码器进行自监督训练，模型能够学习到更高效且细粒度的视频表征。该方法无需依赖语言标注，大幅降低了训练数据需求，并在多项任务中展现出与当前先进模型相当的性能，为构建更高效、更具时序感知能力的视频模型提供了新方向。

详情

AI中文摘要

近年来，视频表示学习取得了巨大进展。这受到多种因素的推动，包括训练规模以及通过语言对比训练的视觉模型的成功。虽然这些因素推动了视频模型的能力边界，但它们也引入了自身的局限性：首先，扩展视频模型可能达到高昂的成本；其次，从语言学习限制了可学习概念的范围，仅限于字幕中的概念。因此，视频模型在时间理解方面仍然存在困难。在本文中，我们提出了一种新颖的方法，将运动作为视频表示的核心模态。具体而言，给定视频中以点轨迹形式存在的运动，我们使用掩码自编码器来掩码部分轨迹，并训练自编码器重建缺失的轨迹。这使我们能够以自监督方式学习表示。我们表明，使用运动来表示视频实际上解决了视频技术的两个核心局限性。首先，它使我们能够大幅减少训练数据的规模，因为运动本质上与外观无关，因此需要更少的样本就能很好地泛化。其次，运动使我们能够绕过依赖语言的训练范式，学习更细粒度的概念。结果是一种嵌入，我们称之为TIME（时间感知运动嵌入），这是一种仅使用合成运动数据训练的表示。我们在零样本方式下对广泛的任务测试了这种嵌入。我们观察到，无需额外技巧，其性能与使用多达4个数量级更少训练数据的最先进模型相当。这为迈向更有时序感知且更具可扩展性的视频模型新范式奠定了基础。

英文摘要

Video representation learning has seen tremendous progress in recent years. This has been driven by many factors, including the scale of training and the success of visual models trained contrastively with language. While these factors have pushed the boundaries of what video models can do, they also introduce their own set of limitations: first, scaling video models can reach prohibitive costs and second, learning from language restricts the range of concepts that can be learned to those in captions. As a result, video models still struggle with temporal understanding. In this paper we propose a novel approach that uses motion as the central modality for video representation. In particular, given the motion in a video in the form of point-tracks, we use a masked-autoencoder to mask some of the tracks and train the autoencoder to reconstruct the missing tracks. This allows us to learn a representation in a self-supervised manner. We show that using motion to represent videos actually addresses both of the core limitations of video technology. First, it allows us to massively reduce the scale of training data, as motion is inherently appearance-independent and hence needs fewer examples to generalize well. Second, motion allows us to bypass the language-dependent training paradigm, learning better fine-grained concepts. The result is an embedding that we call TIME (Temporally Informed Motion Embedding), a representation trained exclusively on synthetic motion data. We test this embedding on a wide set of tasks in a zero-shot manner. We observe that without bells and whistles, performance is on par with state-of-the-art models using up to 4 orders of magnitude less training data. This is a stepping stone towards a new paradigm of video models that are both more temporally aware as well as more scalable.

URL PDF HTML ☆

赞 0 踩 0

2605.23043 2026-05-25 cs.CL stat.ML

HawkesLLM: Semantic Uncertainty Propagation in Agentic Text Simulation

HawkesLLM：智能体文本模拟中的语义不确定性传播

Zewei Deng, Tinghan Ye, Liyan Xie

发表机构 * Department of Industrial and Systems Engineering, University of Minnesota（工业与系统工程系，明尼苏达大学）； H. Milton Stewart School of Industrial and Systems Engineering, Georgia Institute of Technology（H. Milton Stewart工业与系统工程学院，佐治亚理工学院）

AI总结本文提出HawkesLLM框架，用于解决智能体文本模拟系统中语义不确定性随时间累积的问题。该方法将时间影响建模与文本生成过程分离，通过多变量Hawkes过程建模节点间的激活关系，并利用语言模型基于时间模型选择的紧凑记忆生成新内容。实验表明，在GDELT新闻传播案例中，HawkesLLM在有限提示记忆预算下有效提升了后期语义对齐的效果。

Comments 10 pages, 4 figures, Accepted at the ICML 2026 Workshop on Statistical Frameworks for Uncertainty in Agentic Systems

详情

AI中文摘要

智能体文本模拟系统按顺序生成文本，每个项目成为后续步骤的可能上下文。这使得不确定性具有路径依赖性：早期的模糊性可能影响后续输出。本文通过HawkesLLM框架研究这一问题，该框架将时间影响建模与文本生成分离。我们将级联表示为一个网络，其节点是文本生成智能体。多变量Hawkes过程模拟这些节点随时间激活的方式，以及哪些早期节点输出应影响后续提示。然后，语言模型根据该时间模型选择的紧凑记忆编写每个新事件。我们在一个保留的全球事件、语言和语调数据库（GDELT）新闻级联案例研究中评估该框架。诊断跟踪与局部保留参考的语义对齐，并区分局部漂移和全局漂移。在此设置下，HawkesLLM在紧凑的提示记忆预算下改善了后期语义对齐。

英文摘要

Agentic text-simulation systems write in sequence, with each item becoming possible context for later steps. That makes uncertainty path-dependent: an early ambiguity can affect later outputs. This paper studies this problem with HawkesLLM, a framework that separates temporal influence modeling from text generation. We represent the cascade as a network whose nodes are text-generating agents. A multivariate Hawkes process models how these nodes activate over time and which earlier node outputs should influence later prompts. A language model then writes each new event from the compact memory selected by this temporal model. We evaluate the framework on a held-out Global Database of Events, Language, and Tone (GDELT) news-cascade case study. The diagnostics track semantic alignment with local held-out references and separate local drift from global drift. In this setting, HawkesLLM improves late-stage semantic alignment under a compact prompt-memory budget.

URL PDF HTML ☆

赞 0 踩 0

2605.23040 2026-05-25 cs.LG

世界机器：面向时间序列的生成式世界建模

Elton Cardoso do Nascimento, Alexandre da Silva Simões, Esther Luna Colombini, Ricardo Ribeiro Gudwin, Paula Dornhofer Paro Costa

发表机构 * Universidade Estadual de Campinas (UNICAMP)（坎皮纳斯州立大学）； Universidade Estadual Paulista (UNESP)（保罗斯州立大学）

AI总结本文提出了一种名为 World Machine 的生成式世界建模架构，用于时间序列数据，旨在实现对环境的可预测理解和可控模拟。该架构基于变压器模型，引入了潜在状态机制，能够适应不同长度的观测数据和上下文，相比传统变压器在计算和内存效率上有所提升。实验在合成数据集 Toy1D 上验证了该方法的可行性，并展示了其相对于传统变压器的独特优势与各训练组件的贡献。

2605.23019 2026-05-25 cs.LG

CoMoGen: 基于掩码引导的视频生成的可控运动动力学与交互

Adil Meric, Lin Geng Foo, Mert Kiray, Benjamin Busam, Rishabh Dabral, Christian Theobalt

发表机构 * Technical University of Munich（慕尼黑技术大学）； Max Planck Institute for Informatics, Saarland Informatics Campus（马克斯·普朗克信息研究所，萨尔兰信息校园）； Munich Center for Machine Learning (MCML)（慕尼黑机器学习中心）； Obsphera

AI总结本文提出了一种可控视频生成框架 CoMoGen，能够在输入图像和二值掩码序列的条件下生成具有真实交互动态的视频。该方法引入了一个轻量的 MaskAdapter 模块，将掩码序列编码为残差信号，并通过余弦加权调度注入到多模态扩散变换器（MMDiT）中。通过低秩适配（LoRA）对 MMDiT 中负责运动生成的特定层进行微调，实现了对运动关键组件的聚焦，降低了计算成本。实验表明，CoMoGen 在运动保真度和感知真实感方面优于现有方法，达到了当前最优水平。

详情

AI中文摘要

我们提出了CoMoGen，一个可控视频生成框架，它能够根据输入图像和单个二进制掩码序列生成逼真的交互动力学。CoMoGen引入了一个轻量级的MaskAdapter，将二进制掩码序列编码为潜在残差信号，并通过余弦加权调度注入到多模态扩散Transformer（MMDiT）模型中。与UNet架构的分层粗到细设计不同，MMDiT作为一系列统一的Transformer块运行，因此很难确定哪些层负责运动生成。因此，我们提出了一种新颖的方法来确定在MMDiT注意力空间中运行的“运动层”。我们通过使用低秩适应（LoRA）对运动层进行微调，而不需要对MMDiT进行任何架构更改。这种选择性适应使我们的方法能够专注于运动关键组件，从而降低计算成本。尽管方法简单，CoMoGen实现了精确的主体运动以及与周围人类、物体和场景的合理交互。在不同数据集上的全面实验表明，CoMoGen始终优于先前的可控视频生成方法，并在运动保真度和感知真实性方面达到了最先进的性能。项目页面：mericadil.github.io/CoMoGen。

英文摘要

We present CoMoGen, a controllable video generation framework that generates realistic interactive dynamics from a single binary mask sequence conditioned on an input image. CoMoGen introduces a lightweight MaskAdapter that encodes binary mask sequences into a latent residual signal, injected into the Multi Modal Diffusion Transformer (MMDiT) model through a cosine-weighted schedule. Unlike the hierarchical coarse-to-fine design of UNet architectures, MMDiT operates as a sequence of uniform transformer blocks, making it difficult to identify which layers are responsible for the motion generation. Therefore, we propose a novel way to determine "Motion Layers" operating in the attention space of MMDiT. We fine-tune the model by using Low-Rank Adaptation (LoRA) to the Motion Layers, without requiring any architecture change in the MMDiT. This selective adaptation enables our method to focus on motion-critical components, yielding reduced computational cost. Despite its simplicity, CoMoGen enables precise subject motion and plausible interactions with surrounding humans, objects, and scenes. Comprehensive experiments on different datasets show that CoMoGen consistently outperforms prior controllable video generation methods and achieves state-of-the-art performance in motion fidelity and perceptual realism. Project page: mericadil.github.io/CoMoGen.

URL PDF HTML ☆

赞 0 踩 0

2605.22993 2026-05-25 cs.CL cs.AI

A Proactive Multi-Agent Dialogue Framework for Assessing Social Language Disorder Traits in Autism

一种主动式多智能体对话框架用于评估自闭症中的社交语言障碍特征

Chuanbo Hu, Minglei Yin, Bin Liu, Wenqi Li, Lynn K. Paul, Shuo Wang, Xin Li

发表机构 * Department of Computer Science（计算机科学系）； University at Albany（阿尔巴尼大学）； Department of Management Information System（管理信息系统系）； West Virginia University（西弗吉尼亚大学）； Department of Radiology（放射学系）； Washington University in St. Louis（圣路易斯华盛顿大学）； Humanities and Social Sciences（人文学与社会科学）

AI总结该研究提出了一种名为TPA的主动多智能体对话框架，用于评估自闭症谱系障碍中的社会语言障碍（SLD）特征。该框架通过医生智能体主动选择针对性的问题策略，以系统性地揭示患者对话中潜在的语言障碍特征，从而提高诊断效率。实验表明，TPA在多个关键指标上优于现有基线方法，显著提升了SLD特征的覆盖率和诊断效率，为AI辅助临床筛查提供了重要支持。

详情

AI中文摘要

与自闭症谱系障碍中社交语言障碍（SLD）相关的特征性语言行为，包括回声性重复、代词位移和刻板媒体引用，在自发对话中基本不存在，仅在特定对话条件下出现。在结构化临床评估中，这种延迟意味着提问策略选择是决定对话产生多少诊断信息的关键但未被充分重视的因素。大型语言模型（LLMs）能否被引导主动选择系统地揭示这些潜在特征的提问策略，在很大程度上仍未探索。本文提出TPA（思考、计划、询问），一种应用于自闭症诊断观察量表模块4（ADOS-2）语言评估部分的主动式多智能体对话框架，其中医生智能体在选择临床依据策略并生成针对性问题之前，明确推理哪些特征尚未观察到。基于真实ADOS-2临床数据的患者智能体使得无需真实患者参与即可进行可重复评估，并通过三个独立实验验证，确认其对真实患者语言具有足够的保真度。在来自35名患者的484个片段上评估，TPA在所有主要指标上优于六个竞争性对话规划基线，实现了82.1%的SLD特征覆盖率，比训练有素的临床医生进行的真实临床对话自动回放（65.5%）高16.6%，并且每轮诊断效率显著更高（AUCC：0.628 vs. 0.458，绝对增益+0.170）。这些结果表明，主动提问策略选择显著提高了自动化SLD特征评估的效率，对可扩展的AI辅助临床筛查具有直接意义。

英文摘要

Characteristic linguistic behaviors associated with Social Language Disorder (SLD) in autism spectrum disorder, including echoic repetition, pronoun displacement, and stereotyped media quoting, are largely absent from spontaneous conversation and only emerge under specific conversational conditions. In structured clinical assessments, this latency means that questioning strategy selection is a critical yet underappreciated determinant of how much diagnostic information a conversation yields. Whether large language models (LLMs) can be guided to proactively select questioning strategies that systematically surface these latent traits remains largely unexplored. Here we present TPA (Think, Plan, Ask), a proactive multi-agent dialogue framework applied to the language assessment component of the Autism Diagnostic Observation Schedule Module 4 (ADOS-2), in which a doctor agent explicitly reasons about which traits remain unobserved before selecting a clinically grounded strategy and generating a targeted question. A patient agent grounded in real ADOS-2 clinical data enables reproducible evaluation without real patient participation, validated across three independent experiments confirming adequate fidelity to real patient language. Evaluated on 484 episodes from 35 patients, TPA outperforms six competitive dialogue planning baselines across all primary metrics, achieving 82.1% SLD trait coverage, 16.6% higher than automated replay of real clinical dialogues conducted by trained clinicians (65.5%), with substantially greater per-turn diagnostic efficiency (AUCC: 0.628 vs. 0.458, absolute gain +0.170). These results demonstrate that proactive questioning strategy selection substantially improves the efficiency of automated SLD trait assessment, with direct implications for scalable AI-assisted clinical screening.

URL PDF HTML ☆

赞 0 踩 0

2605.22991 2026-05-25 cs.RO

比随机更差：无监督特征选择中基线的重要性

Muhammad Rajabinasab, Michael E. Houle, Oussama Chelly, Arthur Zimek

发表机构 * University of Southern Denmark（丹麦南部大学）； New Jersey Institute of Technology（新泽西理工学院）； Oratio Technologies（Oratio技术公司）

AI总结本文探讨了无监督特征选择方法的评估基准问题，指出当前多数方法缺乏与随机特征选择这一基准的比较，难以衡量其实际贡献。作者提出应将随机特征选择作为评估基准，并通过实验证明许多先进方法在性能和效率上均不如随机选择。因此，研究强调在开发新的无监督特征选择方法时，必须以随机选择为基准，以确保方法的有效性与改进价值。

Comments Preprint submitted to Elsevier Pattern Recognition Letters

AI 大模型

视觉与机器人

科学与医疗

DFKI-MLT at SemEval-2026 TASK 7: Steering Multilingual Models Towards Cultural Knowledge

RoboSurg-VQA: A Multimodal Benchmark for Surgical Segmentation-Aware Visual Question Answering

What Training Data Teaches RL Memory Agents: An Empirical Study of Curriculum Effects in Memory-Augmented QA

Dithering Defense: Adversarial Robustness of Vision Foundation Models via Multi-Level Floyd-Steinberg Dithering

Millimeter-wave Imaging for Anthropometric Body Measurement

Anytime Training with Schedule-Free Spectral Optimization

ModeSwitch-LLM: A Lightweight Phase-Aware Controller for Cross-Mode LLM Inference on a Single GPU

Model Collapse as Cultural Evolution

DreamerNLplus: Interpretable Modeling of Mental Health Dynamics from Social Media Timelines using Hybrid Rule-Based and RAG Methods

The TIME Machine: On The Power of Motion for Efficient Perception

HawkesLLM: Semantic Uncertainty Propagation in Agentic Text Simulation

Steered Generation via Gradient-Based Optimization on Sparse Query Features

Do Language Models Know What Not to Say? Causal Evidence for Statistical Preemption in LLMs

Open Multimodal Datasets and Open-Source Software for Data-Driven Modeling of Multiphase Transport and Thermal Systems

Multilingual Steering by Design: Multilingual Sparse Autoencoders and Principled Layer Selection

Sparse Autoencoders Map Brain-LLM Alignment onto Cortical Semantic Topography

Uncovering the Latent Potential of Deep Intermediate Representations

Brain-LLM Alignment Tracks Training Data, Not Typology

RADAR: Relative Angular Divergence Across Representations

PIMbot: A Self-Adaptive Attack Framework for Adversarial Manipulation of Multi-Robot Reinforcement Learning

World Machine: Towards Generative World Modeling for Time-Series

PACE: Two-Timescale Self-Evolution for Small Language Model Agents

Smoothed Elicitation Complexity for Approximate $Γ$-calibration of Discrete Classification Tasks

Scene Reconstruction as Mapping Priors for 3D Detection

CoMoGen: COntrollable MOtion Dynamics and Interactions with Mask-Guided Video GENeration

A Proactive Multi-Agent Dialogue Framework for Assessing Social Language Disorder Traits in Autism

Verified Task-Space Motion Planning Under Joint-Space Constraints

Test-Time Training Undermines Safety Guardrails

Memorization Dynamics of Fill-in-the-Middle Pretraining

Worse than Random: The Importance of a Baseline for Unsupervised Feature Selection