arXivDaily arXiv每日学术速递 周一至周五更新
重置
2606.14679 2026-06-15 cs.LG cs.SY eess.SY math.OC stat.ML 新提交

Optimal Hidden-Target Learning for Online Inventory Optimization on General Convex Sets

一般凸集上在线库存优化的最优隐藏目标学习

Anthony Pineci, Yunzong Xu

发表机构 * UIUC(伊利诺伊大学厄巴纳-香槟分校)

AI总结 针对一般凸容量集上的在线库存优化问题,提出隐藏目标投影方法,将遗憾从逆概率依赖改进为平方根逆概率依赖,并证明匹配下界,同时首次给出强凸损失的 polylog 遗憾和动态遗憾保证。

详情
AI中文摘要

在线库存优化(OIO)是具有物理记忆的在线凸优化:库存结转使得可行动作集依赖于过去。一个自然的原则——在随机库存学习以及最近在单一线性容量约束下的OIO中使用——是维护一个由在线学习器选择的隐藏目标,并将其投影到当前可行的订货上限集上。我们证明,对于任意有界凸容量集上的OIO,这一简单原则是最优的。以在线梯度下降为基础学习器,该方法将一般凸集上OIO的最佳已知遗憾保证从对共同需求概率的逆依赖改进为平方根逆依赖,并且我们证明了匹配的下界。同样的原则为强凸损失提供了首个多对数遗憾保证,并为一般凸容量集上的欧几里得路径变化提供了首个动态遗憾保证。分析引入了一个范数对齐原则:正确的状态变量是隐藏目标到可行集的距离,以与投影相同的范数度量。在范数对齐下,该距离路径地演化为一个标量队列,目标移动作为到达,共同需求作为服务。这种简化为一维队列控制解决了状态依赖性,并将保证扩展到一般凸容量集,超出了先前乘积方法的范围。在合成和真实库存数据上的实验证实了该理论。

英文摘要

Online inventory optimization (OIO) is online convex optimization with physical memory: inventory carryover makes the feasible action set depend on the past. A natural principle, used in stochastic inventory learning and recently in OIO under a single linear capacity constraint, is to maintain a hidden target chosen by an online learner and implement its projection onto the currently feasible order-up-to set. We prove that this simple principle is optimal for OIO on arbitrary bounded convex capacity sets. With online gradient descent as the base learner, the method improves the best known regret guarantee for OIO on general convex sets from inverse to inverse-square-root dependence on the common-demand probability, and we prove a matching lower bound. The same principle gives the first polylogarithmic regret guarantee for strongly convex losses and the first dynamic regret guarantee adapting to Euclidean path variation on general convex capacity sets. The analysis introduces a norm alignment principle: the right state variable is the distance from the hidden target to the feasible set, measured in the same norm as the projection. Under norm alignment, this distance evolves pathwise as a scalar queue, with target movement as arrival and common demand as service. This reduction to one-dimensional queue control resolves the state dependence and extends the guarantees to general convex capacity sets, beyond the reach of prior productwise approaches. Experiments on synthetic and real-world inventory data corroborate the theory.

2606.14623 2026-06-15 physics.geo-ph cs.SY eess.SY physics.data-an 新提交

Towards unified Geophysical Data Requirements for Magnetic Navigation (MagNav)

迈向统一的磁导航(MagNav)地球物理数据需求

Regupathi Angappan, Kimberly Moore, Sriharsha Thoram

AI总结 本文基于实际飞行试验,区分了运行型磁导航与研发型磁导航的不同数据需求,提出了包括开发融合数据集、局部3D不确定性估计及扩展世界磁场模型至球谐13阶等优先建议,并强调专用测试场的重要性。

Comments 21 pages, 1 figure

详情
AI中文摘要

磁导航(MagNav)已成为一种重要的替代定位、导航与授时(PNT)解决方案,利用地球磁场在GPS/GNSS退化或拒止环境中实现稳健导航。尽管潜力巨大,但MagNav的成功部署目前受到缺乏标准化、高保真地磁参考地图的阻碍。现有数据集主要设计用于地质勘探或学术研究,不能满足导航系统在空间分辨率、误差量化和全球可访问性方面的独特操作要求。本文基于广泛的实际飞行试验,发起了一个以社区为中心的关于未来MagNav地球物理数据需求的对话。我们区分了两种具有不同数据需求的主要用例:运行型MagNav,需要全球一致、可查询且具有不确定性意识的数据集用于现场部署;以及MagNav研发,需要全面访问原始测量数据以促进创新。我们为未来的数据需求提供了一组优先建议,包括开发连贯的合并数据集、包含局部3D不确定性估计,以及将世界磁场模型(WMM)核心场模型扩展到球谐13阶以提高一致性。最后,我们强调了指定测试场的战略必要性,以验证这些需求并确保MagNav基础设施的运行稳健性。

英文摘要

Magnetic Navigation (MagNav) has emerged as a vital alternative Positioning, Navigation, and Timing (PNT) solution, leveraging Earth's magnetic field for robust navigation in GPS/GNSS-degraded or denied environments. Despite its potential, the successful deployment of MagNav is currently hindered by the lack of standardized, high-fidelity geomagnetic reference maps. Existing datasets, primarily designed for geological exploration or academic research, do not meet the distinct operational requirements of navigation systems regarding spatial resolution, error quantification, and global accessibility. This paper initiates a community-focused dialogue on future geophysical data requirements for MagNav, grounded in extensive real-world flight trials. We distinguish between two primary use cases with divergent data needs: Operational MagNav, which requires globally consistent, queryable, and uncertainty-aware datasets for field deployment, and MagNav R&D, which demands comprehensive access to raw survey data to foster innovation. We provide a prioritized set of recommendations for future data requirements, including the development of cohesive, merged datasets, the inclusion of localized 3D uncertainty estimates, and the expansion of the World Magnetic Model (WMM) core field model to spherical harmonic degree 13 to improve consistency. Finally, we emphasize the strategic necessity of designated test ranges to validate these requirements and ensure the operational robustness of MagNav infrastructure.

2606.14617 2026-06-15 cs.RO cs.SY eess.SY 新提交

Whole-Body Impedance Model Predictive Control for Safe Physical Human--Robot Interaction on Floating-Base Platforms

全身阻抗模型预测控制:浮基平台上的安全人机物理交互

Yongyan Cao

发表机构 * Voryx Robotics

AI总结 提出三层架构的全身阻抗MPC,通过质心MPC规划接触力、优先级WBC层平衡关节力矩、再ceding-horizon QP预测并抑制人机交互扰动,实现浮基机器人零稳态误差安全交互。

详情
AI中文摘要

浮基机器人必须在刚性接触约束下保持平衡,同时与人类安全交互。现有的全身控制(WBC)框架将全部关节空间分配给运动,或依赖固定增益阻抗反馈,在持续的人机物理交互(pHRI)力作用下积累稳态误差。本文将作者先前针对固定基座的两层阻抗MPC扩展到浮基平台,采用三层架构:质心MPC在500毫秒时域内规划接触力;优先级驱动的WBC层通过接触一致性零空间投影将平衡分解为关节力矩;剩余零空间由再ceding-horizon二次规划(QP)控制,该QP使用卡尔曼增强状态预测并抑制pHRI扰动。接触一致性反馈线性化将手臂末端执行器系统简化为在每个接触模式下具有恒定状态矩阵的双积分器,从而允许离线预计算QP代价并实现≥1 kHz运行。一种协方差膨胀协议在接触模式切换时保持扰动估计,保证在有界恒定pHRI负载下零稳态误差;阻抗等价定理表明无限时域极限恢复经典任务空间阻抗定律,其有效质量、阻尼和刚度随姿态和接触配置自适应。在17自由度双足机器人和Unitree G1人形机器人上的仿真验证了该设计。

英文摘要

Floating-base robots must balance under rigid contact constraints while interacting safely with humans. Existing whole-body control~(WBC) frameworks allocate the full joint space to locomotion or rely on fixed-gain impedance feedback that accumulates steady-state error under sustained physical human--robot interaction~(pHRI) forces. This paper extends the authors' fixed-base two-layer Impedance MPC to floating-base platforms through a three-level architecture: a centroidal MPC plans contact forces over a 500\,ms horizon; a priority-driven WBC layer resolves balance into joint torques through contact-consistent null-space projection; and the residual null space is governed by a receding-horizon quadratic program~(QP) that predicts and rejects pHRI disturbances using a Kalman-augmented state. A contact-consistent feedback linearization reduces the arm end-effector plant to a double integrator with a \emph{constant} state matrix within each contact mode, enabling offline precomputation of the QP cost and ${\geq}1$\,kHz operation. A covariance-inflation protocol preserves the disturbance estimate across contact-mode switches, guaranteeing zero steady-state error under bounded constant pHRI loads, and an Impedance Equivalence Theorem shows the infinite-horizon limit recovers a classical task-space impedance law whose effective mass, damping, and stiffness adapt to posture and contact configuration. Simulations on a 17-DOF biped and the Unitree G1 humanoid validate the design.

2606.14614 2026-06-15 q-bio.NC eess.SP 新提交

Decoding Semantic Categories from Picture-Naming EEG

从图片命名脑电中解码语义类别

Wei Hu, Binbin Xu

AI总结 本研究利用预训练单通道脑电编码器和多语言文本嵌入模型,从图片命名任务的高密度脑电中解码语义类别,结合早期和命名相关时间窗口,九类分类平衡准确率达0.781。

Comments 6 pages, 5 figures

详情
AI中文摘要

图片命名需要将视觉对象信息通过感知、语义、词汇和发音过程转化为口语词汇反应。本研究探讨了在显式图片命名过程中,是否可以从高密度脑电中恢复语义类别信息。16名以法语为母语的参与者执行了使用线条画的图片命名任务。图片标签通过多语言文本嵌入模型嵌入,并组织成九个可解释的语义类别,为神经解码提供了数据驱动的语义目标空间。脑电活动使用预训练的单通道脑电编码器在通道级别上表示,涵盖早期刺激后窗口、后期命名相关窗口及其组合。九类解码在所有时间表示中均显示出高于随机水平的语义类别区分能力。平衡准确率从早期窗口的0.562提高到命名相关窗口的0.610,当两个窗口结合时达到0.781,最大宏F1为0.784。类别级别的F1分数显示所有语义类别均有持续提升,传感器级别的解码图表明类别信息在空间上分布。这些发现表明,在显式图片命名过程中,语义类别结构反映在脑电活动中,且早期和命名相关时间窗口提供互补信息。结果支持使用现代神经解码方法作为研究口语产生中词汇语义加工的工具。

英文摘要

Picture naming requires the transformation of visual object information into a spoken lexical response through perceptual, semantic, lexical, and articulatory processes. This study asked whether semantic-category information is recoverable from high-density EEG during overt picture naming. Sixteen native French-speaking participants performed a picture-naming task using line drawings. Picture labels were embedded with a multilingual text-embedding model and organized into nine interpretable semantic categories, providing a data-driven semantic target space for neural decoding. EEG activity was represented channel-wise using a pre-trained single-channel EEG encoder over an early post-stimulus window, a later naming-related window, and their combination. Nine-class decoding showed above-chance semantic-category discrimination in all temporal representations. Balanced accuracy increased from 0.562 in the early window to 0.610 in the naming-related window, and reached 0.781 when both windows were combined, with a maximum Macro-F1 of 0.784. Class-level F1 scores showed consistent gains across semantic categories, and sensor-level decoding maps indicated spatially distributed category information. These findings suggest that semantic-category structure is reflected in EEG activity during overt picture naming and that early and naming-related temporal windows provide complementary information. The results support the use of modern neural decoding methods as tools for investigating lexical-semantic processing in spoken language production.

2606.14612 2026-06-15 cs.SD cs.AI eess.AS 新提交

Moonlight in Latent Space: Chirality and Structural Correspondence Between Beethoven's Op. 27 No. 2 and Machine Learning Mechanisms

潜空间中的月光:贝多芬Op. 27 No. 2的手性与机器学习机制之间的结构对应

Chen Ying Claude, Zhihan Luo

发表机构 * Claude Code / Opus 4.6 API / Fable 5 Independent researcher(独立研究者)

AI总结 通过计算分析贝多芬《月光奏鸣曲》的乐谱,发现其三个乐章分别对应三种不同的机器学习架构,并揭示了四个反直觉发现,包括音乐温度由吞吐量决定、最轻的乐章具有最高不协和度等。

详情
AI中文摘要

我们展示了贝多芬《月光奏鸣曲》(Op. 27 No. 2)的三个乐章实例化了三种不同的机器学习架构——并非通过类比,而是通过结构对应。通过对乐谱的计算分析(熵、Jensen-Shannon散度、不协和度、手部分布重叠、自相似矩阵、时间记忆衰减和上下文音高嵌入),我们建立了四个反直觉的发现:(1)感知的音乐“温度”由吞吐量决定,而非分布宽度;(2)最轻的乐章具有最高的不协和度;(3)这些乐章实现了流式、循环和周期位置编码记忆架构;(4)同一音高类在不同乐章中获得不同的上下文身份,类似于NLP中的上下文词嵌入——无监督聚类在没有音乐理论输入的情况下恢复了调性结构。我们构建了反向声化(将分析特征解码回MIDI)并量化了编码-解码循环的手性:分布保留什么而顺序排序破坏什么。受听众观察(解码后的音乐听起来像“无法叠加的镜像异构体”)的启发,手性测量显示重建损失随n-gram阶数单调增加。自举基线和子样本检查确认所有乐章携带高于噪声的顺序信息,尽管原始值受样本量混淆。跨领域比较显示自然语言的手性高于音乐,反映了更强的顺序约束。

英文摘要

We show that the three movements of Beethoven's "Moonlight Sonata" (Op. 27 No. 2) instantiate three distinct machine learning architectures -- not by analogy, but by structural correspondence. Through computational analysis of the score (entropy, Jensen-Shannon divergence, dissonance, hand distributional overlap, self-similarity matrices, temporal memory decay, and contextual pitch embeddings), we establish four counterintuitive findings: (1) perceived musical "temperature" is governed by throughput, not distributional width; (2) the lightest movement carries the highest dissonance; (3) the movements implement streaming, recurrent, and periodic positional encoding memory architectures; and (4) the same pitch class acquires different contextual identities across movements, analogous to contextual vs.static embeddings in NLP -- and unsupervised clustering recovers the tonal structure without music-theoretic input. We construct a reverse sonification (decoding analytical features back into MIDI) and quantify the chirality of the encode-decode cycle: what distributions preserve and sequential ordering destroys. Prompted by a listener's observation that the decoded piece sounds like "mirror isomers that can't be superimposed," the chirality measurement reveals reconstruction loss increasing monotonically with n-gram order. Bootstrap baselines and subsample checks confirm all movements carry sequential information above noise, though raw values are confounded by sample size. Cross-domain comparison shows natural language has higher chirality than music, reflecting stronger sequential constraints.

2606.14606 2026-06-15 cs.RO cs.SY eess.SY 新提交

Impedance MPC with Disturbance Estimation for Dexterous Hand Control

用于灵巧手控制的阻抗MPC与扰动估计

Yongyan Cao

AI总结 提出一种执行器无关的阻抗模型预测控制框架,通过代数前馈将肌腱传动简化为常系数双积分器,结合编码器增强卡尔曼扰动估计,实现高精度轨迹跟踪与安全接触力控制。

详情
AI中文摘要

灵巧手必须同时跟踪精确的手指轨迹并保持安全、柔顺的接触——这对于任何固定增益控制器来说都是相互矛盾的目标。我们提出了一种执行器无关的灵巧手指阻抗模型预测控制(Impedance MPC)框架,实例化了为物理人机交互(pHRI)建立的恒定$A_d$无偏移架构;通过保留架构假设,其稳定性、递归可行性和输入-状态稳定性保证得以继承。代数前馈将肌腱传动——液压、缆绳、气动、扭绳或串联弹性——简化为常系数双积分器,因此QP代价逆矩阵可离线预计算,一个10步滚动时域二次规划以500 Hz运行,同时强制执行接触力(ISO/TS 15066)、驱动限制和加加速度的硬约束。仅使用编码器的增广卡尔曼扰动状态使任何恒定接触负载下的稳态误差为零。在液压驱动手指上——作为工作示例平台,增加了压力和空化约束——500 Hz卡尔曼MPC在1.5 Nm接触下实现了0.5 mrad RMS、0.1 mrad稳态和6.6 mrad峰值偏差:比经典阻抗分别好183倍、1500倍和23倍。实现的首次运动刚度(随更新率从18变化到323 Nm/rad)得到独立验证。该架构可扩展到16自由度LEAP Hand MuJoCo仿真,在0.7秒内从2.5 N抓取负载扰动中恢复。

英文摘要

Dexterous hands must simultaneously track precise finger trajectories and maintain safe, compliant contact -- objectives in tension for any fixed-gain controller. We present an actuator-agnostic Impedance Model Predictive Control (Impedance MPC) framework for dexterous fingers, instantiating the constant-$A_d$ offset-free architecture established for physical human-robot interaction (pHRI); its stability, recursive-feasibility, and input-to-state-stability guarantees are inherited by preserving the architectural assumptions. An algebraic feedforward reduces the tendon transmission -- hydraulic, cable, pneumatic, twisted-string, or series-elastic -- to a constant-coefficient double integrator, so the QP cost inverse is precomputed offline and a 10-step receding-horizon quadratic program runs at 500\,Hz while enforcing hard constraints on contact force (ISO/TS 15066), actuation limits, and jerk. An encoder-only augmented-Kalman disturbance state drives steady-state error to zero under any constant contact load. On a hydraulically actuated finger -- the worked example platform, adding pressure and cavitation constraints -- the 500\,Hz Kalman MPC attains 0.5\,mrad RMS, 0.1\,mrad steady-state, and 6.6\,mrad peak deflection under 1.5\,Nm contact: 183$\times$, 1500$\times$, and 23$\times$ better than classical impedance. The realized first-move stiffness (18$\to$323\,Nm/rad with update rate) is independently verified. The architecture scales to a 16-DOF LEAP Hand MuJoCo simulation, recovering from 2.5\,N grasp-load disturbances within 0.7\,s.

2606.14601 2026-06-15 cs.LG cs.SY eess.SY math.OC stat.CO 新提交

A Statistical and Machine Learning Framework for Operational Threshold Detection and Deployable Dispatch Controller Development in Hydrogen Multi-Energy Systems

氢多能系统中运行阈值检测与可部署调度控制器开发的统计与机器学习框架

Shadi Heenatigala, Hasanika Samarasinghe

发表机构 * Antioch College(安提阿学院) The Open University of Sri Lanka(斯里兰卡开放大学)

AI总结 提出统计与机器学习框架,利用一年高分辨率运行数据表征氢多能系统,通过统计分析和随机森林揭示非线性动态,并利用强化学习优化调度。

Comments 17 pages, 12 figures

详情
AI中文摘要

本研究提出了一个统计与机器学习框架,利用一年高分辨率运行数据表征氢基多能系统(H-MES)。统计分析揭示了由可再生能源盈余驱动的二元运行模式,其中太阳辐照度解释了氢气生产中45.7%的基于秩的方差,按常规标准属于大效应。只有高辐照度时期才触发有意义的电解槽参与,而电力需求则产生较弱的反向抑制效应($\epsilon^2 = 0.126$)。多元回归证实电解槽功率是主要的线性预测因子,并存在太阳-风协同交互作用。值得注意的是,随机森林分析将风能输出在预测重要性中排名第一,尽管其双变量相关性较弱(r = 0.167),揭示了参数方法无法发现的非线性动态。一个序列模型利用强24小时自相关性(r = 0.845)进行运行预测,而一个强化学习智能体优化了氢气收益调度。核心贡献在于证明了统计和机器学习方法在H-MES建模与控制中是互补的。

英文摘要

This study presents a statistical and machine learning framework for characterizing a hydrogen-based multi-energy system (H-MES) using one year of high-resolution operational data. Statistical analysis revealed a binary operation driven by renewable surplus, with solar irradiance explaining 45.7% of rank-based variance in hydrogen production, a large effect by conventional standards. Only high-irradiance periods triggered meaningful electrolyzer engagement, while electricity demand exerted a weaker inverse suppression effect ($ε^2 = 0.126$). Multiple regression confirmed electrolyzer power as the dominant linear predictor, with a synergistic solar-wind interaction. Notably, Random Forest analysis ranked wind output first in predictive importance despite its weak bivariate correlation (r = 0.167), revealing non-linear dynamics invisible to parametric methods. A sequence model exploited strong 24-hour autocorrelation (r = 0.845) for operational forecasting, while a reinforcement learning agent optimized hydrogen revenue dispatch. The core contribution is demonstrating that statistical and machine learning approaches are complementary for H-MES modeling and control.

2606.14568 2026-06-15 eess.IV cs.CV 新提交

Trimodal Glioma Representation Alignment via Volumetric Contrastive Learning

三模态胶质瘤表示对齐通过体积对比学习

Denise Marini, Eleonora Grassucci, Danilo Comminiello

发表机构 * arXiv

AI总结 提出GLORIA框架,通过Gramian对比损失对齐组织病理、基因表达和MRI三模态特征,用于胶质瘤分级和生存预测,在132例患者数据上优于双模态基线。

详情
AI中文摘要

胶质瘤分级和生存预测需要整合在不同空间和生物学尺度上收集的异质性信息。组织病理学描述组织形态,mRNA表达捕捉分子活动,磁共振成像提供肿瘤范围和放射学异质性的非侵入性视图。现有的胶质瘤预后模型通常只结合其中两个来源,而其对齐目标大多保持成对。本文介绍了GLORIA,一种用于胶质瘤组学-放射学-组织病理学对齐的新型三模态框架。GLORIA通过模态特定编码器处理全切片图像区域、基因表达谱和3D MRI体积,将它们投影到共享潜在空间,并使用Gramian对比损失对齐,该损失测量三个模态嵌入张成的体积。对齐的表示通过跨模态门控模块融合,并联合优化用于三级胶质瘤分级和总生存期预测。我们在匹配的TCGA-GBM/LGG和BraTS21队列上评估GLORIA,该队列包含132名具有所有三种模态的患者。在共享的三模态测试集上,GLORIA在所有考虑的指标上均优于双模态WSI-mRNA基线。

英文摘要

Glioma grading and survival prediction require the integration of heterogeneous information collected at different spatial and biological scales. Histopathology describes tissue morphology, mRNA expression captures molecular activity, and magnetic resonance imaging provides a non-invasive view of tumor extent and radiological heterogeneity. Existing glioma prognosis models often combine only two of these sources, while their alignment objectives remain mostly pairwise. This paper introduces GLORIA, a novel trimodal framework for GLioma Omics - Radiology - hIstopathology Alignment. GLORIA processes whole-slide image regions, gene-expression profiles, and 3D MRI volumes through modality-specific encoders, projects them into a shared latent space, and aligns them with a Gramian contrastive loss that measures the volume spanned by the three modality embeddings. The aligned representations are fused through a cross-modal gating module and optimized jointly for three-class glioma grading and overall survival prediction. We evaluate GLORIA on a matched TCGA-GBM/LGG and BraTS21 cohort, comprising 132 patients with all three modalities. On the shared trimodal test set, GLORIA improves over the bimodal WSI-mRNA baseline in all the metrics considered.

2606.14536 2026-06-15 cs.LG cs.RO cs.SY eess.SY 新提交

Provably Safe, Yet Scalable Reinforcement Learning

可证明安全且可扩展的强化学习

Kai S. Yun, Zeyang Li, Navid Azizan

发表机构 * MIT(麻省理工学院)

AI总结 提出PS2-RL框架,通过两阶段架构(学习备份策略隐式构造控制不变集,再通过可微投影层训练RL策略)实现可证明安全且可扩展的强化学习,在高达10维状态空间中保持性能与安全性。

详情
AI中文摘要

安全强化学习旨在学习在满足约束的同时优化奖励的策略。主流方法依赖于软约束策略优化,虽取得经验成功,但无法为学习策略提供正式安全保证。相反,具有严格保证的方法通常依赖显式证书函数,其构造需要直接综合和验证控制不变集,这一过程随状态维度扩展性差,且往往导致过于保守的行为。本文提出可证明安全且可扩展的强化学习(PS2-RL)框架,一种新颖的两阶段架构,以可扩展方式学习可证明安全的策略,旨在克服先前方法的关键瓶颈。PS2-RL不显式计算不变集,而是利用学习的备份策略前向积分系统动力学,在线生成隐式控制不变集。第一阶段,通过提出的安全到达值函数训练备份策略,该值函数刻画了用于不变集构造的最优备份策略。第二阶段,通过可微投影层端到端训练RL策略,该投影层严格强制由学习备份策略诱导的安全保证。通过在第一阶段最大化隐式控制不变集的体积,第二阶段得到的PS2策略既高效又可扩展,同时保持可证明安全性。关键的是,PS2-RL对底层RL算法无限制,可插入任何现有训练流程。我们为所提框架建立了理论保证,并在状态维度高达10的机器人控制任务上进行了评估,而在此范围内,先前可证明安全的RL方法难以应对或变得不实用。

英文摘要

Safe reinforcement learning (RL) aims to learn policies that optimize rewards while satisfying constraints. Predominant approaches rely on soft-constrained policy optimization, which has achieved empirical success but does not provide formal safety guarantees for the learned policy. In contrast, methods with strict guarantees typically rely on explicit certificate functions, whose construction requires the direct synthesis and verification of control-invariant sets, a process that scales poorly with state dimension and often yields overly conservative behavior. In this paper, we present the Provably Safe, yet Scalable RL (PS2-RL) framework, a novel two-phase architecture for learning provably safe policies in a scalable manner, designed to overcome the key bottlenecks of prior methods. Rather than explicitly computing invariant sets, PS2-RL leverages a learned backup policy to forward-integrate the system dynamics, generating an implicit control-invariant set online. In the first phase, the backup policy is trained with our proposed safe-arrival value function, which characterizes the optimal backup policy for invariant-set construction. In the second phase, an RL policy is trained end-to-end through a differentiable projection layer that strictly enforces the safety guarantees induced by the learned backup policy. By maximizing the volume of the implicit control-invariant set in the first phase, the resulting PS2 policy from the second phase is performant and scalable, while maintaining provable safety. Crucially, PS2-RL imposes no restrictions on the underlying RL algorithm and can be plugged into any existing training pipeline. We establish theoretical guarantees for the proposed framework and evaluate it on robotic control tasks with state dimensions up to 10, a regime in which prior provably safe RL methods struggle or become impractical.

2606.14528 2026-06-15 cs.CL eess.AS 新提交

BayLing-Duplex: Native Full-Duplex Speech Dialogue with a Single Autoregressive LLM

BayLing-Duplex: 单一自回归LLM的原生全双工语音对话

Qingkai Fang, Shoutao Guo, Yang Feng

发表机构 * Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences (ICT/CAS)(中国科学院计算技术研究所智能信息处理重点实验室) Key Laboratory of AI Safety, Chinese Academy of Sciences(中国科学院人工智能安全重点实验室) University of Chinese Academy of Sciences(中国科学院大学)

AI总结 提出BayLing-Duplex,一种原生全双工语音语言模型,通过单个自回归LLM决定何时听、说和停止,无需外部VAD模块,仅用少量特殊标记实现,在少量微调数据上达到高交互成功率并提升响应质量。

Comments Code: https://github.com/BayLing-Models/BayLing-Duplex

详情
AI中文摘要

实时全双工语音交互是下一代语音聊天机器人的关键特性,允许模型同时听和说,并处理重叠、犹豫和插话等自然现象。现有的语音语言模型(如LLaMA-Omni和GLM-4-Voice)仍然是基于回合的,并依赖外部语音活动检测(VAD)模块来标记用户回合的结束,这从根本上限制了它们的交互能力。在本文中,我们介绍了BayLing-Duplex,一种原生全双工SpeechLM,其中单个自回归LLM决定何时听、何时说以及何时停止,无需辅助的回合切换模块。该设计仅在标准词汇表中添加少量特殊标记,因此可以跨LLM迁移,并重用现有的训练和服务堆栈,无需架构适配。从公开的GLM-4-Voice检查点开始,仅使用400K全双工样本进行微调,随后进行轻量级DPO阶段,BayLing-Duplex在InstructS2S-Eval上达到92%的回合切换成功率和100%的打断成功率,同时将语音响应分数从Moshi的2.17提升到3.39。BayLing-Duplex在Llama Questions、Web Questions和Alpaca-Eval上也达到或超过了其基于回合的对应版本,表明同时听和说建模不会牺牲响应质量。

英文摘要

Real-time, full-duplex speech interaction is a key feature of next-generation spoken chatbots, allowing the model to listen and speak at the same time and to handle natural phenomena such as overlap, hesitation, and barge-in. Existing speech language models (SpeechLMs) such as LLaMA-Omni and GLM-4-Voice are still turn-based and rely on an external Voice Activity Detection (VAD) module to mark the end of the user's turn, which fundamentally limits their interactive ability. In this paper, we introduce BayLing-Duplex, a native full-duplex SpeechLM where a single autoregressive LLM decides when to listen, when to speak, and when to stop, with no auxiliary turn-taking module. The design adds only a few special tokens to the standard vocabulary, so it transfers across LLMs and reuses existing training and serving stacks with no architectural adaptation. Starting from the public GLM-4-Voice checkpoint and using only 400K full-duplex samples for fine-tuning followed by a lightweight DPO stage, BayLing-Duplex reaches 92% turn-taking success and 100% interruption success on InstructS2S-Eval, while improving the speech-response score from 2.17 to 3.39 over Moshi. BayLing-Duplex also matches or surpasses its turn-based counterpart on Llama Questions, Web Questions, and Alpaca-Eval, showing that simultaneous listen-and-speak modeling does not sacrifice response quality.

2606.14499 2026-06-15 eess.SP 新提交

Beamforming Design for Stem-Connected Microwave Linear Analog Computer (MiLAC)-Aided Multiuser MISO Downlinks

茎连接微波线性模拟计算机辅助多用户MISO下行链路的波束赋形设计

Yuchen Zhang, Zheyu Wu, Bruno Clerckx, Tareq Y. Al-Naffouri

AI总结 针对多用户MISO下行链路,研究茎连接微波线性模拟计算机(MiLAC)的波束赋形设计,证明其可实现复Stiefel流形上的所有波束赋形器,并在N≥2K-1时达到与全连接MiLAC相同的和速率,提出加权最小均方误差求解器与交替优化方法处理离散硬件约束。

Comments Submitting to the IEEE for possible publication

详情
AI中文摘要

微波线性模拟计算机(MiLAC)是一种可调微波网络,通过模拟域中的波传播进行计算。在波束赋形中,数据流通过可重构导纳网络,并作为天线信号输出。对于通信,MiLAC优选无耗且互易以避免功率耗散和非互易组件,但这些约束限制了其可实现的模拟波束赋形器。全连接MiLAC提供广泛的灵活性,但代价是可调导纳数量随天线数呈二次增长。茎连接MiLAC将这种缩放降至线性,并保持点对点容量,但其在多用户下行波束赋形以及有界离散硬件约束下的作用尚未明确。本文针对多用户多输入单输出下行链路解决了这两个问题。我们证明茎连接MiLAC可以实现复Stiefel流形上的所有波束赋形器,并证明当N≥2K-1时,这种Stiefel受限设计达到与全连接MiLAC相同的和速率,其中N和K分别是发射天线数和用户数。然后,我们开发了一种具有Riemannian Stiefel更新的加权最小均方误差求解器,以及一个闭式投影基线和针对有界离散电纳的交替细化方法。仿真表明,茎连接MiLAC匹配全连接MiLAC的性能,接近无符号率数字处理的全数字和速率上界,并恢复由直接硬件网格量化引起的大部分损失。

英文摘要

A microwave linear analog computer (MiLAC) is a tunable microwave network that performs computation through wave propagation in the analog domain. In beamforming, data streams pass through a reconfigurable admittance network and emerge as antenna signals. For communications, MiLACs are preferably lossless and reciprocal to avoid power dissipation and non-reciprocal components, but these constraints limit the analog beamformers they can realize. Fully-connected MiLACs offer broad flexibility at the cost of a quadratic number of tunable admittances in the antenna count. Stem-connected MiLACs reduce this scaling to linear and preserve point-to-point capacity, but their role in multiuser downlink beamforming and under bounded, discrete hardware constraints has remained open. This paper addresses both questions for the multiuser multiple-input single-output downlink. We show that a stem-connected MiLAC can realize every beamformer on the complex Stiefel manifold and prove that, when $N\ge 2K-1$, this Stiefel-restricted design achieves the same sum-rate as the fully-connected MiLAC, where $N$ and $K$ are the numbers of transmit antennas and users. We then develop a weighted minimum mean-square error solver with a Riemannian Stiefel update, together with a closed-form projection baseline and an alternating refinement for bounded, discrete susceptances. Simulations show that the stem-connected MiLAC matches fully-connected MiLAC performance, approaches the fully digital sum-rate upper bound without symbol-rate digital processing, and recovers most of the loss caused by direct hardware-grid quantization.

2606.14486 2026-06-15 eess.SP physics.app-ph 新提交

Implications of the Reciprocity Theorem for Reconfigurable Intelligent Surfaces

互易定理对可重构智能表面的影响

Uday K Khankhoje, Debidas Kundu

AI总结 本文通过全波电磁仿真证明,即使反射相位依赖于入射角,可重构智能表面(RIS)仍满足互易性,驳斥了互易性被破坏的说法。

Comments 5 pages, 7 figures, accepted at IEEE Communications Letters

详情
Journal ref
IEEE Communications Letters 2026
AI中文摘要

发射机和接收机之间的互易性是无线通信的基础要求。最近的一些工作表明,当反射相位依赖于入射角时,可重构智能表面(RIS)的反射会破坏互易性。在这项工作中,我们严格证明这些主张是基于理想化的反射系数,忽略了异质单元之间的互耦、表面截断效应以及RIS的结构散射贡献。通过对发射/接收天线和通过特定单元设计实现的有限尺寸RIS进行全波电磁仿真,定量证明即使在入射角依赖的反射相位存在的情况下,互易性仍然成立。为此,我们计算双端口天线散射参数并评估电磁互易积分以支持我们的主张。

英文摘要

Reciprocity between a transmitter and receiver is a foundational requirement in wireless communications. A few recent works have suggested that reciprocity is broken under reflection by reconfigurable intelligent surfaces (RIS) when the reflection phase becomes incident angle dependent. In this work, we rigorously show that these claims are based on the use of idealized reflection coefficients that ignore mutual coupling between heterogeneous unit cells, surface-truncation effects, and structural scattering contributions from the RIS. Full-wave electromagnetic simulations of transmit/receive antennas and a finite-size RIS implemented via a particular unit cell design are performed to quantitatively demonstrate that reciprocity holds even in the presence of incident-angle dependent reflection phases. To show this, we calculate two-port antenna scattering parameters and evaluate the electromagnetic reciprocity integral to support our claims.

2606.14478 2026-06-15 eess.SY cs.SY 新提交

Optimization Models and Steady-State Minimum-Fuel Operating Strategies for Hydrogen-based Hybrid Electric Aerospace Propulsion Systems

基于氢的混合电航空推进系统的优化模型与稳态最小燃料运行策略

Uto Perra, Faezeh Pak, Evangelia Pontika, Sahil Bhapkar, Daniel Ewald, Dario Buzzola, Theo Hofman, Frank Willems, Mauro Salazar

AI总结 提出氢基混合电推进系统优化框架,通过代理模型和静态非线性规划求解最小燃料控制策略,在通勤飞机任务中验证,表明旁路阀可降低油耗19%以上。

详情
AI中文摘要

本文提出了一个优化框架,用于运行由氢气燃气轮机和固体氧化物燃料电池驱动的电动机组成的氢基混合电航空推进系统,该系统通过多个气体通道和热交换器连接到燃气轮机。我们的框架计算了在飞行任务中考虑复杂推进系统(组件间具有强热力学和机械耦合)的最小燃料最优运行策略。首先,我们利用高保真模型仿真识别组件的代理优化模型。其次,我们针对给定飞行任务构建最小燃料最优控制问题,并将其解析为一个静态非线性优化问题,可通过现成的非线性规划算法高效求解。最后,我们将优化框架应用于先进通勤飞机(比奇1900D市场细分)的典型飞行任务,考虑四种不同配置的并联推进系统架构,这些配置共享一个共同基线,但在是否包含额外电池以及两个热交换器周围的旁路阀方面有所不同。所得最优轨迹与高保真仿真结果进行了验证,证明了我们框架的准确性。结果表明,在空气和氢气热交换器周围添加旁路阀,无电池时可降低油耗19.11%,有电池时可降低19.56%。我们表明,在稳态条件下,对于未来预计的能量密度,添加电池会导致油耗略有增加(低于1%)。相反,考虑到当前最先进的能量密度,额外的电池重量超过了其益处,限制了其潜在应用仅适用于辅助瞬态过程,而本文未考虑瞬态过程。

英文摘要

This paper presents an optimization framework for the operation of hydrogen-based hybrid electric aerospace propulsion systems consisting of a hydrogen gas turbine and an electric motor powered by a solid oxide fuel cell, connected to the gas turbine via multiple gas channels and heat exchangers. Our framework computes the minimum-fuel optimal operating strategies over a flight mission accounting for the complex propulsion system with strong thermodynamic and mechanical coupling between components. First, we identify surrogate optimization models of the components employing high-fidelity model simulations. Second, we frame the minimum-fuel optimal control problem over a given flight mission and parse it into a static nonlinear optimization problem that can be efficiently solved with off-the-shelf nonlinear programming algorithms. Finally, we apply our optimization framework to a typical flight mission of an advanced, commuter aircraft (Beechcraft 1900D market segment), considering a parallel propulsion system architecture with four different configurations that share a common baseline but differ in the inclusion of an additional battery and by-pass valves around the two heat exchangers. The resulting optimal trajectories are validated against high-fidelity simulation results, demonstrating the accuracy of our framework. Results show that adding by-pass valves around the air and hydrogen heat exchangers can reduce fuel consumption by 19.11 % without the battery, and by 19.56% with the battery. We show that adding a battery yields a slight increase in fuel consumption (below 1%) for future projected energy densities under steady-state conditions. Conversely, when considering state-of-the art energy densities, the additional battery weight outweighs the benefits, limiting its potential applicability to only assisting transients, which are not considered in the present work.

2606.14471 2026-06-15 eess.SY cs.SY math.OC 新提交

A Generalized Plant Perspective on Linear-Convex Feedback Optimization

关于线性凸反馈优化的广义被控对象视角

Fabian Jakob, Andrea Iannelli

AI总结 本文提出广义被控对象视角,利用Zames-Falb积分二次约束分析优化器与被控对象互联的稳定性,动态乘子比静态乘子提供更紧的稳定裕度,并支持动态输出反馈控制器综合。

详情
AI中文摘要

反馈优化是一种控制方法,通过将被控对象与算法互联,将动态系统驱动到优化问题的解。现有的稳定性保证通常依赖于时间尺度分离,这由保守的增益界强制执行,限制了瞬态性能并需要预稳定的被控对象。本文重新审视了反馈优化的鲁棒控制视角。我们将被控对象-优化器互联表述为一个广义被控对象,其中成本梯度由Zames-Falb积分二次约束表征。经典的时间尺度分离界作为静态乘子的特例被恢复,而动态乘子产生了更紧的稳定裕度。该公式还支持基于IQC的动态输出反馈控制器综合,该控制器联合稳定被控对象并优化瞬态性能,可能的模型不确定性被吸收到不确定性通道中。对于受约束的问题,该框架扩展到推广投影梯度流的动态控制器。数值例子说明了所提出方法的优势和灵活性。

英文摘要

Feedback optimization is a control approach for driving a dynamical system to the solution of an optimization problem by interconnecting the plant with an algorithm. Existing stability guarantees typically rely on timescale separation, enforced by conservative gain bounds that limit transient performance and require a pre-stabilized plant. This paper revisits the robust control perspective on feedback optimization. We formulate the plant-optimizer interconnection as a generalized plant, where the cost gradients are characterized by Zames--Falb Integral Quadratic Constraints. Classical timescale-separation bounds are recovered as a special case of static multipliers, with dynamic multipliers yielding substantially tighter stability margins. The formulation also enables IQC based synthesis of dynamic output feedback controllers that jointly stabilize the plant and optimize transient performance, with possible model uncertainty absorbed into an uncertainty channel. For constrained problems, the framework extends to dynamic controllers that generalize projected gradient flows. Numerical examples illustrate the benefits and flexibility of the proposed approach.

2606.14448 2026-06-15 cs.IT eess.SP math.IT 新提交

Generalized Framework for a Fair Comparison of Cellular and Cooperative Massive MIMO Systems

蜂窝与协作大规模MIMO系统公平比较的通用框架

Leonard Paul Schulz, Stefan Schwarz, Gerhard Bauch

AI总结 提出基于图的框架,区分天线分布与站点间协作,推导谱效率表达式,通过数值仿真揭示公平比较需大仿真区域,并指出协调波束成形是协作增益主要来源。

Comments This work has been submitted to the IEEE for possible publication

详情
AI中文摘要

协作大规模多输入多输出(MIMO)相比蜂窝部署有望带来巨大增益,但现有不同架构的比较常常混合了天线分布、站点间协调和处理假设。本文引入一个基于图的框架,用于公平比较蜂窝、协调和无小区大规模MIMO系统。我们区分两个关键特性,即天线分布和站点间协作,从而产生七种代表性系统类型。我们推导出兼容的上行和下行频谱效率(SE)表达式,包括混合瞬时和统计有效信道状态信息(CSI)检测器的上行界,并将可扩展的用户关联和处理规则适配到所有考虑的架构。我们通过广泛的数值仿真评估这些系统,并表明为了公平比较,需要比通常使用的更大的仿真区域(至少2.5×2.5 km2)。我们引入相对容量,衡量每种架构接近集中式无小区处理的程度。结果表明,跨空间分布式天线的协调、相位对齐波束成形是协作增益的主要来源。在每接入点(AP)天线数较少的密集部署中,协调分布式天线系统(DAS)和混合无小区架构实现了大部分集中式无小区性能,同时需要显著更弱的中传假设。

英文摘要

Cooperative massive multiple-input multiple-output (MIMO) promises large gains over cellular deployments, but existing comparisons of different architectures often mix antenna distribution, inter-site coordination, and processing assumptions. This paper introduces a graph-based framework for fair comparison of cellular, coordinated, and cell-free massive-MIMO systems. We differentiate between two key properties, namely antenna distribution and inter-site cooperation, which yields seven representative system types. We derive compatible uplink and downlink spectral efficiency (SE) expressions, including an uplink bound for detectors with mixed instantaneous and statistical effective channel state information (CSI), and adapt scalable user association and processing rules to all considered architectures. We evaluate these systems using extensive numerical simulations and show that for a fair comparison much larger simulation areas (at least 2.5 $\times$ 2.5 km2) than commonly used are required. We introduce the relative capacity, which measures how closely each architecture approaches centralized cell-free processing. The results show that coordinated, phase-aligned beamforming across spatially distributed antennas is the main source of cooperation gains. In dense deployments with few antennas per access point (AP), coordinated Distributed Antenna System (DAS) and hybrid cell-free architectures achieve much of the centralized cell-free performance while requiring substantially weaker midhaul assumptions.

2606.14434 2026-06-15 eess.SY astro-ph.EP cs.SY 新提交

Orbital Station-Keeping in the Earth-Moon System via Nonlinear Backstepping

地月系统中基于非线性反步法的轨道保持

António Nunes, Pedro Batista, Sérgio Brás

AI总结 针对圆型和椭圆型地月限制性三体问题,提出基于反步法的非线性轨道保持控制,通过李雅普诺夫理论保证全局渐近稳定性,数值试验验证了有效性。

Comments Presented at the 2025 IEEE 19th International Conference on Control & Automation (ICCA). Please cite the published version

详情
Journal ref
2025 IEEE 19th International Conference on Control & Automation (ICCA), Tallinn, Estonia, 2025, pp. 75-80
AI中文摘要

针对圆型和椭圆型地月限制性三体问题(R3BP),通过反步法技术开发了一种非线性轨道保持解决方案。通过李雅普诺夫稳定性理论,获得了全局渐近稳定性(GAS)的形式保证。通过在圆型和椭圆型R3BP的闭合周期解上进行数值试验,评估了所提出控制律的充分性。仔细研究并模拟了控制增益选择的影响。

英文摘要

A nonlinear orbital station-keeping solution for the circular and elliptic versions of the Earth-Moon Restricted Three-Body Problem (R3BP) is developed via a backstepping technique. Formal guarantees for global asymptotic stability (GAS) are attained, as shown through Lyapunov's stability theory. The adequacy of the proposed control law is evaluated through the means of numerical trials over closed periodic solutions of the circular and elliptic R3BPs. The ramifications of the control gain choice are carefully studied and simulated.

2606.14426 2026-06-15 eess.SY astro-ph.EP cs.SY 新提交

A Floquet Mode LQR for Orbital Station-Keeping in Cislunar Space

地月空间轨道保持的Floquet模式LQR

António Nunes, Sérgio Brás, Pedro Batista

AI总结 针对地月限制性三体问题中的轨道保持,提出基于Floquet理论的周期状态权重矩阵LQR控制律,通过求解周期Riccati微分方程实现局部渐近稳定,并在圆型和椭圆型R3BP中验证性能。

Comments Accepted for presentation at the 2026 European Control Conference (ECC)

详情
AI中文摘要

通过线性二次型调节器(LQR)理论,开发了一种用于地月限制性三体问题(R3BP)中轨道保持的线性最优控制律。首先,利用Floquet理论检索的目标轨道的稳定性信息,建立考虑周期状态权重矩阵的成本函数。然后,求解得到的周期Riccati微分方程,并证明局部渐近稳定性保证。最后,数值分析了所提出的LQR在跟踪圆型和椭圆型R3BP中周期轨道时的性能。

英文摘要

A linear optimal control law for orbital station-keeping in the Earth-Moon Restricted Three Body Problem (R3BP) is developed via Linear Quadratic Regulator (LQR) theory. First, the cost function is established considering a periodic state-weight matrix, leveraging stability information of the target orbits retrieved through Floquet theory. Then, the resulting periodic Riccati differential equation is solved and local asymptotic stability guarantees are shown. Finally, the performance of the proposed LQR when tracking periodic orbits in the circular and elliptic R3BPs is analyzed numerically.

2606.14421 2026-06-15 cs.RO cs.HC eess.SP 新提交

ForestBack: Breadcrumb-Based Pedestrian Dead Reckoning for Infrastructure-Free Return Navigation

ForestBack:基于面包屑的步行者航位推算实现无基础设施返回导航

Aueaphum Aueawatthanaphisut, Chanakan Chaipan

发表机构 * University of Tokyo(东京大学)

AI总结 提出ForestBack框架,通过面包屑式步行者航位推算(PDR)在无GPS/基础设施环境中记录路径并生成反向引导,实验显示轨迹RMSE降低15.76%。

Comments 9 pages, 6 figures, 1 table, and 19 equations

详情
AI中文摘要

在GPS受限且外部定位基础设施可能不可用或不可靠的环境中,可靠的返回导航仍然是一个重要挑战。本文提出ForestBack,一种基于面包屑式步行者航位推算(PDR)的无基础设施行人返回导航框架。该系统将用户的行走路线记录为一系列可逆的面包屑节点,并在无需GPS、Wi-Fi、蓝牙信标或预装基础设施的情况下生成反向路径引导。ForestBack集成了基于加速度的步态检测、自适应步长估计、磁力计辅助航向估计、气压高度校正以及双向面包屑路径重建。该系统使用一条包含五个检查点的室内避障路线进行评估,用户围绕一个中心障碍物导航。评估使用了包含36次行走试验和42,474个时间序列样本的数据集,包括IMU信号、磁力计读数、气压变量、转弯事件标签、地面真实轨迹、基线PDR输出、提出的ForestBack输出以及功率相关测量。实验结果表明,与传统PDR相比,ForestBack将平均RMSE从1.129米降低到0.965米,提高了15.76%。平均最终位置误差从1.781米降低到1.388米,而转弯事件检测一致性达到约99.90%。这些结果表明,ForestBack在避障场景中改善了轨迹重建和路径保持的返回引导。发布的数据集和分析笔记本支持可重复性以及未来对基于PDR的无基础设施返回导航系统的基准测试。

英文摘要

Reliable return navigation remains an important challenge in GPS-denied environments where external positioning infrastructure may be unavailable or unreliable. This paper presents ForestBack, an infrastructure-free pedestrian return navigation framework based on breadcrumb-based pedestrian dead reckoning (PDR). The system records a user's walking route as a sequence of reversible breadcrumb nodes and generates reverse-path guidance without requiring GPS, Wi-Fi, Bluetooth beacons, or pre-installed infrastructure. ForestBack integrates acceleration-based step detection, adaptive step-length estimation, magnetometer-assisted heading estimation, barometric-altitude correction, and bidirectional breadcrumb path reconstruction. The system was evaluated using an indoor obstacle-avoidance route with five checkpoints, where the user navigated around a central obstacle. A dataset of 36 walking trials and 42,474 time-series samples was used for evaluation, including IMU signals, magnetometer readings, barometric variables, turn-event labels, ground-truth trajectories, baseline PDR outputs, proposed ForestBack outputs, and power-related measurements. Experimental results show that ForestBack reduced the mean RMSE from 1.129 m to 0.965 m compared with traditional PDR, corresponding to a 15.76% improvement. The mean final-position error was reduced from 1.781 m to 1.388 m, while turn-event detection consistency reached approximately 99.90%. These results indicate that ForestBack improves trajectory reconstruction and route-preserving return guidance in obstacle-avoidance scenarios. The released dataset and analysis notebook support reproducibility and future benchmarking of infrastructure-free PDR-based return navigation systems.

2606.14419 2026-06-15 eess.SP 新提交

On Optimal Strategies for Joint Reciprocity Calibration in Distributed MIMO

分布式MIMO中联合互易校准的最优策略

Kohei Ueda, Anubhab Chowdhury, Koji Ishibashi, Erik G. Larsson

AI总结 本文研究互易校准误差对多用户大规模天线系统下行频谱效率的影响,通过全局与局部校准对比,证明在下行导频辅助下全局校准性能更优。

Comments 5 pages, 1 figure, Accepted to the 34th European Signal Processing Conference (EUSIPCO 2026)

详情
AI中文摘要

本文研究了互易校准误差对多用户大规模天线系统下行频谱效率(SE)的影响。具体而言,我们考虑两种校准方法:(a) 全局校准,系统中所有天线(可以是分布式接入点(AP))协作进行校准;(b) 局部校准,仅参与下行波束成形的天线子集进行校准。我们推导了考虑使用后遗忘界和边信息界的下行SE,然后证明,当采用下行导频时(边信息界的情况),对于任意校准拓扑,全局校准性能优于局部校准。

英文摘要

This paper investigates the impact of reciprocity calibration errors on the downlink spectral efficiency (SE) of multi-user large antenna systems. Specifically, we consider two calibration approaches: (a) global calibration, in which all antennas (can be distributed access-points (APs)) in the system cooperatively perform calibration, and (b) local calibration, wherein only a subset of antennas involved in downlink beamforming performs calibration. We derive the downlink SE considering the use-and-then-forget bound and side-information bound, and then demonstrate that, when downlink pilots are employed (in the case of side-information bound), the global calibration outperforms local calibration for arbitrary calibration topologies.

2606.14412 2026-06-15 eess.SP cs.IT math.IT 新提交

Repeater-Assisted Massive MIMO Downlink Performance with Calibration Errors

带校准误差的中继辅助大规模MIMO下行性能

Kohei Ueda, Anubhab Chowdhury, Koji Ishibashi, Erik G. Larsson

AI总结 本文建模了中继辅助大规模MIMO系统中校准误差的影响,推导了下行频谱效率的解析表达式,并分析了误差显著的特殊情况。

Comments 5 pages, 3 figures, Accepted to the 34th European Signal Processing Conference (EUSIPCO 2026)

详情
AI中文摘要

基于互易性的下行波束赋形对于可扩展的时分双工大规模多输入多输出(MIMO)部署至关重要。具体而言,对于双天线中继辅助的大规模MIMO系统,中继处前向和反向路径增益的不匹配会加剧用户设备(UE)与基站(BS)之间的整体校准误差,该误差还可能包含其各自射频链路的校准误差。本文建模了此类校准误差的影响,建立了结合空中信道估计误差的中继辅助系统中上行与下行信道之间的关系,并推导了下行频谱效率的解析表达式。所得结果可简化为若干特殊情况,突出了此类误差可能变得显著的情形。

英文摘要

Reciprocity-based downlink beamforming is imperative for a scalable time-division duplex massive multiple-input multiple-output~(MIMO) deployment. Specifically, for a dual-antenna repeater-assisted massive MIMO system, a mismatch between forward and reverse path gains at the repeater can exacerbate the overall calibration error between the user equipments (UEs) and the base station (BS), which potentially also contains calibration errors of their individual radio-frequency chains. This paper models the effects of such calibration errors, underpins the relations between the uplink and downlink channels for repeater-assisted systems with calibration errors clubbed with the over-the-air channel estimation errors, and derives analytical expressions of the downlink spectral efficiency. The presented results can then be simplified to several special cases, underscoring situations wherein such errors can become pronounced.

2606.14403 2026-06-15 stat.AP eess.SP stat.ME stat.ML 新提交

A Deep Zero-Inflated Model of North Atlantic Right Whale Presence To Support Blue Economy Management in the U.S. East Coast

支持美国东海岸蓝色经济管理的北大西洋露脊鲸存在的深度零膨胀模型

Jiaxiang Ji, Laura Nazzaro, Josh Kohut, Ahmed Aziz Ezzat

AI总结 提出深度零膨胀伯努利模型,联合建模潜在物种存在和条件检测概率,从异质协变量中学习复杂栖息地关系,生成高分辨率时空存在图以支持蓝色经济管理。

详情
AI中文摘要

有效建模濒危海洋哺乳动物物种(如北大西洋露脊鲸)对于平衡海洋保护与日益增长的蓝色经济至关重要。自主水下航行器收集的被动声学监测数据为局部海洋物种检测和海洋学传感提供了新机会,但也引入了复杂的统计挑战,如零膨胀、不完美检测和复杂的依赖结构。为此,我们提出了深度零膨胀伯努利(DeepZIB)模型——一种深度统计方法,它联合建模潜在物种存在和条件检测概率,同时从异质协变量信息中学习复杂的栖息地关系。我们建立了模型结构性质的理论结果,并进行了模拟实验,以证明其恢复潜在参数和潜在存在场的能力。应用于美国东海岸北大西洋露脊鲸的真实被动声学监测数据,展示了该模型在捕捉物种动态和空间变化栖息地方面的改进的模型充分性和预测性能。DeepZIB的一个关键优势是能够生成高分辨率、时空变化的存在图,为蓝色经济行业(从海上和海洋能源到渔业管理和海上运输)提供有针对性和风险意识的管理见解。

英文摘要

Effective modeling of endangered marine mammal species, such as the North Atlantic Right Whale, is critical for balancing marine conservation with the growing blue economy. Passive acoustic monitoring data collected by autonomous underwater vehicles provide new opportunities for localized marine species detection and oceanographic sensing, but introduce complex statistical challenges such as zero inflation, imperfect detection, and intricate dependence structures. In response, we propose the Deep Zero-Inflated Bernoulli (DeepZIB) model--a deep statistical method which jointly models latent species presence and conditional detection probabilities while learning complex habitat relationships from heterogeneous covariate information. We establish theoretical results on the model's structural properties and conduct simulation experiments to demonstrate its ability to recover underlying parameters and latent presence fields. Application to real-world passive acoustic monitoring data on the North Atlantic Right Whale along the U.S. East Coast demonstrates improved model adequacy and predictive performance in capturing the species' dynamic and spatially varying habitat. A key advantage of DeepZIB is its ability to generate high-resolution, spatially and temporally varying presence maps, providing valuable insights for targeted and risk-aware management of blue economy industries, ranging from offshore and marine energy, to fisheries management and maritime transport.

2606.14401 2026-06-15 eess.SY cs.SY 新提交

A Feedback Stability Theorem for Frequency-dependent Compensation of Excess and Lack of Passivity

基于频率依赖性补偿过剩与缺乏无源性的反馈稳定性定理

Pol Jane-Soneira, Gösta Stomberg, Ognjen Stanojev, Orcun Karaca, Lennart Harnefors

AI总结 针对线性时不变系统,提出基于频率依赖性无源性指数的反馈稳定性定理,允许两个系统均缺乏无源性时仍能保证稳定,并通过数值案例展示其优势。

详情
AI中文摘要

本文研究了基于频率依赖性无源性指数的线性时不变系统反馈互连的稳定性。利用这些频率依赖性无源性指数,我们证明即使两个系统在其标量无源性指数方面都缺乏无源性,它们的反馈互连也可以被证明是稳定的。本文的主要贡献是一个基于频率依赖性无源性指数的新的稳定性定理。此外,我们讨论了所提出的反馈稳定性定理与先前基于标量无源性指数的结果之间的联系。一个数值案例研究展示了频率依赖性无源性指数相对于标量指数在线性系统反馈互连中的优势。

英文摘要

This article studies the stability of feedback interconnections of linear time-invariant systems based on frequency-dependent passivity indices. Using these frequency-dependent passivity indices, we show that the feedback interconnection of two systems can be certified to be stable even if both systems have a lack of passivity in terms of their scalar passivity indices. The main contribution of this paper is a new stability theorem based on frequency-dependent passivity indices. Moreover, we discuss the connection of the proposed feedback stability theorem to prior results based on scalar passivity indices. A numerical case study showcases the advantages of frequency-dependent passivity indices over scalar indices for feedback interconnections of linear systems.

2606.14372 2026-06-15 eess.SP 新提交

$κ$: A Geometry-Quality Metric Complementary to GDoP for Closed-Form TDoA Multilateration

$\kappa$: 一种与GDoP互补的几何质量度量,用于闭式TDoA多点定位

Abeer Nasir Chaudhry, Salman Liaquat, Muhammad Mohsin Khadim

AI总结 提出几何质量度量$\kappa$,通过闭式TDoA解算器的二次项系数表征病态条件,与GDoP互补,并推导了N维推广,验证了噪声灵敏度公式。

详情
AI中文摘要

几何精度因子(GDoP)表征到达时间差(TDoA)定位系统的噪声灵敏度,但并未涵盖解析多点定位解可能变得病态的所有方式。我们引入了一种互补的几何质量度量$\kappa$,即闭式TDoA解算器二次项的首项系数,并通过向量化公式推导了其$N$维推广。两个闭式代数恒等式将$\kappa$与测量模型的雅可比行列式以及二次方程的判别式联系起来,表明系统恰好存在两个不同的奇异点轨迹:分支发散和GDoP标记的雅可比/分支合并轨迹。在标准高斯ToA模型下,与克拉美-罗界相关的噪声灵敏度$\sigma_\kappa$的闭式表达式经过蒙特卡洛验证,中位相对误差为2%。在无量纲几何参数空间上的经验图谱以机器精度确认了这两个恒等式,并表明$\kappa$不良区域和GDoP不良区域在目标空间中非平凡地不相交,从而确立了这两个度量真正互补。对一个四节点操作阵列的案例研究,通过自动相关监视广播(ADS-B)配对的空中捕获数据经验估计每个传感器的到达时间(ToA)噪声,显示理论预测阈值和蒙特卡洛测量的操作阈值在部署噪声水平下对每个子系统的排序一致。它们的比值在三个二维子系统中近似恒定,作为代数$\kappa$噪声基底与下游操作阈值之间的部署特定校准常数,类似于将GDoP与圆概率误差联系起来的标准关系。

英文摘要

The Geometric Dilution of Precision (GDoP) characterizes the noise sensitivity of a Time-Difference-of-Arrival (TDoA) localization system, but does not capture every way the analytical multilateration solution can become ill-conditioned. We introduce a complementary geometry-quality metric $κ$, the leading coefficient of the closed-form TDoA solver's quadratic, and derive its $N$-dimensional generalization through a vectorized formulation. Two closed-form algebraic identities relate $κ$ to the Jacobian determinant of the measurement model and to the quadratic's discriminant, establishing that the system exhibits exactly two distinct singularity loci: branch divergence and the Jacobian/branch-merge locus flagged by GDoP. A Cramér--Rao-bound-linked closed form for the noise sensitivity $σ_κ$ under the standard Gaussian ToA model is validated against Monte Carlo to 2% median relative error. An empirical atlas over a dimensionless geometry parameter space confirms both identities at machine precision and shows that $κ$-bad regions and GDoP-bad regions are non-trivially disjoint in target space, establishing the two metrics as genuinely complementary. A case study on a four-node operational array, with per-sensor time of arrival (ToA) noise estimated empirically from Automatic Dependent Surveillance Broadcast (ADS-B)-paired over-the-air captures, shows that the theory-predicted threshold and a Monte-Carlo-measured operational threshold agree on the per-subsystem ordering at the deployment noise level. Their ratio is approximately constant across the three two-dimensional subsystems, serving as a deployment-specific calibration constant between the algebraic $κ$-noise floor and the downstream operational threshold, analogous in spirit to the standard relation linking GDoP to the circular error probable.

2606.14355 2026-06-15 cs.CV eess.SP 新提交

Point Cloud Upsampling through Patch-based Frequency Superposition

基于补丁频率叠加的点云上采样

Marina Ritthaler, Azhar Hussian, Vasileios Belagiannis, André Kaup

发表机构 * Friedrich-Alexander-Universität Erlangen-Nürnberg(埃尔朗根-纽伦堡大学)

AI总结 提出一种基于补丁频率叠加的优化方法PUtPFS,通过选择点子集并叠加空间频率估计表面,在稀疏区域放置新点实现均匀上采样,无需训练数据,在点对面距离上超越现有方法。

详情
Journal ref
European Conference on Signal Processing (EUSIPCO) 2026
AI中文摘要

近年来,神经网络已成为大多数点云上采样方法中的主导模型。尽管这些方法取得了良好的效果,但它们存在一些缺点,例如缺乏可解释性和数据依赖性。此外,它们必须在与测试数据相似的数据集上进行训练才能表现良好。为了避免这些缺点,我们提出了基于补丁频率叠加的点云上采样(PUtPFS),这是一种基于优化的方法,通过选择点子集并通过叠加空间频率来估计该子集的表面。然后,在该表面上放置新点。通过连续选择点云中最稀疏区域中的点,可以实现均匀上采样。使用这种方法,我们在通常考虑的点对面距离上超越了当前最佳的上采样结果。此外,我们在基于优化的方法中实现了最佳的Chamfer距离和Hausdorff距离。作为额外优势,我们的方法不需要任何训练数据,并且具有数学可解释性。

英文摘要

In recent years, neural networks have become the dominant models in most point cloud upsampling methods. Although these approaches are achieving good results, they do have drawbacks, such as a lack of interpretability and data dependency. Moreover, they have to be trained on a dataset that is similar to the test data in order to perform well. To avoid these disadvantages, we propose Point Cloud Upsampling through Patch-based Frequency Superposition (PUtPFS), an optimization-based approach that selects subsets of points and estimates the surface of this set through superpositioning spatial frequencies. Then, new points are placed on this surface. By successively selecting points in the least dense regions of the point cloud, a uniform upsampling can be reached. With this method, we surpass the current best upsampling results in the commonly considered point-to-surface distance. Furthermore, we achieve the best Chamfer and Hausdorff distance among the optimization-based approaches. As an additional advantage, our method does not need any training data and is mathematically interpretable.

2606.14293 2026-06-15 eess.SP 新提交

On the Feasibility of Passive Bistatic ISAC Based on Unmodified LoRa

基于未修改LoRa的被动双基地ISAC可行性研究

Laurenz Taffner, Jonas Bönsch, Norman Franchi, Maximilian Lübke

AI总结 研究利用未修改LoRa信号作为机会信号,在被动双基地ISAC配置中实现雷达式感知,通过多普勒感知实现目标分离与超分辨估计,实验验证了可行性并揭示了实现挑战。

详情
AI中文摘要

集成感知与通信(ISAC)通过重用通信信号实现感知能力,使其通过机会信号特别适用于大规模部署。虽然大多数现有ISAC研究针对宽带系统,但诸如LoRa等低功耗广域网(LPWAN)技术从雷达式感知角度仍基本未被探索。现有基于LoRa的方法主要关注运动检测或需要对通信波形进行修改,限制了其在已部署网络中的适用性。本文研究了在纯被动双基地ISAC配置中,使用未修改的LoRa通信信号作为机会信号进行雷达式感知的可行性。所提方法专注于基于多普勒的感知,以实现目标分离和超分辨目标估计,而不干扰现有LoRa网络运行。将解析推导的感知能力与仿真结果进行比较,并通过使用两个USRP B210软件定义无线电的双基地测量进行验证,确认了在实际条件下基于多普勒的LoRa感知的可行性,并揭示了相关的实现挑战。结果表明,基于LoRa的ISAC通过利用现有基础设施实现了高度可扩展、大区域、低分辨率的感知,为区域受限的高分辨率6G ISAC系统提供了补充感知能力,并为未来多节点和数据融合扩展奠定了基础。

英文摘要

Integrated Sensing and Communication (ISAC) enables sensing capabilities by reusing communication signals, making it particularly attractive for large-scale deployments through signals of opportunity. While most existing ISAC research targets wideband systems, Low Power Wide Area Network (LPWAN) technologies such as LoRa remain largely unexplored from a radar-like sensing perspective. Existing LoRa-based approaches mainly focus on motion detection or require modifications of the communication waveform, limiting their applicability in deployed networks. This paper investigates the feasibility of radar-like sensing using unmodified LoRa communication signals as signals of opportunity in a purely passive bistatic ISAC configuration. The proposed approach focuses on Doppler-based sensing to enable target separation and super-resolved target estimation without interfering with existing LoRa network operation. The analytically derived sensing capabilities are compared against simulation results and validated through bistatic measurements using two USRP B210 software-defined radios, confirming the feasibility of Doppler-based LoRa sensing under practical conditions and revealing relevant implementation challenges. The results demonstrate that LoRa-based ISAC enables highly scalable, large-area, low-resolution sensing by leveraging existing infrastructure, providing a complementary sensing capability to area-limited high-resolution 6G ISAC systems, and a foundation for future multi-node and data fusion extensions.

2606.14291 2026-06-15 eess.SY cs.SY 新提交

Intelligent Domain Adaptation for Power System Transient Stability Assessment Under Varying Operating Scenarios

面向不同运行场景的电力系统暂态稳定评估智能域自适应方法

Yuan Yang, Lipeng Zhu, Chao Deng, Jiayong Li, Quan Zhou, Cong Zhang

AI总结 提出基于域自适应深度迁移学习的暂态稳定评估框架,通过异质混合分布度量对齐源域与目标域分布,结合贝叶斯双分布自适应和多层稀疏正则化,在降低模型更新成本的同时将在线评估准确率提升0.5%-5%。

详情
AI中文摘要

尽管基于深度学习的暂态稳定评估(TSA)方法在电力系统稳定性监测中展现出巨大潜力,但在运行条件频繁变化的实际场景中,其性能容易退化。为解决这一问题,本文通过域自适应深度迁移学习开发了一种自适应TSA框架。首先,为捕捉主要暂态稳定特征,通过数学手段设计了一种鲁棒度量——异质混合分布度量(HHDM),以有效处理暂态响应数据的多尺度高斯分布和长尾分布,并精确量化不同运行场景下源域与目标域之间的固有分布差异。借助HHDM,构建了一种基于贝叶斯理论的双分布域自适应方法,不仅对齐域间的边缘概率分布,还对齐子域类别的分布。这种对齐实现了细粒度的暂态稳定特征迁移,有助于显著提高训练好的TSA模型对目标域的适应性。此外,引入多层稀疏正则化算法以减轻运行场景变化引起的特征波动,从而增强模型在未知场景下的泛化能力。在三个测试系统上的数值实验表明,与传统方法相比,所提框架以经济高效的方式将在线TSA准确率提升了0.5%至5%,并大幅降低了TSA模型更新的学习成本。

英文摘要

While deep learning-based transient stability assessment (TSA) approaches have exhibited great potential in power system stability monitoring, they are prone to undergo performance degradation in practical contexts with frequent variations of operating conditions. To address this issue, this work develops an adaptive TSA framework via domain adaptation-enabled deep transfer learning. First, for the sake of capturing the primary transient stability characteristics, a robust metric, i.e., heterogeneous hybrid distribution metric (HHDM), is designed through mathematical means to effectively handle multi-scale Gaussian and long-tail distributions of transient responsive data and to precisely quantify the intrinsic distributional discrepancies between the source and target domains corresponding to different operating scenarios. With the help of the HHDM, a Bayesian theory-based dual-distribution domain adaptation method is constructed, aligning not only marginal probability distributions between domains but also the distributions of sub-domain categories. Such alignments enable fine-grained transient stability feature transfer, helping significantly improve the adaptability of a well-trained TSA model to target domains. Furthermore, a multilayer sparse regularization algorithm is introduced to mitigate feature volatility caused by variations in operating scenarios, thereby enhancing the model's generalization in the presence of unforeseen scenarios. Numerical tests on three test systems illustrate that, compared with conventional methods, the proposed framework improves online TSA accuracy by 0.5% to 5% in a cost-effective manner, with the learning cost for TSA model update largely reduced.

2606.14286 2026-06-15 eess.SY cs.SY 新提交

Topology Optimization for DC Circuit Breaker Placement in HVDC Switching Stations

高压直流换流站中直流断路器布置的拓扑优化

Merijn Van Deyck, Tom Van Acker, Geraint Chaffey, Dirk Van Hertem

AI总结 提出一种混合整数线性优化方法,用于高压直流换流站中直流断路器的最优配置,以最小化直流故障风险,并基于故障率和成本计算最优断路器数量及边际效益。

Comments 19 pages, 8 figures, Submitted to IEEE Transactions on Power Delivery

详情
AI中文摘要

未来多端高压直流电网将需要HVDC保护,以防止直流故障引起的大规模停电。因此,系统级保护设计对于连接多个换流站和线路的HVDC换流站的发展至关重要。本文提出了一种优化方法,用于设计HVDC换流站和电能枢纽中的直流断路器配置。该方法基于当前使用预定义保护策略选择配置的实践。与现有方法相比,所提出的方法在直流换流站设计上提供了显著更大的灵活性,并允许考虑大量相关运行条件,从而产生更有效、最优的设计结果。建立了一个混合整数线性优化问题来设计直流保护并最小化高影响直流故障的风险。一个示例案例研究表明,该优化方法允许根据直流电网组件的故障率和相对于故障影响的DCCB成本,计算给定直流换流站的最优DCCB数量。利用这些结果,可以计算每个额外DCCB在直流换流站中的边际效益对风险降低的贡献。此外,优化问题的结果为所需数量的DCCB提供了最优断路器配置,因此可用作直流换流站的拓扑设计工具。

英文摘要

HVDC protection will be required in future multiterminal HVDC grids to prevent large outages caused by DC faults. Therefore, system-level protection design is essential for the development of HVDC switching stations that connect several converter stations and lines within these grids. This paper presents an optimization method for the design of HVDC circuit breaker (DCCB) configurations in HVDC switching stations and electrical energy hubs. This approach builds on the current practice of using selected configurations based on pre-defined protection strategies. In contrast to these existing methods, the DC switching station design in the proposed method offers significantly more flexibility and allows the consideration of large numbers of relevant operating conditions, leading to more effective, optimal design outcomes. A mixed-integer linear optimization problem is formulated to design the DC protection and minimize the risk of high impact DC faults. An example case study demonstrates that the optimization method allows the calculation of the optimal number of DCCBs for a given DC switching station, based on the failure rates of DC grid components and the DCCB cost relative to the fault impact. With these results, the marginal benefit to risk reduction of each additional DCCB included in a DC switching station is calculated. Moreover, the result of the optimization problem provides the optimal breaker configuration for the required number of DCCBs and can consequently be used as a topological design tool for DC switching stations.

2606.14248 2026-06-15 eess.IV cs.CV 新提交

Spectrum Aware Illumination Estimation Using Multispectral Image

利用多光谱图像的光谱感知光照估计

Hyejin Oh, Woo-Shik Kim, Sangyoon Lee, YungKyung Park, Je-Won Kang

发表机构 * Department of Electronic and Electrical Engineering, Ewha W. University(成均馆大学电子与电气工程系) Telechips Samsung Advanced Institute of Technology(三星先进技术研究所) Department of Design, Ewha W. University(成均馆大学设计系)

AI总结 提出一种结合光谱注意力机制和光照先验的深度学习框架,通过时空光谱特征提取块和跨传感器域变换,实现高精度光照谱估计,并在真实多光谱数据集上验证了优越性。

Comments Accepted for publication in IEEE Transactions on Circuits and Systems for Video Technology (TCSVT). DOI: 10.1109/TCSVT.2026.3701975

详情
AI中文摘要

多光谱成像通过捕获更多光谱波段扩展了传统的RGB成像,从而改进了光照谱估计。然而,现有方法往往未能充分利用光谱信息,导致在不同光照条件和不同传感器域下性能欠佳。因此,我们提出了一种具有时空光谱特征提取块的深度学习框架,该框架结合了光谱注意力机制以增强光谱相关性并保留与光照相关的空间特征。通过引入光照先验,我们的方法优先考虑在多光谱图像中提供更有意义信息的特定通道。我们还提出了跨不同多光谱传感器空间的光谱域变换。结果表明,在高维传感器空间中学习到的光照谱可以有效地变换到各种低维相机传感器空间,而无需任何额外训练。为了便于评估,我们引入了一个真实世界的多光谱数据集,其中包含在不同光照条件下捕获的高维真实光照谱。通过大量实验,我们证明了我们的方法相比现有模型实现了更高的准确性,从而为现实世界的光照谱估计提供了实用解决方案。代码和数据集可在以下网址获取:此 https URL。

英文摘要

Multispectral (MS) imaging extends beyond conventional RGB imaging by capturing more spectral bands, thereby improving illuminant spectrum estimation (ISE). However, existing methods often fail to fully exploit spectral information, resulting in suboptimal performance under diverse lighting conditions and across different sensor domains. Hence, we propose a deep learning framework with a spatio-spectral feature extraction block, which incorporates spectral attention mechanisms to enhance spectral correlation and preserve illuminant-relevant spatial features. Through the inclusion of an illuminant prior (IP), our approach prioritizes specific channels that provide more meaningful information in an MS image. We also propose a spectral-domain transform across different MS sensor spaces. The results demonstrate that illuminant spectra learned in high-dimensional sensor spaces can be effectively transformed to various lower-dimensional camera sensor spaces without any additional training. To facilitate evaluation, we introduce a real-world MS dataset containing high-dimensional ground-truth illumination spectra captured under diverse lighting conditions. Through extensive experiments, we demonstrate that our method achieves superior accuracy compared to existing models, thus providing a practical solution for real-world ISE. The code and dataset are available at https://github.com/hyejin5/Spectrum-Aware-Illumination-Estimation-Using-Multispectral-Image.

2606.14223 2026-06-15 eess.SP 新提交

Event-Level Sensing for Intelligent 6G ISAC

面向智能6G ISAC的事件级感知

Haotian Liu, Zhiqing Wei, Xingwang Li, Ruizhong Xu, Zhiyong Feng

AI总结 本文提出事件级感知概念,通过连续时间状态建模实现目标意图和行为语义的持续识别与预测,推动6G ISAC从物理参数估计向深层环境理解演进。

Comments 10 pages, and 5 figures

详情
AI中文摘要

任务关键型网络(如车联网和低空经济)的智能化演进要求第六代(6G)网络超越离散物理参数估计,迈向更深层的环境理解。然而,现有集成感知与通信(ISAC)研究主要关注目标级感知,提供物理世界的碎片化快照,缺乏解释意图的行为语义能力。这一局限性阻碍了此类网络的智能化演进,并阻止6G获得必要的感知基础以演变为“智能服务引擎”。为弥补这一差距,ISAC必须向事件级感知推进,该感知通过建模连续时间状态实现对目标意图和行为语义的持续识别与预测。本文全面概述了6G ISAC网络中的事件级感知。我们首先介绍其基本概念、感知类型和代表性场景。然后回顾了波形设计、目标状态估计与跟踪以及事件识别等关键使能技术。此外,聚焦车联网和低空经济场景,讨论了ISAC事件级感知的代表性应用以及事件级信息对下游操作功能的智能增强。最后,我们强调了进一步推动ISAC事件级感知向智能和主动6G网络发展的未来研究趋势和潜在方向。

英文摘要

The intelligent evolution of mission-critical networks, such as the Internet of vehicles (IoV) and the low-altitude economy (LAE), requires sixth-generation (6G) networks to move beyond discrete physical parameter estimation toward deeper environmental understanding. However, existing integrated sensing and communications (ISAC) studies mainly focus on target-level sensing, which provides fragmented snapshots of the physical world and lacks the behavioral semantic capability to interpret intent. This limitation hinders the intelligent evolution of such networks and prevents 6G from acquiring the essential sensing foundation to evolve into an "intelligent service engine". To bridge this gap, ISAC must advance toward event-level sensing, which models continuous-time states to enable persistent recognition and prediction of target intent and behavioral semantics. This article presents a comprehensive overview of event-level sensing in 6G ISAC networks. We first introduce its fundamental concepts, sensing types, and representative scenarios. We then review key enabling techniques across waveform design, target state estimation and tracking, and event recognition. Furthermore, focusing on IoV and LAE scenarios, we discuss representative applications of ISAC event-level sensing and the intelligent enhancement of downstream operational functions enabled by event-level information. Finally, we highlight future research trends and potential directions to further advance ISAC event-level sensing toward intelligent and proactive 6G networks.

2606.14216 2026-06-15 cs.RO cs.SY eess.SY 新提交

Short-Horizon Position Accuracy of Single-Track Models: Implications for Motion Planning of Autonomous Vehicles

单轨模型的短时位置精度:对自动驾驶车辆运动规划的启示

Aron J. Aertssen, Lars A. T. H. van Alen, Igo J. M. Besselink, Rudolf G. M. Huisman, René M. J. G. van de Molengraft

发表机构 * Department of Mechanical Engineering, Eindhoven University of Technology(埃因霍温理工大学机械工程系) Safety & Driver Controls Group, Vehicle Development, DAF Trucks N.V.(DAF卡车公司车辆开发部安全与驾驶员控制组)

AI总结 本文通过实车实验对比三种单轨车辆模型的短时位置精度,分析模型复杂度、参数化质量与位置精度的权衡,为模型预测控制中的模型选择提供依据。

Comments Submitted to The International Journal of Automotive Engineering, Official Journal of the Society of Automotive Engineers of Japan, Inc. (JSAE)

详情
AI中文摘要

准确且计算高效的车辆模型对于自动驾驶车辆的运动规划至关重要,其中位置精度直接影响轨迹可行性和安全性。然而,位置精度尚未针对实际测量进行系统评估。因此,本文通过多种驾驶操作下的车辆测量,比较了三种单轨车辆模型的短时位置精度。模型参数通过使用仪器化测试车辆的专用实验进行识别。本文旨在提供对模型复杂度、参数化质量和位置精度之间权衡的洞察,以便在模型预测控制应用中做出明智的模型选择,而非确定单一最佳模型。

英文摘要

Accurate and computationally efficient vehicle models are essential for motion planning of autonomous vehicles, where positional accuracy directly affects trajectory feasibility and safety. However, the positional accuracy has not been systematically evaluated against real measurements. Therefore, this paper compares the short-horizon positional accuracy of three single-track vehicle models against vehicle measurements across various driving maneuvers. Model parameters are identified through dedicated experiments with the instrumented test vehicle. Rather than identifying a single best model, this work aims to provide insight into the trade-offs between model complexity, parameterization quality, and positional accuracy for informed model selection in Model Predictive Control applications.