arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2606.05160 2026-06-04 cs.RO 版本更新

GRAIL: Generating Humanoid Loco-Manipulation from 3D Assets and Video Priors

GRAIL: 从3D资产和视频先验生成人形机器人全身操作

Tianyi Xie, Haotian Zhang, Jinhyung Park, Zi Wang, Bowen Wen, Jiefeng Li, Xueting Li, Qingwei Ben, Haoyang Weng, Yufei Ye, David Minor, Tingwu Wang, Chenfanfu Jiang, Sanja Fidler, Jan Kautz, Linxi Fan, Yuke Zhu, Zhengyi Luo, Umar Iqbal, Ye Yuan

发表机构 * NVIDIA ； UCLA（加州大学洛杉矶分校）

AI总结提出GRAIL全虚拟生成管线，利用3D资产和视频基础模型先验合成人机交互演示，无需物理搭建或遥操作，实现人形机器人全身操作策略的模拟到现实迁移。

Comments Project page: https://research.nvidia.com/labs/dair/grail/

详情

AI中文摘要

扩展人形机器人全身操作需要跨多样物体、全身运动和场景几何的机器人兼容演示，但遥操作和动作捕捉难以规模化，因为每次采集都依赖于物理设置、仪器化演员和机器人操作。我们提出GRAIL，一个在部署前完全保持虚拟的数字生成管线：它组合3D资产、模拟器就绪场景和视频基础模型（VFM）的先验，以合成交互，无需重建物理环境或遥操作机器人。GRAIL并非重建无约束的野外视频，而是从完全指定的3D配置开始，其中物体几何、相机参数、度量尺度、环境深度和机器人比例的角色在视频生成前已知，并在重建过程中重复使用。这种特权设置更好地约束了4D恢复，允许基于模型的物体跟踪、人体运动估计和交互感知优化，以重建度量的4D人-物交互（HOI）轨迹，减少了深度模糊和形态不匹配。我们将恢复的运动重定向到人形机器人，并训练互补的任务通用跟踪器：用于操作的对象感知潜在适配器和用于地形穿越的场景感知跟踪器。GRAIL生成了超过20,000个序列，涵盖拾取、物体操作、坐姿和地形穿越。仅使用GRAIL生成的数据，我们通过模拟到现实管线训练自我中心视觉策略，并将其部署在Unitree G1人形机器人上，在多样物体拾取上实现了84%的真实世界成功率，在爬楼梯上实现了90%的成功率。

英文摘要

Scaling humanoid loco-manipulation requires robot-compatible demonstrations across diverse objects, whole-body motions, and scene geometries, but teleoperation and motion capture are difficult to scale because each collection depends on physical setups, instrumented actors, and robot operation. We present GRAIL, a digital generation pipeline that remains fully virtual until deployment: it composes 3D assets, simulator-ready scenes, and priors from video foundation models (VFMs) to synthesize interactions without rebuilding physical environments or teleoperating the robot. Rather than reconstructing unconstrained in-the-wild videos, GRAIL starts from fully specified 3D configurations in which object geometry, camera parameters, metric scale, environment depth, and a robot-proportioned character are known before video generation and reused during reconstruction. This privileged setup better conditions 4D recovery, allowing model-based object tracking, human motion estimation, and interaction-aware optimization to reconstruct metric 4D human-object interaction (HOI) trajectories with reduced depth ambiguity and morphology mismatch. We retarget the recovered motions to a humanoid robot and train complementary task-general trackers: an object-aware latent adaptor for manipulation and a scene-aware tracker for terrain traversal. GRAIL produces over 20,000 sequences spanning pick-up, object manipulation, sitting, and terrain traversal. Using only GRAIL-generated data, we train egocentric visual policies through a sim-to-real pipeline and deploy them on a Unitree G1 humanoid, achieving 84\% real-world success on diverse object pick-up and 90\% success on stair-climbing.

URL PDF HTML ☆

赞 0 踩 0

2606.05159 2026-06-04 cs.RO 版本更新

X4Val: Learning Neural Surrogates for Variance-Reduced Policy Evaluation

X4Val: 学习方差缩减策略评估的神经代理模型

Rachel Luo, Michael Watson, Apoorva Sharma, Heng Yang, Han Qi, Edward Schmerling, Sushant Veer, Boris Ivanovic, Marco Pavone

发表机构 * NVIDIA Research（NVIDIA研究院）； Harvard University（哈佛大学）； Stanford University（斯坦福大学）

AI总结提出X4Val框架，通过嵌入多域数据并学习可迁移预测器，结合控制变量估计器实现无配对样本下的方差缩减，在自动驾驶和机器人操作任务中方差降低达38.4%。

详情

AI中文摘要

对基于学习的机器人系统进行严格评估是部署的必要前提。然而，真实世界的测试数据收集成本高昂；此外，在典型的迭代开发环境中，从最新策略收集的数据规模必然有限。这促使我们利用异构数据源（包括仿真、历史策略日志以及从相关平台或环境收集的数据）的评估方法。虽然此类辅助数据丰富且廉价，但它们通常不能直接代表真实世界的结果——例如，仿真中的性能可能与真实世界中的性能存在显著差异——这使得它们在高置信度性能估计中的原则性使用具有挑战性。在本文中，我们介绍了X4Val，一个在存在非配对、多域数据的情况下进行方差缩减的真实世界指标估计的通用框架。X4Val将来自真实域和辅助域的样本嵌入到一个共享表示空间中，并学习一个可迁移的真实世界指标预测器；然后将这个学习到的预测器纳入控制变量估计器，即使在无配对样本的情况下也能实现方差缩减。我们提供了理论分析，并在自动驾驶和真实世界机器人操作任务上进行了实证评估，在这些领域中，X4Val实现了高达38.4%的方差缩减，并表现出相对于强基线的持续改进。这些结果表明，非配对的异构数据可以被利用来显著提高严格机器人系统验证的样本效率。

英文摘要

Rigorous evaluation of learning-based robotic systems is an essential prerequisite for deployment. However, real-world test data is expensive to gather; moreover, in a typical iterative development context, data gathered from the latest policy is necessarily limited in scale. This motivates evaluation methodologies that make use of heterogeneous data sources, including simulation, historical policy logs, and data collected from related platforms or environments. While such auxiliary data are abundant and inexpensive, they are generally not directly representative of real-world outcomes -- for example, performance in simulation may differ substantially from performance in the real world -- making their principled use for high-confidence performance estimation challenging. In this paper, we introduce X4Val, a general framework for variance-reduced real-world metric estimation in the presence of non-paired, multi-domain data. X4Val embeds samples from real and auxiliary domains into a shared representation space and learns a transferable predictor of real-world metrics; this learned predictor is then incorporated into a control-variates estimator, enabling variance reduction even when paired samples are unavailable. We provide theoretical analysis and empirical evaluations on autonomous driving and real-world robot manipulation tasks, domains across which X4Val achieves up to 38.4% variance reduction and demonstrates consistent improvements over strong baselines. These results show that non-paired, heterogeneous data can be leveraged to substantially improve the sample efficiency of rigorous robotic system validation.

URL PDF HTML ☆

赞 0 踩 0

2606.05143 2026-06-04 cs.RO 版本更新

HORIZON: Recoverability-Governed Curriculum for Physical-Domain Scaling

HORIZON: 基于可恢复性的物理域缩放课程

Chenhao Bai, Liqin Lu, Kaijun Wang, Hui Chen, Jin-Chuan Shi, Yuyang Liu, Hao Chen, Chunhua Shen

发表机构 * Zhejiang University, State Key Lab of CAD & CG（浙江大学，计算机辅助设计与图形学国家重点实验室）； Zhejiang University of Technology（浙江工业大学）

AI总结针对机器人策略在物理域缩放中的可学习性问题，提出基于可恢复性的前沿课程HORIZON，通过回滚和边界细化逐步扩展物理域，实验揭示了物理域扩展的三个规律。

Comments 16 pages, 9 figures

详情

AI中文摘要

扩展鲁棒的机器人策略需要的不仅仅是更广泛的随机化，因为物理域经验必须在整个训练过程中保持有序和可学习。我们研究了策略何时能从更难的物理中受益，并确定可恢复性是在策略物理域缩放中的核心约束。在在策略训练中，新的动态仅当它们足够接近当前策略以生成纠正性的在策略数据时才有用，而不是将轨迹崩溃为不可恢复的失败。使用四足运动作为具身泛化的物理要求高的基准，我们引入了HORIZON，一种检查点前沿课程，仅在当前策略的可恢复边界内扩展物理域。HORIZON使用回滚和边界细化来管理每个扩展步骤，将固定随机化转变为物理域增长的持续过程。实验揭示了物理域扩展的三个规律。首先，直接域扩展在物理轴上是非均匀的，并且通常在没有阶段排序的情况下不可学习。其次，域组合是非单调的，在紧凑核心之外添加更多域可能会稀释可恢复的联合样本并降低整体鲁棒性。第三，孤立专家的离线蒸馏不能替代在策略课程生成的联合交互。这些结果共同将物理域泛化框架为具身控制的持续增长问题，以可恢复性作为在策略扩展的组织原则。

英文摘要

Scaling robust robot policies requires more than broader randomization, because physical-domain experience must remain organized and learnable throughout training. We study when a policy can benefit from harder physics and identify recoverability as a central constraint in on-policy physical-domain scaling. In on-policy training, new dynamics are useful only insofar as they remain close enough to the current policy to generate corrective on-policy data, rather than collapsing rollouts into unrecoverable failures. Using quadruped locomotion as a physically demanding benchmark for embodied generalization, we introduce HORIZON, a checkpointed frontier curriculum that expands physical domains only within the current policy's recoverable boundary. HORIZON uses rollback and boundary refinement to govern each expansion step, turning fixed randomization into a continual process of physical-domain growth. Experiments reveal three regularities of physical-domain expansion. First, direct domain widening is uneven across physical axes and often unlearnable without staged ordering. Second, domain composition is non-monotonic, and adding more domains beyond a compact core can dilute recoverable joint samples and reduce overall robustness. Third, offline distillation of isolated experts cannot substitute for the joint interaction generated by on-policy curriculum. Together, these results frame physical-domain generalization as a continual growth problem for embodied control, with recoverability as the organizing principle for on-policy expansion.

URL PDF HTML ☆

赞 0 踩 0

2606.05015 2026-06-04 cs.RO 版本更新

Generalization of World Models under Environmental Variability for Vision-based Quadrotor Navigation

环境变异性下基于视觉的四旋翼导航的世界模型泛化

Luca Zanatta, Grzegorz Malczyk, Kostas Alexis

发表机构 * Norwegian University of Science and Technology（挪威科学与技术大学）

AI总结通过基于视觉的四旋翼导航测试，研究世界模型在不同环境随机性下的鲁棒性，发现自监督预训练阶段的泛化能力是模拟到现实迁移的强预测因子，并识别出离散潜在大小和训练序列长度是关键因素。

详情

AI中文摘要

世界模型，即学习预测环境演化的生成模型，已成为样本高效机器人学习的有前景工具。然而，它们对环境变异性的鲁棒性仍知之甚少。为解决这一问题，我们以基于视觉的四旋翼导航为测试平台进行系统研究，在不同环境随机性水平下训练基于DreamerV3的世界模型，并通过跨环境验证（涵盖自监督学习预训练和强化学习微调）在所有水平上评估它们。然后，我们将所有世界模型及相关导航策略部署到真实四旋翼上，在未见环境中进行测试，包括一次开环运行，其中模型仅接收2.5秒的真实感官输入，之后所有传感器被切断，系统完全依靠想象导航穿越12米距离。结果表明，自监督预训练阶段的世界模型鲁棒性是模拟到现实迁移的强预测因子：在跨环境自监督验证中泛化良好的每个模型都成功部署到真实世界，通过窄至0.67米的间隙，而在模拟策略评估中占主导地位的模型却在真实平台上失败。我们进一步识别出(a)离散潜在大小和(b)训练序列长度是控制世界模型质量的主要因素。

英文摘要

World models, learned generative models that predict how an environment evolves, have become a promising tool for sample-efficient robot learning. Yet how robust they are to environmental variability remains poorly understood. To address this, we conduct a systematic study using vision-based quadrotor navigation as a testbed problem, training DreamerV3-based world models under varying levels of environmental randomness and evaluating them across all levels through cross-environment validation, spanning both Self-Supervised Learning (SSL) pretraining and Reinforcement Learning (RL) fine-tuning. We then deploy all world models and associated navigation policies on a real quadrotor in unseen environments, including an open-loop run where the model receives just 2.5s of real sensory input before all sensors are cut off, leaving the system to navigate entirely in imagination over a 12m traverse. Our results show that world model robustness during SSL pretraining is a strong predictor of sim-to-real transfer: every model that generalized well in cross-environment SSL validation deployed successfully in the real world, passing through gaps as narrow as 0.67m, whereas the model that dominated simulation policy evaluation failed on the real platform. We further identify (a) the discrete latent size and (b) the training-sequence length as the dominant factors governing world model quality.

URL PDF HTML ☆

赞 0 踩 0

2606.05011 2026-06-04 cs.CV cs.RO 版本更新

CIPER: A Unified Framework for Cross-view Image-retrieval and Pose-estimation

CIPER: 跨视图图像检索与姿态估计的统一框架

Yurim Jeon, Dongseong Seo, Seung-Woo Seo

发表机构 * Seoul National University（首尔国立大学）

AI总结提出CIPER框架，通过共享Transformer编码器和任务特定令牌联合进行城市级跨视图检索与精确3自由度姿态估计，实现互惠特征学习。

Comments 16 pages, 5 figures

详情

AI中文摘要

跨视图地理定位通过将地面图像与航拍图像数据库匹配来估计其地理位置。现有方法要么通过大规模检索，要么通过精确姿态估计来处理，但无法兼顾：基于检索的方法能够进行广域搜索，但牺牲了定位精度；而姿态估计方法仅在狭窄的搜索空间内实现高精度。简单级联这些流程会导致误差传播和特征表示不一致。我们将跨视图地理定位形式化为一个统一问题，要求同时进行城市级检索和精确的3自由度姿态估计。我们提出CIPER（跨视图图像检索与姿态估计变换器），这是一种单一架构，通过互惠特征学习联合执行两项任务。CIPER使用共享的Transformer编码器和任务特定令牌，将全局检索特征与空间定位线索分离。为了弥合地面和航拍视图之间的大领域差距，我们引入了一个双向Transformer姿态解码器，该解码器使用地面特征作为空间查询进行双向交叉注意力。一种集合预测策略进一步在统一的多任务目标下实现稳定的3自由度回归。在VIGOR、KITTI和Ford Multi-AV上的实验表明，特别是在有限的视野和任意方向条件下，性能具有竞争力。代码可在https://github.com/yurimjeon1892/CIPER获取。

英文摘要

Cross-view geo-localization estimates the geographic location of a ground image by matching it against an aerial image database. Existing methods tackle this through either large-scale retrieval or precise pose estimation, but not both: retrieval-based methods enable wide-area search at the cost of localization accuracy, while pose estimation methods achieve high precision within only a narrow search space. Naively cascading these pipelines introduces error propagation and inconsistent feature representations. We formulate cross-view geo-localization as a unified problem requiring simultaneous city-scale retrieval and precise 3-DoF pose estimation. We propose CIPER (Cross-view Image-retrieval and Pose-estimation transformER), a single architecture that jointly performs both tasks through mutually beneficial feature learning. CIPER uses a shared transformer encoder with task-specific tokens to disentangle global retrieval features from spatial localization cues. To bridge the large domain gap between ground and aerial views, we introduce a two-way transformer pose decoder that uses ground features as spatial queries for bidirectional cross-attention. A set prediction strategy further enables stable 3-DoF regression under a unified multi-task objective. Experiments on VIGOR, KITTI, and Ford Multi-AV demonstrate competitive performance, especially under limited field-of-view and arbitrary orientation conditions. Code is available at https://github.com/yurimjeon1892/CIPER.

URL PDF HTML ☆

赞 0 踩 0

2606.04989 2026-06-04 cs.HC cs.RO 版本更新

What Can Eye Gaze Teach Us About Real-World Cycling? Insights From the Oxford RobotCycle Project

眼动能教会我们关于真实世界骑行的什么？来自牛津RobotCycle项目的见解

Benjamin Hardin, Efimia Panagiotaki, Daniele De Martini, Lars Kunze

发表机构 * University of Oxford（牛津大学）； University of the West of England（西英格兰大学）

AI总结本研究利用可穿戴眼动追踪眼镜，通过分析不同环境（如自行车道、汽车道和共享公交车道）和事件（如超车和行人）下的眼动模式，揭示了骑行中感知危险的潜意识差异，并评估了眼动追踪在估计骑行压力和认知负荷方面的潜力。

详情

AI中文摘要

尽管对骑行情境的身体危险已有较多了解，但对骑行的感知危险知之甚少。此外，危险感知可能在潜意识层面被过滤，因此难以自我报告。为此，这些潜意识感知可以通过眼动等生理指标揭示。本文探讨了英国牛津骑行的感知安全性，并研究了可穿戴眼动追踪眼镜在不同环境和事件下产生关于感知差异见解的能力。本文发现，在自行车道、汽车道和共享公交车道之间，眼动模式发生变化，代表了每种车道类型的不同认知挑战。本文表明，不同交叉路口的眼动模式显著不同，这可能对骑行者的压力有影响。最后，与无事件骑行相比，在超车和道路行人等事件发生时，眼动模式存在差异。本文总结了使用可穿戴眼动追踪器估计压力和骑行者工作量的优点和局限性。

英文摘要

Although much is known about the physical danger of cycling situations, less is understood about the perceived danger of cycling. Furthermore, perception of danger may be filtered at a subconscious level and therefore difficult for one to self-report. To this end, these subconscious perceptions can be revealed through physiological metrics such as eye gaze. This paper explores the perceived safety of cycling in Oxford, United Kingdom and explores the ability of wearable eye tracking glasses to produce insights about the differences in perception under different environments and events. This paper finds that eye gaze patterns change between using bike lanes, car lanes and shared bus lanes, representing different cognitive challenges of each lane type. This paper presents that different intersections have significantly different eye gaze patterns which may have implications for cyclist stress. Finally, eye gaze patterns differ in the presence of events such as passes and pedestrians in the road compared to when cycling with no events. This paper draws conclusions on the benefits and limitations of using wearable eye trackers to estimate stress and cyclist workload.

URL PDF HTML ☆

赞 0 踩 0

2606.04968 2026-06-04 cs.RO 版本更新

M3imic: 学习用于多模态运动模仿的通用全身控制器

Zuxing Lu, Ziang Zheng, Yao Lyu, Jingyu Liu, Feihong Zhang, Song Lu, Xin Yuan, Changyin Sun, Xingxing Zuo, Shengbo Eben Li

发表机构 * School of Automation, Southeast University（东南大学自动化学院）； School of Vehicle and Mobility, Tsinghua University（清华大学车辆与移动性学院）； Department of Robotics, Mohamed Bin Zayed University of Artificial Intelligence（马尔代夫比兹亚德大学人工智能学院机器人系）

AI总结提出M3imic框架，通过模态特定编码器将异构运动参考模态（机器人关节角度、人体姿态轨迹、末端执行器位姿）映射到共享潜在空间，并利用大规模强化学习训练单一策略，实现无需模态特定重训练的sim-to-real迁移。

详情

AI中文摘要

构建通用全身控制器对于使人形机器人在广泛的下游任务（包括 locomotion 和 loco-manipulation）中具备多样化的运动能力至关重要。不同任务依赖于不同的运动参考模态：locomotion 主要依赖于协调的机器人关节轨迹，而 manipulation 则需要精确的末端执行器轨迹跟踪。现有方法常常忽视密集的机器人关节角度与稀疏的末端执行器位姿之间的表示不匹配问题。为解决这一问题，我们提出了 Multi-Modal Mimic (M3imic)，一个通用的多模态全身控制框架，它使用模态特定编码器将异构运动参考模态（包括机器人关节角度、人体姿态轨迹和末端执行器位姿）映射到共享潜在空间，从而统一这些模态。利用模拟器中的大规模强化学习，我们训练了一个单一策略，该策略能够在无需模态特定重训练的情况下实现跨多种运动参考模态的 sim-to-real 迁移。在 Unitree G1 机器人上进行了广泛的仿真和真实世界实验以评估所提出的框架。在仿真中，该策略在未见过的测试数据集上达到了 98.42% 的峰值成功率，展示了其卓越的泛化能力。代码可在 https://github.com/Renforce-Dynamics/MultiModalWBC 获取。

英文摘要

Building a general-purpose whole-body controller is essential for enabling diverse motion capabilities in humanoid robots across a wide range of downstream tasks, including locomotion and loco-manipulation. Different tasks rely on distinct motion reference modalities: locomotion primarily depends on coordinated robot joint trajectories, whereas manipulation requires precise end-effector trajectory tracking. Existing methods often overlook the representational mismatch between dense robot joint angles and sparse end-effector poses. To address this, we propose Multi-Modal Mimic (M3imic), a versatile multi-modal whole-body control framework that unifies heterogeneous motion reference modalities, including robot joint angles, human pose trajectories, and end-effector poses, using modality-specific encoders to map them into a shared latent space. Leveraging large-scale reinforcement learning in the simulator, we train a single policy that achieves sim-to-real transfer across multiple motion reference modalities without modality-specific retraining. Extensive simulation and real-world experiments on the Unitree G1 robot are conducted to evaluate the proposed framework. In simulation, the policy achieves a peak success rate of 98.42\% on an unseen test dataset, demonstrating its exceptional generalization capability. The code is available at https://github.com/Renforce-Dynamics/MultiModalWBC

URL PDF HTML ☆

赞 0 踩 0

2606.04825 2026-06-04 cs.RO 版本更新

HapTile: A Haptic-Informed Vision-Tactile-Language-Action Dataset for Contact-Rich Imitation Learning

HapTile: 用于接触丰富模仿学习的触觉感知视觉-触觉-语言-动作数据集

Amirhosein Alian, Yongqiang Zhao, Shiyi Gu, Xuyang Zhang, Zhuo Chen, Christopher E. Mower, Haitham Bou-Ammar, Shan Luo

发表机构 * King’s College London, UK（伦敦国王学院）； Huawei, Noah’s Ark Lab, UK（华为、诺亚实验室）； University College London, UK（伦敦大学学院）

AI总结提出HapTile数据集，通过集成指尖触觉反馈和操作员触觉感知，为接触丰富的机器人操作任务提供视觉-触觉-语言-动作联合数据，并验证其在策略学习中的有效性。

详情

AI中文摘要

尽管触觉感知对于可靠操作至关重要，但大多数现有的视觉-语言-动作（VLA）数据集仍然仅基于视觉，而那些确实包含触觉信息的数据集通常缺乏任务多样性、语言条件和动作轨迹的联合组合。此外，现有的遥操作流程很少为操作员提供触觉反馈，尽管触觉反馈在演示质量和操作稳定性中具有公认的作用。在这项工作中，我们提出了HapTile，一个接触基础的视觉触觉操作数据集，它通过嵌入两个层次的物理交互感知超越了仅视觉轨迹数据集：机器人末端执行器上的指尖触觉反馈，以及遥操作侧的触觉感知演示。数据收集平台将触觉反馈直接集成到遥操作控制器中，使操作员能够实时感知接触交互。它基于一个标准且可复现的机器人系统构建，该系统配备了定制设计的指尖触觉传感器。该数据集涵盖了日常操作任务，包括拾取与放置、折叠、按压、堆叠以及其他常规活动，这些任务涉及广泛的接触丰富技能。每个任务都配有语言指令，用于根据操作目标对策略进行条件化，同时还有同步的视觉触觉观察和动作轨迹。此外，我们使用两个基线模型对接触丰富的策略学习进行了基准研究，以评估所提出的接触基础数据集的有效性。数据集和更多详细信息可在我们的网站上获取：haptile-dataset.github.io。

英文摘要

Despite the importance of tactile sensing for reliable manipulation, most existing Vision-Language-Action (VLA) datasets remain vision-only, and those that do incorporate tactile information typically lack the joint combination of task diversity, language conditioning, and action trajectories. Furthermore, existing teleoperation pipelines rarely provide haptic feedback to the operator, despite its established role in demonstration quality and manipulation stability. In this work, we present HapTile, a contact-grounded visuotactile manipulation dataset that advances beyond vision-only trajectory datasets by embedding physical interaction sensing at two levels: fingertip tactile feedback at the robot end-effector, and haptic-informed demonstrations at the teleoperator side. The data collection platform integrates haptic feedback directly into the teleoperation controller, enabling the operator to perceive contact interactions in real time. It is built around a standard and reproducible robotic system equipped with custom-designed fingertip tactile sensors. The dataset comprises everyday manipulation tasks spanning a broad range of contact-rich skills, including pick-and-place, folding, pressing, stacking, and other routine activities. Each task is paired with language instructions that condition the policy on the manipulation objective, together with synchronized visuotactile observations and action trajectories. In addition, we provide a benchmarking study on contact-rich policy learning using two baseline models to evaluate the effectiveness of the proposed contact-grounded dataset. The dataset and additional details are available on our website: haptile-dataset.github.io.

URL PDF HTML ☆

赞 0 踩 0

2606.04818 2026-06-04 cs.RO 版本更新

Real-World Deployment of a 5G-Connected Edge-Controlled Aerial Robot in Industrial Subterranean Mines

工业地下矿井中5G连接边缘控制空中机器人的实际部署

Achilleas Santi Seisa, Emanuele Pagliari, Gerasimos Damigos, Elias Small, George Nikolakopoulos

发表机构 * Robotics and AI Group, Department of Computer, Electrical and Space Engineering, Luleå University of Technology（机器人与人工智能小组，计算机、电气与空间工程系，吕勒奥技术大学）

AI总结本文首次在实际工业地下矿井中部署了由边缘卸载控制器控制的5G连接自主飞行空中机器人，采用模型预测控制器（MPC）生成平滑无碰撞路径，展示了边缘控制机器人系统在时间关键、安全高效未来部署中的潜力。

Comments 6 pages, 8 figures, MED 2026

详情

AI中文摘要

本文介绍了首次由边缘卸载控制器控制的5G连接空中机器人的实际自主飞行，旨在弥合受控设置与实际设置之间的差距。该机器人在一个活跃的工业地下矿井中运行，而高层控制器部署在附近的基于Kubernetes的边缘集群中。机器人与边缘之间的通信通过5G新无线电（NR）独立组网（SA）网络实现。所选的控制器是模型预测控制器（MPC），它生成控制动作，使机器人能够在采矿环境中无缝导航。人类操作员为空中机器人选择航点，MPC生成平滑、无碰撞的路径以自主执行。所提出的基于5G边缘的闭环系统在实际工业环境中进行了评估，展示了边缘控制机器人系统在时间关键、安全高效的未来部署中的潜力。

英文摘要

This article presents the first real-world autonomous flight of a 5G-connected aerial robot controlled by an edge-offloaded controller, and aims to bridge the gap between controlled and factual setups. The robot operates within an active industrial subterranean mine, while the high-level controller is deployed in a nearby Kubernetes-based edge cluster. Communication between the robot and the edge is enabled via a 5G New Radio (NR) Standalone (SA) network. The chosen controller is a Model Predictive Controller (MPC), which generates control actions to allow the robot to navigate seamlessly through the mining environment. A human operator selects waypoints for the aerial robot, and the MPC generates smooth, collision-free paths for autonomous executions. The proposed 5G edge-based closed-loop system is evaluated in a real industrial setting and demonstrates the potential of edge-controlled robotic systems toward time-critical, safe and efficient future deployments.

URL PDF HTML ☆

赞 0 踩 0

2606.04788 2026-06-04 cs.CV cs.RO 版本更新

3DThinkVLA：通过3D思维引导的协同训练赋予视觉-语言-动作模型潜在3D先验

Jiaxin Shi, Xidong Zhang, Fucai Zhu, Zhe Li, Siyu Zhu, Weihao Yuan

发表机构 * Shanghai Jiao Tong University（上海交通大学）； Harbin Institute of Technology（哈尔滨工业大学）； Nanyang Technological University（南洋理工大学）； Fudan University（复旦大学）； Nanjing University（南京大学）； Daimon Robotics（达梦机器人）； Great Bay University（大亚大学）

AI总结提出3D思维引导的协同训练框架，通过解耦3D几何感知与空间推理并在不同特征层次注入，使VLA模型在动作预测中隐式进行3D空间推理，无需3D传感器或外部模型，在多个基准上达到最优性能。

详情

AI中文摘要

我们提出了一种3D思维引导的协同训练框架，使视觉-语言-动作（VLA）模型能够在动作预测过程中隐式地进行3D空间推理。我们的核心见解是，3D几何感知和3D空间推理是两种不同的能力，可以在不同的特征层次上解耦并注入。在训练过程中，三个紧密耦合的组件主要在潜在空间中协同工作：（1）为了获得几何先验，一个潜在3D几何感知模块将中间视觉特征与3D基础模型对齐，在不修改VLM骨干架构的情况下获取低级几何线索。（2）作为补充，一个在线3D推理蒸馏模块通过共享推理锚点令牌缓解提示引发的推理差距。在3D VLM协同训练期间，该锚点作为第一个输出令牌发出，以稳健地编码空间先验。在VLA训练期间，它作为插入在任务指令和动作指令之间的输入令牌，将高级空间思维从显式教师推理提示转移到学生动作提示，无需链式思维文本生成。（3）然后，这些解耦的几何和推理特征通过空间增强的动作集成统一起来，该集成将它们作为分层空间条件共同注入到动作查询令牌中，以防止动作捷径。在部署时，我们的方法仅保留其轻量级适配器以执行隐式3D推理，丢弃用于监督的3D基础模型和教师分支。因此，它纯粹在2D图像上运行，无需3D传感器、外部模型或显式文本生成，同时防止预训练VLM的灾难性遗忘，在LIBERO、LIBERO-PLUS、SimplerEnv和真实世界操作任务上实现了最先进的性能。

英文摘要

We propose a 3D-thinking-guided co-training framework that enables vision-language-action (VLA) models to perform 3D spatial reasoning implicitly during action prediction. Our core insight is that 3D geometry perception and 3D spatial reasoning are distinct capabilities that can be disentangled and injected at different feature hierarchies. During training, three tightly coupled components work in concert primarily within the latent space: (1) To gain geometric priors, a latent 3D geometry perception module aligns intermediate visual features with a 3D foundation model, acquiring low-level geometric cues without architectural modifications to the VLM backbone. (2) Complementing this, an online 3D reasoning distillation module mitigates the prompt-induced reasoning gap via a shared reasoning anchor token. During 3D VLM co-training, this anchor is emitted as the first output token to robustly encode spatial priors. During VLA training, it serves as an input token inserted between the task and action instructions, transferring high-level spatial thinking from explicit teacher reasoning prompts to student action prompts without chain-of-thought text generation. (3) These disentangled geometric and reasoning features are then united by a spatially augmented action integration, which jointly injects them into the action-query tokens as hierarchical spatial conditions to prevent action shortcuts. At deployment, our method retains only its lightweight adapters to perform implicit 3D reasoning, discarding the 3D foundation model and the teacher branch used for supervision. Consequently, it operates purely on 2D images without 3D sensors, external models, or explicit text generation while preventing catastrophic forgetting of the pretrained VLM, achieving state-of-the-art performance on LIBERO, LIBERO-PLUS, SimplerEnv, and real-world manipulation tasks.

URL PDF HTML ☆

赞 0 踩 0

2606.04361 2026-06-04 eess.SY cs.MA cs.RO cs.SY math.DS math.OC 版本更新

When Freshness Is Not Enough: Distribution-Aware Age of Information for Networked LQR Control

当新鲜度不足时：面向网络化LQR控制的分布感知信息年龄

Abdullah Y. Etcibasi, C. Emre Koksal, Eylem Ekici

发表机构 * Department of Electrical and Computer Engineering, The Ohio State University（电气与计算机工程系，俄亥俄州立大学）

AI总结本文研究网络化控制系统中，仅最小化平均信息年龄（AoI）不足以优化LQR跟踪性能，需考虑调度间隔的完整分布（包括高阶矩和指数矩）。

详情

AI中文摘要

信息年龄（AoI）已成为无线更新系统设计的核心指标，尤其是在新鲜测量支持跟踪、估计和控制的场景中。尽管其广泛应用，但将平均AoI或峰值AoI作为闭环性能的替代指标通常基于直觉而非控制理论推导。本文探讨了最小化平均AoI是否对网络化控制系统最优。对于具有延迟间歇更新的标量线性时不变系统，我们证明，在状态无关调度策略下，无限时域LQR跟踪问题可简化为对调度间隔分布的优化。所得目标函数依赖于调度过程的高阶统计矩，在不稳定或相关情况下还依赖于指数矩，而非仅依赖于其均值。因此，具有相同平均AoI的策略可能产生显著不同的跟踪成本。我们进一步将分析扩展到具有指数衰减自相关的扰动，并推导出揭示完整间隔分布作用的等效成本公式。最后，使用NGSIM US-101数据集中的真实车辆轨迹验证理论。实证结果与预测的性能趋势一致，表明仅凭平均AoI不足以进行面向控制的网络设计。

英文摘要

Age of Information (AoI) has become a central metric for the design of wireless update systems, especially in applications where fresh measurements support tracking, estimation, and control. Despite its popularity, the use of mean AoI or peak AoI as a surrogate for closed-loop performance is often motivated by intuition rather than by a control-theoretic derivation. This paper examines whether minimizing the mean AoI is in fact optimal for networked control systems. For scalar linear time-invariant systems with delayed intermittent updates, we show that, under state-independent scheduling policies, the infinite-horizon LQR tracking problem reduces to an optimization over the distribution of inter-scheduling intervals. The resulting objective depends on higher-order statistical moments, and in unstable or correlated regimes on exponential moments, of the inter-scheduling process rather than only on its mean. Consequently, policies with identical mean AoI can induce substantially different tracking costs. We further extend the analysis to disturbances with exponentially decaying autocorrelation and derive equivalent cost formulations that expose the role of the full interval distribution. Finally, we validate the theory using real vehicle trajectories from the NGSIM US-101 dataset. The empirical results match the predicted performance trends, demonstrating that mean AoI alone is insufficient for control-oriented network design.

URL PDF HTML ☆

赞 0 踩 0

2606.04355 2026-06-04 cs.RO 版本更新

Think Fast and Far: Long-Horizon Online POMDP Planning via Rapid State Sampling

快速思考与远见：通过快速状态采样实现长时域在线POMDP规划

Yuanchu Liang, Edward Kim, J. Arden Knoll, Wil Thomason, Zachary Kingston, Lydia E. Kavraki, Hanna Kurniawati

发表机构 * Australian National University（澳大利亚国立大学）； Rice University（里士大学）

AI总结提出一种基于快速状态采样的在线POMDP求解器ROP-RAS3，通过宏动作生成和信念空间采样，有效解决长时域POMDP问题，在多种高维连续/离散混合空间中显著优于现有方法。

Comments @inproceedings{Liang2026Thinking, title = {Think Fast and Far: Long-Horizon Online POMDP Planning via Rapid State Sampling}, author = {Yuanchu Liang and Edward Kim and J.Arden Knoll and Wil Thomason and Zachary Kingston and Lydia E. Kavraki and Hanna Kurniawati}, year = 2026, booktitle = {International Journal of Robotics Research (to appear)} }

详情

AI中文摘要

部分可观测马尔可夫决策过程（POMDP）是不确定性下运动规划的通用且原则性框架。尽管POMDP求解器的可扩展性有了巨大提升，但长时域POMDP仍然难以求解。为缓解这一困难，本文提出了一种新的近似在线POMDP求解器，称为基于参考的快速状态空间采样在线POMDP规划（ROP-RAS3）。ROP-RAS3利用新颖的极快采样运动规划技术对状态空间进行采样，并在线生成多样化的宏动作，然后用于偏置信念空间采样并推断高质量策略，而无需对动作空间进行穷举枚举——这是现代在线POMDP求解器的一个基本约束。ROP-RAS3以依赖于采样动作数量而非动作空间大小的速率收敛到近最优的基于参考的解。ROP-RAS3在多种长时域POMDP上进行了评估，这些POMDP具有高达3000个前瞻步骤和35维状态空间，其中状态、动作和观测空间可以是连续的、离散的或离散与连续的混合。尽管基于参考的最优解可能与最优POMDP解不同，但经验结果表明，在所有这些问题中，就成功率而言，ROP-RAS3优于其他最先进方法多达数倍。我们还通过物理机器人演示展示了我们方法的能力。这项工作扩展了我们ISRR24论文的理论和实证结果。代码可在 exttt{https://github.com/RDLLab/ROPRAS3} 找到。

英文摘要

Partially Observable Markov Decision Processes (POMDPs) are a general and principled framework for motion planning under uncertainty. Despite tremendous improvement in the scalability of POMDP solvers, long-horizon POMDPs remain difficult to solve. To alleviate the difficulty, this paper proposes a new approximate online POMDP solver, called Reference-Based Online POMDP Planning via Rapid State Space Sampling (ROP-RAS3). ROP-RAS3 uses novel extremely fast sampling-based motion planning techniques to sample the state space and generate a diverse set of macro actions online, which are then used to bias belief-space sampling and infer high-quality policies without requiring exhaustive enumeration of the action space -- a fundamental constraint for modern online POMDP solvers. ROP-RAS3 converges to a near-optimal reference-based solution at a rate that depends on the number of sampled actions, rather than the size of the action space. ROP-RAS3 is evaluated on various long-horizon POMDPs with up to 3000 lookahead steps and 35-dimensional state spaces, where the state, action and observation spaces can be continuous, discrete, or a hybrid of discrete and continuous. Although the reference-based optimal solution may not be the same as the optimal POMDP solution, empirical results indicate that in all of these problems, in terms of success rate, ROP-RAS3 outperforms other state-of-the-art methods by up to multiple folds. We also demonstrate the capability of our approach on a physical robot demonstration. This work extends the theory and empirical results of our ISRR24 paper. Code can be found at \texttt{https://github.com/RDLLab/ROPRAS3}.

URL PDF HTML ☆

赞 0 踩 0

2605.04607 2026-06-04 cs.RO 版本更新

Right Model, Right Time: Real-Time Cascaded-Fidelity MPC for Bipedal Walking

正确模型，正确时机：用于双足行走的实时级联保真度MPC

Franek Stark, Felix Wiebe, Shubham Vyas, Dennis Mronga, Frank Kirchner

发表机构 * Robotics Innovation Center at the German Research Center for Artificial Intelligence (DFKI)（德国人工智能研究中心机器人创新中心）

AI总结提出一种多阶段全身模型预测控制方法，结合近视野详细全身模型与远视野简化单刚体模型，降低计算复杂度并保持预测能力，在通用MPC框架acados中求解，无需预设足迹位置，在18自由度双足机器人HyPer-2上验证。

Comments Presented at IEEE ICRA 2026 Workshop "2cnd Workshop on Frontiers of Optimization for Robotics"

2606.04269 2026-06-04 cs.RO cs.AI cs.CV 版本更新

CoPark：通过自我对弈学习反应式泊车

Jiarong Wei, Yanxing Chen, Sinuo Song, Yin Wu, Anna Rehr, Abhinav Valada

发表机构 * Department of Computer Science, University of Freiburg（弗赖堡大学计算机科学系）； CARIAD SE（CARIAD SE公司）； Technical University of Munich（慕尼黑技术大学）

AI总结提出CoPark，一种基于残差策略的多智能体自我对弈强化学习方法，通过固定先验与残差头结合，在反应式泊车中实现高精度与安全交互的平衡，显著优于基线方法。

详情

AI中文摘要

学习一个能够以高几何精度达到目标同时与附近智能体安全交互的单一策略面临相互冲突的目标。精度有利于固定几何计划的执行，而交互则要求在另一智能体侵入时立即偏离，导致针对一个目标优化的策略往往在另一个目标上失败。我们在反应式自主泊车的背景下研究这一问题，其中多辆车必须达到指定车位，终端精度达到亚米级，同时在整个操作过程中对邻近车辆保持响应。我们提出CoPark，一种基于残差策略架构的多智能体自我对弈RL方法。预计算的离线计划提供固定的动作先验，而残差头学习反应式修正。残差策略在自我对弈下学习行为，弥补数据和脚本的不足，而固定先验保持纯策略难以可靠达到的车位框架几何。关键设计是一种合作伙伴威胁调制的通道非对称先验释放。连续威胁信号将纵向通道的权限转移给残差头以实现让行，而横向通道仍锚定在预计算参考上以保持亚米级车位对齐。闭环细化层修正动作网格离散化带来的残差终端误差。我们在六个停车场训练策略，并在我们的新反应式泊车基准（包括Dragon Lake Parking (DLP)和DeepScenario Open 3D (DSC3D)）上进行零样本评估。CoPark实现了约70-85%的成功率，碰撞率仅为3-6%，显著优于经典、模仿学习和大规模RL基线。重要的是，结果展示了涌现的交互行为，如倒车让行、中途让行、狭窄通道通行和排队。

英文摘要

Learning a single policy that reaches a goal with high geometric precision while interacting safely with nearby agents poses conflicting objectives. Precision favors commitment to a fixed geometric plan, whereas interaction requires immediate deviation when another agent intrudes, causing policies optimized for one objective to often fail at the other. We study this problem in the context of reactive autonomous parking, where multiple vehicles must reach assigned slots with sub-meter terminal accuracy while remaining responsive to neighboring vehicles throughout the maneuver. We propose CoPark, a multi-agent self-play RL approach built on a residual-policy architecture. A precomputed offline plan provides a fixed action prior, while a residual head learns the reactive corrections. The residual policy learns behaviors under self-play, where data and scripting fall short, while the fixed prior holds the slot-frame geometry that pure policies struggle to reach reliably. The key design is a partner-threat-modulated, channel-asymmetric release of the prior. A continuous threat signal shifts authority of the longitudinal channel to the residual head to enable yielding, while the lateral channel remains anchored to the precomputed reference to preserve sub-meter slot alignment. A closed-loop refinement layer corrects residual terminal error from action-grid discretization. We train our policy on six parking lots and evaluate zero-shot on our new reactive-parking benchmark spanning Dragon Lake Parking (DLP) and DeepScenario Open 3D (DSC3D). CoPark achieves ~70-85% success with only 3-6% collision rate, substantially outperforming classical, imitation-learning, and large-scale RL baselines. Importantly, the results demonstrate emergent interaction behaviors such as reverse-yielding, mid-maneuver yielding, tight-corridor passing, and queuing.

URL PDF HTML ☆

赞 0 踩 0

2606.04130 2026-06-04 cs.RO 版本更新

CLAW: Learning Continuous Latent Action World Models via Adversarial Latent Regularization

CLAW: 通过对抗潜在正则化学习连续潜在动作世界模型

Tewodros Ayalew, Matthew Jeung, Samuel Wheeler, Xiao Zhang, Andre de la Cruz Arce, Kaylene Stocking, Michael Maire, Matthew R. Walter

发表机构 * University of Chicago（芝加哥大学）； Toyota Technological Institute at Chicago（芝加哥丰田技术研究所）； Argonne National Laboratory（阿贡国家实验室）

AI总结提出CLAW框架，利用对抗潜在正则化和扩散视频生成，从无动作视频中端到端学习世界模型与连续潜在动作表示，支持观察模仿学习和目标导向规划。

Comments 8 pages, 15 pages of supplementary material

详情

AI中文摘要

我们引入了CLAW，一个完全端到端的自监督框架，用于直接从无动作视频中联合学习世界模型和连续潜在动作表示。我们的方法利用对抗潜在正则化和基于扩散的视频生成来捕获结构化和语义上有意义的动作表示，同时建模丰富的、可预测的环境动态，而不依赖于任何动作标签或注释。通过同时训练潜在动作模型和世界模型，CLAW学会仅从视觉观察中推理推断的动作如何引起环境转变。我们展示了由此产生的潜在动作世界模型支持从观察中模仿学习和目标导向规划。在模仿学习中，从原始视频中提取的潜在动作实现了行为克隆。对于规划，CLAW生成潜在动作序列并将其映射到可执行动作以达到期望目标。跨多种任务和实体的广泛实验表明，CLAW产生了语义上有意义的潜在动作表示，支持有效的动作迁移，并实现了规划和从观察中模仿，优于现有方法。

英文摘要

We introduce CLAW, a fully end-to-end self-supervised framework for learning a world model jointly with continuous latent action representations directly from action-free videos. Our approach leverages adversarial latent regularization and diffusion-based video generation to capture structured and semantically meaningful action representations while modeling rich, predictive environment dynamics, without relying on any action labels or annotations. By simultaneously training the Latent Action Model and world model, CLAW learns to reason about how inferred actions induce environment transitions from visual observations alone. We show that the resulting latent action world model supports both imitation learning from observation and goal-directed planning. In imitation learning, latent actions extracted from raw videos enable behavior cloning. For planning, CLAW generates sequences of latent actions and maps them to executable actions to reach desired goals. Extensive experiments across diverse tasks and embodiments demonstrate that CLAW produces semantically meaningful latent action representations, supports effective action transfer, and enables planning and imitation from observation, outperforming existing methods.

URL PDF HTML ☆

赞 0 踩 0

2606.04123 2026-06-04 math.OC cs.AI cs.RO 版本更新

Semantic Constraint Synthesis for Adaptive Trajectory Optimization via Large Language Models

基于大语言模型的语义约束综合用于自适应轨迹优化

Eleanor Brosius, Yuji Takubo, Daniele Gammelli, Simone D'Amico, Marco Pavone

发表机构 * Stanford University（斯坦福大学）

AI总结提出利用大语言模型将自然语言描述的任务需求转化为可执行的轨迹优化代码和数学公式，在航天器交会场景中实现了从语义需求重构凸轨迹优化问题的高成功率。

Comments 7 pages, 4 figures, Presented as a short paper at IEEE CVPR 2026, AI4Space Workshop

详情

AI中文摘要

轨迹优化是实现太空探索中安全可靠自主操作的关键组成部分。随着太空任务在频率、复杂性和范围上的增加，迫切需要快速制定数学上合理的轨迹优化问题，以准确反映任务目标和操作约束。然而，将任务意图转化为易于处理的轨迹优化分析公式需要大量的领域专业知识。本文提出一个框架，利用大语言模型（LLMs）将任务需求和约束的自然语言描述转化为可执行的轨迹优化代码及相应的数学公式。在航天器交会场景中的实验表明，从语义任务需求重构凸轨迹优化问题具有高成功率。最终，这项工作凸显了LLMs在连接高层意图与形式化优化模型方面的潜力，从而实现更灵活高效的航天器轨迹设计。

英文摘要

Trajectory optimization is a critical component for enabling safe and reliable autonomous operations in space exploration. As space missions increase in frequency, complexity, and scope, there is a growing need to rapidly formulate mathematically sound trajectory optimization problems that accurately reflect mission objectives and operational constraints. However, translating mission intent into tractable analytical formulations for trajectory optimization requires substantial domain expertise. This paper presents a framework that leverages large language models (LLMs) to translate natural language descriptions of mission requirements and constraints into executable trajectory optimization code and corresponding mathematical formulations. Experiments in spacecraft rendezvous scenarios demonstrate a high success rate in reconditioning a convex trajectory optimization problem from semantic mission requirements. Ultimately, this work highlights the potential of LLMs to bridge high-level intent and formal optimization models, enabling more flexible and efficient trajectory design of spacecraft.

URL PDF HTML ☆

赞 0 踩 0

2606.04111 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

AgenticDiffusion: Agentic Diffusion-based Path Planning for Vision-Based UAV Navigation

AgenticDiffusion：基于智能体扩散的视觉无人机导航路径规划

Faryal Batool, Muhammad Ahsan Mustafa, Fawad Mehboob, Valerii Serpiva, Dzmitry Tsetserukou

发表机构 * University of Engineering and Technology, Lahore（拉合尔工程与技术大学）

AI总结提出AgenticDiffusion多视角无人机导航框架，结合语言引导推理、开放词汇目标定位、视觉扩散规划与NMPC，通过协调第一人称和俯视图提升室内导航效率，在40次真实实验中实现80%任务成功率。

详情

AI中文摘要

室内无人机导航需要在有限视场观测下实现高效探索、场景理解和可靠轨迹执行。现有的基于视觉的导航框架通常依赖单视角观测，限制了其对遮挡、目标可见性和全局场景结构的推理能力。在这项工作中，我们提出了AgenticDiffusion，一个多视角无人机导航框架，在统一的空中导航流程中协调语言引导推理、开放词汇目标定位、基于视觉的扩散规划以及NMPC。给定自然语言指令和同步的第一人称视角（FPV）与俯视图观测，该框架在轨迹执行前确定最具信息量的导航视角并生成任务计划。使用开放词汇定位模型定位目标后，特定视角的扩散规划器生成用于无人机执行的导航轨迹。通过互补视角，所提框架减少了重复目标探索，并提高了在杂乱室内环境中的导航效率。该框架在四个真实无人机导航场景中进行了验证，涉及自适应视角选择、多阶段任务执行、长时域导航和安全着陆点选择。实验结果表明，在40次真实试验中，总体任务成功率达到80%，而扩散规划器实现了100%的轨迹生成成功率。

英文摘要

Indoor UAV navigation requires efficient exploration, scene understanding, and reliable trajectory execution under limited field-of-view observations. Existing vision-based navigation frameworks typically rely on single-view observations, limiting their ability to reason about occlusions, target visibility, and global scene structure. In this work, we propose AgenticDiffusion, a multi-view UAV navigation framework that coordinates language-guided reasoning, open-vocabulary target grounding, vision-based diffusion planning, and NMPC within a unified aerial navigation pipeline. Given a natural language instruction and synchronized first-person-view (FPV) and top-view observations, the framework determines the most informative viewpoint for navigation and generates a mission plan prior to trajectory execution. The targets are localized using an open-vocabulary grounding model, after which viewpoint-specific diffusion planners generate navigation trajectories for UAV execution. Using complementary viewpoints, the proposed framework reduces repeated target exploration and improves navigation efficiency in cluttered indoor environments. The framework was validated in four real-world UAV navigation scenarios involving adaptive viewpoint selection, multi-stage mission execution, long-horizon navigation, and safe landing-site selection. The experimental results demonstrated an overall mission success rate of 80% in 40 real-world trials, while the diffusion planners achieved a trajectory generation success rate of 100%.

URL PDF HTML ☆

赞 0 踩 0

2606.04072 2026-06-04 cs.RO cs.DC cs.LG cs.SY eess.SY 版本更新

CADET: A Modular Platform for Evaluating Distributed Cooperative Autonomy in Connected Autonomous Vehicles

CADET：用于评估网联自动驾驶车辆中分布式协作自主性的模块化平台

Pragya Sharma, Brian Wang, Mani Srivastava

发表机构 * UCLA ； Amazon Scholar（亚马逊学者）

AI总结提出CADET模块化平台，通过解耦自动驾驶堆栈并集成网络与工作负载仿真，系统评估分布式协作自主系统在真实部署条件下的安全性与性能。

详情

Journal ref: ICRA 2026

AI中文摘要

深度学习模型日益成为自动驾驶汽车（AV）管道的核心，然而其集成传统上遵循单一设计，即感知、规划和控制在同一车载计算机上执行。这种设计忽视了协作自主的新兴范式，即车辆通过车联网（V2X）连接与路侧单元（RSU）、边缘服务器和云托管智能进行交互。协作感知和控制提高了安全性和效率，但也引入了系统级挑战：网络延迟、计算异构性和多租户争用，所有这些都严重影响实时决策。这些挑战因对大型基础模型的日益依赖而进一步放大，这些模型的规模需要云部署。我们提出CADET（通过分布式实验工具包实现协作自主），这是一个模块化平台，用于在真实部署条件下对分布式协作自主系统进行系统化和可重复的评估。CADET将自动驾驶堆栈解耦为可组合的模块，这些模块可以灵活地部署在车辆、基础设施和边缘/云层级上。该框架集成了最先进的模型，引入了基于轨迹的网络和工作负载仿真，并提供了同步的模型级、系统级和任务级检测。通过V2V和V2I实验，我们表明分布式部署选择从根本上影响安全性，其中V2V意图数据包优于基于云的感知，而RSU辅助感知在过载并发请求之前维持安全性。尽管专为自动驾驶管道设计，CADET也支持数据集驱动的实验，使系统和机器学习研究人员能够独立于完整的车辆仿真来基准测试分布式推理工作负载。CADET是开源的，代码和演示可在https://nesl.github.io/cadet-web获取。

英文摘要

Deep learning models are increasingly central to autonomous vehicle (AV) pipelines, yet their integration has traditionally followed a monolithic design where perception, planning, and control execute on a single onboard computer. This design overlooks the emerging paradigm of cooperative autonomy, where vehicles interact with roadside units (RSUs), edge servers, and cloud-hosted intelligence through vehicle-to-everything (V2X) connectivity. Cooperative perception and control improve safety and efficiency, but also introduce systems-level challenges: network latency, compute heterogeneity, and multi-tenant contention, all critically affect real-time decision-making. These challenges are further amplified by the increasing reliance on large foundation models, whose scale necessitates cloud deployment. We present CADET (Cooperative Autonomy through Distributed Experimentation Toolkit), a modular platform for systematic and reproducible evaluation of distributed cooperative autonomy systems under realistic deployment conditions. CADET decouples the AV stack into composable modules that can be flexibly deployed across vehicles, infrastructure, and edge/cloud tiers. The framework integrates state-of-the-art models, incorporates trace-driven network and workload emulation, and provides synchronized model-, system-, and task-level instrumentation. Through V2V and V2I experiments, we show that distributed deployment choices fundamentally shape safety, with V2V intent packets outperforming cloud-based perception and RSU-assisted perception sustaining safety until overloaded by concurrent requests. Although designed for AV pipelines, CADET also supports dataset-driven experimentation, enabling systems and ML researchers to benchmark distributed inference workloads independently of full vehicle simulation. CADET is open source, with code and demo available at https://nesl.github.io/cadet-web.

URL PDF HTML ☆

赞 0 踩 0

2606.04046 2026-06-04 cs.CV cs.AI cs.CL cs.LG cs.RO 版本更新

Dive into the Scene: Breaking the Perceptual Bottleneck in Vision-Language Decision Making via Focus Plan Generation

深入场景：通过焦点计划生成打破视觉-语言决策中的感知瓶颈

Boyuan Xiao, Bohong Chen, Yumeng Li, Ji Feng, Yao-Xiang Ding, Kun Zhou

发表机构 * University of Science and Technology of China（中国科学技术大学）； Tsinghua University（清华大学）

AI总结提出SceneDiver方法，通过从粗到细的焦点计划生成，逐步构建场景图并分解任务，减少视觉幻觉，提升视觉-语言模型和视觉-语言-动作模型在具身决策任务中的表现。

Comments Accepted at ICML 2026

详情

AI中文摘要

在具身视觉-语言决策任务（如机器人操作和导航）中，视觉-语言模型和视觉-语言-动作模型（VLMs & VLAs）是具有不同优势的强大工具：VLMs更擅长长期规划，而VLAs更擅长反应控制。然而，它们的性能受到相同感知瓶颈的限制：由于模型无法区分任务相关对象与干扰物，导致视觉幻觉。原则上，准确识别并聚焦关键对象同时过滤无关对象是突破这一限制的关键。一个直接的解决方案是一步聚焦：直接关注重要对象。然而，这种方法被证明无效，因为有效的聚焦本质上需要深度场景理解。为此，我们提出SceneDiver，一种利用VLMs长期规划能力的从粗到细的焦点计划生成方法，首先构建整体场景图以建立初步理解，然后通过识别、理解和分析的迭代循环逐步将任务分解为更简单的子问题。为了实现反应控制，我们还设计了一个轻量级适配器，将深思熟虑的聚焦能力蒸馏到VLAs中。在标准具身AI基准上的评估证实，我们的方法显著减少了VLMs和VLAs的视觉幻觉，同时在需要快速执行的任务中保持了计算效率。我们的代码和数据发布在：https://future-item.github.io/SceneDiver。

英文摘要

In embodied vision-language decision making tasks such as robotic manipulation and navigation, Vision-Language and Vision-Language-Action Models (VLMs & VLAs) are powerful tools with different benefits: VLMs are better at long-term planning, while VLAs are better at reactive control. However, their performance is limited by the same perceptual bottleneck: visual hallucinations arise due to the models' inability to distinguish task-relevant objects from distractors. In principle, accurate identification and focus on critical objects while filtering out irrelevant ones is the key to break this limitation. A straightforward solution is one-step focus: directly attending to essential objects. However, this approach proves ineffective because effective focus inherently requires deep scene understanding. To this end, we propose SceneDiver, a coarse-to-fine focus plan generation method for VLMs leveraging their long-term planning abilities, that first constructs a holistic scene graph to establish initial comprehension, then progressively decomposes the task into simpler sub-problems through an iterative cycle of recognition, understanding, and analysis. To enable reactive control, we also design a lightweight adapter for distilling the deliberate focus ability into VLAs. Evaluations on standard embodied AI benchmarks confirm that our method substantially reduces visual hallucinations for both VLMs and VLAs, while preserving computational efficiency in tasks requiring fast execution. Our code and data are released at: https://future-item.github.io/SceneDiver.

URL PDF HTML ☆

赞 0 踩 0

2606.03943 2026-06-04 cs.RO cs.CV cs.LG 版本更新

PointAction: 3D Points as Universal Action Representations for Robot Control

PointAction: 3D点作为机器人控制的通用动作表示

Mutian Tong, Han Jiang, Qiao Feng, Lingjie Liu, Jiatao Gu

发表机构 * University of Pennsylvania（宾夕法尼亚大学）

AI总结提出PointAction框架，通过微调视频生成模型联合预测未来RGB帧和动态3D点图，将点动力学作为与具体本体无关的动作接口，再由扩散动作解码器映射为可执行动作，以减少RGB动作歧义并跨任务/本体迁移。

Comments Project page: https://oriontmt.github.io/pointaction/

详情

AI中文摘要

视频-动作模型（VAM）利用预训练视频扩散模型捕获的广泛视觉动态，为通用机器人操作提供了有前景的路径。然而，仅RGB视频展开无法直接操作：它们未明确指定度量3D运动、接触几何和细粒度空间约束，导致动作基础不明确。同时，跨不同任务和本体的动作监督扩展仍然成本高昂。我们提出PointAction，一个通过显式基于点的4D建模将视频预测桥接到机器人动作的框架。PointAction微调基础视频生成模型，联合预测未来RGB帧和动态3D点图，产生任务相关场景几何的时间一致3D运动。这些点动力学作为结构化的、与本体无关的动作接口，由基于扩散的动作解码器映射为可执行的机器人动作。通过使用度量3D点动力学作为视频预测和控制之间的接口，PointAction减少了仅RGB动作基础的不确定性，并支持在有限动作监督下跨任务和本体的迁移。实验表明，PointAction在机器人场景上实现了最先进的4D生成质量，在模拟中优于现有基线，并泛化到预训练中未见过的两个真实机器人手臂。

英文摘要

Video-Action Models (VAMs) leverage the broad visual dynamics captured by pre-trained video diffusion models, offering a promising path toward generalizable robot manipulation. However, RGB-only video rollouts are not directly actionable: they leave metric 3D motion, contact geometry, and fine-grained spatial constraints under-specified, making action grounding ambiguous. Meanwhile, scaling action supervision across diverse tasks and embodiments remains costly. We present PointAction, a framework that bridges video predictions to robot actions through explicit point-based 4D modeling. PointAction fine-tunes a foundation video generation model to jointly predict future RGB frames and dynamic 3D pointmaps, producing temporally consistent 3D motion of task-relevant scene geometry. These point dynamics serve as a structured, embodiment-agnostic action interface, which a diffusion-based action decoder maps to executable robot actions. By using metric 3D point dynamics as the interface between video prediction and control, PointAction reduces the ambiguity of RGB-only action grounding and supports transfer across tasks and embodiments with limited action supervision. Experiments show that PointAction achieves state-of-the-art 4D generation quality on robot scenes, outperforms existing baselines in simulation, and generalizes to two real robot arms unseen during pretraining.

URL PDF HTML ☆

赞 0 踩 0

2606.03784 2026-06-04 cs.RO 版本更新

Revisiting Embodied Chain-of-Thought for Generalizable Robot Manipulation

重新审视具身思维链以实现可泛化的机器人操作

Nan Sun, Yuan Zhang, Yongkun Yang, Wentao Zhao, Peiyan Li, Jun Guo, Wenxuan Song, Pengxiang Ding, Runze Suo, Yifei Su, Xin Xiao, Xinghang Li, Huaping Liu

发表机构 * Tsinghua University（清华大学）； Xiaomi Robotics（小米机器人）； Peking University（北京大学）； CASIA ； HKUST(GZ)（香港科技大学（广州））； Zhejiang University（浙江大学）； Fudan University（复旦大学）； Wuhan University（武汉大学）； Shanghai Innovation Institute（上海创新研究院）

AI总结本文通过构建最大规模具身思维链语料库，提出ERVLA模型，利用思维链作为表征塑造监督而非测试时推理，显著提升了机器人操作的可泛化性。

详情

AI中文摘要

具身思维链旨在连接语言推理与机器人控制，但其有效形式和集成策略仍待探索。本文在大规模视觉-语言-动作模型上重新审视具身思维链。我们构建了迄今为止最大的具身思维链语料库，包含978,743条轨迹、226.3M样本和2592.5小时机器人数据。通过大量实验，我们发现有效的具身思维链应将高层语义理解具体化为可操作的动作指导，例如末端执行器运动描述和图像空间轨迹，而仅靠高层推理带来的收益有限。我们进一步表明，当显式思维链用作自回归动作前缀时，其扩展性不可靠，因为存在复合推理错误和不稳定的推理-动作耦合。为解决这些问题，我们提出ERVLA，一种将具身思维链用作表征塑造监督而非强制性测试时推理的VLA模型。ERVLA采用推理丢弃策略进行训练，使模型在训练期间吸收丰富的推理痕迹，同时在推理时无需思维链解码直接预测动作。这种设计提高了随预训练数据增加的扩展性，并避免了自回归不稳定性。ERVLA在LIBERO-Plus上达到86.9%的成功率，在VLABench上达到53.2%的成功率，展现出强大的分布外泛化能力。在真实机器人实验中，ERVLA进一步优于竞争性基线，尤其是在需要语义消歧和长时域执行的任务上。代码、数据和模型检查点将发布。

英文摘要

Embodied chain-of-thought (CoT) aims to bridge linguistic reasoning and robotic control, but its effective form and integration strategy remain underexplored. In this paper, we revisit embodied CoT for vision-language-action (VLA) models at large scale. We construct the largest embodied CoT corpus to date, comprising 978,743 trajectories, 226.3M samples, and 2592.5 hours of robot data. Through extensive experiments, we find that effective embodied CoT should ground high-level semantic understanding into concrete action guidance, such as end-effector movement descriptions and image-space trajectories, while high-level reasoning alone brings only marginal gains. We further show that explicit CoT does not scale reliably when used as an autoregressive action prefix, as it suffers from compounding inference errors and unstable reasoning-action coupling. To address these limitations, we propose ERVLA, a VLA model that uses embodied CoT as representation-shaping supervision rather than mandatory test-time reasoning. ERVLA is trained with a reasoning-dropout strategy, enabling the model to absorb rich reasoning traces during training while predicting actions directly without CoT decoding during inference. This design improves scalability with increasing pre-training data and avoids autoregressive instability. ERVLA achieves state-of-the-art performance on LIBERO-Plus with an 86.9% success rate and reaches 53.2% success rate on VLABench, demonstrating strong out-of-distribution generalization. In real-robot experiments, ERVLA further outperforms competitive state-of-the-art baselines, especially on tasks requiring semantic disambiguation and long-horizon execution.

URL PDF HTML ☆

赞 0 踩 0

2606.03598 2026-06-04 cs.RO cs.AI cs.CV 版本更新

PHASER: Phase-Aware and Semantic Experience Replay for Vision-Language-Action Models

PHASER: 面向视觉-语言-动作模型的相位感知与语义经验回放

Ziyang Chen, Shaoguang Wang, Weiyu Guo, Qianyi Cai, He Zhang, Pengteng Li, Yiren Zhao, Yandong Guo

发表机构 * Thrust of AI, HKUST(Guangzhou)（人工智能 thrust，香港科技大学（广州））； AI 2 Robotics, Shenzhen, China（人工智能与机器人，深圳，中国）

AI总结提出PHASER框架，通过相位感知容量分配和多模态干扰路由策略，结合自动相位提取管线Auto-PC，解决VLA模型在持续学习中的灾难性遗忘问题，在LIBERO基准上平均成功率提升高达31%。

Comments 20 pages, 8 figures, 12 tables

详情

AI中文摘要

视觉-语言-动作（VLA）模型在语言条件机器人操作中取得了显著成功。然而，在开放环境中部署这些模型需要持续获取新技能，这一过程不可避免地会严重遗忘先前学习的行为。虽然经验回放（ER）是一种标准的缓解策略，但简单的均匀采样从根本上与操作轨迹的时间特征不一致。它系统性地欠采样短暂但因果关键的子技能，导致相位饥饿，并完全忽略了历史任务中不同程度的遗忘。为克服这些限制，我们提出PHASER，一种架构无关的持续学习框架。PHASER采用以相位为中心的容量分配，确保所有子技能获得平等的记忆支持，并结合多模态干扰路由策略，动态优先处理遗忘风险高的历史相位。此外，为实现完全自主的终身适应，我们集成了Auto-PC，一种轻量级管线，结合无监督动作信号变化点检测和基于VLM的语义验证，无需大量人工监督即可提取时间边界。在LIBERO持续学习套件上对三个VLA骨干网络的评估表明，PHASER取得了显著的实证改进，与匹配预算的ER相比，平均成功率（ASR）提升高达31%，并在LIBERO-Goal CL设置中达到87.8%的最终ASR。

英文摘要

Vision-Language-Action (VLA) models have achieved remarkable success in language-conditioned robotic manipulation. However, deploying these models in open-ended environments requires continuously acquiring novel skills, a process that inevitably triggers severe catastrophic forgetting of previously learned behaviors. While experience replay (ER) serves as a standard mitigating strategy, naive uniform sampling fundamentally misaligns with the temporal characteristics of manipulation trajectories. It systematically under-samples brief but causally critical sub-skills, leading to phase starvation, and completely overlooks the varying degrees of forgetting across historical tasks. To overcome these limitations, we introduce PHASER, an architecture-agnostic continual learning framework. PHASER employs a phase-centric capacity allocation to guarantee equal memory support for all sub-skills, coupled with a multi-modal interference routing strategy that dynamically prioritizes historical phases at high risk of forgetting. Furthermore, to enable fully autonomous lifelong adaptation, we integrate Auto-PC, a lightweight pipeline combining unsupervised action-signal change-point detection with VLM-based semantic verification to extract temporal boundaries without intensive manual supervision. Evaluated across three VLA backbones on LIBERO continual learning suites, PHASER yields substantial empirical improvements, increasing Average Success Rate (ASR) by up to 31% over matched-budget ER and achieving an 87.8% final ASR on the LIBERO-Goal CL setting.

URL PDF HTML ☆

赞 0 踩 0

2606.03441 2026-06-04 cs.RO cs.LG 版本更新

PerchRL: Vision-Based Agile Perching on Inclined Platforms under Rapid and Irregular Motion

PerchRL：基于视觉的快速不规则运动倾斜平台敏捷着陆

Zihong Lu, Zongzhuo Liu, Huaxu Li, Jinqiang Cui, Jie Mei, Youmin Gong, U Kei Cheang, Boyu Zhou

发表机构 * SUSTech（四川大学）； HITSZ（哈尔滨工业大学）； PCL（鹏城实验室）； Differential Robotics（差分机器人实验室）

AI总结提出PerchRL强化学习框架，通过两阶段学习策略（状态预训练+视觉微调）和混合学习框架（可见性感知状态增强+主动感知奖励），实现四旋翼在快速不规则运动倾斜平台上的自主视觉着陆。

详情

AI中文摘要

自主视觉引导的四旋翼在移动倾斜平台上的着陆对于空地协作至关重要，但由于有限的视场角（FOV）而具有挑战性。本文提出PerchRL，一种基于强化学习（RL）的框架，用于在快速和不规则运动下的倾斜平台上进行基于视觉的敏捷着陆。具体而言，我们采用两阶段学习策略，包括基于状态的预训练和基于视觉的微调。为了提高对不同平台运动的泛化能力，我们使用随机化的平台轨迹来防止过拟合，并采用时间增强方法从历史观测中捕捉潜在运动模式。在基于视觉的微调过程中，提出了一种混合学习框架，包括可见性感知状态增强和主动感知奖励，以提高在间歇性视觉丢失下的鲁棒性。大量的仿真和真实世界实验证明了PerchRL的可行性、稳定性和实时性能，而在不同四旋翼平台上的成功部署进一步验证了其适应性。源代码将发布以惠及社区。

英文摘要

Autonomous vision-based perching of quadrotors on moving inclined platforms is critical for air-ground collaboration but remains challenging due to the limited field of view (FOV). In this paper, we propose PerchRL, a reinforcement learning (RL) framework for vision-based agile perching on inclined platforms under rapid and irregular motion. Specifically, we employ a two-stage learning strategy consisting of state-based pre-training followed by vision-based fine-tuning. To improve generalization across diverse platform motions, we employ randomized platform trajectories to prevent overfitting and temporal augmentation methods to capture latent motion patterns from historical observations. During vision-based fine-tuning, a hybrid learning framework consisting of visibility-aware state augmentation and active perception rewards is presented to improve robustness under intermittent visual loss. Extensive simulation and real-world experiments demonstrate the feasibility, stability, and real-time performance of PerchRL, while successful deployment across distinct quadrotor platforms further validates its adaptability. The source code will be released to benefit the community.

URL PDF HTML ☆

赞 0 踩 0

2606.03175 2026-06-04 cs.CV cs.RO 版本更新

Ask When It Pays: Cost-Aware Open-Ended Interaction for Instance Goal Navigation

在值得时询问：面向实例目标导航的成本感知开放式交互

Xunyi Zhao, Sihao Lin, Gengze Zhou, Zerui Li, Shijie Li, Wei Tao, Jiajun Liu, Qi Wu

发表机构 * Adelaide University（阿德莱德大学）； Responsible AI Research Centre, Australian Institute for Machine Learning（负责任人工智能研究中心，澳大利亚机器学习研究所）； Institute for Infocomm Research (I2R), A*STAR（信息与通信研究院（I2R），A*STAR）； iMotion ； CSIRO Data61 Project Website（CSIRO Data61项目网站）

AI总结针对实例目标导航中语言歧义问题，提出一种成本敏感的不确定性减少方法，通过信息增益分析确定有效问题类型，并构建基准测试和加权成功率指标，实现零样本MLLM导航器仅在预期收益大于成本时查询。

详情

AI中文摘要

低成本机器人臂的可重复且物理可行的动力学参数辨识框架

Junji Oaki, Koki Yamane, Koki Inami, Sho Sakaino

发表机构 * Institute of Systems and Information Engineering, University of Tsukuba（系统与信息工程研究所，茨川大学）

AI总结针对低成本机器人臂CRANE-X7，提出一种结合最小二乘、半定规划投影和闭环输入误差精化的可重复且物理可行的动力学参数辨识方法，并通过主成分分析和惯性矩阵正定性审核确保模型统计一致性与物理可行性。

Comments 11 pages, 8 figures, 7 tables, 1 algorithm and 2 appendices

详情

AI中文摘要

本文针对由模块化智能驱动器驱动的低成本机器人臂CRANE-X7，提出了一种可重复且物理可行的动力学参数辨识框架。为提高实际可辨识性，根据近似连杆对称性移除惯性积，将刚体模型从65个基础参数减少至39个。辨识运动是在实际关节限位下，由结构化的单关节和相邻关节基元手工设计而成。所提出的流程结合了预处理、基于逆动力学回归的普通最小二乘（OLS）、用于可行性恢复的条件半定规划（SDP）投影以及闭环输入误差（CLIE）精化。在共同的主成分分析（PCA）空间中分析来自40个结构化轨迹的候选解，以选择一个统计上中心的代表性模型。由于统计中心性本身不能保证物理可接受性，最终选定的模型需通过所有位姿下的惯性矩阵正定性审核，并在必要时通过局部化的后CLIE SDP救援步骤进行修正。实验表明，参数云从OLS到SDP再到CLIE逐渐变得更加集中，而最终接受的模型在保留的验证运动上保持了高预测精度。这些结果为低成本机器人平台获得统计一致且物理可行的动力学模型提供了一条实用途径。

英文摘要

This paper presents a reproducible and physically feasible dynamic parameter identification framework for CRANE-X7, a low-cost robot arm driven by modular smart actuators. To improve practical identifiability, products of inertia are removed according to approximate link symmetry, reducing the rigid-body model from 65 to 39 base parameters. Identification motions are hand-designed from structured single-joint and adjacent-joint primitives under practical joint-range limits. The proposed pipeline combines preprocessing, inverse-dynamics-regressor-based ordinary least squares (OLS), conditional semidefinite-programming (SDP) projection for feasibility recovery, and closed-loop input error (CLIE) refinement. Candidate solutions from 40 structured trajectories are analyzed in a common principal component analysis (PCA) space to select a statistically central representative model. Because statistical centrality alone does not ensure physical acceptability, the selected model is finally screened by an all-pose positive-definiteness audit of the inertia matrix and, when necessary, corrected by a localized post-CLIE SDP rescue step. Experiments show that the parameter cloud becomes progressively more concentrated from OLS to SDP and CLIE, while the final accepted model preserves high predictive accuracy on held-out validation motions. These results demonstrate a practical route to statistically coherent and physically feasible dynamic models for low-cost robot platforms.

URL PDF HTML ☆

赞 0 踩 0

1902.10607 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Necessary and Sufficient Conditions for Passivity of Velocity-Sourced Impedance Control of Series Elastic Actuators

速度源阻抗控制系列弹性驱动器被动性的必要和充分条件

Fatih Emre Tosun, Volkan Patoglu

发表机构 * Faculty of Engineering and Natural Sciences, Sabancı University（工程与自然科学学院，萨班奇大学）

AI总结本文研究了系列弹性驱动器速度源阻抗控制架构的被动性条件，提出了非保守的设计指南以实现null阻抗和纯弹簧的触觉显示，并强调了在积分控制器中包含物理阻尼的重要性。

Comments Submitted to IEEE T-RO, 12 pages, 10 figures, 7 tables

详情

DOI: 10.1109/TRO.2019.2962332

AI中文摘要

系列弹性驱动（SEA）因其在物理人机交互应用中的稳定性鲁棒性和力控制精度而变得普遍。已提出几种SEA的阻抗控制架构。其中，具有内层速度环、中层扭矩环和外层阻抗环的级联控制器因其简单性、鲁棒性和性能而特别受欢迎。本文推导了确保该级联控制器架构在渲染两种最常见的虚拟阻抗模型时被动性的必要和充分条件。基于新建立的被动性条件，我们提供了非保守的设计指南，以触觉显示null阻抗和纯弹簧，同时确保交互的被动性。我们还展示了在推导被动性条件时，当使用积分控制器时，包含物理阻尼的重要性。特别是，我们展示了物理阻尼对系统被动性的影响。

英文摘要

Series Elastic Actuation (SEA) has become prevalent in applications involving physical human-robot interaction as it provides considerable advantages over traditional stiff actuators in terms of stability robustness and fidelity of force control. Several impedance control architectures have been proposed for SEA. Among these alternatives, the cascaded controller with an inner-most velocity loop, an intermediate torque loop and an outer-most impedance loop is particularly favoured for its simplicity, robustness, and performance. In this paper, we derive the \emph{necessary and sufficient conditions} to ensure the passivity of this cascade-controller architecture for rendering two most common virtual impedance models. Based on the newly established passivity conditions, we provide non-conservative design guidelines to haptically display a null impedance and a pure spring while ensuring the passivity of interaction. We also demonstrate the importance of including physical damping in the actuator model during derivation of passivity conditions, when integral controllers are utilized. In particular, we show the adversary effect of physical damping on system passivity.

URL PDF HTML ☆

赞 0 踩 0

2304.10891 2026-06-04 cs.LG cs.AI cs.CV cs.RO cs.SY eess.SY 版本更新

Transformer-Based Autonomous Driving Models and Deployment-Oriented Compression: A Survey

基于Transformer的自动驾驶模型与面向部署的压缩：综述

Juan Zhong, Yuhang Shi, Zukang Xu, Xi Chen

发表机构 * Renmin University of China（中国人民大学）； Artificial Intelligence Innovation and Incubation Institute, Fudan University（复旦大学人工智能创新与孵化院）； Shanghai Academy of AI for Science（上海人工智能科学研究院）； Department of houmo.ai（houmo.ai部门）

AI总结本文综述了基于Transformer的自动驾驶模型，并从部署角度分析了压缩与加速策略（如量化、剪枝、知识蒸馏等）如何影响模型设计、部署性、鲁棒性和安全性。

详情

AI中文摘要

基于Transformer的模型正成为自动驾驶的核心范式，因为它们能够捕捉感知、预测和规划中的长程空间依赖、多智能体交互和多模态上下文。然而，它们在真实车辆中的部署仍然困难，因为高容量注意力架构带来了显著的延迟、内存和能量开销。本综述回顾了具有代表性的基于Transformer的自动驾驶模型，并按任务角色、感知配置和架构设计进行组织。更重要的是，我们从面向部署的角度审视这些模型，分析效率约束如何在实际中重塑模型设计选择。我们进一步回顾了与基于Transformer的驾驶系统相关的压缩和加速策略，包括量化、剪枝、知识蒸馏、低秩近似和高效注意力，并讨论了它们的优势、局限性和任务依赖性。我们不将压缩视为孤立的后期处理步骤，而是强调其作为直接影响部署性、鲁棒性和安全性的系统级设计考虑。最后，我们指出了面向标准化、安全感知和硬件感知的高效自动驾驶系统评估的开放挑战和未来研究方向。

英文摘要

Transformer-based models are becoming a central paradigm in autonomous driving because they can capture long-range spatial dependencies, multi-agent interactions, and multimodal context across perception, prediction, and planning. At the same time, their deployment in real vehicles remains difficult because high-capacity attention-based architectures impose substantial latency, memory, and energy overhead. This survey reviews representative Transformer-based autonomous driving models and organizes them by task role, sensing configuration, and architectural design. More importantly, it examines these models from a deployment-oriented perspective and analyzes how efficiency constraints reshape model design choices in practice. We further review compression and acceleration strategies relevant to Transformer-based driving systems, including quantization, pruning, knowledge distillation, low-rank approximation, and efficient attention, and discuss their benefits, limitations, and task-dependent applicability. Rather than treating compression as an isolated post-processing step, we highlight it as a system-level design consideration that directly affects deployability, robustness, and safety. Finally, we identify open challenges and future research directions toward standardized, safety-aware, and hardware-conscious evaluation of efficient autonomous driving systems.

URL PDF HTML ☆

赞 0 踩 0

2605.00416 2026-06-04 cs.RO 版本更新

Learning While Deploying: Fleet-Scale Reinforcement Learning for Generalist Robot Policies

在部署中学习：面向通用机器人策略的机群规模强化学习

Yi Wang, Xinchen Li, Pengwei Xie, Pu Yang, Buqing Nie, Yunuo Cai, Qinglin Zhang, Chendi Qu, Jeffrey Wu, Jianheng Song, Xinlin Ren, Jingshun Huang, Mingjie Pan, Siyuan Feng, Zhi Chen, Jianlan Luo

发表机构 * Shanghai Innovation Institute（上海创新研究院）； AGIBOT Finch ； Columbia University（哥伦比亚大学）

AI总结提出LWD框架，通过机群规模的离线到在线强化学习，结合分布式隐式价值学习与伴随匹配Q学习，持续后训练通用视觉-语言-动作策略，在16台双臂机器人上实现95%平均成功率。

Comments No

详情

AI中文摘要

通用机器人策略日益受益于大规模预训练，但仅靠离线数据不足以实现稳健的实世界部署。已部署的机器人会遇到分布偏移、长尾故障、任务变化以及人类纠正机会，这些是固定演示数据集无法完全捕获的。我们提出了“在部署中学习”（LWD），一个机群规模的离线到在线强化学习框架，用于通用视觉-语言-动作（VLA）策略的持续后训练。从预训练的VLA策略开始，LWD通过使用在机器人机群中收集的自主 rollout 和人类干预，在部署、共享物理经验、策略改进和重新部署之间形成闭环。为了稳定地从异构、稀疏奖励的机群数据中学习，LWD结合了分布式隐式价值学习（DIVL）进行鲁棒的价值估计，以及通过伴随匹配的Q学习（QAM）在基于流的VLA动作生成器中进行策略提取。我们在一个由16台双臂机器人组成的机群上，在八个真实世界操作任务（包括语义杂货补货和3-5分钟的长时域任务）上验证了LWD。单个通用策略随着机群经验的积累而改进，平均成功率达到95%，在长时域任务上提升最大。

英文摘要

Generalist robot policies increasingly benefit from large-scale pretraining, but offline data alone is insufficient for robust real-world deployment. Deployed robots encounter distribution shifts, long-tail failures, task variations, and human correction opportunities that fixed demonstration datasets cannot fully capture. We present Learning While Deploying (LWD), a fleet-scale offline-to-online reinforcement learning framework for continual post-training of generalist Vision-Language-Action (VLA) policies. Starting from a pretrained VLA policy, LWD closes the loop between deployment, shared physical experience, policy improvement, and redeployment by using autonomous rollouts and human interventions collected across a robot fleet. To stabilize learning from heterogeneous, sparse-reward fleet data, LWD combines Distributional Implicit Value Learning (DIVL) for robust value estimation with Q-learning via Adjoint Matching (QAM) for policy extraction in flow-based VLA action generators. We validate LWD on a fleet of 16 dual-arm robots across eight real-world manipulation tasks, including semantic grocery restocking and 3--5 minute long-horizon tasks. A single generalist policy improves as fleet experience accumulates, reaching an average success rate of 95%, with the largest gains on long-horizon tasks.

URL PDF HTML ☆

赞 0 踩 0

2604.25050 2026-06-04 cs.RO 版本更新

DiscreteRTC: Discrete Diffusion Policies are Natural Asynchronous Executors

DiscreteRTC：离散扩散策略是自然的异步执行器

Pengcheng Wang, Kaiwen Hong, Chensheng Peng, Katherine Driggs-Campbell, Masayoshi Tomizuka, Chenfeng Xu, Chen Tang

发表机构 * UC Berkeley（加州大学伯克利分校）； UIUC（伊利诺伊大学香槟分校）； UT Austin（德克萨斯大学奥斯汀分校）； UCLA（加州大学洛杉矶分校）

AI总结针对同步执行器在动态任务中的致命停顿问题，提出DiscreteRTC方法，利用离散扩散策略的原生修复能力实现异步执行，在动态模拟和真实操作任务中取得更高成功率。

详情

AI中文摘要

与聊天机器人不同，物理AI必须在世界不断变化的同时行动。因此，无论推理速度有多快，同步执行器的块间停顿对于动态任务都是致命的。异步执行——边行动边思考——因此是一个结构性要求，而实时分块（RTC）通过将块转换重新定义为修复（冻结已承诺的动作并一致地生成剩余部分）使其可行。然而，基于流匹配策略的RTC在结构上并非最优：其修复来自推理时的修正而非基础策略，导致几乎没有预训练收益、需要特定微调、启发式指导以及增加延迟的额外计算。在这项工作中，我们观察到离散扩散策略通过迭代去掩码生成动作，是自然的异步执行器，一次性解决了所有限制：由于修复是其原生操作，因此无需微调，而提前停止进一步提供了自适应指导并降低了推理成本。我们提出了DiscreteRTC，用原生去掩码替代外部修正，并在动态模拟基准和真实世界动态操作任务上展示了其比连续RTC和其他基线更高的成功率。总之，DiscreteRTC实现更简单，无需额外代码即可启用异步修复；推理更快，仅需从头生成动作约0.7倍的计算量；执行更好，在真实世界曲棍球防守任务中，成功率比流匹配RTC高65%，比训练时流匹配RTC高30%。更多可视化见https://outsider86.github.io/DiscreteRTCSite/。

英文摘要

Unlike chatbots, physical AI must act while the world keeps evolving. Therefore, the inter-chunk pause of synchronous executors are fatal for dynamic tasks regardless of how fast the inference is. Asynchronous execution -- thinking while acting -- is therefore a structural requirement, and real-time chunking (RTC) makes it viable by recasting chunk transitions as inpainting: freezing committed actions and consistently generating the remainder. However, RTC with flow-matching policy is structurally suboptimal: its inpainting comes from inference-time corrections rather than the base policy, yielding little pre-training benefit, specific fine-tuning, heuristic guidance, and extra computation that inflates the latency. In this work, we observe that discrete diffusion policies, which generate actions by iteratively unmasking, are natural asynchronous executors that resolve all limitations at once: they are fine-tuning free since inpainting is their native operation, while early stopping further provides adaptive guidance and reduces inference cost. We propose DiscreteRTC, which replaces external corrections with native unmasking, and show on dynamic simulated benchmarks and real-world dynamic manipulation tasks that it achieves higher success rates than continuous RTC and other baselines. In summary, DiscreteRTC is simpler to implement with 0 lines of additional code to enable async inpainting, faster at inference with only ~0.7 computation compared with generating actions from scratch, and better at execution with 65% higher success rate in real-world hockey defend task compared with flow-matching RTC, and 30% higher compared with training-time flow-matching RTC. More visualizations are on https://outsider86.github.io/DiscreteRTCSite/.

URL PDF HTML ☆

赞 0 踩 0

2604.12645 2026-06-04 cs.RO cs.AI 版本更新

Contextual Multi-Task Reinforcement Learning for Autonomous Reef Monitoring

上下文多任务强化学习用于自主珊瑚礁监测

Melvin Laux, Yi-Ling Liu, Rina Alo, Sören Töpper, Mariela De Lucas Alvarez, Frank Kirchner, Rebecca Adam

发表机构 * University of Bremen（不莱梅大学）

AI总结针对水下动力学不确定性和任务变化，提出上下文多任务强化学习框架，学习可复用的控制策略，在模拟环境中实现高效训练、零样本泛化和鲁棒性。

Comments To be published in IEEE OCEANS 2026 (Sanya) conference proceedings

详情

AI中文摘要

尽管自主水下航行器有望实现海洋生态系统监测，但其部署从根本上受限于在高度不确定和非平稳的水下动力学下控制航行器的难度。为了解决这些挑战，我们采用数据驱动的强化学习方法来补偿未知动力学和任务变化。传统的单任务强化学习容易过拟合训练环境，从而限制了所学策略的长期实用性。因此，我们提出使用上下文多任务强化学习范式，允许我们学习可复用于各种任务的控制器，例如在一个珊瑚礁中检测牡蛎，在另一个珊瑚礁中检测珊瑚。我们评估上下文多任务强化学习是否能有效学习自主水下珊瑚礁监测的鲁棒且可泛化的控制策略。我们在HoloOcean中的模拟珊瑚礁环境中训练了一个单一上下文相关策略，该策略能够解决多个相关的监测任务。在我们的实验中，我们经验性地评估了上下文策略在样本效率、对未见任务的零样本泛化以及对变化水流的鲁棒性方面的表现。通过利用多任务强化学习，我们旨在提高训练效率以及所学策略的可重用性，从而向更可持续的自主珊瑚礁监测程序迈进一步。

英文摘要

Although autonomous underwater vehicles promise the capability of marine ecosystem monitoring, their deployment is fundamentally limited by the difficulty of controlling vehicles under highly uncertain and non-stationary underwater dynamics. To address these challenges, we employ a data-driven reinforcement learning approach to compensate for unknown dynamics and task variations. Traditional single-task reinforcement learning has a tendency to overfit the training environment, thus, limit the long-term usefulness of the learnt policy. Hence, we propose to use a contextual multi-task reinforcement learning paradigm instead, allowing us to learn controllers that can be reused for various tasks, e.g., detecting oysters in one reef and detecting corals in another. We evaluate whether contextual multi-task reinforcement learning can efficiently learn robust and generalisable control policies for autonomous underwater reef monitoring. We train a single context-dependent policy that is able to solve multiple related monitoring tasks in a simulated reef environment in HoloOcean. In our experiments, we empirically evaluate the contextual policies regarding sample-efficiency, zero-shot generalisation to unseen tasks, and robustness to varying water currents. By utilising multi-task reinforcement learning, we aim to improve the training effectiveness, as well as the reusability of learnt policies to take a step towards more sustainable procedures in autonomous reef monitoring.

URL PDF HTML ☆

赞 0 踩 0

2604.04974 2026-06-04 cs.RO 版本更新

From Video to Control: A Survey of Learning Manipulation Interfaces from Temporal Visual Data

从视频到控制：基于时间视觉数据学习操作接口的综述

Linfang Zheng, Zikai Ouyang, Chen Wang, Jia Pan, Wei Zhang

发表机构 * School of Automation and Intelligent Manufacturing (AiM)（自动化与智能制造学院）； Guangdong Provincial Key Laboratory of Fully Actuated System Control Theory and Technology（广东省全驱动系统控制理论与技术省级重点实验室）； Southern University of Science and Technology（南方科技大学）； The University of Hong Kong（香港大学）； Peng Cheng Laboratory（鹏城实验室）； LimX Dynamics

AI总结本文综述了利用无动作标注的时间视频数据学习机器人操作控制接口的方法，提出以接口为中心的三种分类：直接视频-动作策略、潜在动作方法和显式视觉接口，并分析了控制集成特性与未来研究方向。

详情

AI中文摘要

视频是物理动力学的可扩展观测：它捕捉物体如何移动、接触如何展开以及场景在交互中如何演变——所有这些都不需要机器人动作标签。然而，将这种时间结构转化为可靠的机器人控制仍然是一个开放的挑战，因为视频缺乏动作监督，并且在具身、视角和物理约束方面与机器人经验不同。本综述回顾了利用无动作标注的时间视频来学习机器人操作控制接口的方法。我们引入了一种以接口为中心的分类法，按照视频到控制接口的构建位置及其启用的控制属性进行组织，识别出三个家族：直接视频-动作策略（保持接口隐式）、潜在动作方法（通过紧凑的学习中间体路由时间结构）以及显式视觉接口（预测下游控制的可解释目标）。对于每个家族，我们分析了控制集成特性——如何闭合回路、执行前可以验证什么以及失败在何处进入。跨家族的综合分析表明，最紧迫的开放挑战集中在机器人集成层——将视频衍生的预测连接到可靠机器人行为的机制——我们概述了弥合这一差距的研究方向。

英文摘要

Video is a scalable observation of physical dynamics: it captures how objects move, how contact unfolds, and how scenes evolve under interaction -- all without requiring robot action labels. Yet translating this temporal structure into reliable robotic control remains an open challenge, because video lacks action supervision and differs from robot experience in embodiment, viewpoint, and physical constraints. This survey reviews methods that exploit non-action-annotated temporal video to learn control interfaces for robotic manipulation. We introduce an interface-centric taxonomy organized by where the video-to-control interface is constructed and what control properties it enables, identifying three families: direct video-action policies, which keep the interface implicit; latent-action methods, which route temporal structure through a compact learned intermediate; and explicit visual interfaces, which predict interpretable targets for downstream control. For each family, we analyze control-integration properties -- how the loop is closed, what can be verified before execution, and where failures enter. A cross-family synthesis reveals that the most pressing open challenges center on the robotics integration layer -- the mechanisms that connect video-derived predictions to dependable robot behavior -- and we outline research directions toward closing this gap.

URL PDF HTML ☆

赞 0 踩 0

2602.23312 2026-06-04 cs.HC cs.AI cs.LG cs.RO cs.SY eess.SY 版本更新

Evaluating Zero-Shot and One-Shot Adaptation of Small Language Models in Leader-Follower Interaction

评估小语言模型在领导者-跟随者交互中的零样本和单样本适应

Rafael R. Baptista, André de Lima Salgado, Ricardo V. Godoy, Marcelo Becker, Thiago Boaventura, Gustavo J. G. Lahr

发表机构 * University of Sao Paulo（圣保罗大学）； Federal University of Lavras（拉瓦尔联邦大学）； Faculdade Israelita de Ensino e Pesquisa Albert Einstein（亚伯拉罕·林克·埃instein教育与研究学院）

AI总结本文通过微调小语言模型（Qwen2.5-0.5B）在领导者-跟随者交互中实现角色分类，零样本微调达到86.66%准确率且延迟低至22.2毫秒，但单样本模式因上下文长度增加导致性能下降。

详情

AI中文摘要

领导者-跟随者交互是人机交互（HRI）中的一个重要范式。然而，对于资源受限的移动和辅助机器人来说，实时分配角色仍然具有挑战性。虽然大型语言模型（LLMs）在自然通信方面显示出潜力，但其规模和延迟限制了设备端部署。小语言模型（SLMs）提供了一种潜在的替代方案，但它们在HRI中角色分类的有效性尚未得到系统评估。在本文中，我们提出了一个用于领导者-跟随者通信的SLMs基准测试，引入了一个源自已发表数据库的新数据集，并增加了合成样本以捕捉交互特定的动态。我们研究了两种适应策略：提示工程和微调，在零样本和单样本交互模式下进行研究，并与未训练的基线进行比较。使用Qwen2.5-0.5B的实验表明，零样本微调实现了稳健的分类性能（86.66%准确率），同时保持低延迟（每个样本22.2毫秒），显著优于基线和提示工程方法。然而，结果也表明在单样本模式下性能下降，其中增加的上下文长度挑战了模型的架构能力。这些发现表明，微调的SLMs为直接角色分配提供了有效的解决方案，同时突出了边缘端对话复杂性与分类可靠性之间的关键权衡。

英文摘要

Leader-follower interaction is an important paradigm in human-robot interaction (HRI). Yet, assigning roles in real time remains challenging for resource-constrained mobile and assistive robots. While large language models (LLMs) have shown promise for natural communication, their size and latency limit on-device deployment. Small language models (SLMs) offer a potential alternative, but their effectiveness for role classification in HRI has not been systematically evaluated. In this paper, we present a benchmark of SLMs for leader-follower communication, introducing a novel dataset derived from a published database and augmented with synthetic samples to capture interaction-specific dynamics. We investigate two adaptation strategies: prompt engineering and fine-tuning, studied under zero-shot and one-shot interaction modes, compared with an untrained baseline. Experiments with Qwen2.5-0.5B reveal that zero-shot fine-tuning achieves robust classification performance (86.66% accuracy) while maintaining low latency (22.2 ms per sample), significantly outperforming baseline and prompt-engineered approaches. However, results also indicate a performance degradation in one-shot modes, where increased context length challenges the model's architectural capacity. These findings demonstrate that fine-tuned SLMs provide an effective solution for direct role assignment, while highlighting critical trade-offs between dialogue complexity and classification reliability on the edge.

URL PDF HTML ☆

赞 0 踩 0

2603.09170 2026-06-04 cs.RO cs.AI 版本更新

ZeroWBC: Learning Natural Whole-Body Humanoid Interaction from Human Egocentric Data

ZeroWBC: 从人类自我中心数据学习自然全身人形交互

Haoran Yang, Jiacheng Bao, Yucheng Xin, Haoming Song, Yuyang Tian, Bin Zhao, Dong Wang, Xuelong Li

发表机构 * University of Science and Technology of China（中国科学技术大学）； Shanghai AI Laboratory（上海人工智能实验室）； Northwestern Polytechnical University（西北工业大学）； Tsinghua University（清华大学）； Shanghai Jiao Tong University（上海交通大学）； TeleAI, China Telecom（TeleAI，中国电信）

AI总结提出ZeroWBC框架，利用人类自我中心视频和同步全身运动数据，通过生成-跟踪方法实现无遥操作的人形机器人全身交互控制。

详情

AI中文摘要

由于全身遥操作数据成本高昂，实现多功能且自然的全身人形交互控制仍然具有挑战性。我们提出ZeroWBC，一种无需遥操作的框架，从人类自我中心视频以及同步的全身运动和文本注释中学习人形全身交互。ZeroWBC采用生成-跟踪公式来解决静态场景全身交互控制问题。给定初始自我中心图像和语言指令，微调的视觉-语言模型生成未来人类全身运动标记，这些标记被解码为连续运动并重定向到人形机器人。得到的参考运动，连同根部和关键身体部位轨迹，然后由通用交互运动跟踪策略执行。为了提高交互性能，我们引入了一种面向交互的跟踪奖励，该奖励优先考虑全局根部和关键身体部位轨迹对齐，同时保持自然的全身运动。在Unitree G1人形机器人上的实验表明，ZeroWBC无需机器人遥操作演示即可实现多样化的场景感知行为。这些结果表明了一种从人类自我中心数据学习自然人形全身交互的可扩展范式。

英文摘要

Achieving versatile and natural whole-body humanoid interaction control remains challenging due to the high cost of whole-body teleoperation data. We present ZeroWBC, a teleoperation-free framework that learns humanoid whole-body interaction from human egocentric videos paired with synchronized whole-body motion and text annotations. ZeroWBC adopts a generation-then-tracking formulation to tackle the static scene whole-body interaction control problem. Given an initial egocentric image and a language instruction, a fine-tuned Vision-Language Model generates future human whole-body motion tokens, which are decoded into continuous motions and retargeted to the humanoid. The resulting reference motions, together with root and key body-part trajectories, are then executed by a general interactive motion tracking policy. To improve interaction performance, we introduce an interaction-oriented tracking reward that prioritizes global root and key body-part trajectory alignment while preserving natural whole-body motion. Experiments on the Unitree G1 humanoid robot show that ZeroWBC enables diverse scene-aware behaviors without robot teleoperation demonstrations. These results suggest a scalable paradigm for learning natural humanoid whole-body interaction from human egocentric data.

URL PDF HTML ☆

赞 0 踩 0

2510.27191 2026-06-04 cs.RO cs.AI 版本更新

Vectorized Online POMDP Planning

向量化在线POMDP规划

Marcus Hoerger, Muhammad Sudrajat, Hanna Kurniawati

发表机构 * School of Computing, Australian National University（澳大利亚国立大学计算机学院）

AI总结提出向量化在线POMDP规划器（VOPP），通过全向量化计算消除并行瓶颈，实现大规模并行在线规划，计算效率比现有最先进并行求解器高至少20倍。

Comments 8 pages, 3 figures. Accepted at ICRA 2026

详情

AI中文摘要

部分可观测下的规划是自主机器人的一项基本能力。部分可观测马尔可夫决策过程（POMDP）为部分可观测问题下的规划提供了强大的框架，捕捉了动作的随机效应以及通过噪声观测获得的有限信息。POMDP求解可以极大受益于当今硬件上的大规模并行化，但并行化POMDP求解器一直具有挑战性。大多数求解器依赖于将动作上的数值优化与其价值估计交错进行，这会在并行进程之间产生依赖关系和同步瓶颈，从而抵消并行化的好处。在本文中，我们提出了向量化在线POMDP规划器（VOPP），一种新颖的并行在线求解器，它利用了最近的POMDP公式，该公式解析地解决了优化组件的一部分，将数值计算仅保留为期望的估计。VOPP将所有与规划相关的数据结构表示为张量集合，并将所有规划步骤实现为该表示上的全向量化计算。结果是一个大规模并行的在线求解器，并发进程之间没有依赖关系或同步瓶颈。实验结果表明，与现有的最先进并行在线求解器相比，VOPP在计算近最优解方面的效率至少高出20倍。此外，VOPP优于最先进的顺序在线求解器，同时使用的规划预算小1000倍。

英文摘要

Planning under partial observability is an essential capability of autonomous robots. The Partially Observable Markov Decision Process (POMDP) provides a powerful framework for planning under partial observability problems, capturing the stochastic effects of actions and the limited information available through noisy observations. POMDP solving could benefit tremendously from massive parallelization on today's hardware, but parallelizing POMDP solvers has been challenging. Most solvers rely on interleaving numerical optimization over actions with the estimation of their values, which creates dependencies and synchronization bottlenecks between parallel processes that can offset the benefits of parallelization. In this paper, we propose Vectorized Online POMDP Planner (VOPP), a novel parallel online solver that leverages a recent POMDP formulation which analytically solves part of the optimization component, leaving numerical computations to consist of only estimation of expectations. VOPP represents all data structures related to planning as a collection of tensors, and implements all planning steps as fully vectorized computations over this representation. The result is a massively parallel online solver with no dependencies or synchronization bottlenecks between concurrent processes. Experimental results indicate that VOPP is at least $20\times$ more efficient in computing near-optimal solutions compared to an existing state-of-the-art parallel online solver. Moreover, VOPP outperforms state-of-the-art sequential online solvers, while using a planning budget that is $1000\times$ smaller.

URL PDF HTML ☆

赞 0 踩 0

2603.08485 2026-06-04 cs.RO 版本更新

3PoinTr: 3D Point Tracks for Learning Manipulation from Unconstrained Human Videos

3PoinTr: 从无约束人类视频中通过3D点轨迹学习操作

Adam Hung, Bardienus Pieter Duisterhof, Jeffrey Ichnowski

发表机构 * Carnegie Mellon University（卡内基梅隆大学）

AI总结提出3PoinTr方法，通过预测密集3D点轨迹从无约束人类视频中预训练样本高效的机器人操作策略，在真实和模拟任务中显著优于基线方法。

详情

AI中文摘要

从人类视频中学习操作策略可以大大减少对昂贵的机器人演示的需求，但现有方法通常需要限制性假设，如编排的人类动作、预定义关键点、手动注释或已知的抓取位置。我们提出3PoinTr，一种通过预测密集3D点轨迹从无约束人类视频中预训练样本高效机器人策略的方法。在无约束的人类演示视频中，人类可以自由地遵循他们认为合适的任何轨迹和操作策略，而不是编排他们的动作来模仿机器人。3PoinTr使用轻量级的可见性感知Transformer从人类视频中学习场景点应如何移动，然后训练一个闭环多任务机器人策略，以从这些预测的点轨迹中灵活提取与动作相关的先验知识。仅使用20个带动作标签的机器人演示，3PoinTr在真实世界任务上的平均成功率比最强的行为克隆和视频预训练基线高出25.0个百分点，在模拟中平均成功率高出29.6个百分点。针对性的消融实验支持关键设计选择，并证实了从无动作视频中学习的好处。我们进一步表明，3PoinTr的点轨迹预测Transformer通过保留对部分遮挡点的监督，优于强基线。项目页面：https://adamhung60.github.io/3PoinTr/。

英文摘要

Learning manipulation policies from human videos could greatly reduce the need for expensive robot demonstrations, but existing approaches typically require restrictive assumptions such as choreographed human motions, predefined keypoints, manual annotations, or known grasp locations. We propose 3PoinTr, a method for pretraining sample-efficient robot policies from unconstrained human videos by predicting dense 3D point tracks. In the unconstrained human demonstration videos, humans are free to follow whatever trajectories and manipulation strategies they see fit, rather than choreographing their motions to mimic a robot. 3PoinTr uses a lightweight visibility-aware transformer to learn how scene points should move from human videos, and then trains a closed-loop multitask robot policy to flexibly extract action-relevant priors from those predicted point tracks. With only 20 action-labeled robot demonstrations, 3PoinTr achieves a 25.0 percentage point higher average success rate than the strongest behavior cloning and video-pretraining baselines on real-world tasks, and a 29.6 percentage point higher average success rate in simulation. Targeted ablations support the key design choices and confirm the benefit of learning from actionless videos. We further show that 3PoinTr's point track prediction transformer outperforms a strong baseline by preserving supervision over partially occluded points. Project page: https://adamhung60.github.io/3PoinTr/.

URL PDF HTML ☆

赞 0 踩 0

2601.04493 2026-06-04 cs.RO 版本更新

Continuum Robot State Estimation with Actuation Uncertainty

具有驱动不确定性的连续体机器人状态估计

James M. Ferguson, Alan Kuntz, Tucker Hermans

发表机构 * Department of Electrical and Computer Engineering and Department of Computer Science, Vanderbilt University（电气与计算机工程系和计算机科学系，范德比尔特大学）； Kahlert School of Computing and Robotics Center, University of Utah（计算与机器人中心，犹他大学）； NVIDIA（英伟达）

AI总结针对连续体机器人在未知交互力和模型不确定性下的形状估计问题，提出一种基于机械原理的驱动先验的离散Cosserat杆模型，联合估计机器人形状、外部载荷和驱动输入，并通过稀疏因子图实现高效非线性优化。

Comments Public preprint for IEEE RAL. Accepted May 2026

详情

AI中文摘要

连续体机器人是柔性、细长的操纵器，非常适合受限的手术环境。在这些环境中，未知的相互作用力和模型不确定性显著影响机器人形状，从而激发了从外部观测进行状态估计的需求。现有的估计方法要么忽略驱动建模，要么依赖于简化的确定性驱动模型。相比之下，我们使用机械原理的驱动先验联合估计机器人形状、外部载荷和驱动输入。为此，我们提出了一种具有分段线性应变积分的离散Cosserat杆公式，该公式提供了高数值精度，同时诱导出稀疏因子图结构以实现高效的非线性优化。我们将该框架扩展到仿真中的腱驱动和并联机器人，并在手术同心管机器人上进行了实验验证。总体而言，我们的方法能够在多种机器人架构上实现原理性的实时估计，同时通过线性化因子图直接访问操纵器雅可比矩阵。

英文摘要

Continuum robots are flexible, slender manipulators well suited for confined surgical environments. In these settings, unknown interaction forces and model uncertainty significantly affect robot shape, motivating state estimation from external observations. Existing estimation methods either neglect actuation modeling or rely on simplified deterministic actuation models. In contrast, we jointly estimate robot shape, external loads, and actuation inputs using mechanically principled actuation priors. To achieve this, we present a discrete Cosserat rod formulation with piecewise-linear strain integration that provides high numerical accuracy while inducing a sparse factor graph structure for efficient nonlinear optimization. We extend the framework to tendon-driven and parallel robots in simulation and validate it experimentally on a surgical concentric tube robot. Overall, our approach enables principled real-time estimation across multiple robot architectures while providing direct access to manipulator Jacobians through the linearized factor graph.

URL PDF HTML ☆

赞 0 踩 0

2602.13081 2026-06-04 cs.RO 版本更新

Agentic AI for Robot Control: Flexible but still Fragile

用于机器人控制的智能体AI：灵活但依然脆弱

Oscar Lima, Marc Vinci, Martin Günther, Marian Renz, Alexander Sung, Sebastian Stock, Johannes Brust, Lennart Niecksch, Zongyao Yi, Felix Igelbrink, Benjamin Kisliuk, Martin Atzmueller, Joachim Hertzberg

发表机构 * ETH Zurich（苏黎世联邦理工学院）； University of Cambridge（剑桥大学）； Technical University of Munich（慕尼黑技术大学）； Max Planck Institute for Intelligent Systems（智能系统马克斯·普朗克研究所）； German Aerospace Center (DLR)（德国航空航天中心（DLR））

AI总结本文提出一种基于推理型语言模型的智能体控制系统，通过迭代规划-执行循环选择并调用机器人技能完成任务，在两种物理平台上实验表明系统灵活但存在非确定性行为、指令遵循错误和提示敏感等脆弱性。

详情

DOI: 10.1609/aaaiss.v8i1.42578

AI中文摘要

最近的工作利用生成模型的能力和常识先验进行机器人控制。在本文中，我们提出了一种智能体控制系统，其中具有推理能力的语言模型通过迭代规划器和执行器循环选择并调用机器人技能来规划和执行任务。我们在两种物理机器人平台上部署该系统，分别用于：(i) 室内移动操作（Mobipick）中的桌面抓取、放置和箱子插入，以及 (ii) 自主农业导航和感知（Valdemar）。两种设置都涉及不确定性、部分可观测性、传感器噪声和模糊的自然语言命令。该系统公开了其规划和决策过程的结构化内省，通过显式事件检查对外部事件做出反应，并支持操作员干预以修改或重定向正在进行的执行。在两个平台上，我们的概念验证实验揭示了显著的脆弱性，包括非确定性次优行为、指令遵循错误以及对提示规范的高度敏感性。同时，该架构是灵活的：转移到不同的机器人和任务领域主要需要更新系统提示（领域模型、可供性和动作目录）并将相同的工具接口重新绑定到平台特定的技能API。

英文摘要

Recent work leverages the capabilities and commonsense priors of generative models for robot control. In this paper, we present an agentic control system in which a reasoning-capable language model plans and executes tasks by selecting and invoking robot skills within an iterative planner and executor loop. We deploy the system on two physical robot platforms in two settings: (i) tabletop grasping, placement, and box insertion in indoor mobile manipulation (Mobipick) and (ii) autonomous agricultural navigation and sensing (Valdemar). Both settings involve uncertainty, partial observability, sensor noise, and ambiguous natural-language commands. The system exposes structured introspection of its planning and decision process, reacts to exogenous events via explicit event checks, and supports operator interventions that modify or redirect ongoing execution. Across both platforms, our proof-of-concept experiments reveal substantial fragility, including non-deterministic suboptimal behavior, instruction-following errors, and high sensitivity to prompt specification. At the same time, the architecture is flexible: transfer to a different robot and task domain largely required updating the system prompt (domain model, affordances, and action catalogue) and re-binding the same tool interface to the platform-specific skill API.

URL PDF HTML ☆

赞 0 踩 0

2602.12215 2026-06-04 cs.RO 版本更新

LDA-1B: Scaling Latent Dynamics Action Model via Universal Embodied Data Ingestion

LDA-1B：通过通用具身数据摄取扩展潜动力学动作模型

Jiangran Lyu, Kai Liu, Xuheng Zhang, Haoran Liao, Yusen Feng, Wenxuan Zhu, Tingrui Shen, Jiayi Chen, Jiazhao Zhang, Yifei Dong, Wenbo Cui, Senmao Qi, Shuo Wang, Yixin Zheng, Mi Yan, Xuesong Shi, Haoran Li, Dongbin Zhao, Ming-Yu Liu, Zhizheng Zhang, Li Yi, Yizhou Wang, He Wang

发表机构 * Peking University（北京大学）； Galbot ； CASIA（中国科学院自动化研究所）； BAAI（北京人工智能研究院）； Tsinghua University（清华大学）； Sun Yat-sen University（中山大学）； NVIDIA

AI总结提出LDA-1B机器人基础模型，通过统一格式的具身交互数据集EI-30k和结构化DINO潜空间中的动力学学习，联合建模动力学、策略和视觉预测，在接触丰富、灵巧和长时任务上分别提升高达21%、48%和23%。

Comments Accepted at RSS 2026, Project Page:https://pku-epic.github.io/LDA

详情

AI中文摘要

最近的机器人基础模型很大程度上依赖于大规模行为克隆，该方法模仿专家动作，但丢弃了嵌入在异构具身数据中的可迁移动力学知识。虽然统一世界模型（UWM）公式有潜力利用这种多样化数据，但由于粗粒度的数据使用和碎片化的数据集，现有实例难以扩展到基础模型级别。我们引入了LDA-1B，一个通过通用具身数据摄取进行扩展的机器人基础模型，它通过联合学习动力学、策略和视觉预测，为不同质量的数据分配不同的角色。为了大规模支持这种机制，我们组装并标准化了EI-30k，一个具身交互数据集，包含超过3万小时的人类和机器人轨迹，采用统一格式。通过结构化DINO潜空间中的预测实现了对这种异构数据的可扩展动力学学习，避免了冗余的像素空间外观建模。作为这种表示的补充，LDA-1B采用多模态扩散变换器来处理异步视觉和动作流，从而在1B参数规模上实现稳定训练。在模拟和真实世界的实验中，LDA-1B在接触丰富、灵巧和长时任务上分别比先前方法（例如π_{0.5}）高出高达21%、48%和23%。值得注意的是，LDA-1B实现了数据高效的微调，通过利用通常有害且被丢弃的30%低质量轨迹，获得了10%的提升。

英文摘要

Recent robot foundation models largely rely on large-scale behavior cloning, which imitates expert actions but discards transferable dynamics knowledge embedded in heterogeneous embodied data. While the Unified World Model (UWM) formulation has the potential to leverage such diverse data, existing instantiations struggle to scale to foundation-level due to coarse data usage and fragmented datasets. We introduce LDA-1B, a robot foundation model that scales through universal embodied data ingestion by jointly learning dynamics, policy, and visual forecasting, assigning distinct roles to data of varying quality. To support this regime at scale, we assemble and standardize EI-30k, an embodied interaction dataset comprising over 30k hours of human and robot trajectories in a unified format. Scalable dynamics learning over such heterogeneous data is enabled by prediction in a structured DINO latent space, which avoids redundant pixel-space appearance modeling. Complementing this representation, LDA-1B employs a multi-modal diffusion transformer to handle asynchronous vision and action streams, enabling stable training at the 1B-parameter scale. Experiments in simulation and the real world show LDA-1B outperforms prior methods (e.g., $π_{0.5}$) by up to 21\%, 48\%, and 23\% on contact-rich, dexterous, and long-horizon tasks, respectively. Notably, LDA-1B enables data-efficient fine-tuning, gaining 10\% by leveraging 30\% low-quality trajectories typically harmful and discarded.

URL PDF HTML ☆

赞 0 踩 0

2602.03920 2026-06-04 cs.RO cs.HC 版本更新

How Users Understand Robot Foundation Model Performance through Task Success Rates and Beyond

用户如何通过任务成功率及其他方式理解机器人基础模型性能

Isaac Sheidlower, Jindan Huang, James Staley, Bingyu Wu, Qicong Chen, Reuben Aronson, Elaine Short

发表机构 * Brown University（布朗大学）； Tufts University（塔夫茨大学）

AI总结通过用户研究，探讨非机器人专家如何理解机器人基础模型（RFM）评估中的任务成功率（TSR）及其他信息，发现用户不仅按专家预期使用TSR，还重视未常报告的失败案例，并希望获取历史评估数据和机器人对新任务的性能估计。

详情

AI中文摘要

机器人基础模型（RFM）代表了一种开发通用家用机器人的有前景的方法。鉴于RFM的广泛能力，用户不可避免地会要求基于RFM的机器人执行RFM未经训练或评估的任务。在这些情况下，由于失败成本相对较高，用户理解尝试新任务的相关风险至关重要。此外，了解RFM能力的知情用户将知道机器人能够处理哪些情况和任务。在本文中，我们研究非机器人专家如何解释RFM评估中的性能信息。这些评估通常报告任务成功率（TSR）作为主要性能指标。虽然TSR对专家来说是直观的，但有必要验证新手是否也按预期使用这些信息。为此，我们进行了一项研究，用户看到了真实的评估数据，包括TSR、失败案例描述以及来自多个已发表RFM研究项目的视频。结果强调，非专家不仅以与专家预期一致的方式使用TSR，而且还高度重视其他类型的信息，例如RFM评估中通常不报告的失败案例。此外，我们发现用户希望访问RFM先前评估的真实数据以及机器人关于其在新任务上表现如何的估计。

英文摘要

Robot Foundation Models (RFMs) represent a promising approach to developing general-purpose home robots. Given the broad capabilities of RFMs, users will inevitably ask an RFM-based robot to perform tasks that the RFM was not trained or evaluated on. In these cases, it is crucial that users understand the risks associated with attempting novel tasks due to the relatively high cost of failure. Furthermore, an informed user who understands an RFM's capabilities will know what situations and tasks the robot can handle. In this paper, we study how non-roboticists interpret performance information from RFM evaluations. These evaluations typically report task success rate (TSR) as the primary performance metric. While TSR is intuitive to experts, it is necessary to validate whether novices also use this information as intended. Toward this end, we conducted a study in which users saw real evaluation data, including TSR, failure case descriptions, and videos from multiple published RFM research projects. The results highlight that non-experts not only use TSR in a manner consistent with expert expectations but also highly value other information types, such as failure cases that are not often reported in RFM evaluations. Furthermore, we find that users want access to both real data from previous evaluations of the RFM and estimates from the robot about how well it will do on a novel task.

URL PDF HTML ☆

赞 0 踩 0

2602.01429 2026-06-04 cs.RO 版本更新

Sem-NaVAE: Semantically-Guided Outdoor Mapless Navigation via Generative Trajectory Priors

Sem-NaVAE: 基于语义引导的室外无地图导航通过生成式轨迹先验

Gonzalo Olguín, Javier Ruiz-del-Solar

发表机构 * Department of Electrical Engineering & the Advanced Mining Technology Center (AMTC), Universidad de Chile（电气工程系及先进采矿技术中心（AMTC）、智利大学）

AI总结提出Sem-NaVAE方法，结合条件变分自编码器生成多样化轨迹和轻量视觉语言模型进行语义选择，实现室外无地图实时导航，在未见环境中达到90%成功率。

Comments Accepted for publication in IEEE Robotics and Automation Letters (RA-L). 8 pages, 5 figures

2512.24698 2026-06-04 cs.RO 版本更新

Dynamic Policy Learning for Legged Robot with Simplified Model Pretraining and Model-Homotopy-Inspired Transfer

基于简化模型预训练与模型同伦启发迁移的足式机器人动态策略学习

Dongyun Kang, Min-Gyu Kim, Tae-Gyu Song, Hajun Kim, Sehoon Ha, Hae-Won Park

发表机构 * Department of Mechanical Engineering, Korea Advanced Institute of Science and Technology (KAIST)（机械工程系，韩国科学技术院（KAIST））； School of Interactive Computing, Georgia Institute of Technology（交互计算学院，佐治亚理工学院）

AI总结提出一种延续学习框架，结合简化模型预训练和模型同伦启发迁移，高效生成和优化足式机器人的复杂动态行为，并在真实四足机器人上验证了翻转和墙面辅助等动态任务。

Comments 8 pages

详情

DOI: 10.1109/LRA.2026.3692082
Journal ref: IEEE Robotics and Automation Letters, vol. 11, no. 7, pp. 8068-8075, July 2026

AI中文摘要

为足式机器人生成动态运动仍然是一个具有挑战性的问题。虽然强化学习在各种足式运动任务中取得了显著成功，但产生高度动态的行为通常需要大量的奖励调整或高质量的演示。利用降阶模型有助于缓解这些挑战。然而，当将策略迁移到全身动力学环境时，模型差异构成了重大挑战。在这项工作中，我们引入了一个基于延续的学习框架，该框架结合了简化模型预训练和模型同伦启发迁移，以高效生成和优化复杂的动态行为。首先，我们使用单刚体模型预训练策略，以在简化环境中捕获核心运动模式。接下来，我们采用延续策略逐步将策略迁移到全身环境，以最小化性能损失。为了定义延续路径，我们引入了一条从单刚体模型到全身模型的参数化过渡路径，通过逐步重新分配躯干和腿之间的质量和惯性。与基线方法相比，所提出的方法在迁移过程中实现了更快的收敛并表现出更优的稳定性。我们的框架在包括翻转和墙面辅助机动在内的多种动态任务上得到了验证，并成功部署在真实的四足机器人上。

英文摘要

Generating dynamic motions for legged robots remains a challenging problem. While reinforcement learning has achieved notable success in various legged locomotion tasks, producing highly dynamic behaviors often requires extensive reward tuning or high-quality demonstrations. Leveraging reduced-order models can help mitigate these challenges. However, the model discrepancy poses a significant challenge when transferring policies to full-body dynamics environments. In this work, we introduce a continuation-based learning framework that combines simplified model pretraining and model-homotopy-inspired transfer to efficiently generate and refine complex dynamic behaviors. First, we pretrain the policy using a single rigid body model to capture core motion patterns in a simplified environment. Next, we employ a continuation strategy to progressively transfer the policy to the full-body environment, minimizing performance loss. To define the continuation path, we introduce a parametric transition path from the single rigid body model to the full-body model by gradually redistributing mass and inertia between the trunk and legs. The proposed method achieves faster convergence and demonstrates superior stability during the transfer process compared to baseline methods. Our framework is validated on a range of dynamic tasks, including flips and wall-assisted maneuvers, and is successfully deployed on a real quadrupedal robot.

URL PDF HTML ☆

赞 0 踩 0

2512.16919 2026-06-04 cs.CV cs.AI cs.RO 版本更新

DVGT: Driving Visual Geometry Transformer

DVGT: 驾驶视觉几何变换器

Sicheng Zuo, Zixun Xie, Wenzhao Zheng, Shaoqing Xu, Fang Li, Shengyin Jiang, Long Chen, Zhi-Xin Yang, Jiwen Lu

发表机构 * Tsinghua University（清华大学）； University of Macau（澳门大学）； Xiaomi EV（小米电动车）； Peking University（北京大学）

AI总结提出DVGT，一种从无位姿多视角图像序列重建全局稠密3D点图的视觉几何变换器，通过交替注意力机制学习几何关系，无需相机参数和后处理对齐，在多个驾驶数据集上显著优于现有模型。

Comments Code is available at https://github.com/wzzheng/DVGT

详情

AI中文摘要

从视觉输入中感知和重建3D场景几何对于自动驾驶至关重要。然而，目前仍缺乏一种能够适应不同场景和相机配置的、面向驾驶的稠密几何感知模型。为弥补这一空白，我们提出了驾驶视觉几何变换器（DVGT），它从一系列无位姿的多视角视觉输入中重建全局稠密3D点图。我们首先使用DINO骨干网络为每张图像提取视觉特征，并采用交替的视角内局部注意力、跨视角空间注意力和跨帧时间注意力来推断图像间的几何关系。然后，我们使用多个头解码第一帧自车坐标系下的全局点图以及每帧的自车位姿。与依赖精确相机参数的传统方法不同，DVGT无需显式的3D几何先验，能够灵活处理任意相机配置。DVGT直接从图像序列预测度量尺度的几何，消除了与外部传感器后对齐的需求。在包含nuScenes、OpenScene、Waymo、KITTI和DDAD的大型驾驶数据集混合训练下，DVGT在各种场景中显著优于现有模型。代码可在https://github.com/wzzheng/DVGT获取。

英文摘要

Perceiving and reconstructing 3D scene geometry from visual inputs is crucial for autonomous driving. However, there still lacks a driving-targeted dense geometry perception model that can adapt to different scenarios and camera configurations. To bridge this gap, we propose a Driving Visual Geometry Transformer (DVGT), which reconstructs a global dense 3D point map from a sequence of unposed multi-view visual inputs. We first extract visual features for each image using a DINO backbone, and employ alternating intra-view local attention, cross-view spatial attention, and cross-frame temporal attention to infer geometric relations across images. We then use multiple heads to decode a global point map in the ego coordinate of the first frame and the ego poses for each frame. Unlike conventional methods that rely on precise camera parameters, DVGT is free of explicit 3D geometric priors, enabling flexible processing of arbitrary camera configurations. DVGT directly predicts metric-scaled geometry from image sequences, eliminating the need for post-alignment with external sensors. Trained on a large mixture of driving datasets including nuScenes, OpenScene, Waymo, KITTI, and DDAD, DVGT significantly outperforms existing models on various scenarios. Code is available at https://github.com/wzzheng/DVGT.

URL PDF HTML ☆

赞 0 踩 0

2504.03038 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Learning to Adapt Control Barrier Functions Under Epistemic and Aleatoric Uncertainty

在认知不确定性和偶然不确定性下学习自适应控制障碍函数

Taekyung Kim, Robin Inho Kee, Dimitra Panagou

发表机构 * Department of Robotics, University of Michigan（机器人学系，密歇根大学）； Charles Stark Draper Laboratory（查尔斯·斯泰克·德拉珀实验室）； Department of Aerospace Engineering, University of Michigan（航空航天工程系，密歇根大学）

AI总结提出在线自适应CBF框架（OA-CBF），通过概率集成神经网络和图注意力编码器动态调整CBF参数，在保证安全性的同时减少保守性。

Comments Extended journal version of the IEEE CDC 2025 paper (available as arXiv:2504.03038v5). Project page: https://www.taekyung.me/oa-cbf

详情

四足履轮腿机器人自主运动模式切换：基于仿真的步阶跨越分析

Jie Wang, Krispin Davies

发表机构 * University of Cambridge（剑桥大学）； ClearPath AI

AI总结本文提出了一种用于四足混合机器人自主切换运动模式的方法，特别是在跨越不同高度台阶时，通过能量效率评估机制实现平稳过渡。

详情

DOI: 10.1016/j.simpat.2024.102893

AI中文摘要

混合履轮腿机器人结合了轮式和腿式运动的优势，通过高效切换滚动和行走模式，在多种地形中实现适应性。然而，自动实现这些切换仍然是重大挑战。本文介绍了一种用于四足混合机器人自主模式切换的方法，特别是在跨越台阶时。我们的方法基于一种决策机制，利用所提出的基于能量的准则评估两种运动模式的能量效率。为了确保平稳跨越台阶，我们结合了两种攀爬步态，用于评估行走运动的能量使用情况。仿真结果验证了该方法的有效性，显示在不同高度的台阶上实现了成功的自主切换。我们提出的方法具有通用性，可以修改以适应类似机械配置的其他混合机器人，前提是其运动能量性能已先进行研究。

英文摘要

Hybrid track/wheel-legged robots combine the advantages of wheel-based and leg-based locomotion, granting adaptability across varied terrains through efficient transitions between rolling and walking modes. However, automating these transitions remains a significant challenge. In this paper, we introduce a method designed for autonomous mode transition in a quadruped hybrid robot with a track/wheel-legged configuration, especially during step negotiation. Our approach hinges on a decision-making mechanism that evaluates the energy efficiency of both locomotion modes using a proposed energy-based criterion. To guarantee a smooth negotiation of steps, we incorporate two climbing gaits designated for the assessment of energy usage in walking locomotion. Simulation results validate the method's effectiveness, showing successful autonomous transitions across steps of diverse heights. Our suggested approach has universal applicability and can be modified to suit other hybrid robots of similar mechanical configuration, provided their locomotion energy performance is studied beforehand.

URL PDF HTML ☆

赞 0 踩 0

1702.03433 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Path Assignment Techniques For Vehicle Tracking

车辆跟踪中的路径分配技术

Richard Altendorfer, Sebastian Wirkert

发表机构 * Driver Assistance Systems, ZF TRW（驾驶辅助系统，ZF TRW）； Deutsches Krebsforschungszentrum（德国癌症研究中心）

AI总结本文提出两种路径分配方法，旨在通过延迟处理阶段来过滤测量数据，以避免延迟和其他中间滤波器的伪影，通过生成离散后验概率分布并使用中位数估计器提取路径或车道索引，通过ROC曲线展示方法性能。

Comments 6 pages, 9 figures

详情

DOI: 10.1109/IVS.2014.6856510
Journal ref: Proceedings of the IEEE Intelligent Vehicles Symposium (2014) 1451-1456

AI中文摘要

许多驾驶员辅助系统，如自适应巡航控制系统，需要识别处于主机车辆路径中的最近车辆。这涉及将检测到的车辆分配给主机车辆的路径或邻近路径。在回顾了主机车辆路径估计和车道分配技术的方法后，我们介绍了两种方法，这些方法受到在尽可能晚的处理阶段过滤测量数据的动机，以避免延迟和其他中间滤波器的伪影。这些滤波器生成离散后验概率分布，从中通过中位数估计器提取路径或“车道”索引。通过使用实验数据和标记的地面真实数据，展示了这些方法的相对性能。

英文摘要

Many driver assistance systems such as Adaptive Cruise Control require the identification of the closest vehicle that is in the host vehicle's path. This entails an assignment of detected vehicles to the host vehicle path or neighboring paths. After reviewing approaches to the estimation of the host vehicle path and lane assignment techniques we introduce two methods that are motivated by the rationale to filter measured data as late in the processing stages as possible in order to avoid delays and other artifacts of intermediate filters. These filters generate discrete posterior probability distributions from which a path or "lane" index is extracted by a median estimator. The relative performance of those methods is illustrated by a ROC using experimental data and labeled ground truth data.

URL PDF HTML ☆

赞 0 踩 0

1508.04124 2026-06-04 eess.SY cs.RO cs.SY 版本更新

A Complete Derivation Of The Association Log-Likelihood Distance For Multi-Object Tracking

多目标跟踪中关联对数似然距离的完整推导

Richard Altendorfer, Sebastian Wirkert

发表机构 * Driver Assistance Systems, ZF TRW（ZF TRW驾驶辅助系统）； German Cancer Research Center（德国癌症研究中心）

AI总结本文基于多目标跟踪中的关联问题，推导了关联对数似然距离，并通过蒙特卡洛模拟验证了其在关联关系准确性上的优越性。

Comments 7 pages, 3 figures

详情

DOI: 10.1109/IVS.2016.7535395
Journal ref: 2016 IEEE Intelligent Vehicles Symposium (IV)

AI中文摘要

Mahalanobis距离常用于多目标跟踪中的测量-跟踪关联。从Mahalanobis距离的原始定义出发，我们回顾了其在关联中的应用。由于多目标跟踪中没有原则将Mahalanobis距离视为一种独特的统计距离，我们重新审视了多假设跟踪中的全局关联假设作为最通用的关联设置。这些关联假设诱导出一种用于分配的距离似然量，我们称之为关联对数似然距离。我们比较了Mahalanobis距离与关联对数似然距离在蒙特卡洛模拟中产生正确关联关系的能力。结果表明，基于关联对数似然的距离在平均上比Mahalanobis距离表现更好，证实了最大化全局关联假设比最小化特定统计距离度量是一种更根本的关联方法。

英文摘要

The Mahalanobis distance is commonly used in multi-object trackers for measurement-to-track association. Starting with the original definition of the Mahalanobis distance we review its use in association. Given that there is no principle in multi-object tracking that sets the Mahalanobis distance apart as a distinguished statistical distance we revisit the global association hypotheses of multiple hypothesis tracking as the most general association setting. Those association hypotheses induce a distance-like quantity for assignment which we refer to as association log-likelihood distance. We compare the ability of the Mahalanobis distance to the association log-likelihood distance to yield correct association relations in Monte-Carlo simulations. It turns out that on average the distance based on association log-likelihood performs better than the Mahalanobis distance, confirming that the maximization of global association hypotheses is a more fundamental approach to association than the minimization of a certain statistical distance measure.

URL PDF HTML ☆

赞 0 踩 0

1809.07870 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Suspended Load Path Tracking Control Using a Tilt-rotor UAV Based on Zonotopic State Estimation

基于退化拓扑状态估计的倾斜旋翼无人机悬浮负载路径跟踪控制

Brenner S. Rego, Guilherme V. Raffo

发表机构 * Graduate Program in Electrical Engineering, Federal University of Minas Gerais, Belo Horizonte, MG 31270-901, Brazil（巴西联邦矿务工程师学院电气工程研究生项目）； Department of Electronics Engineering, Federal University of Minas Gerais, Belo Horizonte, MG 31270-901, Brazil（巴西联邦矿务工程师学院电子工程系）

AI总结本文研究了利用倾斜旋翼无人机进行悬浮负载路径跟踪控制的问题，通过建立多体机械系统的动力学模型，提出退化拓扑状态估计器来估计负载位置和姿态，并设计了具有极点放置约束的离散时间混合H2/H∞控制器以实现鲁棒的路径跟踪控制。

详情

DOI: 10.1016/j.jfranklin.2018.08.028

AI中文摘要

本文解决了一种倾斜旋翼无人机悬浮负载路径跟踪控制的问题。主要挑战来自于负载动态行为，通常通过绳索与无人机连接，增加了系统的未驱动自由度。此外，为了执行负载运输任务，通常需要知道负载的位置信息。由于可用传感器通常嵌入在移动平台上，负载位置的信息可能无法直接获取。为了解决这个问题，本文首先从负载的角度出发，推导了多体机械系统的运动学，利用欧拉-拉格朗日方法推导出详细的动力学模型，得到一个高度耦合、非线性的状态空间表示，输入是仿射的，负载的位置和姿态直接由状态变量表示。提出了一种退化拓扑状态估计器来解决负载位置和姿态的估计问题，该估计器基于飞机上的传感器，具有不同的采样时间，并且测量噪声是未知但有界的。为了解决路径跟踪问题，设计了一个具有极点放置约束的离散时间混合H2/H∞控制器，具有保证的时间响应特性，并对未建模动态、参数不确定性以及外部干扰具有鲁棒性。通过在基于Gazebo模拟器的平台上进行的数值实验以及系统计算机辅助设计（CAD）模型上的实验，验证了退化拓扑状态估计器和设计的控制器的性能。

英文摘要

This work addresses the problem of path tracking control of a suspended load using a tilt-rotor UAV. The main challenge in controlling this kind of system arises from the dynamic behavior imposed by the load, which is usually coupled to the UAV by means of a rope, adding unactuated degrees of freedom to the whole system. Furthermore, to perform the load transportation it is often needed the knowledge of the load position to accomplish the task. Since available sensors are commonly embedded in the mobile platform, information on the load position may not be directly available. To solve this problem in this work, initially, the kinematics of the multi-body mechanical system are formulated from the load's perspective, from which a detailed dynamic model is derived using the Euler-Lagrange approach, yielding a highly coupled, nonlinear state-space representation of the system, affine in the inputs, with the load's position and orientation directly represented by state variables. A zonotopic state estimator is proposed to solve the problem of estimating the load position and orientation, which is formulated based on sensors located at the aircraft, with different sampling times, and unknown-but-bounded measurement noise. To solve the path tracking problem, a discrete-time mixed $\mathcal{H}_2/\mathcal{H}_\infty$ controller with pole-placement constraints is designed with guaranteed time-response properties and robust to unmodeled dynamics, parametric uncertainties, and external disturbances. Results from numerical experiments, performed in a platform based on the Gazebo simulator and on a Computer Aided Design (CAD) model of the system, are presented to corroborate the performance of the zonotopic state estimator along with the designed controller.

URL PDF HTML ☆

赞 0 踩 0

1809.03225 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Gait learning for soft microrobots controlled by light fields

基于光场控制的软微机器人步态学习

Alexander von Rohr, Sebastian Trimpe, Alonso Marco, Peer Fischer, Stefano Palagi

发表机构 * Micro, Nano, and Molecular Systems Group, Max Planck Institute for Intelligent Systems（微、纳、分子系统组，人工智能系统马克斯·普朗克研究所）； Max Planck ETH Center for Learning Systems（马克斯·普朗克-ETH学习系统中心）

AI总结本文提出一种基于贝叶斯优化和高斯过程的概率学习方法，用于优化光场控制的软微机器人步态，通过有限实验预算实现高效且鲁棒的运动性能提升。

Comments 8 pages, 7 figures, to appear in the proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems 2018

详情

DOI: 10.1109/IROS.2018.8594092

AI中文摘要

基于光场控制的软微机器人可以生成多种不同的步态。这种内在的灵活性可以用来最大化其在特定环境中的运动性能，并用于适应变化的条件。然而，由于缺乏准确的运动模型以及微机器人之间的固有变异性，分析控制设计是不可能的。另一方面，常见的数据驱动方法需要运行大量的实验，导致非常特定于样本的结果。本文提出了一种基于贝叶斯优化（BO）和高斯过程（GPs）的概率学习方法，用于光场控制的软微机器人。所提出的方法产生了一种学习方案，该方案在数据效率方面表现优异，能够在有限的实验预算下进行步态优化，并且对微机器人样本之间的差异具有鲁棒性。这些特性是通过在半合成数据集上比较不同的GP先验和BO设置来设计学习方案获得的。开发的学习方案在微机器人实验中得到验证，结果在仅20次实验的预算下，使微机器人的运动性能提高了115%。这些令人鼓舞的结果为基于光场控制的软微机器人和概率学习控制的自适应微机器人系统铺平了道路。

英文摘要

Soft microrobots based on photoresponsive materials and controlled by light fields can generate a variety of different gaits. This inherent flexibility can be exploited to maximize their locomotion performance in a given environment and used to adapt them to changing conditions. Albeit, because of the lack of accurate locomotion models, and given the intrinsic variability among microrobots, analytical control design is not possible. Common data-driven approaches, on the other hand, require running prohibitive numbers of experiments and lead to very sample-specific results. Here we propose a probabilistic learning approach for light-controlled soft microrobots based on Bayesian Optimization (BO) and Gaussian Processes (GPs). The proposed approach results in a learning scheme that is data-efficient, enabling gait optimization with a limited experimental budget, and robust against differences among microrobot samples. These features are obtained by designing the learning scheme through the comparison of different GP priors and BO settings on a semi-synthetic data set. The developed learning scheme is validated in microrobot experiments, resulting in a 115% improvement in a microrobot's locomotion performance with an experimental budget of only 20 tests. These encouraging results lead the way toward self-adaptive microrobotic systems based on light-controlled soft microrobots and probabilistic learning control.

URL PDF HTML ☆

赞 0 踩 0

1811.04333 2026-06-04 cs.RO cs.FL cs.SY eess.SY 版本更新

Reactive Task and Motion Planning for Robust Whole-Body Dynamic Locomotion in Constrained Environments

面向受限环境的鲁棒全身体动态运动的反应任务与运动规划

Ye Zhao, Yinan Li, Luis Sentis, Ufuk Topcu, Jun Liu

发表机构 * George W. Woodruff School of Mechanical Engineering, Georgia Tech, USA（佐治亚理工学院机械工程学院）； Department of Applied Mathematics, University of Waterloo, Canada（滑铁卢大学应用数学系）； Department of Aerospace Engineering and Engineering Mechanics, UT Austin, USA（得克萨斯大学奥斯汀分校航空航天工程与工程力学系）； Institute for Computational Engineering and Sciences, UT Austin, USA（得克萨斯大学奥斯汀分校计算工程与科学研究所）

AI总结本文提出了一种基于时序逻辑的游戏框架，用于在受限和动态变化的环境中进行全身体动态运动的任务规划与控制，通过符号系统的方法确保运动行为的正确性。

Comments 49 pages, 23 figures, 1 table

详情

AI中文摘要

基于接触的决策和规划方法越来越重要，以赋予四足机器人更高的自主性。源自符号系统的正式合成方法在推理高层运动决策和实现复杂 maneuvering 行为方面具有巨大潜力。本文首次尝试正式设计由任务规划和全身体动态运动控制组成的架构，在受限和动态变化的环境中。在高层，我们构建了一个多肢体运动规划器与其动态环境之间的双玩家时序逻辑游戏，以合成一个获胜策略，提供符号运动动作。这些运动动作满足由时序逻辑片段表达的期望高层任务规范。这些动作发送到一个稳健的有限状态转换系统，该系统合成一个满足状态可达性约束的运动控制器。此控制器通过低层运动规划器进一步执行，生成可行的运动轨迹。我们构建了一系列动态运动模型用于腿部机器人，作为处理多样化环境事件的模板库。我们设计了一种重新规划策略，考虑突发环境变化或大状态扰动，以提高最终运动行为的鲁棒性。我们正式证明了分层运动框架的正确性，通过运动规划层保证鲁棒实现。在多种环境中的反应运动行为模拟表明，我们的框架有潜力成为智能运动行为的理论基础。

英文摘要

Contact-based decision and planning methods are becoming increasingly important to endow higher levels of autonomy for legged robots. Formal synthesis methods derived from symbolic systems have great potential for reasoning about high-level locomotion decisions and achieving complex maneuvering behaviors with correctness guarantees. This study takes a first step toward formally devising an architecture composed of task planning and control of whole-body dynamic locomotion behaviors in constrained and dynamically changing environments. At the high level, we formulate a two-player temporal logic game between the multi-limb locomotion planner and its dynamic environment to synthesize a winning strategy that delivers symbolic locomotion actions. These locomotion actions satisfy the desired high-level task specifications expressed in a fragment of temporal logic. Those actions are sent to a robust finite transition system that synthesizes a locomotion controller that fulfills state reachability constraints. This controller is further executed via a low-level motion planner that generates feasible locomotion trajectories. We construct a set of dynamic locomotion models for legged robots to serve as a template library for handling diverse environmental events. We devise a replanning strategy that takes into consideration sudden environmental changes or large state disturbances to increase the robustness of the resulting locomotion behaviors. We formally prove the correctness of the layered locomotion framework guaranteeing a robust implementation by the motion planning layer. Simulations of reactive locomotion behaviors in diverse environments indicate that our framework has the potential to serve as a theoretical foundation for intelligent locomotion behaviors.

URL PDF HTML ☆

赞 0 踩 0

1710.05465 2026-06-04 cs.AI cs.RO cs.SY eess.SY 版本更新

Flow: A Modular Learning Framework for Mixed Autonomy Traffic

Flow: 一种用于混合自主性的模块化学习框架

Cathy Wu, Aboudy Kreidieh, Kanaad Parvate, Eugene Vinitsky, Alexandre M Bayen

发表机构 * Laboratory for Information and Decision Systems, Massachusetts Institute of Technology（信息与决策实验室，麻省理工学院）； Institute of Data, Systems, and Society, Massachusetts Institute of Technology（数据、系统与社会研究所，麻省理工学院）； Department of Mechanical Engineering, University of California, Berkeley（机械工程系，加州大学伯克利分校）

AI总结本文提出了一种模块化学习框架，利用深度强化学习解决复杂交通动态问题，通过提高系统层面的速度，使学习到的控制法则在仅有4-7%的自动驾驶汽车参与度下，相比人类驾驶性能提升高达57%。此外，在单车道交通中，一个仅使用局部观测的小型神经网络控制法则能够消除拥堵现象，达到近最优性能。

Comments 17 pages, 8 figures, 5 tables. 2021 IEEE Transactions on Robotics (T-RO)

详情

DOI: 10.1109/TRO.2021.3087314

AI中文摘要

自动驾驶车辆（AVs）的快速发展为交通系统带来了巨大的潜力，通过提高安全性和效率以及出行可及性。然而，随着AVs的采用，这些影响的发展进程并不清楚。从分析部分自动驾驶的总体目标来看，出现了许多技术挑战：部分控制和观测、多车辆交互以及现实世界网络所代表的大量场景。为了深入了解近期AV的影响，本文研究了深度强化学习（RL）在低AV采用率环境下克服这些挑战的适用性。本文提出了一种模块化学习框架，利用深度RL来处理复杂的交通动态。模块由多个部分组成，以捕捉常见的交通现象（如停止-启动交通拥堵、车道变换、交叉口）。学习到的控制法则在系统层面的速度上优于人类驾驶性能，仅在4-7%的AVs参与度下，提高了高达57%。此外，在单车道交通中，一个仅使用局部观测的小型神经网络控制法则被发现能够消除停止-启动交通现象，超越了所有已知的基于模型的控制器，达到近最优性能，并且能够推广到非分布交通密度。

英文摘要

The rapid development of autonomous vehicles (AVs) holds vast potential for transportation systems through improved safety, efficiency, and access to mobility. However, the progression of these impacts, as AVs are adopted, is not well understood. Numerous technical challenges arise from the goal of analyzing the partial adoption of autonomy: partial control and observation, multi-vehicle interactions, and the sheer variety of scenarios represented by real-world networks. To shed light into near-term AV impacts, this article studies the suitability of deep reinforcement learning (RL) for overcoming these challenges in a low AV-adoption regime. A modular learning framework is presented, which leverages deep RL to address complex traffic dynamics. Modules are composed to capture common traffic phenomena (stop-and-go traffic jams, lane changing, intersections). Learned control laws are found to improve upon human driving performance, in terms of system-level velocity, by up to 57% with only 4-7% adoption of AVs. Furthermore, in single-lane traffic, a small neural network control law with only local observation is found to eliminate stop-and-go traffic - surpassing all known model-based controllers to achieve near-optimal performance - and generalize to out-of-distribution traffic densities.

URL PDF HTML ☆

赞 0 踩 0

1610.04091 2026-06-04 eess.SY cs.DC cs.RO cs.SY math.OC 版本更新

Optimizing Communication and Computation for Multi-UAV Information Gathering Applications

为多UAV信息采集应用优化通信与计算

Mason Thammawichai, Sujit P. Baliyarasimhuni, Eric C. Kerrigan, João B. Sousa

发表机构 * Department of Aeronautics, Imperial College London（帝国理工学院伦敦校区航空系）； Department of Electronics and Communications Engineering, Indraprastha Institute of Information Technology（印度拉普拉兹信息技术学院电子与通信工程系）； Department of Electrical & Electronic Engineering and the Department of Aeronautics, Imperial College London（帝国理工学院伦敦校区电子与电气工程系及航空系）； Department of Electrical and Computer Engineering, University of Porto（葡萄牙波尔图大学电气与计算机工程系）

AI总结本文针对多UAV系统中通信与计算能耗的优化问题，提出了一种混合整数优化方法，通过数据聚合和多跳分层聚类实现高效的路由方案，以延长系统寿命。

详情

DOI: 10.1109/TAES.2017.2761139
Journal ref: IEEE Transactions on Aerospace and Electronic Systems ( Volume: 54, Issue: 2, April 2018)

AI中文摘要

移动代理网络，如多UAV系统，受到资源限制的约束。特别是，有限的能源直接影响系统性能，如系统寿命。在无线传感器网络文献中已证明，通信能耗主导了计算和传感能耗。因此，通过优化通信数据量可以显著延长多UAV系统的寿命，但会增加计算成本。在本文中，我们旨在取得通信与计算能耗之间的最佳权衡。具体而言，我们提出了一种混合整数优化公式，用于多跳分层聚类基于自组织UAV网络的数据聚合，以获得节能的信息路由方案。所提出的框架在两个应用上进行了测试，即目标跟踪和区域映射。基于仿真结果，我们的方法相比没有数据聚合和聚类方案的基线策略，能显著节省能量。

英文摘要

Mobile agent networks, such as multi-UAV systems, are constrained by limited resources. In particular, limited energy affects system performance directly, such as system lifetime. It has been demonstrated in the wireless sensor network literature that the communication energy consumption dominates the computational and the sensing energy consumption. Hence, the lifetime of the multi-UAV systems can be extended significantly by optimizing the amount of communication data, at the expense of increasing computational cost. In this work, we aim at attaining an optimal trade-off between the communication and the computational energy. Specifically, we propose a mixed-integer optimization formulation for a multi-hop hierarchical clustering-based self-organizing UAV network incorporating data aggregation, to obtain an energy-efficient information routing scheme. The proposed framework is tested on two applications, namely target tracking and area mapping. Based on simulation results, our method can significantly save energy compared to a baseline strategy, where there is no data aggregation and clustering scheme.

URL PDF HTML ☆

赞 0 踩 0

1806.06723 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Towards Manipulability of Interactive Lagrangian Systems

面向交互拉格朗日系统的可操作性

Hanlei Wang

发表机构 * Science and Technology on Space Intelligent Control Laboratory（航天智能控制实验室）

AI总结本文研究了具有参数不确定性和通信/传感约束的交互拉格朗日系统的可操作性，提出了一种新的动态反馈方法，设计了能够实现无限可操作性和对通信/传感约束鲁棒性的自适应控制器，解决了非线性双侧远程操作中的任意未知时变延迟问题。

Comments 15 pages, 15 figures

详情

DOI: 10.1016/j.automatica.2020.108913
Journal ref: Automatica, 119: 108913, 2020

AI中文摘要

本文研究了具有参数不确定性和通信/传感约束的交互拉格朗日系统的可操作性。两个标准例子是主从远程操作和机器人教学操作。我们系统地提出了通用动力系统中无限可操作性的概念，并探讨了这种统一动机如何形成一种设计范式，以保证交互动力系统的无限可操作性，并特别促进了交互拉格朗日系统的非线性自适应控制器的设计和分析。具体而言，基于一个新的动态反馈类，我们提出了一种自适应控制器，能够实现受控拉格朗日系统的无限可操作性和对通信/传感约束的鲁棒性，主要归因于由此产生的动态级联框架。所提出的范式在人机交互的网络耦合要求和受控动力学之间实现了理想的平衡。我们还证明了我们主要结果的一个特殊情况解决了长期存在的非线性双侧远程操作问题，即任意未知时变延迟。仿真结果展示了所提出自适应控制器下交互机器人系统的性能。

英文摘要

This paper investigates manipulability of interactive Lagrangian systems with parametric uncertainty and communication/sensing constraints. Two standard examples are teleoperation with a master-slave system and teaching operation of robots. We here systematically formulate the concept of infinite manipulability for general dynamical systems, and investigate how such a unified motivation yields a design paradigm towards guaranteeing the infinite manipulability of interactive dynamical systems and in particular facilitates the design and analysis of nonlinear adaptive controllers for interactive Lagrangian systems. Specifically, based on a new class of dynamic feedback, we propose adaptive controllers that achieve both the infinite manipulability of the controlled Lagrangian systems and the robustness with respect to the communication/sensing constraints, mainly owing to the resultant dynamic-cascade framework. The proposed paradigm yields the desirable balance between network coupling requirements and controlled dynamics of human-system interaction. We also show that a special case of our main result resolves the longstanding nonlinear bilateral teleoperation problem with arbitrary unknown time-varying delay. Simulation results show the performance of the interactive robotic systems under the proposed adaptive controllers.

URL PDF HTML ☆

赞 0 踩 0

1710.04465 2026-06-04 cs.RO cs.SY eess.SY stat.CO 版本更新

Markerless visual servoing on unknown objects for humanoid robot platforms

无标记未知物体的人形机器人视觉伺服

Claudio Fantacci, Giulia Vezzani, Ugo Pattacini, Vadim Tikhanoff, Lorenzo Natale

AI总结本文提出了一种无标记未知物体的人形机器人视觉伺服框架，通过立体视觉计算可抓取物体的体积，利用递归贝叶斯滤波估计末端执行器的6D姿态，结合非线性约束优化问题计算目标姿态，并通过图像基于视觉伺服控制实现末端执行器的精确控制。

详情

DOI: 10.1109/ICRA.2018.8462914
Journal ref: IEEE International Conference on Robotics and Automation (ICRA), 2018

AI中文摘要

为了精确地抓住一个物体，人形机器人需要对末端执行器、物体姿态和形状有良好的了解。本文提出了一种无标记未知物体的视觉伺服框架，分为四个主要部分：I) 通过立体视觉建立最小二乘问题来计算机器人手可抓取的物体体积；II) 基于序贯蒙特卡洛（SMC）滤波的递归贝叶斯滤波技术，用于在不使用标记的情况下估计机器人末端执行器的6D姿态；III) 建立非线性约束优化问题来计算关于物体的目标可抓取姿态；IV) 通过图像基于视觉伺服控制命令机器人末端执行器向目标姿态移动。我们通过大量实验在iCub人形机器人平台上验证了该方法的有效性和鲁棒性，实现了实时计算、平滑轨迹和亚像素精度。

英文摘要

To precisely reach for an object with a humanoid robot, it is of central importance to have good knowledge of both end-effector, object pose and shape. In this work we propose a framework for markerless visual servoing on unknown objects, which is divided in four main parts: I) a least-squares minimization problem is formulated to find the volume of the object graspable by the robot's hand using its stereo vision; II) a recursive Bayesian filtering technique, based on Sequential Monte Carlo (SMC) filtering, estimates the 6D pose (position and orientation) of the robot's end-effector without the use of markers; III) a nonlinear constrained optimization problem is formulated to compute the desired graspable pose about the object; IV) an image-based visual servo control commands the robot's end-effector toward the desired pose. We demonstrate effectiveness and robustness of our approach with extensive experiments on the iCub humanoid robot platform, achieving real-time computation, smooth trajectories and sub-pixel precisions.

URL PDF HTML ☆

赞 0 踩 0

1807.06172 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Experimental Resilience Assessment of An Open-Source Driving Agent

开放源代码驾驶代理的实验韧性评估

Abu Hasnat Mohammad Rubaiyat, Yongming Qin, Homa Alemzadeh

发表机构 * Department of Electrical and Computer Engineering（电气与计算机工程系）

AI总结本文提出基于系统理论过程分析（STPA）的故障注入框架，用于评估开放源代码驾驶代理openpilot在不同环境条件和传感器数据故障下的韧性，通过战略性软件故障注入方法提高不安全场景的覆盖率，从而更有效地模拟安全关键故障并测试自动驾驶车辆。

Comments 10 pages, 7 figures

详情

DOI: 10.1109/PRDC.2018.00016

AI中文摘要

自动驾驶车辆（AV）依赖雷达和相机等传感器进行环境感知、路径规划和控制。随着自主性提高和与复杂环境的交互增加，对AV安全性和可靠性的关注日益增长。本文提出一种基于系统理论过程分析（STPA）的故障注入框架，用于评估开放源代码驾驶代理openpilot在不同环境条件和影响传感器数据的故障下的韧性。为了增加测试期间不安全场景的覆盖率，我们采用战略性软件故障注入方法，其中触发故障注入的触发器来源于系统高级危险分析中识别出的不安全场景。实验结果表明，所提出的战略性故障注入方法相比随机故障注入提高了危险覆盖率，从而有助于更有效地模拟安全关键故障并测试AV。此外，本文还提供了关于openpilot安全机制性能及其在及时检测和恢复故障输入能力方面的见解。

英文摘要

Autonomous vehicles (AV) depend on the sensors like RADAR and camera for the perception of the environment, path planning, and control. With the increasing autonomy and interactions with the complex environment, there have been growing concerns regarding the safety and reliability of AVs. This paper presents a Systems-Theoretic Process Analysis (STPA) based fault injection framework to assess the resilience of an open-source driving agent, called openpilot, under different environmental conditions and faults affecting sensor data. To increase the coverage of unsafe scenarios during testing, we use a strategic software fault-injection approach where the triggers for injecting the faults are derived from the unsafe scenarios identified during the high-level hazard analysis of the system. The experimental results show that the proposed strategic fault injection approach increases the hazard coverage compared to random fault injection and, thus, can help with more effective simulation of safety-critical faults and testing of AVs. In addition, the paper provides insights on the performance of openpilot safety mechanisms and its ability in timely detection and recovery from faulty inputs.

URL PDF HTML ☆

赞 0 踩 0

1605.04344 2026-06-04 eess.SY cs.RO cs.SY 版本更新

On the Effects of Measurement Uncertainty in Optimal Control of Contact Interactions

关于测量不确定性在接触相互作用最优控制中的影响

Brahayam Ponton, Stefan Schaal, Ludovic Righetti

发表机构 * Max-Planck Institute for Intelligent Systems, Tuebingen-Germany（图灵智能研究所，图宾根-德国）； University of Southern California, Los Angeles-USA（南加州大学，洛杉矶-美国）

AI总结本文研究了在机器人应用中，接触相互作用的不确定性不仅来自过程模型的噪声，还来自对世界知识的不精确，提出了一种基于风险敏感控制的SOC算法，通过引入观测器动态来显式依赖当前测量不确定性，仿真结果显示测量不确定性导致低阻尼行为，与过程噪声导致的刚性行为形成对比。

Comments 17 pages, 5 figures - this version is the one published at WAFR 2016 to fulfill the open access requirements of the EU commission, please refer to the previous version for the complete derivation of the algorithm

详情

DOI: 10.1007/978-3-030-43089-4_50

AI中文摘要

随机最优控制（SOC）通常仅考虑过程模型中的噪声，即未知干扰。然而，在许多涉及与环境交互的机器人应用中，如运动和操作，不确定性还来自对世界的不精确知识，这并非实际干扰。我们通过开发基于风险敏感控制的SOC算法，分析同时考虑测量模型中的噪声的影响，该算法将观测器动态纳入其中，使得控制律显式依赖于当前的测量不确定性。在简单2D机械臂的仿真结果中，我们观察到测量不确定性导致低阻尼行为，这一结果与过程噪声产生刚性行为的效果形成对比。这表明考虑测量不确定性可能是解决涉及不确定接触相互作用问题的一种很有前途的方法。

英文摘要

Stochastic Optimal Control (SOC) typically considers noise only in the process model, i.e. unknown disturbances. However, in many robotic applications involving interaction with the environment, such as locomotion and manipulation, uncertainty also comes from lack of precise knowledge of the world, which is not an actual disturbance. We analyze the effects of also considering noise in the measurement model, by developing a SOC algorithm based on risk-sensitive control, that includes the dynamics of an observer in such a way that the control law explicitly depends on the current measurement uncertainty. In simulation results on a simple 2D manipulator, we have observed that measurement uncertainty leads to low impedance behaviors, a result in contrast with the effects of process noise that creates stiff behaviors. This suggests that taking into account measurement uncertainty could be a potentially very interesting way to approach problems involving uncertain contact interactions.

URL PDF HTML ☆

赞 0 踩 0

1905.12191 2026-06-04 cs.RO cs.MA cs.SY eess.SY 版本更新

CARE: Cooperative Autonomy for Resilience and Efficiency of Robot Teams for Complete Coverage of Unknown Environments under Robot Failures

CARE: 机器人团队在未知环境中鲁棒性和效率的协同自主性

Junnan Song, Shalabh Gupta

AI总结本文提出了一种分布式算法CARE，用于解决未知环境中多机器人覆盖率路径规划问题，该算法在机器人故障情况下提供鲁棒性并提高整体效率，通过事件驱动的重新规划实现任务重新分配，实验结果表明其在故障情况下能够实现完全覆盖、减少覆盖时间和加快目标发现。

详情

DOI: 10.1007/s10514-019-09870-3
Journal ref: Autonomous Robots, volume 44, 2020

AI中文摘要

本文针对未知环境中多机器人覆盖率路径规划（MCPP）问题，特别是在机器人故障情况下，提出了一个分布式算法，称为协同自主性以实现鲁棒性和效率（CARE）。该算法不仅为机器人团队提供故障容忍能力，还通过事件驱动的重新规划提高整体操作效率。算法使用分布式离散事件监督器（DESs），在机器人故障或空闲时触发一组可行玩家之间的游戏，以做出协作决策进行任务重新分配。游戏理论结构通过潜在游戏构建，其中每个玩家的效用与所有玩家的共享目标函数对齐。该算法已在各种复杂场景的高保真机器人模拟器上得到验证，结果表明，与三种替代方法相比，团队在故障情况下实现了完全覆盖，减少了覆盖时间，并加快了目标发现。

英文摘要

This paper addresses the problem of Multi-robot Coverage Path Planning (MCPP) for unknown environments in the presence of robot failures. Unexpected robot failures can seriously degrade the performance of a robot team and in extreme cases jeopardize the overall operation. Therefore, this paper presents a distributed algorithm, called Cooperative Autonomy for Resilience and Efficiency (CARE), which not only provides resilience to the robot team against failures of individual robots, but also improves the overall efficiency of operation via event-driven replanning. The algorithm uses distributed Discrete Event Supervisors (DESs), which trigger games between a set of feasible players in the event of a robot failure or idling, to make collaborative decisions for task reallocations. The game-theoretic structure is built using Potential Games, where the utility of each player is aligned with a shared objective function for all players. The algorithm has been validated in various complex scenarios on a high-fidelity robotic simulator, and the results demonstrate that the team achieves complete coverage under failures, reduced coverage time, and faster target discovery as compared to three alternative methods.

URL PDF HTML ☆

赞 0 踩 0

1803.07696 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Inverse Optimal Control from Incomplete Trajectory Observations

从不完整轨迹观测中逆最优控制

Wanxin Jin, Dana Kulić, Shaoshuai Mou, Sandra Hirche

发表机构 * School of Aeronautics and Astronautics, Purdue University（航空与航天学院，普渡大学）； Monash University（墨尔本大学）； Chair of Information-oriented Control, Technical University of Munich（信息导向控制教授职位，慕尼黑技术大学）

AI总结本文提出了一种从不完整轨迹观测中学习最优控制系统目标函数的方法，通过恢复矩阵确定候选特征的权重，并开发了增量逆最优控制算法。

Comments Codes: https://github.com/wanxinjin/IOC-from-Incomplete-Trajectory-Observations

详情

DOI: 10.1177/0278364921996384
Journal ref: The International Journal of Robotics Research. 2021;40(6-7):848-865

AI中文摘要

本文开发了一种方法，使能够从不完整的轨迹观测中学习最优控制系统的的目标函数。假设目标函数是未知权重的特征（或基函数）的加权和，观测数据是系统状态和输入轨迹的一段。所提出的技术引入了恢复矩阵的概念，以建立任何可用轨迹段与给定候选特征权重之间的关系。恢复矩阵的秩表明是否可以在候选特征中找到相关子集，并且可以从段数据中学习相应的权重。恢复矩阵可以迭代获得，其秩非递减的性质表明额外的观测可能有助于目标学习。基于恢复矩阵，建立了一种使用不完整轨迹观测学习所选特征权重的方法，并通过自动寻找所需的最小观测开发了增量逆最优控制算法。该方法的有效性在线性二次调节系统和模拟机器人机械臂上得到了验证。

英文摘要

This article develops a methodology that enables learning an objective function of an optimal control system from incomplete trajectory observations. The objective function is assumed to be a weighted sum of features (or basis functions) with unknown weights, and the observed data is a segment of a trajectory of system states and inputs. The proposed technique introduces the concept of the recovery matrix to establish the relationship between any available segment of the trajectory and the weights of given candidate features. The rank of the recovery matrix indicates whether a subset of relevant features can be found among the candidate features and the corresponding weights can be learned from the segment data. The recovery matrix can be obtained iteratively and its rank non-decreasing property shows that additional observations may contribute to the objective learning. Based on the recovery matrix, a method for using incomplete trajectory observations to learn the weights of selected features is established, and an incremental inverse optimal control algorithm is developed by automatically finding the minimal required observation. The effectiveness of the proposed method is demonstrated on a linear quadratic regulator system and a simulated robot manipulator.

URL PDF HTML ☆

赞 0 踩 0

1904.00378 2026-06-04 cs.RO cs.SY eess.SY 版本更新

MAT-Fly: An Educational Platform for Simulating Unmanned Aerial Vehicles Aimed to Detect and Track Moving Objects

MAT-Fly：一种用于模拟无人驾驶航空器的教育平台，旨在检测和跟踪移动物体

Giuseppe Silano, Luigi Iannelli

发表机构 * Faculty of Electrical Engineering, Czech Technical University in Prague（布拉格捷克技术大学电气工程系）； Department of Engineering, University of Sannio in Benevento, Piazza Roma 21（巴内维诺萨恩尼奥大学工程系，罗马广场21号）

AI总结本文提出了一种用于无人驾驶航空器领域特定任务的模拟方法，即视觉检测和跟踪任意移动物体，介绍了MAT-Fly平台，该平台基于Matlab和MathWorks虚拟现实（VR）和计算机视觉系统（CVS）工具箱，用于模拟四旋翼飞行器跟踪沿复杂路径移动的汽车，并开源供教育使用。

Comments 11 pages, 15 figures, journal paper

详情

DOI: 10.1109/ACCESS.2021.3064758
Journal ref: IEEE Access, 2021

AI中文摘要

本文的主要动机是提出一种针对无人驾驶航空器领域特定任务的模拟方法，即视觉检测和跟踪任意移动物体。特别地，介绍了MAT-Fly，一个具有易用性和控制开发特点的多旋翼飞行器数值模拟平台。该平台基于Matlab和MathWorks虚拟现实（VR）和计算机视觉系统（CVS）工具箱，共同模拟四旋翼飞行器在跟踪沿复杂路径移动的汽车时的行为。VR工具箱被选择是因为学生对Matlab比较熟悉，并且由于其结构简单，用户在学习和开发阶段不需要付出显著的努力。整体架构非常模块化，使得每个模块可以轻松替换，从而简化代码重用和平台定制。一些简单的测试环境被展示以证明该方法的有效性以及平台的工作方式。该模拟器作为开源发布，使用户能够查看系统中的任何部分，并用于教育目的。

英文摘要

The main motivation of this work is to propose a simulation approach for a specific task within the Unmanned Aerial Vehicle (UAV) field, i.e., the visual detection and tracking of arbitrary moving objects. In particular, it is described MAT-Fly, a numerical simulation platform for multi-rotor aircraft characterized by the ease of use and control development. The platform is based on Matlab and the MathWorks Virtual Reality (VR) and Computer Vision System (CVS) toolboxes that work together to simulate the behavior of a quad-rotor while tracking a car that moves along a nontrivial path. The VR toolbox has been chosen due to the familiarity that students have with Matlab and because it does not require a notable effort by the user for the learning and development phase thanks to its simple structure. The overall architecture is quite modular so that each block can be easily replaced with others simplifying the code reuse and the platform customization. Some simple testbeds are presented to show the validity of the approach and how the platform works. The simulator is released as open-source, making it possible to go through any part of the system, and available for educational purposes.

URL PDF HTML ☆

赞 0 踩 0

1903.08818 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Contingency Model Predictive Control for Automated Vehicles

应急模型预测控制用于自动驾驶车辆

John P. Alsterda, Matthew Brown, J. Christian Gerdes

发表机构 * Department of Mechanical Engineering, Stanford University（斯坦福大学机械工程系）

AI总结本文提出了一种新的应急模型预测控制（CMPC）框架，该框架在跟踪期望路径的同时，维护一个替代轨迹以避免已识别的潜在紧急情况。通过在经典递推MPC时间 horizon中并行添加额外的预测时间 horizon，CMPC能够预见可能发生的情况，而不是在紧急情况发生后才反应。通过数学定义框架并实验比较其性能与最先进的确定性MPC，展示了CMPC在应对潜在摩擦损失（如冰面）时的有效性。

Comments American Control Conference, July 2019; 6 pages

详情

DOI: 10.23919/ACC.2019.8815260
Journal ref: IEEE American Control Conference (ACC) (2019) 717-722

AI中文摘要

我们提出了应急模型预测控制（CMPC），一种新颖且可实施的控制框架，该框架在跟踪期望路径的同时，同时维护一个应急计划——一个替代轨迹以避免已识别的潜在紧急情况。通过在经典递推MPC时间 horizon中并行添加额外的预测时间 horizon，CMPC能够预见可能发生的情况，而不是在紧急情况发生后才反应。通过数学定义框架并实验比较其性能与最先进的确定性MPC，展示了CMPC在应对潜在摩擦损失（如冰面）时的有效性。

英文摘要

We present Contingency Model Predictive Control (CMPC), a novel and implementable control framework which tracks a desired path while simultaneously maintaining a contingency plan -- an alternate trajectory to avert an identified potential emergency. In this way, CMPC anticipates events that might take place, instead of reacting when emergencies occur. We accomplish this by adding an additional prediction horizon in parallel to the classical receding MPC horizon. The contingency horizon is constrained to maintain a feasible avoidance solution; as such, CMPC is selectively robust to this emergency while tracking the desired path as closely as possible. After defining the framework mathematically, we demonstrate its effectiveness experimentally by comparing its performance to a state-of-the-art deterministic MPC. The controllers drive an automated research platform through a left-hand turn which may be covered by ice. Contingency MPC prepares for the potential loss of friction by purposefully and intuitively deviating from the prescribed path to approach the turn more conservatively; this deviation significantly mitigates the consequence of encountering ice.

URL PDF HTML ☆

赞 0 踩 0

1607.01202 2026-06-04 eess.SY cs.AI cs.RO cs.SY 版本更新

Optimal control for a robotic exploration, pick-up and delivery problem

机器人探索、拾取和配送问题的最优控制

Vladislav Nenchev, Christos G. Cassandras, Jörg Raisch

AI总结本文研究了机器人在最小时间内寻找并收集有限数量物体并运送到集散地的最优控制问题，采用递推时间窗方案解决混合系统中的最优控制问题，并提出基于运动参数化和梯度优化的事件驱动方法，以提高计算效率。

Comments 14 pages, 23 figures

详情

DOI: 10.1016/j.nahs.2018.06.004

AI中文摘要

本文解决了一个机器人在最小时间内寻找并收集有限数量物体并运送到集散地的最优控制问题。该机器人具有四阶动力学，其在拾取或放下物体时会瞬间改变。物体被建模为具有预先未知位置的点质量，在有界二维空间中可能包含未知障碍物。对于这种混合系统，通过递推时间窗方案近似求解最优控制问题（OCP），其中推导出的成本到目标的下界在最坏情况和概率情况下进行评估，假设物体位置服从均匀分布。首先，基于时间和位置空间离散化和混合整数规划的时间驱动近似解被提出。由于该解的计算成本较高，提出了一种基于合适运动参数化和梯度优化的事件驱动近似方法。在数值示例中比较了解决方案，表明后一种方法在计算上具有显著优势，同时与前者产生相似的定性结果。这些方法特别适用于各种机器人应用，如自动化清洁、搜索和救援、收割或制造。

英文摘要

This paper addresses an optimal control problem for a robot that has to find and collect a finite number of objects and move them to a depot in minimum time. The robot has fourth-order dynamics that change instantaneously at any pick-up or drop-off of an object. The objects are modeled by point masses with a-priori unknown locations in a bounded two-dimensional space that may contain unknown obstacles. For this hybrid system, an Optimal Control Problem (OCP) is approximately solved by a receding horizon scheme, where the derived lower bound for the cost-to-go is evaluated for the worst and for a probabilistic case, assuming a uniform distribution of the objects. First, a time-driven approximate solution based on time and position space discretization and mixed integer programming is presented. Due to the high computational cost of this solution, an alternative event-driven approximate approach based on a suitable motion parameterization and gradient-based optimization is proposed. The solutions are compared in a numerical example, suggesting that the latter approach offers a significant computational advantage while yielding similar qualitative results compared to the former. The methods are particularly relevant for various robotic applications like automated cleaning, search and rescue, harvesting or manufacturing.

URL PDF HTML ☆

赞 0 踩 0

1903.01577 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Episodic Learning with Control Lyapunov Functions for Uncertain Robotic Systems

具有控制李雅普诺夫函数的不确定性机器人系统的经验学习

Andrew J. Taylor, Victor D. Dorobantu, Hoang M. Le, Yisong Yue, Aaron D. Ames

发表机构 * California Institute of Technology（加州理工学院）

AI总结本文提出了一种基于控制李雅普诺夫函数的机器学习框架，用于适应机器人系统中的参数不确定性和未建模动态，通过迭代更新李雅普诺夫函数导数的估计和改进控制器，最终获得一个稳定性的二次规划基于控制器，并在平面Segway模拟中验证了方法的有效性。

详情

DOI: 10.1109/IROS40897.2019.8967820

AI中文摘要

许多现代非线性控制方法旨在赋予系统保证性质，如稳定性或安全性，并已成功应用于机器人领域。然而，模型不确定性仍然是持续的挑战，削弱了理论保证并导致物理系统中的实施失败。本文开发了一种以控制李雅普诺夫函数（CLFs）为中心的机器学习框架，以适应一般机器人系统中的参数不确定性和未建模动态。我们提出的方法通过迭代更新李雅普诺夫函数导数的估计并改进控制器，最终获得一个基于二次规划的稳定控制器。我们在平面Segway模拟中验证了我们的方法，通过迭代改进基础无模型控制器，展示了显著的性能提升。

英文摘要

Many modern nonlinear control methods aim to endow systems with guaranteed properties, such as stability or safety, and have been successfully applied to the domain of robotics. However, model uncertainty remains a persistent challenge, weakening theoretical guarantees and causing implementation failures on physical systems. This paper develops a machine learning framework centered around Control Lyapunov Functions (CLFs) to adapt to parametric uncertainty and unmodeled dynamics in general robotic systems. Our proposed method proceeds by iteratively updating estimates of Lyapunov function derivatives and improving controllers, ultimately yielding a stabilizing quadratic program model-based controller. We validate our approach on a planar Segway simulation, demonstrating substantial performance improvements by iteratively refining on a base model-free controller.

URL PDF HTML ☆

赞 0 踩 0

1710.02066 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Feedback Regularization and Geometric PID Control for Robust Stabilization of a Planar Three-link Hybrid Bipedal Walking Model

反馈校正与几何PID控制用于平面三连杆混合双足步行模型的鲁棒稳定化

W. M. L. T. Weerakoon, T. W. U. Madhushani, D. H. S. Maithripala, J. M. Berg

发表机构 * Department of Mechanical Engineering, University of Peradeniya（珀斯德尼亚大学机械工程系）； Postgraduate and Research Unit, Sri Lanka Technological Campus（斯里兰卡科技校园研究生与研究单位）； Department of Mechanical Engineering, Texas Tech University（德克萨斯技术大学机械工程系）

AI总结本文应用一种 recently 开发的几何PID控制器来稳定一个平面三连杆混合动态步行模型。该模型有三个连杆，代表机器人躯干和两个无膝腿，每个髋关节有独立的控制力矩。几何PID控制器是为完全驱动的机械系统开发的，但在摆动相中，三连杆双足机器人有三个自由度但只有两个控制输入。通过选择两个“虚拟约束”来强制执行，解决欠驱动问题，并验证所得到的二维零动力学的稳定性。所得到的受控动力学不具有机械系统的结构，但通过“反馈校正”恢复了这种结构，随后使用几何PID控制来提供对虚拟约束的鲁棒渐近调节。所提出的方法可以容忍更大的倾斜变化，展示了几何方法的价值和积分作用的益处。

Comments Preprint submitted to 2018 American Control Conference

详情

DOI: 10.23919/ACC.2018.8431829

AI中文摘要

This paper applies a recently developed geometric PID controller to stabilize a three-link planar bipedal hybrid dynamic walking model. The three links represent the robot torso and two kneeless legs, with an independent control torque available at each hip joint. The geometric PID controller is derived for fully actuated mechanical systems, however in the swing phase the three-link biped robot has three degrees of freedom and only two controls. Following the bipedal walking literature, underactuation is addressed by choosing two "virtual constraints" to enforce, and verifying the stability of the resulting two-dimensional zero dynamics. The resulting controlled dynamics do not have the structure of a mechanical system, however this structure is restored using "feedback regularization," following which geometric PID control is used to provide robust asymptotic regulation of the virtual constraints. The proposed method can tolerate significantly greater variations in inclination, showing the value of the geometric methods, and the benefit of integral action.

英文摘要

This paper applies a recently developed geometric PID controller to stabilize a three-link planar bipedal hybrid dynamic walking model. The three links represent the robot torso and two kneeless legs, with an independent control torque available at each hip joint. The geometric PID controller is derived for fully actuated mechanical systems, however in the swing phase the three-link biped robot has three degrees of freedom and only two controls. Following the bipedal walking literature, underactuation is addressed by choosing two "virtual constraints" to enforce, and verifying the stability of the resulting two-dimensional zero dynamics. The resulting controlled dynamics do not have the structure of a mechanical system, however this structure is restored using "feedback regularization," following which geometric PID control is used to provide robust asymptotic regulation of the virtual constraints. The proposed method can tolerate significantly greater variations in inclination, showing the value of the geometric methods, and the benefit of integral action.

URL PDF HTML ☆

赞 0 踩 0

1904.02851 2026-06-04 cs.RO cs.AI cs.LG cs.SY eess.SY 版本更新

Planning under non-rational perception of uncertain spatial costs

在不确定空间成本下的非理性感知规划

Aamodh Suresh, Sonia Martinez

AI总结本文研究了在不确定空间成本下考虑非理性风险感知的运动规划策略，提出基于累积前景理论（CPT）生成感知风险地图的方法，并通过理论和仿真验证了CPT模型的建模能力，与CVaR等其他风险感知模型相比，展示了在路径规划中的优势。

Comments 12 pages and 10 figures. This revision adds more explanation and clearer figures

详情

AI中文摘要

本工作探讨了设计一种考虑与不确定空间成本相关的风险感知的运动规划策略。我们提出的方法利用累积前景理论（CPT）来生成给定环境中的感知风险地图。CPT-like感知风险和路径长度指标被结合以定义一个符合采样运动规划器（RRT*）渐近最优要求的成本函数。通过理论和仿真展示了CPT的建模能力，并与其他风险感知模型如条件价值-at-风险（CVaR）进行了比较。理论上，我们定义了风险感知模型的表达性概念，并证明CPT的表达性高于CVaR和期望风险。然后我们展示了这种表达性在路径规划设置中的转化，其中我们观察到一个配备CPT和同时扰动随机近似（SPSA）方法的规划器可以更好地近似任意环境中的路径。此外，我们通过仿真展示了我们的规划器能够捕捉一组丰富的有意义路径，代表了不同风险感知的自定义环境。然后我们通过在拥挤和动态环境中的仿真比较了我们的规划器与T-RRT*（连续成本空间的规划器）和Risk-RRT*（动态人类障碍物的风险感知规划器）的性能，展示了我们所提规划器的优势。

英文摘要

This work investigates the design of risk-perception-aware motion-planning strategies that incorporate non-rational perception of risks associated with uncertain spatial costs. Our proposed method employs the Cumulative Prospect Theory (CPT) to generate a perceived risk map over a given environment. CPT-like perceived risks and path-length metrics are then combined to define a cost function that is compliant with the requirements of asymptotic optimality of sampling-based motion planners (RRT*). The modeling power of CPT is illustrated in theory and in simulation, along with a comparison to other risk perception models like Conditional Value at Risk (CVaR). Theoretically, we define a notion of expressiveness for a risk perception model and show that CPT's is higher than that of CVaR and expected risk. We then show that this expressiveness translates to our path planning setting, where we observe that a planner equipped with CPT together with a simultaneous perturbation stochastic approximation (SPSA) method can better approximate arbitrary paths in an environment. Additionally, we show in simulation that our planner captures a rich set of meaningful paths, representative of different risk perceptions in a custom environment. We then compare the performance of our planner with T-RRT* (a planner for continuous cost spaces) and Risk-RRT* (a risk-aware planner for dynamic human obstacles) through simulations in cluttered and dynamic environments respectively, showing the advantage of our proposed planner.

URL PDF HTML ☆

赞 0 踩 0

1902.10139 2026-06-04 eess.SY cs.LG cs.RO cs.SY 版本更新

Learning Dynamic-Objective Policies from a Class of Optimal Trajectories

从一类最优轨迹中学习动态目标策略

Christopher Iliffe Sprague, Dario Izzo, Petter Ögren

AI总结本文提出了一种新颖且简单的方法，通过轨迹优化、同伦持续和模仿学习相结合，合成能够在线切换目标函数的最优状态反馈控制器，并在倒立摆摆起和航天器轨道转移问题中验证了其有效性。

Comments Accepted to the 59th IEEE Conference on Decision and Control (CDC)

详情

AI中文摘要

最优状态反馈控制器，能够切换不同的目标函数，在可能遇到意外情况的系统中具有优势。然而，即使对于单一目标函数，合成此类控制器也是极具挑战性的。本文提出了一种新颖且简单的方法，通过轨迹优化、同伦持续和模仿学习相结合，来合成这些策略。我们使用数值持续法高效地生成多个目标函数和边界条件下的最优演示，并利用这些演示来训练我们的策略。此外，我们展示了我们的策略能够有效学习一系列最优状态反馈控制器，这些控制器可以在线切换目标函数。我们通过两个轨迹优化问题，即倒立摆摆起和航天器轨道转移，展示了该方法，并证明在仿真中合成的策略产生的轨迹接近最优。这些结果表明，轨迹优化和同伦持续对动态目标情境下的控制器合成具有益处。

英文摘要

Optimal state-feedback controllers, capable of changing between different objective functions, are advantageous to systems in which unexpected situations may arise. However, synthesising such controllers, even for a single objective, is a demanding process. In this paper, we present a novel and straightforward approach to synthesising these policies through a combination of trajectory optimisation, homotopy continuation, and imitation learning. We use numerical continuation to efficiently generate optimal demonstrations across several objectives and boundary conditions, and use these to train our policies. Additionally, we demonstrate the ability of our policies to effectively learn families of optimal state-feedback controllers, which can be used to change objective functions online. We illustrate this approach across two trajectory optimisation problems, an inverted pendulum swingup and a spacecraft orbit transfer, and show that the synthesised policies, when evaluated in simulation, produce trajectories that are near-optimal. These results indicate the benefit of trajectory optimisation and homotopy continuation to the synthesis of controllers in dynamic-objective contexts.

URL PDF HTML ☆

赞 0 踩 0

1710.09691 2026-06-04 eess.SY cs.LG cs.RO cs.SY 版本更新

Iterative Machine Learning for Precision Trajectory Tracking with Series Elastic Actuators

迭代机器学习用于系列弹性执行器的高精度轨迹跟踪

Nathan Banka, W. Tony Piaskowy, Joseph Garbini, Santosh Devasia

发表机构 * Ultra-Precision Controls Lab（超精密控制实验室）； University of Washington（华盛顿大学）

AI总结本文研究了在系列弹性执行器中使用迭代学习方法提高位置跟踪精度的问题，通过迭代学习生成前馈命令，利用复值高斯过程回归技术估计局部系统模型，从而减少跟踪误差。

Comments 9 pages, 16 figure. Submitted to AMC Workshop

详情

DOI: 10.1109/AMC.2019.8371094
Journal ref: 2018 IEEE 15th International Workshop on Advanced Motion Control (AMC), Tokyo, 2018, pp. 234-239

AI中文摘要

当机器人在未知环境中操作时，位置的小误差可能导致接触力的大幅变化，尤其是对于典型的高阻抗设计。这可能会损坏周围环境或机器人本身。系列弹性执行器（SEAs）是一种减少机器人手臂输出阻抗以提高对环境施加力的控制能力的流行方法。然而，这种增加的力控制能力伴随着较低的位置精度和带宽。本文探讨了使用迭代学习的前馈命令来改进使用SEAs时的位置跟踪。在每次迭代中，系统对量化输入的输出响应被用来估计线性化的局部系统模型。这些估计的模型是通过复值高斯过程回归（cGPR）技术获得的，然后用于基于前一次迭代的误差生成新的前馈输入命令。本文展示了该迭代机器学习（IML）技术在双自由度（2-DOF）机器人手臂上的应用，并证明了IML方法能够成功收敛以减少跟踪误差。

英文摘要

When robots operate in unknown environments small errors in postions can lead to large variations in the contact forces, especially with typical high-impedance designs. This can potentially damage the surroundings and/or the robot. Series elastic actuators (SEAs) are a popular way to reduce the output impedance of a robotic arm to improve control authority over the force exerted on the environment. However this increased control over forces with lower impedance comes at the cost of lower positioning precision and bandwidth. This article examines the use of an iteratively-learned feedforward command to improve position tracking when using SEAs. Over each iteration, the output responses of the system to the quantized inputs are used to estimate a linearized local system models. These estimated models are obtained using a complex-valued Gaussian Process Regression (cGPR) technique and then, used to generate a new feedforward input command based on the previous iteration's error. This article illustrates this iterative machine learning (IML) technique for a two degree of freedom (2-DOF) robotic arm, and demonstrates successful convergence of the IML approach to reduce the tracking error.

URL PDF HTML ☆

赞 0 踩 0

1905.03419 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Testing Scenario Library Generation for Connected and Automated Vehicles, Part I: Methodology

连接和自动化车辆测试场景库生成方法研究，第一部分：方法论

Shuo Feng, Yiheng Feng, Chunhui Yu, Yi Zhang, Henry X. Liu

发表机构 * University of Michigan（密歇根大学）

AI总结本文提出了一种系统框架，用于生成连接和自动化车辆（CAVs）的测试场景库（TSLG），考虑不同的操作设计域（ODDs）、CAV模型和性能指标，通过引入新的场景关键性度量标准和多起始优化方法，提高测试效率。

Comments 11 pages,3 figures

详情

DOI: 10.1109/TITS.2020.2972211
Journal ref: IEEE Transactions on Intelligent Transportation Systems, 2020

AI中文摘要

在连接和自动化车辆（CAVs）的发展和部署过程中，测试和评估是一个关键步骤，但目前尚无系统框架用于生成测试场景库。本研究旨在提供一种通用框架，用于解决测试场景库生成（TSLG）问题，考虑不同的操作设计域（ODDs）、CAV模型和性能指标。给定一个ODD，测试场景库被定义为一组关键场景，可用于CAV测试。每个测试场景通过新提出的度量标准——场景关键性进行评估，该度量标准可以计算为机动挑战和暴露频率的组合。为了寻找关键场景，设计了辅助目标函数，并应用了多起始优化方法和种子填充技术。所提出的框架在理论上被证明可以以远少的测试次数获得准确的评估结果，与道路测试方法相比。在本研究的第二部分中，通过三个案例研究来演示所提出的方法。基于强化学习的技术被应用于在高维场景下增强搜索方法。

英文摘要

Testing and evaluation is a critical step in the development and deployment of connected and automated vehicles (CAVs), and yet there is no systematic framework to generate testing scenario library. This study aims to provide a general framework for the testing scenario library generation (TSLG) problem with different operational design domains (ODDs), CAV models, and performance metrics. Given an ODD, the testing scenario library is defined as a critical set of scenarios that can be used for CAV test. Each testing scenario is evaluated by a newly proposed measure, scenario criticality, which can be computed as a combination of maneuver challenge and exposure frequency. To search for critical scenarios, an auxiliary objective function is designed, and a multi-start optimization method along with seed-filling is applied. The proposed framework is theoretically proved to obtain accurate evaluation results with much fewer number of tests, if compared with the on-road test method. In part II of the study, three case studies are investigated to demonstrate the proposed methodologies. Reinforcement learning based technique is applied to enhance the searching method under high-dimensional scenarios.

URL PDF HTML ☆

赞 0 踩 0

1810.00182 2026-06-04 eess.SY cs.MA cs.RO cs.SY math.OC 版本更新

Collaborative target-tracking control using multiple autonomous fixed-wing UAVs with constant speeds

使用多架自主固定翼无人机进行协同目标跟踪控制

Zhiyong Sun, Hector Garcia de Marina, Brian D. O. Anderson, Changbin Yu

发表机构 * Department of Electrical Engineering, Eindhoven University of Technology（埃因霍温理工大学电子工程系）； Universidad Complutense de Madrid（马德里complutense大学）； Research School of Electrical, Energy and Material Engineering, Australian National University（澳大利亚国立大学电气、能源和材料工程研究学校）； Optus-Curtin Centre of Excellence in Artificial Intelligence, Curtin University（Curtin大学人工智能卓越中心）

AI总结本文研究了使用多架固定翼无人机进行协同跟踪控制的问题，通过设计控制器使无人机群体能够协同跟踪目标的位置和速度，提出了相对速度条件和参考速度构造方法，并讨论了常速约束下的控制器设计和性能限制。

Comments 33 pages (single column). To be published in the AIAA Journal of Guidance, Dynamics, and Control

详情

AI中文摘要

本文考虑了使用一组具有恒定且非相同速度的固定翼无人飞行器（UAV）进行协同跟踪控制的问题。固定翼UAV的动力学通过具有非完整约束的单轮型方程建模，假设UAV在名义操作模式下以恒定高度飞行。控制器设计使得所有固定翼UAV作为群体能够协同跟踪目标的位置和速度。我们首先提出了跟踪UAV与目标之间相对速度的条件，以确保在UAV受恒定速度约束时跟踪目标的可行性。我们构造了一个包含目标速度和位置作为反馈的参考速度，该参考速度由群体质心跟踪。通过这种方式，所有车辆的航向被控制，使群体质心沿着成功跟踪目标轨迹的参考轨迹移动。进一步设计了一个间距控制器，以确保所有车辆靠近群体质心轨迹。还详细讨论了控制器设计中的权衡以及由于恒速约束导致的目标跟踪控制性能限制。提供了三架固定翼UAV跟踪目标旋翼机的实验结果。

英文摘要

This paper considers a collaborative tracking control problem using a group of fixed-wing unmanned aerial vehicles (UAVs) with constant and non-identical speeds. The dynamics of fixed-wing UAVs are modelled by unicycle-type equations with nonholonomic constraints, assuming that UAVs fly at constant altitudes in the nominal operation mode. The controller is designed such that all fixed-wing UAVs as a group can collaboratively track a desired target's position and velocity. We first present conditions on the relative speeds of tracking UAVs and the target to ensure that the tracking objective can be achieved when UAVs are subject to constant speed constraints. We construct a reference velocity that includes both the target's velocity and position as feedback, which is to be tracked by the group centroid. In this way, all vehicles' headings are controlled such that the group centroid follows a reference trajectory that successfully tracks the target's trajectory. A spacing controller is further devised to ensure that all vehicles stay close to the group centroid trajectory. Trade-offs in the controller design and performance limitations of the target tracking control due to the constant-speed constraint are also discussed in detail. Experimental results with three fixed-wing UAVs tracking a target rotorcraft are provided.

URL PDF HTML ☆

赞 0 踩 0

1802.09099 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Pareto optimal multi-robot motion planning

Pareto最优多机器人运动规划

Guoxiang Zhao, Minghui Zhu

发表机构 * School of Electrical Engineering and Computer Science, Pennsylvania State University（宾夕法尼亚州立大学电气工程与计算机科学学院）

AI总结本文研究了一类多机器人协调问题，目标是使机器人以最短时间到达目标区域并避免碰撞。提出了一种新的数值算法来识别Pareto最优解，确保没有机器人可以单方面减少旅行时间而不延长其他机器人的。通过室内多机器人平台和计算机模拟实验，展示了该算法的 anytime 特性。

1806.04225 2026-06-04 cs.RO cs.AI cs.LG cs.SY eess.SY math.OC 版本更新

PAC-Bayes Control: Learning Policies that Provably Generalize to Novel Environments

PAC-Bayes 控制：学习能够证明在新环境中泛化的能力的策略

Anirudha Majumdar, Alec Farid, Anoopkumar Sonar

发表机构 * Department of Mechanical and Aerospace Engineering（1,2 机械与航空航天工程系）； Department of Computer Science Princeton University（3 计算机科学系纽约大学普林斯顿分校）

AI总结本文提出了一种基于PAC-Bayes框架的机器人策略学习方法，通过在新环境中泛化能力的理论分析，为机器人系统提供强泛化保证。

Comments Extended version of paper presented at the 2018 Conference on Robot Learning (CoRL)

详情

AI中文摘要

我们的目标是学习能够证明在新环境中泛化能力的机器人控制策略，给定一组示例环境的数据集。我们方法的关键技术思想是利用机器学习中的泛化理论工具，通过精确的类比（以缩减形式呈现）将控制策略在新环境中的泛化与监督学习中的假设泛化联系起来。特别是，我们利用Probably Approximately Correct (PAC)-Bayes框架，这使我们能够获得在新环境中（随机）控制策略预期成本的上界。我们提出策略学习算法，明确寻求最小化此上界。相应的优化问题可以在有限策略空间的设置中通过凸优化（特别是相对熵编程）解决。在更一般的情况下，对于连续参数化策略（例如神经网络策略），我们使用随机梯度下降来最小化此上界。我们展示了所提出方法应用于学习（1）反应性障碍物回避策略和（2）基于神经网络的抓取策略的模拟结果。我们还展示了Parrot Swing无人机在不同障碍物环境中的硬件结果。我们的例子展示了该方法在具有连续状态和动作空间、复杂（例如非线性）动态、丰富感官输入（例如深度图像）和基于神经网络的策略的机器人系统中提供强泛化保证的潜力。

英文摘要

Our goal is to learn control policies for robots that provably generalize well to novel environments given a dataset of example environments. The key technical idea behind our approach is to leverage tools from generalization theory in machine learning by exploiting a precise analogy (which we present in the form of a reduction) between generalization of control policies to novel environments and generalization of hypotheses in the supervised learning setting. In particular, we utilize the Probably Approximately Correct (PAC)-Bayes framework, which allows us to obtain upper bounds that hold with high probability on the expected cost of (stochastic) control policies across novel environments. We propose policy learning algorithms that explicitly seek to minimize this upper bound. The corresponding optimization problem can be solved using convex optimization (Relative Entropy Programming in particular) in the setting where we are optimizing over a finite policy space. In the more general setting of continuously parameterized policies (e.g., neural network policies), we minimize this upper bound using stochastic gradient descent. We present simulated results of our approach applied to learning (1) reactive obstacle avoidance policies and (2) neural network-based grasping policies. We also present hardware results for the Parrot Swing drone navigating through different obstacle environments. Our examples demonstrate the potential of our approach to provide strong generalization guarantees for robotic systems with continuous state and action spaces, complicated (e.g., nonlinear) dynamics, rich sensory inputs (e.g., depth images), and neural network-based policies.

URL PDF HTML ☆

赞 0 踩 0

1903.11483 2026-06-04 cs.LG cs.NE cs.RO cs.SY eess.SY stat.ML 版本更新

Constructing Parsimonious Analytic Models for Dynamic Systems via Symbolic Regression

通过符号回归构建动态系统的简洁解析模型

Erik Derner, Jiří Kubalík, Nicola Ancona, Robert Babuška

发表机构 * Czech Institute of Informatics, Robotics, and Cybernetics（捷克信息学、机器人学与自动化研究所）； Czech Technical University in Prague（布拉格捷克技术大学）； Department of Control Engineering, Faculty of Electrical Engineering（电气工程系控制工程系）； Delft University of Technology（代尔夫特理工大学）

AI总结本文提出利用符号回归构建动态系统的简洁解析模型，通过两种先进的符号回归算法在状态空间域和输入输出域中应用，展示了在模拟示例和真实系统中的优越性能。

详情

DOI: 10.1016/j.asoc.2020.106432
Journal ref: Applied Soft Computing, Volume 94, September 2020, 106432

AI中文摘要

构建动态系统的数学模型对于许多工程和科学学科至关重要。模型有助于模拟、分析系统行为、决策制定和自动控制算法的设计。即使像强化学习（RL）这样的无模型控制技术也已被证明能从使用模型中受益，通常这些模型是在线学习的。任何模型构建方法都必须处理模型的准确性和复杂性之间的权衡，这很难做到。本文提出利用符号回归（SR）来构建由解析方程描述的简洁过程模型。我们为方法配备了两种最先进的符号回归算法，它们自动搜索适合测量数据的方程：单节点遗传编程（SNGP）和多基因遗传编程（MGGP）。除了状态空间域中的标准问题表述外，我们还展示了该方法如何应用于非线性自回归加外生输入（NARX）类型的输入输出模型。我们展示了该方法在三个模拟示例中的应用，这些示例的状态空间最高可达14维：倒立摆、移动机器人和双足行走机器人。与深度神经网络和局部线性回归的比较表明，SR在大多数情况下优于这些常用替代方法。我们在真实摆系统上展示了解析模型的发现使RL控制器能够成功完成摆起任务，该模型仅基于100个数据样本构建。

英文摘要

Developing mathematical models of dynamic systems is central to many disciplines of engineering and science. Models facilitate simulations, analysis of the system's behavior, decision making and design of automatic control algorithms. Even inherently model-free control techniques such as reinforcement learning (RL) have been shown to benefit from the use of models, typically learned online. Any model construction method must address the tradeoff between the accuracy of the model and its complexity, which is difficult to strike. In this paper, we propose to employ symbolic regression (SR) to construct parsimonious process models described by analytic equations. We have equipped our method with two different state-of-the-art SR algorithms which automatically search for equations that fit the measured data: Single Node Genetic Programming (SNGP) and Multi-Gene Genetic Programming (MGGP). In addition to the standard problem formulation in the state-space domain, we show how the method can also be applied to input-output models of the NARX (nonlinear autoregressive with exogenous input) type. We present the approach on three simulated examples with up to 14-dimensional state space: an inverted pendulum, a mobile robot, and a bipedal walking robot. A comparison with deep neural networks and local linear regression shows that SR in most cases outperforms these commonly used alternative methods. We demonstrate on a real pendulum system that the analytic model found enables a RL controller to successfully perform the swing-up task, based on a model constructed from only 100 data samples.

URL PDF HTML ☆

赞 0 踩 0

1809.04048 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Accurate Tracking of Aggressive Quadrotor Trajectories using Incremental Nonlinear Dynamic Inversion and Differential Flatness

使用增量非线性动态逆反与微分平坦性准确跟踪攻击性四旋翼轨迹

Ezra Tal, Sertac Karaman

AI总结本文提出了一种新的控制律，用于跟踪四旋翼的位置、偏航角及其至四阶导数，包括速度、加速度、加加速度和加加加速度以及偏航速率和偏航加速度。通过基于四旋翼动力学的微分平坦性，使用前馈输入跟踪加加速度和加加加速度。通过闭环电机速度控制，直接控制机体扭矩以实现加加加速度跟踪。控制器利用增量非线性动态逆反（INDI）在外部干扰如空气动力学阻力的情况下实现对线性和角加速度的鲁棒跟踪。通过响应分析严格分析所提出的控制律，并通过实验演示。控制器使四旋翼无人机能够跟踪复杂的3D轨迹，达到最高12.9 m/s的速度和2.1g的加速度，同时保持均方根跟踪误差仅为6.6 cm，在约18m×7m×3m的飞行体积内。此外，通过在飞行测试中附加拖板和在悬停时拉拽无人机来演示控制器的鲁棒性。

Comments To be published in IEEE Transactions on Control Systems Technology. Revision: new set of experiments at increased speed (up to 12.9 m/s), updated controller design using quaternion representation, new video available at https://youtu.be/K15lNBAKDCs

详情

AI中文摘要

自主无人机（UAV）能够执行攻击性（即高速和高加速度）动作已经引起了极大的关注。本文聚焦于准确跟踪攻击性四旋翼轨迹。我们提出了一种新的控制律，用于跟踪位置和偏航角及其导数至四阶，具体包括速度、加速度、加加速度和加加加速度以及偏航速率和偏航加速度。通过基于四旋翼动力学的微分平坦性，使用前馈输入跟踪加加速度和加加加速度。加加加速度跟踪需要直接控制机体扭矩，我们通过闭环电机速度控制，基于安装在电机上的光学编码器的测量来实现。控制器利用增量非线性动态逆反（INDI）在外部干扰如空气动力学阻力的情况下实现对线性和角加速度的鲁棒跟踪。因此，不需要先验建模空气动力学效应。我们通过响应分析严格分析所提出的控制律，并通过实验演示。控制器使四旋翼UAV能够跟踪复杂的3D轨迹，达到最高12.9 m/s的速度和2.1g的加速度，同时保持根均方跟踪误差仅为6.6 cm，在约18m×7m×3m的飞行体积内。我们还通过在飞行测试中附加拖板和在悬停时拉拽UAV来演示控制器的鲁棒性。

英文摘要

Autonomous unmanned aerial vehicles (UAVs) that can execute aggressive (i.e., high-speed and high-acceleration) maneuvers have attracted significant attention in the past few years. This paper focuses on accurate tracking of aggressive quadcopter trajectories. We propose a novel control law for tracking of position and yaw angle and their derivatives of up to fourth order, specifically, velocity, acceleration, jerk, and snap along with yaw rate and yaw acceleration. Jerk and snap are tracked using feedforward inputs for angular rate and angular acceleration based on the differential flatness of the quadcopter dynamics. Snap tracking requires direct control of body torque, which we achieve using closed-loop motor speed control based on measurements from optical encoders attached to the motors. The controller utilizes incremental nonlinear dynamic inversion (INDI) for robust tracking of linear and angular accelerations despite external disturbances, such as aerodynamic drag forces. Hence, prior modeling of aerodynamic effects is not required. We rigorously analyze the proposed control law through response analysis, and we demonstrate it in experiments. The controller enables a quadcopter UAV to track complex 3D trajectories, reaching speeds up to 12.9 m/s and accelerations up to 2.1g, while keeping the root-mean-square tracking error down to 6.6 cm, in a flight volume that is roughly 18 m by 7 m and 3 m tall. We also demonstrate the robustness of the controller by attaching a drag plate to the UAV in flight tests and by pulling on the UAV with a rope during hover.

URL PDF HTML ☆

赞 0 踩 0

1904.02209 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

The Green Choice: Learning and Influencing Human Decisions on Shared Roads

绿色选择：学习和影响共享道路上的人类决策

Erdem Bıyık, Daniel A. Lazar, Dorsa Sadigh, Ramtin Pedarsani

发表机构 * Department of Electrical Engineering, Stanford University（斯坦福大学电气工程系）； Department of Electrical and Computer Engineering, UC Santa Barbara（加州大学圣芭芭拉分校电气与计算机工程系）； Department of Computer Science, Stanford University（斯坦福大学计算机科学系）

AI总结本文研究如何通过设计价格策略来影响人类在共享道路上的决策，以最大化道路使用效率并减少交通延误，核心方法是基于用户偏好算法和交通基本图模型的规划优化。

Comments Submitted to CDC 2019

详情

DOI: 10.1109/CDC40024.2019.9030169

AI中文摘要

自动驾驶车辆通过编队有潜力提高道路容量，即使在与人类驾驶员共享道路的情况下。然而，当道路网络用户自私地选择路线时，所产生的交通配置可能非常低效。为此，我们考虑如何影响人类决策以减少道路上的拥堵。我们考虑一个由平行道路组成的网络，有两种交通模式：(i) 人类驾驶员会选择他们可用的最快路线，(ii) 滚动出行服务提供一系列自动驾驶车辆出行选项，每种选项有不同的价格。在本工作中，我们试图设计这些价格，使得当自动驾驶服务用户选择这些选项，而人类驾驶员自私地选择他们的路线时，道路使用最大化，交通延误最小化。为此，我们正式化了自动驾驶服务用户在不同价格/延误值路线之间做出选择的模型。开发基于偏好的算法来学习用户的偏好，并使用与交通基本图相关的车辆流模型，我们制定了一种规划优化以最大化社会目标，并展示了所提出路线和学习方案的好处。

英文摘要

Autonomous vehicles have the potential to increase the capacity of roads via platooning, even when human drivers and autonomous vehicles share roads. However, when users of a road network choose their routes selfishly, the resulting traffic configuration may be very inefficient. Because of this, we consider how to influence human decisions so as to decrease congestion on these roads. We consider a network of parallel roads with two modes of transportation: (i) human drivers who will choose the quickest route available to them, and (ii) ride hailing service which provides an array of autonomous vehicle ride options, each with different prices, to users. In this work, we seek to design these prices so that when autonomous service users choose from these options and human drivers selfishly choose their resulting routes, road usage is maximized and transit delay is minimized. To do so, we formalize a model of how autonomous service users make choices between routes with different price/delay values. Developing a preference-based algorithm to learn the preferences of the users, and using a vehicle flow model related to the Fundamental Diagram of Traffic, we formulate a planning optimization to maximize a social objective and demonstrate the benefit of the proposed routing and learning scheme.

URL PDF HTML ☆

赞 0 踩 0

1904.01068 2026-06-04 cs.RO cs.AI cs.LG cs.SY eess.SY 版本更新

Efficient and Safe Exploration in Deterministic Markov Decision Processes with Unknown Transition Models

在未知转移模型的确定性马尔可夫决策过程中实现高效且安全的探索

Erdem Bıyık, Jonathan Margoliash, Shahrouz Ryan Alimo, Dorsa Sadigh

发表机构 * Stanford University（斯坦福大学）； Jet Propulsion Laboratory（喷气推进实验室）； California Institute of Technology（加州理工学院）

AI总结本文提出了一种安全探索算法，通过利用Lipschitz连续性确保在探索过程中不访问危险状态，该算法在确定性马尔可夫决策过程中提供了确定性的安全保证，并通过模拟导航任务验证了其性能。

Comments Proceedings of the American Control Conference (ACC), July 2019. The first two authors have equal contribution

1905.10706 2026-06-04 cs.LG cs.RO cs.SY eess.SY stat.ML 版本更新

Interactive Differentiable Simulation

交互式可微分模拟

Eric Heiden, David Millard, Hejia Zhang, Gaurav S. Sukhatme

发表机构 * University of Southern California（南加州大学）

AI总结本文提出交互式可微分模拟（IDS），一种能够高效准确推断刚体系统物理属性的可微分物理引擎，通过视觉输入实现系统识别，从而建立具有物理意义的世界模型，并在非线性动态系统中实现自动任务机器人设计和参数估计，显著提升了非线性控制领域的样本效率。

详情

AI中文摘要

智能体需要对世界有物理理解才能预测其未来行动的影响。虽然基于学习的环境动力学模型在样本效率上相比无模型强化学习算法有所改进，但通常无法泛化到训练数据之外的系统状态，且往往依赖于非解释性的潜在变量。我们引入交互式可微分模拟（IDS），一种可微分的物理引擎，能够高效准确地推断刚体系统的物理属性。将模型集成到深度学习架构中，该模型能够利用视觉输入实现系统识别，从而建立具有物理意义的世界模型。我们展示了通过自动计算IDS中的梯度，实现非线性动态系统的自动任务机器人设计和参数估计。当与自适应模型预测控制算法结合时，我们的方法在具有挑战性的非线性控制领域中，相比无模型强化学习算法显示出数量级的样本效率提升。

英文摘要

Intelligent agents need a physical understanding of the world to predict the impact of their actions in the future. While learning-based models of the environment dynamics have contributed to significant improvements in sample efficiency compared to model-free reinforcement learning algorithms, they typically fail to generalize to system states beyond the training data, while often grounding their predictions on non-interpretable latent variables. We introduce Interactive Differentiable Simulation (IDS), a differentiable physics engine, that allows for efficient, accurate inference of physical properties of rigid-body systems. Integrated into deep learning architectures, our model is able to accomplish system identification using visual input, leading to an interpretable model of the world whose parameters have physical meaning. We present experiments showing automatic task-based robot design and parameter estimation for nonlinear dynamical systems by automatically calculating gradients in IDS. When integrated into an adaptive model-predictive control algorithm, our approach exhibits orders of magnitude improvements in sample efficiency over model-free reinforcement learning algorithms on challenging nonlinear control domains.

URL PDF HTML ☆

赞 0 踩 0

1804.08871 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Representing the Unknown - Impact of Uncertainty on the Interaction between Decision Making and Trajectory Generation

表示未知 - 不确定性对决策制定与轨迹生成交互的影响

Marcus Nolte, Susanne Ernst, Jan Richelmann, Markus Maurer

发表机构 * Institute of Control Engineering（控制工程研究所）

AI总结本文探讨了不确定性对自动驾驶车辆运动规划问题参数和环境模型要求的影响，强调了决策制定与轨迹生成之间明确接口的重要性。

1810.03345 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Bounded Collision Force by the Sobolev Norm

通过Sobolev范数限制碰撞力

Kevin Haninger, Dragoljub Surdilovic

AI总结本文提出利用Sobolev范数作为系统范数，为动态系统中的最大碰撞力提供严格界限，通过实验和仿真验证了该方法在分析机器人碰撞力与控制策略、关节灵活性及末端执行器柔顺性之间的关系方面的有效性。

Comments Accepted ICRA2019, suppoprted by EU H2020 programme, Grant #820689

详情

DOI: 10.1109/ICRA.2019.8793711

AI中文摘要

机器人与环境或人类接触时可能产生安全风险，包括过大的碰撞力。尽管文献中有关于机器人惯性、相对速度和界面刚度对碰撞影响的实验研究，但关于最大碰撞力的分析模型仍局限于简化质量-弹簧机器人模型。该简化模型限制了对控制（力/扭矩、阻抗或顺应性）或柔顺机器人（关节和末端执行器柔顺性）的分析。本文将Sobolev范数适配为系统范数，为一般动态系统中刚度元件的最大力提供严格界限，允许使用更精确的模型和反馈控制进行碰撞研究。Sobolev范数可通过转换系统的$\mathcal{H}_2$范数找到，允许高效计算、连接现有控制理论并合成控制器以最小化碰撞力。Sobolev范数首先通过顺应性控制机器人进行实验验证，然后在模拟线性柔性关节机器人中进行验证。随后，它被用于研究控制、关节灵活性和末端执行器柔顺性对碰撞的影响，并展示了碰撞性能与环境估计不确定性之间的权衡。

英文摘要

A robot making contact with an environment or human presents potential safety risks, including excessive collision force. While experiments on the effect of robot inertia, relative velocity, and interface stiffness on collision are in literature, analytical models for maximum collision force are limited to a simplified mass-spring robot model. This simplified model limits the analysis of control (force/torque, impedance, or admittance) or compliant robots (joint and end-effector compliance). Here, the Sobolev norm is adapted to be a system norm, giving rigorous bounds on the maximum force on a stiffness element in a general dynamic system, allowing the study of collision with more accurate models and feedback control. The Sobolev norm can be found through the $\mathcal{H}_2$ norm of a transformed system, allowing efficient computation, connection with existing control theory, and controller synthesis to minimize collision force. The Sobolev norm is validated, first experimentally with an admittance-controlled robot, then in simulation with a linear flexible-joint robot. It is then used to investigate the impact of control, joint flexibility and end-effector compliance on collision, and a trade-off between collision performance and environmental estimation uncertainty is shown.

URL PDF HTML ☆

赞 0 踩 0

1602.04450 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Bayesian Optimization with Safety Constraints: Safe and Automatic Parameter Tuning in Robotics

具有安全约束的贝叶斯优化：机器人中的安全自动参数调节

Felix Berkenkamp, Andreas Krause, Angela P. Schoellig

发表机构 * 1 Learning \& Adaptive Systems Group, Department of Computer Science, ETH Zurich, Switzerland. 2 Dynamic Systems Lab, Institute for Aerospace Studies, University of Toronto, Canada.

AI总结本文提出了一种通用算法，允许在目标函数之外存在多个独立的安全约束。该算法在给定初始安全参数的情况下，最大化性能，但仅评估满足所有安全约束的参数。通过利用高斯过程先验的正则性假设，该算法仔细探索参数空间，并展示了如何利用上下文变量安全地将知识转移到新任务中。

详情

AI中文摘要

机器人算法通常依赖于各种参数，这些参数的选择对机器人的性能有显著影响。虽然初始参数猜测可以从机器人的动态模型中获得，但通常需要在真实系统上手动调整参数以达到最佳性能。优化算法，如贝叶斯优化，已被用来自动化这一过程。然而，这些方法在优化过程中可能会评估不安全的参数，导致安全关键系统的故障。最近，一种称为SafeOpt的安全贝叶斯优化算法已被开发，该算法保证系统性能永远不会低于临界值；即，安全性是基于性能函数定义的。然而，在机器人中，将性能和安全性结合往往并不理想。例如，高增益控制器可能实现低平均跟踪误差（性能），但可能会超调并违反输入约束。在本文中，我们提出了一种通用算法，允许在目标函数之外存在多个独立的安全约束。给定初始的安全参数集，该算法最大化性能，但只评估满足所有约束的参数，以高概率。为此，它通过利用高斯过程先验的正则性假设来仔细探索参数空间。此外，我们展示了如何利用上下文变量安全地将知识转移到新情况和任务中。我们提供了理论分析，并证明所提出的算法能够实现快速、自动和安全的参数调节优化，在四旋翼飞行器的实验中得到了验证。

英文摘要

Robotic algorithms typically depend on various parameters, the choice of which significantly affects the robot's performance. While an initial guess for the parameters may be obtained from dynamic models of the robot, parameters are usually tuned manually on the real system to achieve the best performance. Optimization algorithms, such as Bayesian optimization, have been used to automate this process. However, these methods may evaluate unsafe parameters during the optimization process that lead to safety-critical system failures. Recently, a safe Bayesian optimization algorithm, called SafeOpt, has been developed, which guarantees that the performance of the system never falls below a critical value; that is, safety is defined based on the performance function. However, coupling performance and safety is often not desirable in robotics. For example, high-gain controllers might achieve low average tracking error (performance), but can overshoot and violate input constraints. In this paper, we present a generalized algorithm that allows for multiple safety constraints separate from the objective. Given an initial set of safe parameters, the algorithm maximizes performance but only evaluates parameters that satisfy safety for all constraints with high probability. To this end, it carefully explores the parameter space by exploiting regularity assumptions in terms of a Gaussian process prior. Moreover, we show how context variables can be used to safely transfer knowledge to new situations and tasks. We provide a theoretical analysis and demonstrate that the proposed algorithm enables fast, automatic, and safe optimization of tuning parameters in experiments on a quadrotor vehicle.

URL PDF HTML ☆

赞 0 踩 0

1712.01491 2026-06-04 eess.SY cs.RO cs.SY 版本更新

TrackerBots: Autonomous Unmanned Aerial Vehicle for Real-Time Localization and Tracking of Multiple Radio-Tagged Animals

TrackerBots: 用于实时定位和跟踪多个无线电标签动物的自主无人机

Hoa Van Nguyen, Michael Chesser, Lian Pin Koh, S. Hamid Rezatofighi, Damith C. Ranasinghe

发表机构 * Wiley（威立出版集团）

AI总结本文提出了一种名为TrackerBots的自主无人机系统，利用RSSI测量实现低成本轻量级的多只无线电标签动物的实时定位和跟踪，通过粒子滤波和POMDP动态路径规划提高导航效率并节省电量。

Comments The accepted version to the Journal of Field Robotics, published after the embargo period (12 months)

详情

DOI: 10.1002/rob.21857
Journal ref: Journal of Field Robotics. 2019; 36: 617 - 635

AI中文摘要

自主空中机器人为通过高效收集时间空间粒度信息研究濒危物种的栖息地和行为提供了新可能。我们提出了一种新的自主空中车辆系统-TrackerBots，用于跟踪和定位多个无线电标签动物。利用非常规场常用的高频（VHF）无线电颈环接收信号强度指示（RSSI）值的简单性，实现了一种低成本轻量级的跟踪平台，适合与无人机（UAV）集成。由于基于RSSI测量的系统存在不确定性和非线性，我们的跟踪和规划方法整合了粒子滤波用于跟踪和定位；部分可观测马尔可夫决策过程（POMDP）用于动态路径规划。这种方法允许无人机在最大信息增益方向上自主导航以定位多个移动动物并减少探索时间；从而节省机载电池电量。我们还采用了搜索终止标准的概念以在空中系统的功率限制内最大化所定位动物的数量。我们通过广泛的模拟和使用两个移动VHF无线电标签的实地实验验证了我们的实时和在线方法。

英文摘要

Autonomous aerial robots provide new possibilities to study the habitats and behaviors of endangered species through the efficient gathering of location information at temporal and spatial granularities not possible with traditional manual survey methods. We present a novel autonomous aerial vehicle system-TrackerBots-to track and localize multiple radio-tagged animals. The simplicity of measuring the received signal strength indicator (RSSI) values of very high frequency (VHF) radio-collars commonly used in the field is exploited to realize a low cost and lightweight tracking platform suitable for integration with unmanned aerial vehicles (UAVs). Due to uncertainty and the nonlinearity of the system based on RSSI measurements, our tracking and planning approaches integrate a particle filter for tracking and localizing; a partially observable Markov decision process (POMDP) for dynamic path planning. This approach allows autonomous navigation of a UAV in a direction of maximum information gain to locate multiple mobile animals and reduce exploration time; and, consequently, conserve onboard battery power. We also employ the concept of a search termination criteria to maximize the number of located animals within power constraints of the aerial system. We validated our real-time and online approach through both extensive simulations and field experiments with two mobile VHF radio-tags.

URL PDF HTML ☆

赞 0 踩 0

1711.00493 2026-06-04 eess.SY cs.RO cs.SY eess.SP 版本更新

Event-Triggered Diffusion Kalman Filters

事件触发扩散卡尔曼滤波器

Amr Alanwar, Hazem Said, Ankur Mehta, Matthias Althoff

发表机构 * Technical University of Munich（慕尼黑技术大学）； Ain Shams University（艾因夏姆斯大学）； University of California, Los Angeles（加州大学洛杉矶分校）

AI总结本文提出了一种事件触发的扩散卡尔曼滤波器，通过本地信号指示估计误差来收集测量并交换信息，从而减少资源消耗并提高分布式状态估计的有效性。

详情

AI中文摘要

分布式状态估计强烈依赖于协作信号处理，这通常需要在资源受限的传感器节点上执行过多的通信和计算。为了解决这个问题，我们提出了一种事件触发的扩散卡尔曼滤波器，该滤波器根据本地信号指示的估计误差收集测量并交换信息。在此基础上，我们开发了一种节能的状态估计算法，该算法调节无线网络中的资源消耗，并确保每个消耗的资源的有效性。所提出的算法不需要节点共享其本地协方差矩阵，从而允许显著减少传输信息的数量。为了验证其效率，我们将所提出算法应用于分布式的同时定位和时间同步问题，并在移动四旋翼节点和静止的定制超宽带无线设备的物理测试台上进行评估。获得的实验结果表明，所提出的算法在通信开销方面节省了与原始扩散卡尔曼滤波器相关的86%，而仅导致性能下降16%。我们在线提供了Matlab代码和实际测试数据。

英文摘要

Distributed state estimation strongly depends on collaborative signal processing, which often requires excessive communication and computation to be executed on resource-constrained sensor nodes. To address this problem, we propose an event-triggered diffusion Kalman filter, which collects measurements and exchanges messages between nodes based on a local signal indicating the estimation error. On this basis, we develop an energy-aware state estimation algorithm that regulates the resource consumption in wireless networks and ensures the effectiveness of every consumed resource. The proposed algorithm does not require the nodes to share its local covariance matrices, and thereby allows considerably reducing the number of transmission messages. To confirm its efficiency, we apply the proposed algorithm to the distributed simultaneous localization and time synchronization problem and evaluate it on a physical testbed of a mobile quadrotor node and stationary custom ultra-wideband wireless devices. The obtained experimental results indicate that the proposed algorithm allows saving 86% of the communication overhead associated with the original diffusion Kalman filter while causing deterioration of performance by 16% only. We make the Matlab code and the real testing data available online.

URL PDF HTML ☆

赞 0 踩 0

1612.04704 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Distributed area coverage control with imprecise robot localization: Simulation and experimental studies

分布式区域覆盖控制与不精确机器人定位：仿真与实验研究

Sotiris Papatheodorou, Anthony Tzes, Konstantinos Giannousakis, Yiannis Stergiopoulos

AI总结本文研究了具有不精确定位的移动机器人网络的区域覆盖问题，提出了一种基于保证Voronoi原理的分布式控制方法，通过仿真和实验验证了其有效性。

Comments 12 pages, 28 figures, submitted to IEEE Transactions on Automatic Control in 14 December 2016

1904.13317 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

A data-efficient geometrically inspired polynomial kernel for robot inverse dynamics

一种数据高效且受几何启发的多项式核用于机器人逆动力学

Alberto Dalla Libera, Ruggero Carli

AI总结本文提出了一种基于高斯过程回归的数据驱动逆动力学估计器，引入了几何启发多项式核（GIP），该核在合适输入空间上将逆动力学描述为多项式函数，并证明其定义了有限维的再生核希尔伯特空间，包含刚体动力学计算的逆动力学函数，实验表明该方法在数据效率和泛化能力上优于其他数据驱动方法，同时相比模型驱动方法需要更少的先验信息且不受模型偏差影响。

详情

DOI: 10.1109/LRA.2019.2945240
Journal ref: IEEE Robotics and Automation Letters, vol. 5, no. 1, pp. 24-31, Jan. 2020

AI中文摘要

在本文中，我们介绍了一种基于高斯过程回归的新数据驱动逆动力学估计器。受逆动力学可以描述为合适输入空间上的多项式函数的启发，我们提出了一个名为几何启发多项式核（GIP）的新核。所得到的估计器在数据效率方面与基于模型的方法相似。事实上，我们证明了GIP核定义了一个有限维的再生核希尔伯特空间，该空间包含通过刚体动力学计算的逆动力学函数。所提出的核基于最近引入的乘法多项式核，这是经典多项式核的重新定义，配备了允许更高正则化的参数集。我们已在模拟环境和UR10机器人的真实实验中测试了所提出的方法。获得的结果证实，与其它数据驱动估计器相比，所提出的方法在数据效率和泛化能力上更优。相反，与基于模型的估计器相比，我们的方法需要更少的先验信息且不受模型偏差影响。

英文摘要

In this paper, we introduce a novel data-driven inverse dynamics estimator based on Gaussian Process Regression. Driven by the fact that the inverse dynamics can be described as a polynomial function on a suitable input space, we propose the use of a novel kernel, called Geometrically Inspired Polynomial Kernel (GIP). The resulting estimator behaves similarly to model-based approaches as concerns data efficiency. Indeed, we proved that the GIP kernel defines a finite-dimensional Reproducing Kernel Hilbert Space that contains the inverse dynamics function computed through the Rigid Body Dynamics. The proposed kernel is based on the recently introduced Multiplicative Polynomial Kernel, a redefinition of the classical polynomial kernel equipped with a set of parameters that allows for a higher regularization. We tested the proposed approach in a simulated environment, and also in real experiments with a UR10 robot. The obtained results confirm that, compared to other data-driven estimators, the proposed approach is more data-efficient and exhibits better generalization properties. Instead, with respect to model-based estimators, our approach requires less prior information and is not affected by model bias.

URL PDF HTML ☆

赞 0 踩 0

1905.03416 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Prioritized Inverse Kinematics: Nonsmoothness, Trajectory Existence, Task Convergence, Stability

优先级逆运动学：非光滑性、轨迹存在性、任务收敛性、稳定性

Sang-ik An, Dongheui Lee

发表机构 * German Excellence Initiative（德国卓越计划）； Institute of Robotics and Mechatronics, German Aerospace Center（机器人与机电研究所，德国航空航天中心）

AI总结本文研究了一类优先级逆运动学（PIK）解的理论性质，探讨了其作为动态系统优先级多输出的输出调节或跟踪控制律的特性。首先，开发了研究PIK解非光滑性的工具，并发现非光滑性的充分条件。这表明经典定理无法保证满足PIK解的关节轨迹存在性和唯一性。因此，构建了一个利用PIK解结构信息的替代存在性和唯一性定理。接着，将PIK解的类别缩小到所有任务都遵循某些期望任务轨迹的情况，并发现与任务收敛性相关的性质。进一步分析了当所有任务都达到某些期望任务位置时，由PIK解作为右侧的微分方程的平衡点的稳定性。最后，通过一个双臂机械臂的例子，展示了如何利用这些发现来分析由PIK解生成的关节轨迹行为。

Comments 18 pages

详情

AI中文摘要

在本文中，我们研究了一类优先级逆运动学（PIK）解的各种理论性质，这些解可以被视为动态系统优先级多输出的输出调节或跟踪控制律。我们首先开发了研究PIK解非光滑性的工具，并发现非光滑性的充分条件。这表明经典定理无法保证满足PIK解的关节轨迹存在性和唯一性。因此，我们构建了一个利用PIK解结构信息的替代存在性和唯一性定理。接着，我们将PIK解的类别缩小到所有任务都遵循某些期望任务轨迹的情况，并发现与任务收敛性相关的性质。研究进一步分析了当所有任务都达到某些期望任务位置时，由PIK解作为右侧的微分方程的平衡点的稳定性。最后，我们通过一个双臂机械臂的例子，展示了如何利用这些发现来分析由PIK解生成的关节轨迹行为。

英文摘要

In this paper, we study various theoretical properties of a class of prioritized inverse kinematics (PIK) solutions that can be considered as a class of (output regulation or tracking) control laws of a dynamical system with prioritized multiple outputs. We first develop tools to investigate nonsmoothness of PIK solutions and find a sufficient condition for nonsmoothness. It implies that existence and uniqueness of a joint trajectory satisfying a PIK solution cannot be guaranteed by the classical theorems. So, we construct an alternative existence and uniqueness theorem that uses structural information of PIK solutions. Then, we narrow the class of PIK solutions down to the case that all tasks are designed to follow some desired task trajectories and discover a few properties related to task convergence. The study goes further to analyze stability of equilibrium points of the differential equation whose right hand side is a PIK solution when all tasks are designed to reach some desired task positions. Finally, we furnish an example with a two-link manipulator that shows how our findings can be used to analyze the behavior of a joint trajectory generated from a PIK solution.

URL PDF HTML ☆

赞 0 踩 0

1808.08252 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Inverse Statics Optimization for Compound Tensegrity Robots

复合张力结构机器人的逆静态优化

Andrew P. Sabelhaus, Albert H. Li, Kimberly A. Sover, Jacob Madden, Andrew Barkan, Adrian K. Agogino, Alice M. Agogino

发表机构 * Department of Mechanical Engineering, University of California Berkeley（加州大学伯克利分校机械工程系）； Intelligent Systems Division, NASA Ames Research Center（NASA阿姆斯研究中心智能系统部）

AI总结本文提出了一种方法，用于计算复合张力结构机器人的电缆张力，以实现静态平衡。该方法基于改进的力密度法，通过二次优化问题解决电缆张力计算，并通过仿真和硬件实验验证了其在脊柱机器人和四足机器人设计与控制中的有效性。

详情

AI中文摘要

由电缆驱动的张力结构（张力-完整性）机器人具有软机器人许多优点，如灵活性和鲁棒性，同时仍遵循简单的静态和动态模型。然而，现有的张力结构建模方法无法原生描述具有任意刚体的张力网络的机器人。本文提出了一种方法，用于计算此类张力结构机器人的电缆张力，这里定义为复合张力结构。首先，将复合张力结构机器人的静态平衡模型重新表述为用于其他张力结构的标准力密度法。接下来，我们提出了在所提出模型下计算机器人电缆张力的问题。提出了解决方案作为带有实际约束的二次优化问题。仿真展示了如何利用该逆静态优化问题来设计和控制两种不同的复合张力结构应用：由该脊柱制成的脊柱机器人和四足机器人。最后，通过硬件实验验证了逆静态模型的准确性，证明了使用所提出方法进行低误差开环控制的可行性。

英文摘要

Robots built from cable-driven tensegrity (`tension-integrity') structures have many of the advantages of soft robots, such as flexibility and robustness, while still obeying simple statics and dynamics models. However, existing tensegrity modeling approaches cannot natively describe robots with arbitrary rigid bodies in their tension network. This work presents a method to calculate the cable tensions in static equilibrium for such tensegrity robots, here defined as compound tensegrity. First, a static equilibrium model for compound tensegrity robots is reformulated from the standard force density method used with other tensegrity structures. Next, we pose the problem of calculating tension forces in the robot's cables under our proposed model. A solution is proposed as a quadratic optimization problem with practical constraints. Simulations illustrate how this inverse statics optimization problem can be used for both the design and control of two different compound tensegrity applications: a spine robot and a quadruped robot built from that spine. Finally, we verify the accuracy of the inverse statics model through a hardware experiment, demonstrating the feasibility of low-error open-loop control using our proposed methodology.

URL PDF HTML ☆

赞 0 踩 0

1705.05415 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Robotic Wireless Sensor Networks

机器人无线传感器网络

Pradipta Ghosh, Andrea Gasparri, Jiong Jin, Bhaskar Krishnamachari

发表机构 * University of Southern California（南加州大学）； Università degli studi "Roma Tre"（罗马三大学）； Swinburne University of Technology（斯winburne技术大学）

AI总结本文综述了机器人与无线传感器网络交叉领域的新兴研究领域，探讨了多机器人系统在满足通信性能要求的同时实现传感目标的协同控制、学习和适应方法，并指出现有文献中存在的一些研究空白和未来研究方向。

详情

DOI: 10.1007/978-3-319-92384-0_16

AI中文摘要

在本章中，我们介绍了机器人与无线传感器网络（WSN）交叉领域的新兴、前沿且跨学科的研究领域，称为机器人无线传感器网络（RWSN）。我们定义RWSN为一种自主的多机器人网络系统，旨在通过协同控制、学习和适应，实现特定的传感目标，同时满足和维持一定的通信性能要求。尽管机器人和WSN这两个领域都非常知名且研究充分，但这两个领域交叉处存在大量新的机会和研究方向，这些方向要么相对未被探索，要么完全未被探索。例如，使用一组机器人路由器来建立发送方和接收方之间的临时通信路径，利用受控的移动性优势来改进数据路由。我们发现，直接归类为RWSN相关研究的文献数量非常有限，而机器人和WSN文献中存在一些相关于这一新研究领域的文章。为了连接这些点，我们首先识别了与RWSN相关的核心问题和研究趋势，如连通性、定位、路由和信息的鲁棒流动。接着，我们根据第一步中识别的问题和趋势，将现有的RWSN研究以及机器人和WSN社区的相关最新进展进行分类。最后，我们分析现有文献中缺失的部分，并确定未来需要更多研究关注的主题。

英文摘要

In this chapter, we present a literature survey of an emerging, cutting-edge, and multi-disciplinary field of research at the intersection of Robotics and Wireless Sensor Networks (WSN) which we refer to as Robotic Wireless Sensor Networks (RWSN). We define a RWSN as an autonomous networked multi-robot system that aims to achieve certain sensing goals while meeting and maintaining certain communication performance requirements, through cooperative control, learning and adaptation. While both of the component areas, i.e., Robotics and WSN, are very well-known and well-explored, there exist a whole set of new opportunities and research directions at the intersection of these two fields which are relatively or even completely unexplored. One such example would be the use of a set of robotic routers to set up a temporary communication path between a sender and a receiver that uses the controlled mobility to the advantage of packet routing. We find that there exist only a limited number of articles to be directly categorized as RWSN related works whereas there exist a range of articles in the robotics and the WSN literature that are also relevant to this new field of research. To connect the dots, we first identify the core problems and research trends related to RWSN such as connectivity, localization, routing, and robust flow of information. Next, we classify the existing research on RWSN as well as the relevant state-of-the-arts from robotics and WSN community according to the problems and trends identified in the first step. Lastly, we analyze what is missing in the existing literature, and identify topics that require more research attention in the future.

URL PDF HTML ☆

赞 0 踩 0

1905.08314 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Longitudinal Dynamic versus Kinematic Models for Car-Following Control Using Deep Reinforcement Learning

纵向动态模型与运动学模型在使用深度强化学习的汽车跟随控制中的比较

Yuan Lin, John McPhee, Nasser L. Azad

发表机构 * University of Waterloo, Ontario, Canada（加拿大温哥华大学）

AI总结本文研究了在考虑车辆动力学的情况下，使用深度强化学习的纵向汽车跟随控制问题，通过引入延迟的控制输入和实际车辆加速度到强化学习环境状态中，改进了DRL框架，从而在考虑车辆动力学时实现了接近最优的控制性能。

Comments Accepted to 2019 IEEE Intelligent Transportation Systems Conference

详情

DOI: 10.1109/ITSC.2019.8916781

AI中文摘要

目前大多数关于通过深度强化学习（DRL）实现自动驾驶车辆控制的研究都使用点质量运动学模型，忽略了车辆动力学，包括加速度延迟和加速度命令动力学。加速度延迟源于传感和执行延迟，导致控制输入执行延迟。加速度命令动力学决定了实际车辆加速度不会立即达到期望的命令加速度，因为存在动力学限制。在本工作中，我们研究了将使用车辆运动学模型训练的DRL控制器应用于更现实的驾驶控制中的可行性。我们考虑了一个特定的纵向汽车跟随控制问题，即自适应巡航控制系统（ACC），该问题通过使用点质量运动学模型的DRL解决。当此类控制器应用于具有车辆动力学的汽车跟随时，我们观察到显著退化的汽车跟随性能。因此，我们重新设计DRL框架，通过将延迟的控制输入和实际车辆加速度分别添加到强化学习环境状态中，以适应加速度延迟和加速度命令动力学。训练结果表明，改进后的DRL控制器在考虑车辆动力学时的汽车跟随控制性能接近最优，与动态规划解决方案相比。

英文摘要

The majority of current studies on autonomous vehicle control via deep reinforcement learning (DRL) utilize point-mass kinematic models, neglecting vehicle dynamics which includes acceleration delay and acceleration command dynamics. The acceleration delay, which results from sensing and actuation delays, results in delayed execution of the control inputs. The acceleration command dynamics dictates that the actual vehicle acceleration does not rise up to the desired command acceleration instantaneously due to dynamics. In this work, we investigate the feasibility of applying DRL controllers trained using vehicle kinematic models to more realistic driving control with vehicle dynamics. We consider a particular longitudinal car-following control, i.e., Adaptive Cruise Control (ACC), problem solved via DRL using a point-mass kinematic model. When such a controller is applied to car following with vehicle dynamics, we observe significantly degraded car-following performance. Therefore, we redesign the DRL framework to accommodate the acceleration delay and acceleration command dynamics by adding the delayed control inputs and the actual vehicle acceleration to the reinforcement learning environment state, respectively. The training results show that the redesigned DRL controller results in near-optimal control performance of car following with vehicle dynamics considered when compared with dynamic programming solutions.

URL PDF HTML ☆

赞 0 踩 0

1903.12311 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Mesh-based Tools to Analyze Deep Reinforcement Learning Policies for Underactuated Biped Locomotion

基于网格的工具用于分析深度强化学习策略在欠驱动双足运动中的稳定性与鲁棒性

Nihar Talele, Katie Byl

AI总结本文提出了一种基于网格的方法，用于分析通过深度强化学习获得的双足运动策略的稳定性与鲁棒性，通过量化评估策略鲁棒性的程度，提供更高效的工具来评估此类策略的鲁棒性特性。

详情

AI中文摘要

在本文中，我们提出了一种基于网格的方法，用于分析通过深度强化学习获得的五连杆平面模型各种双足步态策略的稳定性与鲁棒性。直观上，人们可能会认为在训练过程中包含扰动和/或其它类型的噪声会导致最终的控制策略更加鲁棒。然而，人们也希望有一个定量且计算高效的手段来评估这种可能性的程度。而不是依赖蒙特卡洛模拟，这种模拟在量化性能指标时可能会变得计算负担很重，我们的目标是提供更复杂的工具来评估此类策略的鲁棒性特性。我们的工作受双重视假的启发，即当动态收缩可行时，可以简化所需控制策略的复杂性，并且通过深度学习获得的控制策略可能会倾向于收缩到全状态空间中的低维流形，从而产生这种倾向。本文中网格工具的可操作性提供了一些证据表明这可能是正确的。

英文摘要

In this paper, we present a mesh-based approach to analyze stability and robustness of the policies obtained via deep reinforcement learning for various biped gaits of a five-link planar model. Intuitively, one would expect that including perturbations and/or other types of noise during training would likely result in more robustness of the resulting control policy. However, one would also like to have a quantitative and computationally-efficient means of evaluating the degree to which this might be so. Rather than relying on Monte Carlo simulations, which can become quite computationally burdensome in quantifying performance metrics, our goal is to provide more sophisticated tools to assess robustness properties of such policies. Our work is motivated by the twin hypotheses that contraction of dynamics, when achievable, can simplify the required complexity of a control policy and that control policies obtained via deep learning may therefore exhibit tendency to contract to lower-dimensional manifolds within the full state space, as a result. The tractability of our mesh-based tools in this work provides some evidence that this may be so.

URL PDF HTML ☆

赞 0 踩 0

1904.11898 2026-06-04 cs.RO cs.CV cs.LG cs.SY eess.SY 版本更新

Perceptual Attention-based Predictive Control

基于感知注意力的预测控制

Keuntaek Lee, Gabriel Nakajima An, Viacheslav Zakharov, Evangelos A. Theodorou

发表机构 * Georgia Institute of Technology（佐治亚理工学院）

AI总结本文提出了一种新的信息处理架构，用于安全的深度学习视觉导航系统，通过模型预测控制（MPC）、卷积神经网络（CNNs）和不确定性量化方法，实现基于感知注意力的预测控制算法，提高了系统对不安全状况的快速检测能力。

详情

AI中文摘要

在本文中，我们提出了一种新的信息处理架构，用于安全的基于深度学习的视觉导航自主系统。所提出的信息处理架构用于支持一种基于感知注意力的预测控制算法，该算法利用模型预测控制（MPC）、卷积神经网络（CNNs）和不确定性量化方法。我们的方法新颖之处在于利用MPC学习如何在视觉输入的相关区域上放置注意力，从而最终使系统能够更快速地检测到不安全状况。我们通过使用MPC学习如何选择输入图像中的感兴趣区域，这些区域用于输出控制动作以及在注意力感知的视觉输入中的epistemic和aleatoric不确定性估计。我们使用这些不确定性估计来量化在当前导航条件下网络控制器的安全性。所提出的架构和算法在1:5比例的陆地车辆上进行了测试。实验结果表明，所提出的算法在早期检测不安全状况方面优于先前的方法，例如当导航环境中出现新障碍物时。所提出的架构是向在安全关键领域使用基于深度学习的感知控制策略迈出的第一步。

英文摘要

In this paper, we present a novel information processing architecture for safe deep learning-based visual navigation of autonomous systems. The proposed information processing architecture is used to support a perceptual attention-based predictive control algorithm that leverages model predictive control (MPC), convolutional neural networks (CNNs), and uncertainty quantification methods. The novelty of our approach lies in using MPC to learn how to place attention on relevant areas of the visual input, which ultimately allows the system to more rapidly detect unsafe conditions. We accomplish this by using MPC to learn to select regions of interest in the input image, which are used to output control actions as well as estimates of epistemic and aleatoric uncertainty in the attention-aware visual input. We use these uncertainty estimates to quantify the safety of our network controller under the current navigation condition. The proposed architecture and algorithm is tested on a 1:5 scale terrestrial vehicle. Experimental results show that the proposed algorithm outperforms previous approaches on early detection of unsafe conditions, such as when novel obstacles are present in the navigation environment. The proposed architecture is the first step towards using deep learning-based perceptual control policies in safety-critical domains.

URL PDF HTML ☆

赞 0 踩 0

1809.03674 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Planar Cooperative Extremum Seeking with Guaranteed Convergence Using A Three-Robot Formation

平面协作极值搜索与保证收敛的三机器人编队

Anna Skobeleva, Baris Fidan, V. Ugrinovskii, Ian R. Petersen

发表机构 * Mechanical and Mechatronics Engineering Department, University of Waterloo（滑铁卢大学机械与机电工程系）

AI总结本文提出了一种结合编队获取和协作极值搜索控制的方案，用于在平面上移动的三机器人团队。极值搜索任务是找到一个未知二维函数在平面上的最大值点。该函数表示由于位于最大值点的源产生的信号强度场，并假设该函数在最大值点附近局部凹凸且随距离源点的增加而单调递减。通过在特定领头机器人位置和最大值点处对场函数进行泰勒展开，并结合基于机器人信号强度测量的梯度估计器，设计并分析了所提出的控制方案。所提出的方案被证明可以指数收敛并同时（i）获取指定的几何编队，（ii）将领头机器人驱动到指定的最大值点附近的盘内，该盘的半径取决于指定的期望编队大小以及场函数的海森矩阵范数上限。通过一组仿真实验评估了所提出控制方案的性能。

Comments Presented at the 2018 IEEE Conference on Decision and Control (CDC), Miami Beach, FL, USA

详情

DOI: 10.1109/CDC.2018.8619784

AI中文摘要

本文提出了一种结合编队获取和协作极值搜索控制的方案，用于在平面上移动的三机器人团队。极值搜索任务是找到一个未知二维函数在平面上的最大值点。该函数表示由于位于最大值点的源产生的信号强度场，并假设该函数在最大值点附近局部凹凸且随距离源点的增加而单调递减。通过在特定领头机器人位置和最大值点处对场函数进行泰勒展开，并结合基于机器人信号强度测量的梯度估计器，设计并分析了所提出的控制方案。所提出的方案被证明可以指数收敛并同时（i）获取指定的几何编队，（ii）将领头机器人驱动到指定的最大值点附近的盘内，该盘的半径取决于指定的期望编队大小以及场函数的海森矩阵范数上限。通过一组仿真实验评估了所提出控制方案的性能。

英文摘要

In this paper, a combined formation acquisition and cooperative extremum seeking control scheme is proposed for a team of three robots moving on a plane. The extremum seeking task is to find the maximizer of an unknown two-dimensional function on the plane. The function represents the signal strength field due to a source located at maximizer, and is assumed to be locally concave around maximizer and monotonically decreasing in distance to the source location. Taylor expansions of the field function at the location of a particular lead robot and the maximizer are used together with a gradient estimator based on signal strength measurements of the robots to design and analyze the proposed control scheme. The proposed scheme is proven to exponentially and simultaneously (i) acquire the specified geometric formation and (ii) drive the lead robot to a specified neighborhood disk around maximizer, whose radius depends on the specified desired formation size as well as the norm bounds of the Hessian of the field function. The performance of the proposed control scheme is evaluated using a set of simulation experiments.

URL PDF HTML ☆

赞 0 踩 0

1402.3735 2026-06-04 cs.MA cs.RO cs.SY eess.SY 版本更新

Decentralized Goal Assignment and Safe Trajectory Generation in Multi-Robot Networks via Multiple Lyapunov Functions

多机器人网络中基于多重李雅普诺夫函数的去中心化目标分配与安全轨迹生成

Dimitra Panagou, Matthew Turpin, Vijay Kumar

发表机构 * GRASP Lab, School of Engineering and Applied Science, University of Pennsylvania（宾夕法尼亚大学工程与应用科学学院GRASP实验室）

AI总结本文研究了在仅本地通信可用的情况下多机器人网络的去中心化目标分配与轨迹生成问题，提出了一种基于切换系统和集合不变性方法的解决方案。通过使用一组李雅普诺夫-like函数编码候选目标分配的局部决策，使连接的代理选择最短总距离到目标的分配。当最优分配可能导致碰撞轨迹时，使用另一组李雅普诺夫-like屏障函数来维持系统安全并保持收敛保证。所提出的切换策略产生计算高效且可扩展的反馈控制策略，适用于在有限信息共享下机器人网络的快速响应部署。

详情

AI中文摘要

本文考虑了在仅本地通信可用的情况下多机器人网络的去中心化目标分配与轨迹生成问题，并提出了一种基于与切换系统和集合不变性相关的方法的解决方案。采用一组李雅普诺夫-like函数来编码候选目标分配之间的（局部）决策，使得一组连接的代理选择导致到目标的最短总距离的分配。在最优分配可能导致碰撞轨迹的情况下，激活另一组李雅普诺夫-like屏障函数，从而在保持系统安全的同时保留收敛保证。所提出的切换策略产生了计算高效且可扩展的反馈控制策略，因此适用于在有限信息共享下机器人网络的快速响应部署。通过模拟结果和六台地面机器人的实验验证了所提出方法的有效性。

英文摘要

This paper considers the problem of decentralized goal assignment and trajectory generation for multi-robot networks when only local communication is available, and proposes an approach based on methods related to switched systems and set invariance. A family of Lyapunov-like functions is employed to encode the (local) decision making among candidate goal assignments, under which a group of connected agents chooses the assignment that results in the shortest total distance to the goals. An additional family of Lyapunov-like barrier functions is activated in the case when the optimal assignment may lead to colliding trajectories, maintaining thus system safety while preserving the convergence guarantees. The proposed switching strategies give rise to feedback control policies that are computationally efficient and scalable with the number of agents, and therefore suitable for applications including first-response deployment of robotic networks under limited information sharing. The efficacy of the proposed method is demonstrated via simulation results and experiments with six ground robots.

URL PDF HTML ☆

赞 0 踩 0

1811.12819 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Structure-Preserving Constrained Optimal Trajectory Planning of a Wheeled Inverted Pendulum

保持结构的约束最优轨迹规划 of 一个轮式倒置摆

Klaus Albert, Karmvir Singh Phogat, Felix Anhalt, Ravi N Banavar, Debasish Chatterjee, Boris Lohmann

发表机构 * Systems and Control Engineering（系统与控制工程）

AI总结本文研究了轮式倒置摆的约束最优轨迹规划问题，通过离散力学推导了离散时间模型，并利用离散时间约束最优控制问题生成最优轨迹，同时设计了非线性连续时间模型和闭环LQ控制器，通过实验验证了非线性模型和控制方案的有效性。

Comments 12 pages, 8 figures, 1 table. arXiv admin note: text overlap with arXiv:1710.10932

详情

AI中文摘要

轮式倒置摆（WIP）是一个欠驱动、非完整机电系统，已被商业化为Segway。设计一个控制律进行运动规划，同时考虑状态和控制约束，并尊重配置流形，是一个具有挑战性的问题。本文通过离散力学推导了WIP系统的离散时间模型，并通过求解离散时间约束最优控制问题生成WIP系统的最优轨迹。进一步，我们描述了一个带有参数的非线性连续时间模型，用于设计闭环LQ控制器。所设计的最优轨迹作为参考提供给机器人，同时最优控制轨迹作为前馈控制作用，反馈模式下的LQ控制器用于抑制噪声和干扰，以确保WIP系统的稳定运动。在进行涉及剧烈操作和较为急转弯的WIP系统实验时，我们发现设计的最优轨迹与机器人跟踪这些轨迹时所走的路径高度一致。这证实了非线性模型和控制方案的有效性。最后，这些实验展示了WIP系统的高度非线性特性和控制方案的鲁棒性。

英文摘要

The Wheeled Inverted Pendulum (WIP) is an underactuated, nonholonomic mechatronic system, and has been popularized commercially as the Segway. Designing a control law for motion planning, that incorporates the state and control constraints, while respecting the configuration manifold, is a challenging problem. In this article we derive a discrete-time model of the WIP system using discrete mechanics and generate optimal trajectories for the WIP system by solving a discrete-time constrained optimal control problem. Further, we describe a nonlinear continuous-time model with parameters for designing a closed loop LQ-controller. A dual control architecture is implemented in which the designed optimal trajectory is then provided as a reference to the robot with the optimal control trajectory as a feedforward control action, and an LQ-controller in the feedback mode is employed to mitigate noise and disturbances for ensuing stable motion of the WIP system. While performing experiments on the WIP system involving aggressive maneuvers with fairly sharp turns, we found a high degree of congruence in the designed optimal trajectories and the path traced by the robot while tracking these trajectories. This corroborates the validity of the nonlinear model and the control scheme. Finally, these experiments demonstrate the highly nonlinear nature of the WIP system and robustness of the control scheme.

URL PDF HTML ☆

赞 0 踩 0

1802.00714 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Incremental Control and Guidance of Hybrid Aircraft Applied to a Tailsitter UAV

增量控制与引导用于混合式飞行器的尾旋无人机

E. J. J. Smeur, M. Bronz, G. C. H. E. de Croon

发表机构 * Delft University of Technology（代尔夫特理工大学）； ENAC, MAIAA, University of Toulouse（法国图卢兹大学）

AI总结本文提出了一种增量非线性动态逆控制方法，用于混合式飞行器的姿态和位置控制，实现了一个连续的控制器，能够跨飞行包线跟踪飞行器的期望加速度，并在尾旋无人机上进行了多场户外实验验证。

Comments 20 pages, 26 figures

详情

DOI: 10.2514/1.G004520
Journal ref: Journal of Guidance, Control and Dynamics, September 2019 [online]

AI中文摘要

混合无人飞行器可以显著提高微空气车辆的潜力，因为它们结合了悬停能力和机翼以实现快速高效的前进飞行。然而，这些车辆很难控制，因为其空气动力学难以建模且容易受到风切变影响。这通常导致复杂的复合控制器，具有悬停、过渡和前进飞行的不同模式。在本文中，我们提出了一种增量非线性动态逆控制用于姿态和位置控制。结果是一个单一的连续控制器，能够跨飞行包线跟踪飞行器的期望加速度。所提出的控制器在Cyclone混合无人机上实现。进行了多次户外实验，显示未建模的力和力矩被增量控制结构有效补偿。最后，我们提供了一种全面的程序，用于在其他类型的混合无人机上实现该控制器。

英文摘要

Hybrid unmanned aircraft can significantly increase the potential of micro air vehicles, because they combine hovering capability with a wing for fast and efficient forward flight. However, these vehicles are very difficult to control, because their aerodynamics are hard to model and they are susceptible to wind gusts. This often leads to composite and complex controllers, with different modes for hover, transition and forward flight. In this paper, we propose incremental nonlinear dynamic inversion control for the attitude and position control. The result is a single, continuous controller, that is able to track the desired acceleration of the vehicle across the flight envelope. The proposed controller is implemented on the Cyclone hybrid UAV. Multiple outdoor experiments are performed, showing that unmodeled forces and moments are effectively compensated by the incremental control structure. Finally, we provide a comprehensive procedure for the implementation of the controller on other types of hybrid UAVs.

URL PDF HTML ☆

赞 0 踩 0

1904.03830 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A Hybrid Compositional Approach to Optimal Mission Planning for Multi-rotor UAVs using Metric Temporal Logic

一种基于度量时序逻辑的混合组合方法用于多旋翼无人机最优任务规划

Usman A. Fiaz, John S. Baras

AI总结本文提出了一种混合组合方法，用于多旋翼无人机在受约束环境中的时间敏感搜索救援任务规划，通过将度量时序逻辑形式化描述任务规范，并利用混合模型捕捉无人机的各种操作模式，将任务分解为子任务并使用混合整数线性规划求解器求解最优路径，从而降低计算复杂度并实现实时应用。

Comments 8 pages, 5 figures, 1 table. Fixed typos, added new references

详情

AI中文摘要

本文研究了一种混合组合方法用于多旋翼无人机的最优任务规划。我们考虑了一个时间敏感的搜索和救援场景，其中两个四旋翼无人机在一个受约束的环境中运行。度量时序逻辑（MTL）用于正式描述任务规范。为了捕捉无人机的各种操作模式，我们使用一个混合模型，该模型在不同操作点周围线性化动力学。我们通过利用各种任务规范的不变性质，即路径上的安全性和时间约束的相互独立性以及机器人不同模式（即动力学）来将任务分解为多个子任务。对于每个子任务，我们将MTL公式转换为线性约束，并使用混合整数线性规划（MILP）求解器求解所需的路径最优控制问题。完整的路径由各个最优子路径的组合构成。我们证明所得到的轨迹满足任务规范，并且所提出的方法显著降低了问题的计算复杂度，使其能够实现实时应用。

英文摘要

This paper investigates a hybrid compositional approach to optimal mission planning for multi-rotor Unmanned Aerial Vehicles (UAVs). We consider a time critical search and rescue scenario with two quadrotors in a constrained environment. Metric Temporal Logic (MTL) is used to formally describe the task specifications. In order to capture the various modes of UAV operation, we utilize a hybrid model for the system with linearized dynamics around different operating points. We divide the mission into several sub-tasks by exploiting the invariant nature of various task specifications i.e., the mutual independence of safety and timing constraints along the way, and the different modes (i,e., dynamics) of the robot. For each sub-task, we translate the MTL formulae into linear constraints, and solve the associated optimal control problem for desired path using a Mixed Integer Linear Program (MILP) solver. The complete path is constructed by the composition of individual optimal sub-paths. We show that the resulting trajectory satisfies the task specifications, and the proposed approach leads to significant reduction in computational complexity of the problem, making it possible to implement in real-time.

URL PDF HTML ☆

赞 0 踩 0

1610.04391 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

A guiding vector field algorithm for path following control of nonholonomic mobile robots

一种基于引导矢量场的路径跟随控制非holonomic移动机器人的算法

Yuri A. Kapitanyuk, Anton V. Proskurnikov, Ming Cao

发表机构 * Delft Center for Systems and Control at Delft University of Technology（代尔夫特理工大学系统与控制中心）

AI总结本文提出了一种基于引导矢量场（GVF）思想的非holonomic移动机器人路径跟随控制算法，该算法能够将任意光滑曲线作为期望路径，通过设计引导矢量场使其积分曲线收敛到轨迹，并通过非线性运动控制器使机器人沿该积分曲线运动，最终到达期望路径，同时通过实验验证了算法的全局收敛性和有效性。

Comments under review in IEEE Transactions on Control Systems Technology

1905.05574 2026-06-04 cs.IT cs.DC cs.PF cs.RO cs.SY eess.SY math.IT 版本更新

Coded Distributed Tracking

编码分布式追踪

Albin Severinson, Eirik Rosnes, Alexandre Graell i Amat

发表机构 * Department of Electrical Engineering, Chalmers University of Technology（电气工程系，查尔姆斯理工大学）

AI总结本文提出了一种云辅助的分布式追踪方案，利用编码分布式计算方法提高追踪的及时性和准确性，并通过MDS码进一步提升大规模更新间隔下的估计精度，同时揭示了信息年龄与估计精度之间的权衡。

Comments Accepted for publication at IEEE GLOBECOM 2019

详情

AI中文摘要

我们考虑在分布式环境中跟踪随时间演化的过程状态的问题，其中多个观察者各自观测状态的部分。我们提出了一种云辅助方案，其中追踪在云上进行。为了提供及时且准确的更新，并缓解云计算中的straggler问题，我们提出了一种编码分布式计算方法，其中编码的观测被分布到多个工作者上。所提出的方案基于一种编码版本的卡尔曼滤波器，该滤波器在使用擦除纠正码编码的数据上运行，使得状态可以从部分由子集工作者计算的更新中进行估计。我们将所提出的方法应用于跟踪多个车辆的问题。我们显示，复制实现了比相应未编码方案显著更高的精度。使用最大距离可分离（MDS）码进一步在较大的更新间隔下提高精度。在两种情况下，所提出的方案在更新间隔足够大时接近理想集中式方案的精度。最后，我们观察到MDS码在信息年龄和估计精度之间存在权衡。

英文摘要

We consider the problem of tracking the state of a process that evolves over time in a distributed setting, with multiple observers each observing parts of the state, which is a fundamental information processing problem with a wide range of applications. We propose a cloud-assisted scheme where the tracking is performed over the cloud. In particular, to provide timely and accurate updates, and alleviate the straggler problem of cloud computing, we propose a coded distributed computing approach where coded observations are distributed over multiple workers. The proposed scheme is based on a coded version of the Kalman filter that operates on data encoded with an erasure correcting code, such that the state can be estimated from partial updates computed by a subset of the workers. We apply the proposed scheme to the problem of tracking multiple vehicles. We show that replication achieves significantly higher accuracy than the corresponding uncoded scheme. The use of maximum distance separable (MDS) codes further improves accuracy for larger update intervals. In both cases, the proposed scheme approaches the accuracy of an ideal centralized scheme when the update interval is large enough. Finally, we observe a trade-off between age-of-information and estimation accuracy for MDS codes.

URL PDF HTML ☆

赞 0 踩 0

1903.12605 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Stable, Concurrent Controller Composition for Multi-Objective Robotic Tasks

多目标机器人任务的稳定并发控制器组合

Anqi Li, Ching-An Cheng, Byron Boots, Magnus Egerstedt

发表机构 * Institute for Robotics and Intelligent Machines, Georgia Institute of Technology（机器人与智能机器研究所，佐治亚理工学院）

AI总结本文提出了一种稳定且并发的控制器组合方法，用于多目标机器人任务，通过分解任务为子任务并独立设计子任务控制器，再利用RMPflow框架结合生成整体控制策略，同时通过CLF分析确保系统稳定性。

Comments The 58th IEEE Conference on Decision and Control (CDC), 2019

详情

AI中文摘要

机器人系统往往需要同时考虑多个任务。这一挑战要求控制器合成算法能够在满足多个控制规范的同时保持系统稳定性。本文将多目标任务分解为子任务，其中每个子任务控制器独立设计，然后结合生成整体控制策略。特别地，我们采用最近提出的机器人控制器结构Riemannian Motion Policies（RMPs），以及其相关的计算框架RMPflow，用于结合RMP控制器。我们通过严格的控制Lyapunov函数（CLF）处理重新建立并扩展了RMPflow的稳定性结果。然后我们证明RMPflow能够稳定地结合满足一定CLF约束的独立设计的子任务控制器。这一新见解导致了一种高效的基于CLF的计算框架，用于生成同时考虑所有子任务的稳定控制器。与原始使用RMPflow相比，我们的框架为用户提供了一种通过名义控制器将设计启发式融入子任务的灵活性。我们通过数值模拟和机器人实现验证了所提出的计算框架。

英文摘要

Robotic systems often need to consider multiple tasks concurrently. This challenge calls for controller synthesis algorithms that fulfill multiple control specifications while maintaining the stability of the overall system. In this paper, we decompose multi-objective tasks into subtasks, where individual subtask controllers are designed independently and then combined to generate the overall control policy. In particular, we adopt Riemannian Motion Policies (RMPs), a recently proposed controller structure in robotics, and, RMPflow, its associated computational framework for combining RMP controllers. We re-establish and extend the stability results of RMPflow through a rigorous Control Lyapunov Function (CLF) treatment. We then show that RMPflow can stably combine individually designed subtask controllers that satisfy certain CLF constraints. This new insight leads to an efficient CLF-based computational framework to generate stable controllers that consider all the subtasks simultaneously. Compared with the original usage of RMPflow, our framework provides users the flexibility to incorporate design heuristics through nominal controllers for the subtasks. We validate the proposed computational framework through numerical simulation and robotic implementation.

URL PDF HTML ☆

赞 0 踩 0

1905.01261 2026-06-04 eess.SY cs.RO cs.SY 版本更新

A Lyapunov-Based Approach to Exploit Asymmetries in Robotic Dual-Arm Task Resolution

基于李雅普诺夫的方法利用双臂机器人任务解决中的不对称性

Diogo Almeida, Yiannis Karayiannidis

发表机构 * Dept. of Electrical Eng., Chalmers University of Technology（电气工程系，查尔姆斯理工大学）

AI总结本文提出了一种基于李雅普诺夫的方法，用于设计绝对运动任务的控制律并更新双臂之间的相对任务分布，通过数值实验展示了该方法相较于对称分布的优势。

Comments Accepted for publication at CDC 2019

1905.01248 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Asymmetric Dual-Arm Task Execution using an Extended Relative Jacobian

使用扩展相对雅可比的非对称双臂任务执行

Diogo Almeida, Yiannis Karayiannidis

发表机构 * Division of Robotics, Perception and Learning, KTH Royal Institute of Technology（机器人、感知与学习 division，皇家理工学院）； Dept. of Electrical Eng., Chalmers University of Technology（电气工程系，查尔默斯技术大学）

AI总结本文提出了一种基于扩展相对雅可比的非对称双臂任务执行方法，通过定义非对称相对运动空间，允许用户在不指定绝对运动目标的情况下设置任务执行的不对称程度，同时保留绝对运动作为功能性冗余。

Comments Accepted for presentation at ISRR19. 16 Pages

详情

AI中文摘要

协调的双臂操作任务可以大致分为具有绝对和相对运动成分。特别是相对运动任务，其在如何分配给末端执行器方面本质上是冗余的。在本工作中，我们从非对称解决相对运动任务的角度分析协作操作。我们讨论了现有方法如何使相对运动任务的非对称执行成为可能，并展示了如何定义非对称相对运动空间。我们利用这一结果提出了一种扩展的相对雅可比来建模协作系统，这使用户能够在不指定绝对运动目标的情况下设置任务执行的具体不对称程度。这无需规定绝对运动目标，而是将绝对运动保留为系统的功能性冗余。我们通过一种新的微分逆运动学算法的数值模拟来展示所提出雅可比的性质。

英文摘要

Coordinated dual-arm manipulation tasks can be broadly characterized as possessing absolute and relative motion components. Relative motion tasks, in particular, are inherently redundant in the way they can be distributed between end-effectors. In this work, we analyse cooperative manipulation in terms of the asymmetric resolution of relative motion tasks. We discuss how existing approaches enable the asymmetric execution of a relative motion task, and show how an asymmetric relative motion space can be defined. We leverage this result to propose an extended relative Jacobian to model the cooperative system, which allows a user to set a concrete degree of asymmetry in the task execution. This is achieved without the need for prescribing an absolute motion target. Instead, the absolute motion remains available as a functional redundancy to the system. We illustrate the properties of our proposed Jacobian through numerical simulations of a novel differential Inverse Kinematics algorithm.

URL PDF HTML ☆

赞 0 踩 0

1807.08229 2026-06-04 cs.AI cs.RO cs.SY eess.SY 版本更新

Optimal Continuous State POMDP Planning with Semantic Observations: A Variational Approach

基于语义观测的最优连续状态POMDP规划：一种变分方法

Luke Burks, Ian Loefgren, Nisar Ahmed

AI总结本文提出了一种基于变分方法的最优规划策略，针对语义观测下的连续状态部分可观测马尔可夫决策过程（CPOMDP）进行改进，通过变分贝叶斯方法解决混合连续-离散概率模型的表示和推理问题，提升了动态决策任务的效率和鲁棒性。

Comments Final version accepted to IEEE Transactions on Robotics (in press as of August 2019)

详情

AI中文摘要

本文开发了用于利用语义观测进行最优规划的新策略，使用连续状态部分可观测马尔可夫决策过程（CPOMDP）。在高斯混合（GM）CPOMDP策略近似方法方面，提出了两项主要创新。尽管现有方法具有许多有益的理论性质，但它们无法高效地表示和推理混合连续-离散概率模型。第一项主要创新是通过softmax模型推导出连续-离散语义观测概率的变分贝叶斯GM近似，用于点基值迭代贝尔曼策略备份。这种方法的关键优势是可以在复杂的非高斯不确定性下进行动态决策，同时利用连续动态状态空间模型（从而避免繁琐且昂贵的离散化）。第二项主要创新是一种基于聚类的混合物凝聚技术，能够很好地扩展到非常大的GM策略函数和信念函数。针对目标搜索和拦截任务的仿真结果表明，这些创新所产生的GM策略比其他最先进的策略近似方法产生的策略更有效，但需要显著较少的建模开销和在线运行时间成本。此外，结果还显示该方法对模型误差具有鲁棒性，并能扩展到更高维度。

英文摘要

This work develops novel strategies for optimal planning with semantic observations using continuous state partially observable markov decision processes (CPOMDPs). Two major innovations are presented in relation to Gaussian mixture (GM) CPOMDP policy approximation methods. While existing methods have many desirable theoretical properties, they are unable to efficiently represent and reason over hybrid continuous-discrete probabilistic models. The first major innovation is the derivation of closed-form variational Bayes GM approximations of Point-Based Value Iteration Bellman policy backups, using softmax models of continuous-discrete semantic observation probabilities. A key benefit of this approach is that dynamic decision-making tasks can be performed with complex non-Gaussian uncertainties, while also exploiting continuous dynamic state space models (thus avoiding cumbersome and costly discretization). The second major innovation is a new clustering-based technique for mixture condensation that scales well to very large GM policy functions and belief functions. Simulation results for a target search and interception task with semantic observations show that the GM policies resulting from these innovations are more effective than those produced by other state of the art policy approximations, but require significantly less modeling overhead and online runtime cost. Additional results show the robustness of this approach to model errors and scaling to higher dimensions.

URL PDF HTML ☆

赞 0 踩 0

1905.04835 2026-06-04 cs.LG cs.CV cs.MA cs.RO cs.SY eess.SY stat.ML 版本更新

Multi-Agent Image Classification via Reinforcement Learning

通过强化学习进行多智能体图像分类

Hossein K. Mousavi, Mohammadreza Nazari, Martin Takáč, Nader Motee

AI总结本文研究了利用多个能够收集未知环境部分姿态依赖观测的移动智能体进行图像分类的问题，提出了一种网络架构，用于指导智能体形成局部信念、采取局部行动并从原始部分观测中提取相关特征，通过与邻居智能体交换信息更新自身信念，并利用强化学习技术实现分类问题的去中心化实现。

Comments Preprint of the paper to be published in IROS'19 proceedings

1801.09627 2026-06-04 cs.LG cs.RO cs.SY eess.SY 版本更新

Barrier-Certified Adaptive Reinforcement Learning with Applications to Brushbot Navigation

具有应用的障碍证书自适应强化学习：Brushbot导航

Motoya Ohnishi, Li Wang, Gennaro Notomista, Magnus Egerstedt

发表机构 * School of Electrical Engineering, Royal Institute of Technology（皇家理工学院电气工程学院）； Georgia Institute of Technology（佐治亚理工学院）； RIKEN Center for Advanced Intelligence Project（日本理化学研究所高级智能研究中心）； School of Mechanical Engineering（机械工程学院）

AI总结本文提出了一种安全学习框架，结合自适应模型学习算法和障碍证书，用于具有可能非平稳智能体动态的系统。通过稀疏优化技术提取模型的动态结构，并利用学习的模型结合控制障碍证书来约束策略（反馈控制器），以保持安全性，即避免特定的不利状态空间区域。在某些条件下，保证了在安全被非平稳性破坏后，以李雅普诺夫稳定性的方式恢复安全。此外，将动作-价值函数近似重新公式化，使任何基于内核的非线性函数估计方法都能应用于我们的自适应学习框架。最后，保证了障碍证书策略优化的解是全局最优的，确保在温和条件下进行贪心策略改进。所得到的框架通过四旋翼无人机的模拟进行验证，该无人机此前在安全学习文献中被假设为平稳性，然后在动态未知、高度复杂且非平稳的Brushbot机器人上进行测试。

Comments ©2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

详情

DOI: 10.1109/TRO.2019.2920206
Journal ref: Published in IEEE Transactions on Robotics, 2019

AI中文摘要

本文提出了一种安全学习框架，该框架结合了自适应模型学习算法和障碍证书，用于具有可能非平稳智能体动态的系统。为了提取模型的动态结构，我们使用了稀疏优化技术。我们利用学习的模型结合控制障碍证书，以约束策略（反馈控制器）从而保持安全性，即避免特定的状态空间区域中的不利区域。在某些条件下，恢复安全性的保证是在安全被非平稳性破坏后以李雅普诺夫稳定性的方式恢复。此外，我们重新公式化了动作-价值函数近似，使任何基于内核的非线性函数估计方法都能应用于我们的自适应学习框架。最后，保证了障碍证书策略优化的解是全局最优的，确保在温和条件下进行贪心策略改进。所得到的框架通过四旋翼无人机的模拟进行验证，该无人机此前在安全学习文献中被假设为平稳性，然后在动态未知、高度复杂且非平稳的Brushbot机器人上进行测试。

英文摘要

This paper presents a safe learning framework that employs an adaptive model learning algorithm together with barrier certificates for systems with possibly nonstationary agent dynamics. To extract the dynamic structure of the model, we use a sparse optimization technique. We use the learned model in combination with control barrier certificates which constrain policies (feedback controllers) in order to maintain safety, which refers to avoiding particular undesirable regions of the state space. Under certain conditions, recovery of safety in the sense of Lyapunov stability after violations of safety due to the nonstationarity is guaranteed. In addition, we reformulate an action-value function approximation to make any kernel-based nonlinear function estimation method applicable to our adaptive learning framework. Lastly, solutions to the barrier-certified policy optimization are guaranteed to be globally optimal, ensuring the greedy policy improvement under mild conditions. The resulting framework is validated via simulations of a quadrotor, which has previously been used under stationarity assumptions in the safe learnings literature, and is then tested on a real robot, the brushbot, whose dynamics is unknown, highly complex and nonstationary.

URL PDF HTML ☆

赞 0 踩 0

1903.11683 2026-06-04 stat.ML cs.CV cs.LG cs.RO cs.SY eess.SY stat.AP 版本更新

Outlier-Robust Spatial Perception: Hardness, General-Purpose Algorithms, and Guarantees

抗异常的空域感知：难度、通用算法和保证

Vasileios Tzoumas, Pasquale Antonante, Luca Carlone

AI总结本文研究了空域感知中异常数据的影响，提出了一种通用算法来有效去除异常，并提供了对算法性能的理论保证。

详情

AI中文摘要

空域感知是许多机器人应用的核心，涵盖了定位与建图、点云对齐和从相机图像中估计相对姿态等广泛的研究问题。异常数据的存在会威胁到空域感知的鲁棒性，而一般情况下，异常值是主要问题。尽管已有处理异常值的技术，但它们可能以不可预测的方式失败（例如RANSAC、鲁棒估计器），或具有指数级的运行时间（例如分支界限法）。在本文中，我们通过三个贡献推动了异常拒绝的前沿。首先，我们证明了即使是最简单的线性异常拒绝实例也是近似不可行的：在最坏情况下，无法设计出一个准多项式时间算法来高效计算近似解。我们的第二个贡献是提供第一个实例级的次优界限，以评估给定异常拒绝结果的近似质量。我们的第三个贡献是提出了一种简单的通用算法，称为自适应修剪，用于去除异常值。我们的算法利用了最近提出的一类全局求解器，能够解决无异常的问题，并通过迭代去除误差较大的测量值。我们在三个空域感知问题上展示了所提出的算法：三维配准、双视几何和SLAM。结果表明，我们的算法在各种应用中优于几种最先进的方法，同时是一种通用的方法。

英文摘要

Spatial perception is the backbone of many robotics applications, and spans a broad range of research problems, including localization and mapping, point cloud alignment, and relative pose estimation from camera images. Robust spatial perception is jeopardized by the presence of incorrect data association, and in general, outliers. Although techniques to handle outliers do exist, they can fail in unpredictable manners (e.g., RANSAC, robust estimators), or can have exponential runtime (e.g., branch-and-bound). In this paper, we advance the state of the art in outlier rejection by making three contributions. First, we show that even a simple linear instance of outlier rejection is inapproximable: in the worst-case one cannot design a quasi-polynomial time algorithm that computes an approximate solution efficiently. Our second contribution is to provide the first per-instance sub-optimality bounds to assess the approximation quality of a given outlier rejection outcome. Our third contribution is to propose a simple general-purpose algorithm, named adaptive trimming, to remove outliers. Our algorithm leverages recently-proposed global solvers that are able to solve outlier-free problems, and iteratively removes measurements with large errors. We demonstrate the proposed algorithm on three spatial perception problems: 3D registration, two-view geometry, and SLAM. The results show that our algorithm outperforms several state-of-the-art methods across applications while being a general-purpose method.

URL PDF HTML ☆

赞 0 踩 0

1902.07747 2026-06-04 eess.SY cs.AI cs.DC cs.RO cs.SY 版本更新

Lookup Table-Based Consensus Algorithm for Real-Time Longitudinal Motion Control of Connected and Automated Vehicles

基于查找表的共识算法用于连接和自动化车辆的实时纵向运动控制

Ziran Wang, Kyuntae Han, BaekGyu Kim, Guoyuan Wu, Matthew J. Barth

AI总结本文提出了一种基于查找表的共识算法，用于实时控制连接和自动化车辆的纵向运动，通过动态生成查找表来实时寻找最佳控制增益，优于之前的工作和线性反馈算法。

Comments 2019 American Control Conference (ACC)Philadelphia, PA, USA, July 10-12, 2019978-1-5386-7928-9

详情

AI中文摘要

连接和自动化车辆（CAV）技术是解决当前交通系统安全、机动性和可持续性问题的有前途的解决方案。具体而言，控制算法在CAV系统中起重要作用，因为它执行由前一步生成的命令，如通信、感知和规划。在本研究中，我们提出了一种共识算法，用于实时控制CAV的纵向运动。与该领域之前的研究不同，这些研究中的共识算法的控制增益是预先确定并固定的，我们开发了算法来构建查找表，实时寻找不同CAV初始条件下的理想控制增益。数值模拟显示，所提出的基于查找表的共识算法在四种不同场景中，针对各种CAV初始条件，在收敛时间和最大 jerk 方面均优于作者之前的工作以及van Arem的基于线性反馈的纵向运动控制算法。

英文摘要

Connected and automated vehicle (CAV) technology is one of the promising solutions to addressing the safety, mobility and sustainability issues of our current transportation systems. Specifically, the control algorithm plays an important role in a CAV system, since it executes the commands generated by former steps, such as communication, perception, and planning. In this study, we propose a consensus algorithm to control the longitudinal motion of CAVs in real time. Different from previous studies in this field where control gains of the consensus algorithm are pre-determined and fixed, we develop algorithms to build up a lookup table, searching for the ideal control gains with respect to different initial conditions of CAVs in real time. Numerical simulation shows that, the proposed lookup table-based consensus algorithm outperforms the authors' previous work, as well as van Arem's linear feedback-based longitudinal motion control algorithm in all four different scenarios with various initial conditions of CAVs, in terms of convergence time and maximum jerk of the simulation run.

URL PDF HTML ☆

赞 0 踩 0

1808.00649 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Robust Tracking with Model Mismatch for Fast and Safe Planning: an SOS Optimization Approach

具有模型不匹配的鲁棒跟踪：一种SOS优化方法

Sumeet Singh, Mo Chen, Sylvia L. Herbert, Claire J. Tomlin, Marco Pavone

发表机构 * Dept. of Aeronautics and Astronautics, Stanford University（航空航天系，斯坦福大学）； Dept. of Electrical Engineering and Computer Science, University of California, Berkeley（电气工程与计算机科学系，加州大学伯克利分校）

AI总结本文提出了一种基于SOS优化的方法，用于在快速且安全的规划中处理模型不匹配问题，通过设计反馈跟踪控制器和跟踪界限，以在不精确模型下保证系统安全性。

Comments Presented at WAFR 2018; final version v2 -- fixed typos

详情

AI中文摘要

在实时运动规划中，通常的做法是通过在简化低维动态模型上运行规划算法计算轨迹，然后使用反馈跟踪控制器来跟踪该轨迹，该控制器考虑了完整的高维系统动态。虽然这种规划与模型不匹配的策略通常能带来快速的计算时间，但缺乏动态可行性保证，这阻碍了其在安全关键系统中的应用。基于最近通过汉密尔顿-雅可比（HJ）可达性视角解决此问题的工作，我们提出了一种算法框架，用于离线计算一对“规划器”（即低维）和“跟踪器”（即高维）模型的反馈跟踪控制器及其相关跟踪界限。该界限随后用作在通过低维模型生成运动计划时的安全边际。具体而言，我们利用求和平方（SOS）编程的计算工具，设计了一个双线性优化算法，用于计算反馈跟踪控制器及其相关跟踪界限。该算法通过数值实验进行演示，重点研究SOS带来的计算可扩展性增加与内在保守性之间的权衡。总体而言，我们的结果使规划与模型不匹配的有吸引力策略能够扩展到HJ分析无法触及的系统，同时保持安全性保证。

英文摘要

In the pursuit of real-time motion planning, a commonly adopted practice is to compute a trajectory by running a planning algorithm on a simplified, low-dimensional dynamical model, and then employ a feedback tracking controller that tracks such a trajectory by accounting for the full, high-dimensional system dynamics. While this strategy of planning with model mismatch generally yields fast computation times, there are no guarantees of dynamic feasibility, which hampers application to safety-critical systems. Building upon recent work that addressed this problem through the lens of Hamilton-Jacobi (HJ) reachability, we devise an algorithmic framework whereby one computes, offline, for a pair of "planner" (i.e., low-dimensional) and "tracking" (i.e., high-dimensional) models, a feedback tracking controller and associated tracking bound. This bound is then used as a safety margin when generating motion plans via the low-dimensional model. Specifically, we harness the computational tool of sum-of-squares (SOS) programming to design a bilinear optimization algorithm for the computation of the feedback tracking controller and associated tracking bound. The algorithm is demonstrated via numerical experiments, with an emphasis on investigating the trade-off between the increased computational scalability afforded by SOS and its intrinsic conservativeness. Collectively, our results enable scaling the appealing strategy of planning with model mismatch to systems that are beyond the reach of HJ analysis, while maintaining safety guarantees.

URL PDF HTML ☆

赞 0 踩 0

1809.00975 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Minimum Violation Control Synthesis on Cyber-Physical Systems under Attacks

在攻击下的网络物理系统最小违规控制综合

Luyao Niu, Jie Fu, Andrew Clark

AI总结本文研究了在存在攻击的情况下，如何在线性时序逻辑约束下综合出最小违规控制器，通过构建Stackelberg博弈模型并提出非线性规划问题来解决最优控制策略，最后通过数值案例验证了方法的有效性。

详情

DOI: 10.1109/CDC.2018.8619174

AI中文摘要

网络物理系统正日益执行复杂任务，通常使用时序逻辑等形式化语言进行建模。恶意攻击者通过智能攻击可能会削弱系统执行所需任务的能力。然而，目前在存在此类攻击的情况下，综合研究受到关注有限。特别是，当由于对抗性攻击无法完全满足所需规范时，控制器综合问题尚未被研究。本文聚焦于在存在对抗者的情况下，对随机有限状态离散时间系统进行线性时序逻辑约束下的最小违规控制综合问题。最小违规控制策略是在用户定义的重要任务上满足，而违反次要任务。我们通过构建控制器与攻击者之间的并发Stackelberg博弈模型，并提出非线性规划问题来制定和解决最优控制策略。为了减少计算努力，我们开发了一种启发式算法，高效地解决该问题，并通过数值案例研究展示了所提出方法。

英文摘要

Cyber-physical systems are conducting increasingly complex tasks, which are often modeled using formal languages such as temporal logic. The system's ability to perform the required tasks can be curtailed by malicious adversaries that mount intelligent attacks. At present, however, synthesis in the presence of such attacks has received limited research attention. In particular, the problem of synthesizing a controller when the required specifications cannot be satisfied completely due to adversarial attacks has not been studied. In this paper, we focus on the minimum violation control synthesis problem under linear temporal logic constraints of a stochastic finite state discrete-time system with the presence of an adversary. A minimum violation control strategy is one that satisfies the most important tasks defined by the user while violating the less important ones. We model the interaction between the controller and adversary using a concurrent Stackelberg game and present a nonlinear programming problem to formulate and solve for the optimal control policy. To reduce the computation effort, we develop a heuristic algorithm that solves the problem efficiently and demonstrate our proposed approach using a numerical case study.

URL PDF HTML ☆

赞 0 踩 0

1809.07012 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Enhancing the settling time estimation of a class of fixed-time stable systems

增强一类固定时间稳定系统的 settling 时间估计

R. Aldana-López, D. Gómez-Gutiérrez, E. Jiménez-Rodríguez, J. D. Sánchez-Torres, M. Defoort

发表机构 * Multi-agent autonomous systems lab, Intel Labs, Intel Tecnología de M\'exico, Av. del Bosque 1001, Colonia El Bajío, Zapopan, 45019, Jalisco, M\'exico. ； Research Laboratory on Optimal Design, Devices ； Advanced Materials -OPTIMA-, Department of Mathematics ； LAMIH, CNRS UMR 8201, Univ. Valenciennes, Valenciennes 59313, France.

AI总结本文研究了一类固定时间稳定系统的收敛时间分析，提出了一种新的非保守上界用于估计其 settling 时间，通过改进方法提供了更精确的上界，并展示了预定义时间控制器在第一和第二阶系统中的应用。

详情

DOI: 10.1002/rnc.4600
Journal ref: International Journal of Robust and Nonlinear Control, Volume29, Issue12, Pages 4135-4148,2019

AI中文摘要

本文研究了一类固定时间稳定系统的收敛时间分析，旨在提供一种新的非保守上界用于其 settling 时间的估计。我们的贡献包括四个方面：首先，重新审视已知的固定时间稳定系统类，展示经典上界估计的保守性；其次，提供一个统一上界，该上界适用于系统中任意轨迹的 settling 时间；第三，通过略微修改之前的固定时间系统类，提出了一种新的预定义时间收敛算法，其中 settling 时间的上界作为系统参数预先设定；最后，介绍了用于第一阶和第二阶系统的预定义时间控制器。一些仿真结果展示了所提方案在 settling 时间估计方面的性能，与现有方法相比具有优势。

英文摘要

This paper deals with the convergence time analysis of a class of fixed-time stable systems with the aim to provide a new non-conservative upper bound for its settling time. Our contribution is fourfold. First, we revisit the well-known class of fixed-time stable systems, given in (Polyakov et al.,2012}, while showing the conservatism of the classical upper estimate of the settling time. Second, we provide the smallest constant that uniformly upper bounds the settling time of any trajectory of the system under consideration. Third, introducing a slight modification of the previous class of fixed-time systems, we propose a new predefined-time convergent algorithm where the least upper bound of the settling time is set a priori as a parameter of the system. At last, predefined-time controllers for first order and second order systems are introduced. Some simulation results highlight the performance of the proposed scheme in terms of settling time estimation compared to existing methods.

URL PDF HTML ☆

赞 0 踩 0

1903.02531 2026-06-04 cs.RO cs.AI cs.CV cs.LG cs.SY eess.SY 版本更新

Combining Optimal Control and Learning for Visual Navigation in Novel Environments

将最优控制与学习相结合用于新环境中的视觉导航

Somil Bansal, Varun Tolani, Saurabh Gupta, Jitendra Malik, Claire Tomlin

发表机构 * University of California, Berkeley（加州大学伯克利分校）； Facebook AI Research（脸书人工智能研究）

AI总结本文提出了一种结合模型控制与学习感知的方法，用于在新环境中实现可靠的视觉导航，通过生成无碰撞路径的 waypoints，使机器人能够高效地到达目标位置，同时在低帧率和仿真到现实的迁移中表现良好。

Comments Project website: https://vtolani95.github.io/WayPtNav/

详情

AI中文摘要

基于模型的控制是机器人导航的流行范式，因为它可以利用已知的动力学模型来高效地规划鲁棒的机器人轨迹。然而，在环境事先未知且只能通过机器人上的传感器部分观测的情况下，使用基于模型的方法具有挑战性。在本工作中，我们通过将基于模型的控制与基于学习的感知相结合来解决这一不足。基于学习的感知模块生成一系列 waypoints，通过无碰撞路径引导机器人到达目标。这些 waypoints 被用于基于模型的规划器生成平滑且动态可行的轨迹，该轨迹通过反馈控制在物理系统上执行。我们在模拟的真实世界复杂环境中以及在实际地面车辆上的实验表明，与纯几何映射或端到端学习方法相比，所提出的方法在新环境中能够更可靠、更高效地到达目标位置。我们的方法不依赖于详细的显式 3D 环境地图，能够与低帧率工作，并且在仿真到现实的迁移中表现良好。描述我们方法和实验的视频可在项目网站上获得。

英文摘要

Model-based control is a popular paradigm for robot navigation because it can leverage a known dynamics model to efficiently plan robust robot trajectories. However, it is challenging to use model-based methods in settings where the environment is a priori unknown and can only be observed partially through on-board sensors on the robot. In this work, we address this short-coming by coupling model-based control with learning-based perception. The learning-based perception module produces a series of waypoints that guide the robot to the goal via a collision-free path. These waypoints are used by a model-based planner to generate a smooth and dynamically feasible trajectory that is executed on the physical system using feedback control. Our experiments in simulated real-world cluttered environments and on an actual ground vehicle demonstrate that the proposed approach can reach goal locations more reliably and efficiently in novel environments as compared to purely geometric mapping-based or end-to-end learning-based alternatives. Our approach does not rely on detailed explicit 3D maps of the environment, works well with low frame rates, and generalizes well from simulation to the real world. Videos describing our approach and experiments are available on the project website.

URL PDF HTML ☆

赞 0 踩 0

1805.00222 2026-06-04 eess.SY cs.RO cs.SY eess.SP 版本更新

Model-Free Active Input-Output Feedback Linearization of a Single-Link Flexible Joint Manipulator: An Improved ADRC Approach

无模型主动输入输出反馈线性化单连杆柔性关节机械臂：一种改进的ADRC方法

Wameedh Riyadh Abdul Adheem, Ibraheem Kasim Ibraheem

发表机构 * Electrical Engineering Department（电气工程系）； College of Engineering, Baghdad University（巴格达大学工程学院）

AI总结本文提出了一种基于改进主动扰动抑制控制（IADRC）范式的无模型主动输入输出反馈线性化（AIOFL）技术，用于设计具有已知相对次数的非线性系统的反馈线性化控制律，通过改进非线性扩展状态观测器（INLESO）估计广义扰动并结合改进非线性状态误差反馈（INLSEF）生成名义控制律，实现实时消除所有不需要的动力学、外源扰动和系统不确定性，将系统转化为积分链，仅需非线性系统的相对次数信息，并通过李雅普诺夫函数分析证明了系统的渐近稳定性。

详情

AI中文摘要

传统输入输出反馈线性化（IOFL）要求系统动力学的完整知识，并假设输入通道无扰动且系统无不确定性。本文提出了一种基于改进主动扰动抑制控制（IADRC）范式的无模型主动输入输出反馈线性化（AIOFL）技术，用于设计具有已知相对次数的非线性系统的反馈线性化控制律。线性化控制律（LCL）由改进非线性扩展状态观测器（INLESO）估计的具有饱和行为的广义扰动缩放估计和由改进非线性状态误差反馈（INLSEF）生成的名义控制律组成。所提出的AIOFL能够实时消除代表所有不需要的动力学、外源扰动和系统不确定性的广义扰动，并将系统转化为积分链，直到系统的相对次数。通过李雅普诺夫函数进行了稳定性分析，证明了INLESO的收敛性和闭环系统的渐近稳定性。通过将所提出的AIOFL技术应用于柔性关节单连杆机械臂（SLFJM）验证了结果。仿真结果验证了基于IADRC的所提出AIOFL工具的有效性，与传统ADRC基于AIOFL和传统IOFL技术相比。

英文摘要

Traditional Input-Output Feedback Linearization (IOFL) requires full knowledge of system dynamics and assumes no disturbance at the input channel and no system's uncertainties. In this paper, a model-free Active Input-Output Feedback Linearization (AIOFL) technique based on an Improved Active Disturbance Rejection Control (IADRC) paradigm is proposed to design feedback linearization control law for a generalized nonlinear system with known relative degree. The Linearization Control Law(LCL) is composed of a scaled generalized disturbance estimated by an Improved Nonlinear Extended State Observer (INLESO) with saturation-like behavior and the nominal control law produced by an Improved Nonlinear State Error Feedback (INLSEF). The proposed AIOFL cancels in real-time fashion the generalized disturbances which represent all the unwanted dynamics, exogenous disturbances, and system uncertainties and transforms the system into a chain of integrators up to the relative degree of the system, the only information required about the nonlinear system. Stability analysis has been conducted based on Lyapunov functions and revealed the convergence of the INLESO and the asymptotic stability of the closed-loop system. Verification of the outcomes has been achieved by applying the proposed AIOFL technique on the Flexible Joint Single Link Manipulator (SLFJM). The simulations results validated the effectiveness of the proposed AIOFL tool based on IADRC as compared to the conventional ADRC based AIOFL and the traditional IOFL techniques.

URL PDF HTML ☆

赞 0 踩 0

1802.08953 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Robust Target-relative Localization with Ultra-Wideband Ranging and Communication

利用超宽带测距与通信的稳健目标相对定位

Thien-Minh Nguyen, Abdul Hanif Zaini, Chen Wang, Kexin Guo, Lihua Xie

AI总结本文提出了一种方法，利用四旋翼无人机上的超宽带测距传感器实现目标的相对定位和跟踪，通过策略性安装传感器融合相对位置和方位信息，采用扩展卡尔曼滤波器估计器融合UWB测距数据和 onboard 传感器数据，并利用UWB通信能力传输目标的方位以处理目标方位与测距数据的耦合，实验结果表明无人机能够自主控制其相对于静态和移动目标的位置。

Comments 2018 International Conference on Robotics and Automation (ICRA 2018)

详情

DOI: 10.1109/ICRA.2018.8460844
Journal ref: 2018 IEEE International Conference on Robotics and Automation (ICRA)

AI中文摘要

在本文中，我们提出了一种方法，利用超宽带（UWB）测距传感器实现四旋翼无人机对目标的相对定位和跟踪。这些传感器被策略性安装，以帮助获取四旋翼无人机与目标之间的相对位置和方位。为了实现即使在目标速度不确定的情况下也能进行稳健定位的自主飞行，开发了两个主要特征。首先，开发了一个基于扩展卡尔曼滤波器（EKF）的估计器，用于融合UWB测距测量数据与机载传感器数据，包括惯性测量单元（IMU）、气压计和光学流数据。其次，为了正确处理目标的方位与测距测量的耦合，利用UWB通信能力将目标的方位传输到四旋翼无人机。实验结果表明，四旋翼无人机在目标静止和移动的情况下都能自主控制其相对于目标的位置。

英文摘要

In this paper we propose a method to achieve relative positioning and tracking of a target by a quadcopter using Ultra-wideband (UWB) ranging sensors, which are strategically installed to help retrieve both relative position and bearing between the quadcopter and target. To achieve robust localization for autonomous flight even with uncertainty in the speed of the target, two main features are developed. First, an estimator based on Extended Kalman Filter (EKF) is developed to fuse UWB ranging measurements with data from onboard sensors including inertial measurement unit (IMU), altimeters and optical flow. Second, to properly handle the coupling of the target's orientation with the range measurements, UWB based communication capability is utilized to transfer the target's orientation to the quadcopter. Experiment results demonstrate the ability of the quadcopter to control its position relative to the target autonomously in both cases when the target is static and moving.

URL PDF HTML ☆

赞 0 踩 0

1904.04968 2026-06-04 cs.RO cs.SY eess.SY math.OC 版本更新

Asymptotic Optimality of a Time Optimal Path Parametrization Algorithm

时间最优路径参数化算法的渐近最优性

Igor Spasojevic, Varun Murali, Sertac Karaman

AI总结本文研究了时间最优路径参数化问题，证明了线性时间算法在所有由凸优化方法最优解决的问题中都是渐近最优的，并且刻画了该问题的最优解。

1903.05073 2026-06-04 cs.RO cs.LO cs.SY eess.SY 版本更新

A Formal Safety Net for Waypoint Following in Ground Robots

为地面机器人路径跟随提供形式安全网

Brandon Bohrer, Yong Kiam Tan, Stefan Mitsch, Andrew Sogokon, André Platzer

AI总结本文提出一个可重用的形式验证安全网，为具有容差和加速度的Dubins型地面机器人提供端到端的安全性和活跃性保证，通过形式化方法验证安全性和活跃性属性，并合成监控器以确保运行时的模型合规性。

详情

DOI: 10.1109/LRA.2019.2923099

AI中文摘要

我们提出一个可重用的形式验证安全网，为具有容差和加速度的Dubins型地面机器人提供端到端的安全性和活跃性保证。我们：i) 用微分动态逻辑（dL）建模机器人，并对控制器和机器人运动学制定假设；ii) 证明具有速度限制的路径跟随形式安全性和活跃性属性；iii) 合成一个监控器，该监控器被自动证明在运行时强制执行模型合规性；iv) 我们使用VeriPhy工具链使这些保证延伸到不受信任的控制器、环境和计划的机器代码层面。安全网的保证适用于任何机器人，只要路径被安全选择且其模型中的物理假设成立。实验表明这些假设在实践中成立，存在合规性与性能之间的内在权衡。

英文摘要

We present a reusable formally verified safety net that provides end-to-end safety and liveness guarantees for 2D waypoint-following of Dubins-type ground robots with tolerances and acceleration. We: i) Model a robot in differential dynamic logic (dL), and specify assumptions on the controller and robot kinematics, ii) Prove formal safety and liveness properties for waypoint-following with speed limits, iii) Synthesize a monitor, which is automatically proven to enforce model compliance at runtime, and iv) Our use of the VeriPhy toolchain makes these guarantees carry over down to the level of machine code with untrusted controllers, environments, and plans. The guarantees for the safety net apply to any robot as long as the waypoints are chosen safely and the physical assumptions in its model hold. Experiments show these assumptions hold in practice, with an inherent trade-off between compliance and performance.

URL PDF HTML ☆

赞 0 踩 0

1605.00604 2026-06-04 eess.SY cs.LO cs.RO cs.SY 版本更新

Formal Verification of Obstacle Avoidance and Navigation of Ground Robots

地面机器人的障碍回避与导航的正式验证

Stefan Mitsch, Khalil Ghorbal, David Vogelbacher, André Platzer

发表机构 * Computer Science Department, Carnegie Mellon University（卡内基梅隆大学计算机科学系）； INRIA（法国国家信息与自动化研究所）； Karlsruhe Institute of Technology（卡尔斯鲁厄理工学院）

AI总结本文研究了移动机器人在动态环境中安全性的核心问题，通过形式化验证方法验证了控制器在回避静态和移动障碍物时的安全性、被动安全性、主动友好安全性及被动方向安全性，并证明了在传感器不确定性和执行器扰动下仍能保证安全性的结论。

详情

DOI: 10.1177/0278364917733549
Journal ref: International Journal of Robotics Research. 36(12), pp. 1312-1340, 2017

AI中文摘要

移动机器人在动态环境中的安全性依赖于确保其不与障碍物碰撞。为了支持此类安全论证，我们分析并形式化验证了一系列日益强大的控制器安全属性：(i) 静态安全性，确保不会与静态障碍物发生碰撞；(ii) 被动安全性，确保在机器人移动时不会与静态或移动障碍物发生碰撞；(iii) 更强的被动友好安全性，其中机器人进一步保持足够的机动距离以避免障碍物碰撞；(iv) 被动方向安全性，允许机器人传感器覆盖不完美，即机器人意识到其环境并非所有内容都可见。我们补充了这些可证明正确的安全属性以活化属性：我们证明可证明安全的运动足够灵活，使机器人仍能导航航点并通过交叉口。我们使用混合系统模型和定理证明技术，描述并形式化验证机器人离散的控制决策及其连续的物理运动。此外，我们正式证明了在传感器不确定性和执行器扰动，以及引入更激进的操控选择时，安全性仍能得到保证。我们的验证结果是通用的，因为它们不限于特定控制算法的选择，而是识别出使它们同时适用于广泛控制算法类别的条件。

英文摘要

The safety of mobile robots in dynamic environments is predicated on making sure that they do not collide with obstacles. In support of such safety arguments, we analyze and formally verify a series of increasingly powerful safety properties of controllers for avoiding both stationary and moving obstacles: (i) static safety, which ensures that no collisions can happen with stationary obstacles, (ii) passive safety, which ensures that no collisions can happen with stationary or moving obstacles while the robot moves, (iii) the stronger passive friendly safety in which the robot further maintains sufficient maneuvering distance for obstacles to avoid collision as well, and (iv) passive orientation safety, which allows for imperfect sensor coverage of the robot, i. e., the robot is aware that not everything in its environment will be visible. We complement these provably correct safety properties with liveness properties: we prove that provably safe motion is flexible enough to let the robot still navigate waypoints and pass intersections. We use hybrid system models and theorem proving techniques that describe and formally verify the robot's discrete control decisions along with its continuous, physical motion. Moreover, we formally prove that safety can still be guaranteed despite sensor uncertainty and actuator perturbation, and when control choices for more aggressive maneuvers are introduced. Our verification results are generic in the sense that they are not limited to the particular choices of one specific control algorithm but identify conditions that make them simultaneously apply to a broad class of control algorithms.

URL PDF HTML ☆

赞 0 踩 0

1904.05728 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Technical Report: Safe, Aggressive Quadrotor Flight via Reachability-based Trajectory Design

技术报告：基于可达性的安全激进四旋翼飞行轨迹设计

Shreyas Kousik, Patrick Holmes, Ramanarayan Vasudevan

发表机构 * Mechanical Engineering, University of Michigan, Ann Arbor, MI（机械工程，密歇根大学，安娜堡，MI）

AI总结本文提出了一种基于可达性的四旋翼飞行轨迹设计方法，通过在线规划中创新性地使用Zonotopes来确保在存在轨迹依赖性跟踪误差的情况下飞行安全，实验显示在500个随机复杂环境中实现了最高5m/s的激进飞行且无碰撞。

Comments 12 Pages, 3 Figures, 1 Table

详情

AI中文摘要

四旋翼可以提供基础设施检查和搜索救援等服务，需要在复杂环境中自主操作。通常通过滚动时间规划实现自主性，即在执行短期计划的同时计算新的计划，因为传感器在任何时候都只能获得有限的信息。为了确保安全并防止机器人丢失，计划必须被验证为在存在不确定性（例如跟踪误差）的情况下仍然无碰撞。现有的样条基规划器通过均匀扩张障碍物来补偿不确定性，这可能会过于保守。另一方面，基于可达性的规划器可以将轨迹依赖性的不确定性作为轨迹函数来考虑。本文将基于可达性的轨迹设计（RTD）应用于规划四旋翼轨迹，以在存在轨迹依赖性跟踪误差的情况下保持安全。这通过在在线规划中以新颖的方式使用Zonotopes来实现。模拟显示在500个随机复杂环境中实现了最高5m/s的激进飞行且无碰撞。

英文摘要

Quadrotors can provide services such as infrastructure inspection and search-and-rescue, which require operating autonomously in cluttered environments. Autonomy is typically achieved with receding-horizon planning, where a short plan is executed while a new one is computed, because sensors receive limited information at any time. To ensure safety and prevent robot loss, plans must be verified as collision free despite uncertainty (e.g, tracking error). Existing spline-based planners dilate obstacles uniformly to compensate for uncertainty, which can be conservative. On the other hand, reachability-based planners can include trajectory-dependent uncertainty as a function of the planned trajectory. This work applies Reachability-based Trajectory Design (RTD) to plan quadrotor trajectories that are safe despite trajectory-dependent tracking error. This is achieved by using zonotopes in a novel way for online planning. Simulations show aggressive flight up to 5 m/s with zero crashes in 500 cluttered, randomized environments.

URL PDF HTML ☆

赞 0 踩 0

1805.09703 2026-06-04 eess.SY cs.RO cs.SY 版本更新

No More Differentiator in PID:Development of Nonlinear Lead for Precision Mechatronics

PID中不再需要微分器：非线性前置器在精密机电系统中的开发

Arun Palanikumar, Niranjan Saikumar, S. Hassan HosseinNia

AI总结本文提出了一种新型非线性前置器，该前置器在精度和稳定性方面优于传统线性前置器，可用于替代传统前置器。通过洛伦兹驱动的纳米级精度平台验证了该概念，并展示了在精度、跟踪和带宽方面的改进。

Comments European Control Conference 2018

详情

DOI: 10.23919/ECC.2018.8550088

AI中文摘要

工业PID控制系统由三个组成部分构成：滞后（积分器）、前置（微分器）和低通滤波器（LPF）。由于PID是一种线性控制方法，因此其固有地受到水床效应的限制，导致精度与跟踪之间存在权衡，一方面由滞后和LPF提供，另一方面由前置器提供稳定性与鲁棒性。应用于滞后和LPF元素的非线性重置策略在减少这种权衡方面非常有效。然而，开发重置前置器的研究却很少。在本文中，我们开发了一种新型前置器，其精度和稳定性均优于传统线性前置器，可作为其替代品使用。该概念在洛伦兹驱动的纳米级精度平台上进行了呈现和验证。通过两个独立设计展示了精度、跟踪和带宽的改进。性能在时域和频域中均得到验证，以确保实际设置中达到的相位裕度与设计理论相匹配。

英文摘要

Industrial PID consists of three elements: Lag (integrator), Lead (Differentiator) and Low Pass Filters (LPF). PID being a linear control method is inherently bounded by the waterbed effect due to which there exists a trade-off between precision \& tracking, provided by Lag and LPF on one side and stability \& robustness, provided by Lead on the other side. Nonlinear reset strategies applied in Lag and LPF elements have been very effective in reducing this trade-off. However, there is lack of study in developing a reset Lead element. In this paper, we develop a novel lead element which provides higher precision and stability compared to the linear lead filter and can be used as a replacement for the same. The concept is presented and validated on a Lorentz-actuated nanometer precision stage. Improvements in precision, tracking and bandwidth are shown through two separate designs. Performance is validated in both time and frequency domain to ensure that phase margin achieved on the practical setup matches design theories.

URL PDF HTML ☆

赞 0 踩 0

1709.04906 2026-06-04 eess.SY cs.MA cs.RO cs.SY 版本更新

On the interaction between Autonomous Mobility-on-Demand systems and the power network: models and coordination algorithms

关于自主出行-on-demand系统与电力网络相互作用：模型和协调算法

Federico Rossi, Ramon Iglesias, Mahnoosh Alizadeh, Marco Pavone

AI总结本文研究了自主出行-on-demand系统与电力网络的相互作用，提出了一种模型来捕捉两者之间的耦合关系，并通过联合优化算法协调两个系统。

Comments Extended version of the paper presented at Robotics: Science and Systems XIV and accepted by TCNS. In Version 4, the body of the paper is largely rewritten for clarity and consistency, and new numerical simulations are presented. All source code is available (MIT) at https://dx.doi.org/10.5281/zenodo.3241651

详情

AI中文摘要

我们研究了由电动自动驾驶车辆提供按需运输请求的车队（称为自主出行-on-demand系统，或AMoD系统）与电力网络之间的相互作用。我们提出了一种模型，该模型捕捉了由于车辆充电需求而产生的两个系统之间的耦合，同时捕捉时间变化的客户需求、电力生成成本、道路拥堵、电池损耗以及电力传输和分配约束。随后，我们利用该模型联合优化两个系统的运行。我们设计了一种算法程序，通过捆绑客户需求来无损地减少问题规模，使该问题能够被现成的线性规划求解器高效求解。接下来，我们证明了联合问题的社会最优解可以作为一般均衡来强制执行，并提供了一种对偶分解算法，使自利的代理能够计算市场清算价格而无需共享私人信息。我们通过研究达拉斯-沃思堡的假设AMoD系统及其对德克萨斯电力网络的影响来评估该模型的性能。AMoD系统与电力网络之间缺乏协调会导致达拉斯-沃思堡电力价格增加4.4%；相反，AMoD系统与电力网络之间的协调可以减少电力支出，即使电力需求增加，也比没有汽车的情况更节省1.47亿美元/年。最后，我们提供了一种滚动时域实现，并通过基于代理的模拟评估其性能。本文的结果为电力驱动的AMoD系统与电力网络之间的相互作用提供了首次系统性的描述，并进一步揭示了AMoD的经济和社会价值。

英文摘要

We study the interaction between a fleet of electric, self-driving vehicles servicing on-demand transportation requests (referred to as Autonomous Mobility-on-Demand, or AMoD, system) and the electric power network. We propose a model that captures the coupling between the two systems stemming from the vehicles' charging requirements and captures time-varying customer demand and power generation costs, road congestion, battery depreciation, and power transmission and distribution constraints. We then leverage the model to jointly optimize the operation of both systems. We devise an algorithmic procedure to losslessly reduce the problem size by bundling customer requests, allowing it to be efficiently solved by off-the-shelf linear programming solvers. Next, we show that the socially optimal solution to the joint problem can be enforced as a general equilibrium, and we provide a dual decomposition algorithm that allows self-interested agents to compute the market clearing prices without sharing private information. We assess the performance of the mode by studying a hypothetical AMoD system in Dallas-Fort Worth and its impact on the Texas power network. Lack of coordination between the AMoD system and the power network can cause a 4.4% increase in the price of electricity in Dallas-Fort Worth; conversely, coordination between the AMoD system and the power network could reduce electricity expenditure compared to the case where no cars are present (despite the increased demand for electricity) and yield savings of up $147M/year. Finally, we provide a receding-horizon implementation and assess its performance with agent-based simulations. Collectively, the results of this paper provide a first-of-a-kind characterization of the interaction between electric-powered AMoD systems and the power network, and shed additional light on the economic and societal value of AMoD.

URL PDF HTML ☆

赞 0 踩 0

1905.05926 2026-06-04 cs.NI cs.RO cs.SY eess.SY 版本更新

Connectivity-Aware UAV Path Planning with Aerial Coverage Maps

具有空中覆盖图的连接意识UAV路径规划

Hongyu Yang, Jun Zhang, S. H. Song, Khaled B. Lataief

发表机构 * Department of ECE, Hong Kong University of Science and Technology, Hong Kong（香港科技大学电子工程系）； Department of EIE, The Hong Kong Polytechnic University, Hong Kong（香港理工大学电子工程系）

AI总结本文提出了一种利用UAV可控移动来克服地面用户优化的蜂窝网络在空中覆盖不连续问题的连接意识UAV路径规划方法，通过引入两个新指标量化UAV路径的蜂窝连接质量，并利用空中覆盖图提供准确的散射覆盖孔位置，将UAV路径规划问题转化为在连接约束下寻找最短路径的问题。

Comments This paper has been accepted by IEEE WCNC 2019

详情

AI中文摘要

蜂窝网络有潜力支持无人机的有效无线通信，有助于实现各种远程无人机应用。然而，这些网络是为地面用户提供优化的，因此无法保证无缝的空中覆盖。在本文中，我们通过利用UAV的可控移动来克服这一困难，并研究连接意识的UAV路径规划。为了明确将通信需求施加于UAV路径规划上，我们引入了两个新的指标来量化UAV路径的蜂窝连接质量。此外，空中覆盖图用于提供复杂传播环境中散落覆盖孔的准确位置。我们将UAV路径规划问题 formulation 为在连接约束下寻找最短路径的问题。基于图搜索方法，提出了一种新颖的连接意识路径规划算法，具有较低的复杂度。通过弗吉尼亚一个城区的空中覆盖图（通过射线追踪构建）验证了所提算法的有效性和优越性。仿真结果也展示了UAV路径长度和连接质量之间的权衡。

英文摘要

Cellular networks are promising to support effective wireless communications for unmanned aerial vehicles (UAVs), which will help to enable various long-range UAV applications. However, these networks are optimized for terrestrial users, and thus do not guarantee seamless aerial coverage. In this paper, we propose to overcome this difficulty by exploiting controllable mobility of UAVs, and investigate connectivity-aware UAV path planning. To explicitly impose communication requirements on UAV path planning, we introduce two new metrics to quantify the cellular connectivity quality of a UAV path. Moreover, aerial coverage maps are used to provide accurate locations of scattered coverage holes in the complicated propagation environment. We formulate the UAV path planning problem as finding the shortest path subject to connectivity constraints. Based on graph search methods, a novel connectivity-aware path planning algorithm with low complexity is proposed. The effectiveness and superiority of our proposed algorithm are demonstrated using the aerial coverage map of an urban section in Virginia, which is built by ray tracing. Simulation results also illustrate a tradeoff between the path length and connectivity quality of UAVs.

URL PDF HTML ☆

赞 0 踩 0

1905.12240 2026-06-04 eess.SY cs.HC cs.RO cs.SY 版本更新

Research on fuzzy PID Shared control method of small brain-controlled uav

小脑控无人机模糊PID共享控制方法研究

Na Dong, Wen-qi Zhang, Zhong-ke Gao

发表机构 * School of Electrical and Information Engineering, Tianjin University（天津大学电气与信息工程学院）

AI总结本文针对脑控无人机在信号识别精度、时间限制和命令数量方面的不足，提出基于共享控制的模糊PID辅助控制器，实现自动控制与脑控制的协同，通过评估当前飞行状态和设置切换率来决定自动控制与脑控制的切换机制，提升系统控制性能，并通过矩形轨迹跟踪实验验证算法有效性。

详情

AI中文摘要

脑控无人机是一种通过BCI分析人类脑电信号来获取飞行指令的无人机。脑控无人机研究有助于推动脑机接口的整合，并具有广阔的应用前景。目前，BCI仍存在一些问题，例如识别精度有限、识别时间有限以及在分析EEG信号以获取控制指令时识别命令数量有限。因此，仅由脑控制的四旋翼无人机的控制性能并不理想。基于共享控制的概念，本文设计了一个辅助控制器，使用模糊PID控制，实现了自动控制与脑控制的协同控制。通过评估当前飞行状态并设置切换率，可以决定自动控制与脑控制的切换机制，以提高系统控制性能。最后，为小型四旋翼无人机设计了一个相同高度的矩形轨迹跟踪控制实验，以验证算法的有效性。

英文摘要

Brain-controlled unmanned aerial vehicle (uav) is a uav that can analyze human brain electrical signals through BCI to obtain flight commands. The research of brain-controlled uav can promote the integration of brain-computer and has a broad application prospect. At present, BCI still has some problems, such as limited recognition accuracy, limited recognition time and small number of recognition commands in the acquisition of control commands by analyzing eeg signals. Therefore, the control performance of the quadrotor which is controlled only by brain is not ideal. Based on the concept of Shared control, this paper designs an assistant controller using fuzzy PID control, and realizes the cooperative control between automatic control and brain control. By evaluating the current flight status and setting the switching rate, the switching mechanism of automatic control and brain control can be decided to improve the system control performance. Finally, a rectangular trajectory tracking control experiment of the same height is designed for small quadrotor to verify the algorithm.

URL PDF HTML ☆

赞 0 踩 0

1905.11176 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Temporally Coupled Dynamical Movement Primitives in Cartesian Space

笛卡尔空间中耦合的动态运动素体

Martin Karlsson, Anders Robertsson, Rolf Johansson

AI总结本文提出了一种在笛卡尔空间中耦合动态运动素体（DMPs）的控制方法，利用单位四元数表示姿态，并证明单位四元数集减去一个点是可缩的，从而设计出连续且全局渐近稳定的反馈控制系统。

1905.11130 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Autonomous Interpretation of Demonstrations for Modification of Dynamical Movement Primitives

自主解释示范以修改动力学运动原语

Martin Karlsson, Anders Robertsson, Rolf Johansson

发表机构 * LCCC Linnaeus Center（LCCC林纳尤斯中心）； ELLIIT Excellence Center（ELLIIT卓越中心）

AI总结本文提出了一种框架，使机器人操作员能够直观地调整动力学运动原语（DMPs）。通过使用引导通过编程，操作员可以演示纠正轨迹，从而生成一个结合故障轨迹前部分和纠正轨迹后部分的修改DMP。

1905.11129 2026-06-04 cs.RO cs.SY eess.SY 版本更新

On Motion Control and Machine Learning for Robotic Assembly

关于机器人装配的运动控制与机器学习

Martin Karlsson

发表机构 * Department of Automatic Control, Lund University（自动控制系，卢恩大学）

AI总结本研究通过减少机器人编程所需的工程工作量并增强机器人应对意外事件的能力，提出了新的方法，从而加快编程速度并使非工程背景用户也能使用机器人。

1905.09396 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Predictive Control for Chasing a Ground Vehicle using a UAV

使用无人机追击地面车辆的预测控制

Jaeseung Byun, Karan P. Jain, Siddharth H. Nair, Haoyun Xu, Jiaming Zha

AI总结本文提出了一种多旋翼无人机追击地面车辆的高层规划器，同时满足各种状态和输入约束。假设地面车辆的最小运动学模型，利用在线收集的数据在模型预测控制框架内生成预测，通过仿真和稳定四旋翼平台的实验验证了该方案。

1905.06978 2026-06-04 eess.SY cs.LG cs.RO cs.SY stat.AP 版本更新

Randomized Algorithms for Data-Driven Stabilization of Stochastic Linear Systems

数据驱动的随机算法用于随机线性系统的稳定化

Mohamad Kazem Shirani Faradonbeh, Ambuj Tewari, George Michailidis

AI总结本文提出两种随机算法用于数据驱动的随机线性系统稳定化，通过数值分析研究了随机反馈和随机参数方法的稳定速度和失败概率，证明在统计独立随机化数量不小时可以保证快速稳定化。

1905.05946 2026-06-04 cs.RO cs.CV cs.SY eess.SY 版本更新

Depth map estimation methodology for detecting free-obstacle navigation areas

用于自由障碍区域检测的深度图估计方法

Sergio Trejo, Karla Martinez, Gerardo Flores

AI总结本文提出了一种基于视觉的方法，利用立体相机和一维LiDAR估计四旋翼导航中的自由障碍区域。通过加权最小二乘滤波器过滤深度图，并通过卡尔曼滤波算法融合信息，确定四旋翼可通过的足够大自由空间区域。整个过程在Jetson TX2嵌入式计算机上用ROS实现。

1811.01516 2026-06-04 cs.RO cs.SY eess.SY 版本更新

SLAMBooster: An Application-aware Controller for Approximation in SLAM

SLAMBooster: 一种面向应用的近似控制算法用于SLAM

Yan Pei, Swarnendu Biswas, Donald S. Fussell, Keshav Pingali

发表机构 * University of Texas at Austin, USA（德克萨斯大学奥斯汀分校）； Indian Institute of Technology Kanpur, India（印度理工学院坎普尔分校）

AI总结本文提出SLAMBooster，一种面向应用的在线控制算法，用于在SLAM中实现近似计算，通过动态调整近似参数来减少计算时间和能耗，同时保持定位精度。

Comments 13 pages

详情

AI中文摘要

同时定位与建图（SLAM）是构建移动代理环境地图的同时定位代理的问题。密集SLAM算法在像素粒度上执行重建和定位。这些算法需要大量计算资源，这限制了它们在低功耗资源受限设备上的应用。近似计算可以用于加速SLAM实现，只要近似不会阻止代理正确导航通过环境。先前的SLAM近似研究假设代理的整个轨迹在开始前已知，并且专注于离线控制器，在轨迹开始时设置近似参数。在实践中，轨迹并不事先已知，允许在运行时动态调整参数提供了更多减少计算时间和能耗的机会。我们描述了SLAMBooster，一种面向应用的在线控制系统，用于密集SLAM，它在代理运动过程中自适应地控制近似参数。SLAMBooster基于比例-积分-微分（PID）控制器技术，但我们的实验表明这种通用控制器导致定位精度显著下降。为了解决这个问题，SLAMBooster还利用领域知识通过执行平滑表面检测和姿态校正来控制近似。我们实现了SLAMBooster在开源的SLAMBench框架中，并在多个轨迹上进行了评估。我们的实验表明，在嵌入式平台上，SLAMBooster平均减少了72%的计算时间和35%的能量消耗，同时保持定位精度在合理范围内。这些改进使得在更广泛设备上部署SLAM成为可能。

英文摘要

Simultaneous Localization and Mapping (SLAM) is the problem of constructing a map of a mobile agent's environment while localizing the agent within the map. Dense SLAM algorithms perform reconstruction and localization at pixel granularity. These algorithms require a lot of computational power, which has hindered their use on low-power resource-constrained devices. Approximate computing can be used to speed up SLAM implementations as long as the approximations do not prevent the agent from navigating correctly through the environment. Previous studies of approximation in SLAM have assumed that the entire trajectory of the agent is known before the agent starts, and they have focused on offline controllers that set approximation knobs at the start of the trajectory. In practice, the trajectory is not known ahead of time, and allowing knob settings to change dynamically opens up more opportunities for reducing computation time and energy. We describe SLAMBooster, an application-aware, online control system for dense SLAM that adaptively controls approximation knobs during the motion of the agent. SLAMBooster is based on a control technique called proportional-integral-derivative (PID) controller but our experiments showed this application-agnostic controller led to an unacceptable reduction in localization accuracy. To address this problem, SLAMBooster also exploits domain knowledge for controlling approximation by performing smooth surface detection and pose correction. We implemented SLAMBooster in the open-source SLAMBench framework and evaluated it on several trajectories. Our experiments show that on the average, SLAMBooster reduces the computation time by 72% and energy consumption by 35% on an embedded platform, while maintaining the accuracy of localization within reasonable bounds. These improvements make it feasible to deploy SLAM on a wider range of devices.

URL PDF HTML ☆

赞 0 踩 0

1810.03076 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Online Center of Mass Estimation for a Humanoid Wheeled Inverted Pendulum Robot

人形轮式反重力摆机器人在线质心估计

Munzir Zafar, Akash Patel, Bogdan Vlahov, Nathaniel Glaser, Sergio Aguillera, Seth Hutchinson

AI总结本文提出了一种新颖的鲁棒控制与在线学习结合的方法，用于平衡具有n自由度的轮式反重力摆人形机器人，通过在线学习更新质量模型以获得更准确的质心估计，实验表明该方法提升了整体控制效率。

详情

AI中文摘要

我们提出了一种新颖的鲁棒控制和在线学习应用，用于平衡具有n个自由度（DoF）的轮式反重力摆（WIP）人形机器人。我们的技术将质量模型的不准确性转化为质心（CoM）误差，并在存在误差的情况下实现平衡，同时利用在线学习更新质量模型以获得更好的质心估计。使用我们机器人的模拟模型，我们元学习了一组激励关节姿态，使我们的梯度下降算法快速收敛到准确的（CoM）估计。该模拟流程完全在线执行，使用主动扰动抵消来解决由于持续演变的质量模型所产生的质量误差。在19个自由度的WIP上进行了实验，我们手动获取了学习姿态集的数据，并展示了由梯度下降产生的质量模型产生的质心估计能够提高整体控制和效率。本工作为Golem Krang人形机器人整体控制贡献了更丰富的文献。

英文摘要

We present a novel application of robust control and online learning for the balancing of a n Degree of Freedom (DoF), Wheeled Inverted Pendulum (WIP) humanoid robot. Our technique condenses the inaccuracies of a mass model into a Center of Mass (CoM) error, balances despite this error, and uses online learning to update the mass model for a better CoM estimate. Using a simulated model of our robot, we meta-learn a set of excitory joint poses that makes our gradient descent algorithm quickly converge to an accurate (CoM) estimate. This simulated pipeline executes in a fully online fashion, using active disturbance rejection to address the mass errors that result from a steadily evolving mass model. Experiments were performed on a 19 DoF WIP, in which we manually acquired the data for the learned set of poses and show that the mass model produced by a gradient descent produces a CoM estimate that improves overall control and efficiency. This work contributes to a greater corpus of whole body control on the Golem Krang humanoid robot.

URL PDF HTML ☆

赞 0 踩 0

1809.06715 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Controller Synthesis for Discrete-time Hybrid Polynomial Systems via Occupation Measures

通过占用测度进行离散时间混合多项式系统的控制器合成

Weiqiao Han, Russ Tedrake

AI总结本文提出了一种基于占用测度的控制器合成方法，用于稳定具有多个与环境接触的刚体系统，通过解决有限维半正定规划问题来近似无限维线性规划问题，以实现系统稳定。

Comments Accepted by ICRA 2019. Some text overlap with arXiv:1803.09022 in introducing standard notations and preliminary knowledge. arXiv admin note: text overlap with arXiv:1803.09022

1905.03349 2026-06-04 eess.SY cs.RO cs.SY 版本更新

A New Hybrid Control Architecture to Attenuate Large Horizontal Wind Disturbance for a Small-Scale Unmanned Helicopter

一种新型混合控制架构用于抑制小型无人驾驶直升机的大水平风扰动

Xiaorui Zhu, Wenwu Zeng, Zexiang Li, Chunyang Zheng

发表机构 * School of Mechanical Engineering and Automation, Harbin Institute of Technology Shenzhen Graduate School（哈尔滨工业大学机械工程与自动化学院深圳研究生院）； Department of Electronic and Computer Engineering, Hong Kong University of Science and Technology（香港理工大学电子与计算机工程系）

AI总结本文提出了一种结合风洞实验数据和反步算法的新方法，用于抑制小型无人驾驶直升机的大水平风扰动，通过混合控制架构实现更精确快速的扰动抑制。

详情

AI中文摘要

本文提出了一种新颖的方法，用于抑制小型无人驾驶自主直升机的大水平风扰动，结合风洞实验数据和反步算法。大水平风扰动对自主直升机有害，尤其是小型直升机，由于其低惯性以及多输入间的强交叉耦合效应。为实现更精确和快速的扰动抑制，提出了一种新的混合控制架构，利用基于风洞实验数据的直接力/力矩补偿。在该架构中，大水平风扰动被视为控制系统的附加输入，而非平衡状态附近的微小扰动。然后设计反步算法以保证直升机稳定收敛至期望位置。所提出的技术最终在HIROBO Eagle平台上通过仿真进行评估，并与传统风速补偿方法进行比较。

英文摘要

This paper presents a novel method to attenuate large horizontal wind disturbance for a small-scale unmanned autonomous helicopter combining wind tunnel-based experimental data and a backstepping algorithm. Large horizontal wind disturbance is harmful to autonomous helicopters, especially to small ones because of their low inertia and the high cross-coupling effects among the multiple inputs. In order to achieve more accurate and faster attenuation of large wind disturbance, a new hybrid control architecture is proposed to take advantage of the direct force/moment compensation based on the wind tunnel experimental data. In this architecture, large horizontal wind disturbance is treated as an additional input to the control system instead of a small perturbation around the equilibrium state. A backstepping algorithm is then designed to guarantee the stable convergence of the hilicopter to the desired position. The proposed technique is finally evaluated in simulation on the platform, HIROBO Eagle, compared with a traditional wind velocity compensation method.

URL PDF HTML ☆

赞 0 踩 0

1905.03131 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Vision-based Unscented FastSLAM for Mobile Robot

基于视觉的无迹快速SLAM

Chunxin Qiu, Xiaorui Zhu, Xiaobing Zhao

AI总结本文提出一种结合无迹粒子滤波和无迹卡尔曼滤波的视觉无迹快速SLAM算法，通过双目视觉检测地标实现定位与建图，利用无迹快速SLAM提升定位与建图的精度和鲁棒性。

1905.03130 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Cooperative Distributed Robust Control of Modular Mobile Robots with Bounded Curvature and Velocity

模块化移动机器人协同分布式鲁棒控制：具有有限曲率和速度的控制

Xiaorui Zhu, Youngshik Kim, Mark A. Minor

发表机构 * University of Utah（犹他大学）

AI总结本文研究了一种新型的运动控制系统，用于具有柔性框架轮式模块化移动机器人（CFMMR）。该系统结合了基于有限曲率的运动控制器和非线性阻尼动态控制器，通过多种控制器交互形式评估不同控制结构，实验结果验证了姿态调节的有效性。

1905.03051 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Bayesian Optimization for Polynomial Time Probabilistically Complete STL Trajectory Synthesis

基于多项式时间概率完备STL轨迹综合的贝叶斯优化

Vince Kurtz, Hai Lin

AI总结本文提出了一种基于贝叶斯优化的STL轨迹综合方法，替代传统的混合整数线性规划方法，实现了多项式时间复杂度和线性预测数量的高效轨迹综合，同时保证了概率完备性。

Comments CDC 2019 Submission

详情

AI中文摘要

近年来，信号临时逻辑（STL）作为机器人和网络物理系统控制目标编码的实用且表达能力强的手段获得了广泛关注。STL轨迹综合的最新进展是将问题形式化为混合整数线性规划（MILP）。MILP方法在有界规范下是正确且完整的，但这种强正确性保证以指数复杂度和规范的时间界为代价。在本工作中，我们提出了一种替代的综合范式，依赖于贝叶斯优化而不是混合整数规划。这将完备性保证放松为概率完备性，但显著更高效：我们的方法在STL时间界上呈多项式复杂度，在预测数量上呈线性复杂度。我们证明我们的方法是正确且概率完备的，并通过一个非平凡的例子展示了其可扩展性。

英文摘要

In recent years, Signal Temporal Logic (STL) has gained traction as a practical and expressive means of encoding control objectives for robotic and cyber-physical systems. The state-of-the-art in STL trajectory synthesis is to formulate the problem as a Mixed Integer Linear Program (MILP). The MILP approach is sound and complete for bounded specifications, but such strong correctness guarantees come at the price of exponential complexity in the number of predicates and the time bound of the specification. In this work, we propose an alternative synthesis paradigm that relies on Bayesian optimization rather than mixed integer programming. This relaxes the completeness guarantee to probabilistic completeness, but is significantly more efficient: our approach scales polynomially in the STL time-bound and linearly in the number of predicates. We prove that our approach is sound and probabilistically complete, and demonstrate its scalability with a nontrivial example.

URL PDF HTML ☆

赞 0 踩 0

1905.01028 2026-06-04 eess.SY cs.MA cs.RO cs.SY 版本更新

Robust Cooperative Formation Control of Fixed-Wing Unmanned Aerial Vehicles

固定翼无人飞行器鲁棒协同编队控制

Qingrui Zhang, Hugh H. T. Liu

发表机构 * Institute for Aerospace Studies, University of Toronto（航空航天研究学院，多伦多大学）

AI总结本文研究了固定翼无人机在紧密编队飞行中用于节能的鲁棒协同编队控制问题，提出了一种新的协同控制方法，通过虚拟结构概念解决大规模无人机编队中虚拟领导者设计的难题，并采用协作者滤波器生成平滑的参考信号，同时利用不确定性与扰动观测器补偿气动耦合导致的模型不确定性，最终实现了鲁棒的编队控制。

详情

AI中文摘要

本文研究了固定翼无人飞行器在紧密编队飞行中用于节能的鲁棒协同编队控制问题。提出了一种新的协同控制方法。采用虚拟结构的概念来解决在编队飞行中为大量无人机设计虚拟领导者所面临的困难。为了提高暂态性能，期望轨迹通过一组协作者滤波器生成平滑的参考信号，即虚拟领导者的状态。利用不确定性与扰动观测器估计并补偿由于无人机之间气动耦合导致的模型不确定性。因此，整个设计包含三个主要组成部分：用于运动规划的协作者滤波器、基准协同控制以及不确定性与扰动观测。所提出的编队控制器至少可以保证编队跟踪的最终有界控制性能。如果满足某些条件，则可以实现渐近编队跟踪控制。本文的主要贡献在于两个方面：1）通过虚拟结构概念解决了设计虚拟领导者的问题；2）提出了一种针对大量无人机之间存在气动耦合的紧密编队飞行的鲁棒协同控制器。所提出设计的效率将通过五架无人机紧密编队飞行的数值模拟来展示。

英文摘要

Robust cooperative formation control is investigated in this paper for fixed-wing unmanned aerial vehicles in close formation flight to save energy. A novel cooperative control method is developed. The concept of virtual structure is employed to resolve the difficulty in designing virtual leaders for a large number of UAVs in formation flight. To improve the transient performance, desired trajectories are passed through a group of cooperative filters to generate smooth reference signals, namely the states of the virtual leaders. Model uncertainties due to aerodynamic couplings among UAVs are estimated and compensated using uncertainty and disturbance observers. The entire design, therefore, contains three major components: cooperative filters for motion planning, baseline cooperative control, and uncertainty and disturbance observation. The proposed formation controller could at least secure ultimate bounded control performance for formation tracking. If certain conditions are satisfied, asymptotic formation tracking control could be obtained. Major contributions of this paper lie in two aspects: 1) the difficulty in designing virtual leaders is resolved in terms of the virtual structure concept; 2) a robust cooperative controller is proposed for close formation flight of a large number of UAVs suffering from aerodynamic couplings in between. The efficiency of the proposed design will be demonstrated using numerical simulations of five UAVs in close formation flight.

URL PDF HTML ☆

赞 0 踩 0

1809.07874 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Task-Driven Estimation and Control via Information Bottlenecks

基于信息瓶颈的任务驱动估计与控制

Vincent Pacelli, Anirudha Majumdar

发表机构 * Princeton University（普林斯顿大学）

AI总结本文提出了一种基于信息瓶颈理论的任务驱动估计与控制框架，通过任务相关的变量构建高效表示，并设计了在线算法进行状态估计以实现鲁棒的控制策略。

Comments 9 pages, 4 figures, abridged version accepted to ICRA2019; Incorporates changes in final conference submission

详情

AI中文摘要

我们的目标是开发一种原理明确且通用的算法框架，用于机器人系统的任务驱动估计与控制。最先进的机器人控制系统通常依赖于准确估计机器人完整状态（例如，运行中的机器人可能估计关节角度和速度、躯干状态以及相对于目标的位置）。然而，完整状态表示通常对于特定任务来说过于丰富，可能导致显著的计算不高效和对状态估计误差的脆弱性。相反，我们提出了一种方法，避免使用这种丰富的表示，并试图创建任务驱动的表示。关键的技术洞察是利用信息瓶颈理论来形式化“任务驱动表示”的概念，以信息理论量度来衡量表示的最小性。我们提出了新的迭代算法，用于自动合成（离线）任务驱动表示（以一组任务相关变量（TRVs）给出）和一个以TRVs为函数的高效控制策略。我们还提出了在线算法来估计TRVs，以便应用控制策略。我们证明了我们的方法在理论和彻底的模拟实验（包括向目标位置奔跑的弹簧加载倒立摆）中都对未建模的测量不确定性具有显著的鲁棒性。

英文摘要

Our goal is to develop a principled and general algorithmic framework for task-driven estimation and control for robotic systems. State-of-the-art approaches for controlling robotic systems typically rely heavily on accurately estimating the full state of the robot (e.g., a running robot might estimate joint angles and velocities, torso state, and position relative to a goal). However, full state representations are often excessively rich for the specific task at hand and can lead to significant computational inefficiency and brittleness to errors in state estimation. In contrast, we present an approach that eschews such rich representations and seeks to create task-driven representations. The key technical insight is to leverage the theory of information bottlenecks}to formalize the notion of a "task-driven representation" in terms of information theoretic quantities that measure the minimality of a representation. We propose novel iterative algorithms for automatically synthesizing (offline) a task-driven representation (given in terms of a set of task-relevant variables (TRVs)) and a performant control policy that is a function of the TRVs. We present online algorithms for estimating the TRVs in order to apply the control policy. We demonstrate that our approach results in significant robustness to unmodeled measurement uncertainty both theoretically and via thorough simulation experiments including a spring-loaded inverted pendulum running to a goal location.

URL PDF HTML ☆

赞 0 踩 0

1905.01683 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Path Planning for Autonomous Bus Driving in Urban Environments

城市环境中自动驾驶巴士的路径规划

Rui Oliveira, Pedro F. Lima, Gonçalo Collares Pereira, Jonas Mårtensson, Bo Wahlberg

AI总结本文提出了一种针对城市环境中巴士驾驶的路径规划框架，通过优化问题解决巴士的复杂驾驶任务，利用道路对齐车辆模型，并考虑巴士的过道特性以实现安全的避障约束。

Comments 6 pages, 8 figures

详情

AI中文摘要

在城市环境中驾驶往往面临需要专家操作车辆的困难情况。当考虑大型车辆，如巴士时，这些情况变得更加具有挑战性。我们提出了一种路径规划框架，以解决巴士在城市区域中的复杂驾驶任务。该方法使用道路对齐的车辆模型进行建模。道路对齐的坐标系引入了对车辆本体和障碍物的扭曲，促使开发新的近似方法来捕捉这种扭曲。这些近似方法允许安全且非保守的碰撞避免约束的制定。与其他路径规划方法不同，我们的方法利用了 curb 和其他可扫过的区域，这些区域巴士在执行某些操作时必须扫过。此外，它充分利用了巴士的特定特性，即过道，这是车辆底盘的升高部分，可以扫过 curb。进行了模拟，展示了所提出方法的适用性和优势。

英文摘要

Driving in urban environments often presents difficult situations that require expert maneuvering of a vehicle. These situations become even more challenging when considering large vehicles, such as buses. We present a path planning framework that addresses the demanding driving task of buses in urban areas. The approach is formulated as an optimization problem using the road-aligned vehicle model. The road-aligned frame introduces a distortion on the vehicle body and obstacles, motivating the development of novel approximations that capture this distortion. These approximations allow for the formulation of safe and non-conservative collision avoidance constraints. Unlike other path planning approaches, our method exploits curbs and other sweepable regions, which a bus must often sweep over in order to manage certain maneuvers. Furthermore, it takes full advantage of the particular characteristics of buses, namely the overhangs, an elevated part of the vehicle chassis, that can sweep over curbs. Simulations are presented, showing the applicability and benefits of the proposed method.

URL PDF HTML ☆

赞 0 踩 0

1903.10623 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Attitude- and Cruise Control of a VTOL Tiltwing UAV

VTOL 倾转翼无人机的姿态与巡航控制

David Rohr, Thomas Stastny, Sebastian Verling, Roland Siegwart

发表机构 * IEEE

AI总结本文提出了一种过量驱动的垂直起降（VTOL）倾转翼无人机的数学建模、控制器设计和飞行测试方法，通过简化空气动力学和第一原理，建立了能够捕捉关键空气动力学效应的动态模型，包括螺旋桨滑流对机翼的影响和机翼后失速特性。通过优化如功率最优调平等方法解决了无人机的过量驱动问题，并开发了由低层姿态控制器和高层巡航控制器组成的控制系统，通过系统线性化和查找表确定飞行包线内调平的强非线性变化，通过广泛的飞行测试验证了控制系统在所有飞行阶段（悬停、过渡、巡航）的性能。

Comments 8 pages

详情

DOI: 10.1109/LRA.2019.2914340

AI中文摘要

本文提出了一种垂直起降（VTOL）倾转翼无人机的数学建模、控制器设计和飞行测试方法。基于简化空气动力学和第一原理，建立了能够捕捉关键空气动力学效应的动态模型，包括螺旋桨滑流对机翼的影响和机翼后失速特性。基于该模型分析了稳态飞行包线和相应的调平作用，并通过优化如功率最优调平等方法解决了无人机的过量驱动问题。所开发的控制系统由两个控制器组成：首先是一个基于动态反向和串行方法的低层姿态控制器，用于处理冗余执行器的分配。其次是一个高层巡航控制器，用于跟踪所需的垂直速度。该控制器基于系统线性化和查找表，以确定飞行包线内调平的强非线性变化。我们通过广泛的飞行测试验证了该控制系统的性能，适用于所有飞行阶段（悬停、过渡、巡航）。

英文摘要

This paper presents the mathematical modeling, controller design, and flight-testing of an over-actuated Vertical Take-off and Landing (VTOL) tiltwing Unmanned Aerial Vehicle (UAV). Based on simplified aerodynamics and first-principles, a dynamical model of the UAV is developed which captures key aerodynamic effects including propeller slipstream on the wing and post-stall characteristics of the airfoils. The model-based steady-state flight envelope and the corresponding trim-actuation is analyzed and the overactuation of the UAV solved by optimizing for, e.g., power-optimal trims. The developed control system is composed of two controllers: First, a low-level attitude controller based on dynamic inversion and a daisy-chaining approach to handle allocation of redundant actuators. Secondly, a higher-level cruise controller to track a desired vertical velocity. It is based on a linearization of the system and look-up tables to determine the strong and nonlinear variation of the trims throughout the flight-envelope. We demonstrate the performance of the control-system for all flight phases (hover, transition, cruise) in extensive flight-tests.

URL PDF HTML ☆

赞 0 踩 0

1905.01150 2026-06-04 cs.RO cs.MA cs.SY eess.SY 版本更新

A Right-of-Way Based Strategy to Implement Safe and Efficient Driving at Non-Signalized Intersections for Automated Vehicles

基于通行权的非信号交叉口安全高效自动驾驶策略

Yadong Xing, Can Zhao, ZhiHeng Li, Yi Zhang, Li Li, Fei-Yue Wang, Xiao Wang, Yujing Wang, Yuelong Su, Dongpu Cao

发表机构 * Graduate School at Shenzhen, Tsinghua University（清华大学深圳研究生院）； Department of Automation, BNRist, Tsinghua University（清华大学自动化系，北京理工大学）； traffic management solution division, traffic throughput through simulation experiments（交通管理解决方案部）

AI总结本文提出一种基于通行权分配的策略，用于在非信号交叉口实现安全高效的自动驾驶，通过测试比较现有策略，证明该策略在交通效率上优于原有策略，但受限于通信范围和缺乏长期规划，不如协作驾驶策略，但通信成本更低。

Comments 6 pages, 7 figures

1904.12394 2026-06-04 math.DS cs.NA cs.RO math.NA 版本更新

Stability conditions of an ODE arising in human motion and its numerical simulation

来自人体运动的ODE的稳定性条件及其数值模拟

Takahiro Kosugi, Hitoshi Kino, Masaaki Goto, Yuki Matsutani

发表机构 * Department of Intelligent Mechanical Engineering, Faculty of Engineering, Fukuoka Institute of Technology（福冈技术学院智能机械工程系）； Department of Robotics, Faculty of Engineering, Kindai University（近畿大学机器人系）

AI总结本文研究了一个来自肌骨系统前馈位置控制的ODE的平衡点稳定性，提出了一种渐近稳定的充分条件，并通过惩罚ODE的数值模拟和实验结果验证了该条件。

Comments 15 pages, 7 figures

1808.07921 2026-06-04 cs.RO cs.AI cs.PL cs.SE cs.SY eess.SY 版本更新

SOTER: A Runtime Assurance Framework for Programming Safe Robotics Systems

SOTER：一种用于安全机器人系统编程的运行时保证框架

Ankush Desai, Shromona Ghosh, Sanjit A. Seshia, Natarajan Shankar, Ashish Tiwari

发表机构 * University of California at Berkeley, CA, USA（加州大学伯克利分校）； SRI International（SRI国际）； Microsoft（微软）

AI总结本文提出SOTER框架，通过一种编程语言和集成的运行时保证系统，为安全机器人系统提供保障，确保在使用未经认证组件时仍能满足安全要求。

详情

AI中文摘要

近年来，机器人实现更高自主性和智能性的趋势导致了高度复杂性。自主机器人越来越多地依赖第三方现成组件和复杂的机器学习技术。这种趋势使得提供强设计时认证的正确操作变得具有挑战性。为了解决这些挑战，我们提出了SOTER，一种机器人编程框架，包含两个关键组件：（1）一种用于实现和测试高层反应式机器人软件的编程语言；（2）一个集成的运行时保证（RTA）系统，该系统帮助在使用未经认证的组件时仍能提供安全保证。SOTER提供了语言原语，用于声明性地构建RTA模块，该模块包含一个高级高性能控制器（未经认证）、一个安全但性能较低的控制器（认证）以及期望的安全规范。该框架提供正式保证，确保一个良好的RTA模块始终满足安全规范，而无需完全牺牲性能，通过在安全时使用高性能未经认证的组件。SOTER允许复杂的机器人软件堆栈作为RTA模块的组合来构建，其中每个未经认证的组件都通过RTA模块进行保护。为了证明我们框架的有效性，我们考虑了一个现实世界案例研究，即构建一个安全的无人机监视系统。我们的实验在模拟和实际无人机上均表明，SOTER启用的RTA确保了系统的安全性，包括在不可信的第三方组件有bug或偏离预期行为时。

英文摘要

The recent drive towards achieving greater autonomy and intelligence in robotics has led to high levels of complexity. Autonomous robots increasingly depend on third party off-the-shelf components and complex machine-learning techniques. This trend makes it challenging to provide strong design-time certification of correct operation. To address these challenges, we present SOTER, a robotics programming framework with two key components: (1) a programming language for implementing and testing high-level reactive robotics software and (2) an integrated runtime assurance (RTA) system that helps enable the use of uncertified components, while still providing safety guarantees. SOTER provides language primitives to declaratively construct a RTA module consisting of an advanced, high-performance controller (uncertified), a safe, lower-performance controller (certified), and the desired safety specification. The framework provides a formal guarantee that a well-formed RTA module always satisfies the safety specification, without completely sacrificing performance by using higher performance uncertified components whenever safe. SOTER allows the complex robotics software stack to be constructed as a composition of RTA modules, where each uncertified component is protected using a RTA module. To demonstrate the efficacy of our framework, we consider a real-world case-study of building a safe drone surveillance system. Our experiments both in simulation and on actual drones show that the SOTER-enabled RTA ensures the safety of the system, including when untrusted third-party components have bugs or deviate from the desired behavior.

URL PDF HTML ☆

赞 0 踩 0

1904.08833 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A Passivity-based Nonlinear Admittance Control with Application to Powered Upper-limb Control under Unknown Environmental Interactions

一种基于被动性的非线性阻抗控制及其在未知环境交互下的动力上肢控制应用

Min Jun Kim, Woongyong Lee, Jae Yeon Choi, Goobong Chung, Kyung-Lyong Han, Il Seop Choi, Christian Ott, Wan Kyun Chung

AI总结本文提出了一种基于被动性理论的动力上肢外骨骼机器人的阻抗控制器，通过非线性运动方程建模，利用被动性理论将人类操作员和环境交互纳入控制回路，通过力/扭矩传感器与人类交互，通过末端执行器与环境交互，尽管环境交互无法被任何传感器检测到（未知），但被动性允许自然交互。分析表明，当控制增益趋于无穷大时，实际系统的行为与名义模型相似，表明所提出的方法是一种阻抗控制器。然而，由于实际中控制增益无法无限增长，根据可实现的控制增益性能限制也被分析。分析结果表明，按无限范数意义，性能与控制增益成线性关系。在实验中，使用1自由度测试台验证了所提出的方法，并用实际的动力上肢外骨骼设备来提升和操控未知负载。

Comments Accepted in IEEE/ASME Transactions on Mechatronics (T-MECH)

详情

AI中文摘要

本文提出了一种基于被动性理论的动力上肢外骨骼机器人的阻抗控制器，该机器人由非线性运动方程支配。被动性允许我们将人类操作员和环境交互纳入控制回路。机器人通过F/T传感器与人类操作员交互，主要通过末端执行器与环境交互。尽管环境交互无法被任何传感器检测到（因此未知），被动性允许我们实现自然交互。分析表明，当控制增益趋于无穷大时，实际系统的行为与名义模型相似，这表明所提出的方法是一种阻抗控制器。然而，由于实际中控制增益无法无限增长，根据可实现的控制增益的性能限制也被分析。分析结果表明，按无限范数意义，性能与控制增益成线性关系。在实验中，所提出的方法通过1自由度测试台进行了验证，并使用实际的动力上肢外骨骼设备来提升和操控未知负载。

英文摘要

This paper presents an admittance controller based on the passivity theory for a powered upper-limb exoskeleton robot which is governed by the nonlinear equation of motion. Passivity allows us to include a human operator and environmental interaction in the control loop. The robot interacts with the human operator via F/T sensor and interacts with the environment mainly via end-effectors. Although the environmental interaction cannot be detected by any sensors (hence unknown), passivity allows us to have natural interaction. An analysis shows that the behavior of the actual system mimics that of a nominal model as the control gain goes to infinity, which implies that the proposed approach is an admittance controller. However, because the control gain cannot grow infinitely in practice, the performance limitation according to the achievable control gain is also analyzed. The result of this analysis indicates that the performance in the sense of infinite norm increases linearly with the control gain. In the experiments, the proposed properties were verified using 1 degree-of-freedom testbench, and an actual powered upper-limb exoskeleton was used to lift and maneuver the unknown payload.

URL PDF HTML ☆

赞 0 踩 0

1904.08361 2026-06-04 cs.LG cs.RO cs.SY eess.SY stat.ML 版本更新

Decoupled Data Based Approach for Learning to Control Nonlinear Dynamical Systems

基于解耦数据的方法用于学习控制非线性动力学系统

Ran Wang, Karthikeya Parunandi, Dan Yu, Dileep Kalathil, Suman Chakravorty

发表机构 * College of Astronautics, Nanjing University and hence, run into the curse of dimensionality（南京大学航天学院）； Department of Electrical and Computer Engineering, Texas A&M University, Texas, USA（德克萨斯A&M大学电气与计算机工程系）

AI总结本文提出了一种解耦数据基于的方法，用于学习控制具有连续状态空间、连续动作空间和未知动态的非线性随机动力学系统，通过解耦的开环-闭环方法，利用黑盒仿真模型解决开环确定性轨迹优化问题，并通过线性化动态在该名义轨迹上开发闭环控制，从而使用线性二次调节器算法，证明了该方法的性能近似最优，并在训练时间上显著优于其他先进算法。

详情

AI中文摘要

本文解决了一个非线性随机动力学系统学习最优控制策略的问题，该系统具有连续状态空间、连续动作空间和未知动态。此类问题通常在随机自适应控制和强化学习文献中使用基于模型和无模型的方法分别解决。这两种方法都依赖于解决动态规划问题，无论是直接还是间接，以找到最优闭环控制策略。动态规划方法固有的'维度灾难'使这些方法也变得计算上困难。本文提出了一种新颖的解耦数据基于控制（D2C）算法，通过解耦的'开环-闭环'方法解决这个问题。首先，使用动力学系统的黑盒仿真模型解决一个开环确定性轨迹优化问题。然后，通过在该名义轨迹上线性化动态，开发围绕该开环轨迹的闭环控制。通过线性化，可以使用基于线性二次调节器的算法来实现该闭环控制。我们证明了D2C算法的性能近似最优。此外，仿真性能表明，与其它先进算法相比，训练时间显著减少。

英文摘要

This paper addresses the problem of learning the optimal control policy for a nonlinear stochastic dynamical system with continuous state space, continuous action space and unknown dynamics. This class of problems are typically addressed in stochastic adaptive control and reinforcement learning literature using model-based and model-free approaches respectively. Both methods rely on solving a dynamic programming problem, either directly or indirectly, for finding the optimal closed loop control policy. The inherent `curse of dimensionality' associated with dynamic programming method makes these approaches also computationally difficult. This paper proposes a novel decoupled data-based control (D2C) algorithm that addresses this problem using a decoupled, `open loop - closed loop', approach. First, an open-loop deterministic trajectory optimization problem is solved using a black-box simulation model of the dynamical system. Then, a closed loop control is developed around this open loop trajectory by linearization of the dynamics about this nominal trajectory. By virtue of linearization, a linear quadratic regulator based algorithm can be used for this closed loop control. We show that the performance of D2C algorithm is approximately optimal. Moreover, simulation performance suggests significant reduction in training time compared to other state of the art algorithms.

URL PDF HTML ☆

赞 0 踩 0

1904.07479 2026-06-04 eess.SY cs.MA cs.RO cs.SY 版本更新

Robust nonlinear control of close formation flight

紧密编队飞行的鲁棒非线性控制

Qingrui Zhang, Hugh H. T. Liu

发表机构 * University of Toronto（多伦多大学）

AI总结本文研究了在动态飞行操作中，为跟随飞行器在受领航机生成的尾涡影响下实现精确位置控制的问题，提出了一种鲁棒非线性编队控制算法，通过基线控制律和扰动观测器实现精确编队跟踪控制，通过数值模拟验证了设计的有效性。

Comments 33 pages, 20 figures

详情

AI中文摘要

本文研究了紧密编队飞行的鲁棒非线性控制问题。其目标是在动态飞行操作中，为跟随飞行器在受领航机生成的尾涡影响下实现精确位置控制。一个关键问题是如何确保在存在不确定性和扰动的情况下，位置误差保持有界。本文提出了一种鲁棒非线性编队控制算法，以实现精确的紧密编队跟踪控制。所提出的控制算法由基线控制律和扰动观测器组成。基线控制律用于稳定紧密编队飞行的非线性动力学，而扰动观测器用于补偿系统不确定性和编队相关的空气动力学扰动。通过所提出的设计，可以保证位置控制性能在期望的有界范围内，从而为在紧密编队中使用该设计获得足够的阻力减少。所提出设计的有效性通过两架飞行器的紧密编队飞行数值模拟得以验证。

英文摘要

This paper investigates the robust nonlinear close formation control problem. It aims to achieve precise position control at dynamic flight operation for a follower aircraft under the aerodynamic impact due to the trailing vortices generated by a leader aircraft. One crucial concern is the control robustness that ensures the boundedness of position error subject to uncertainties and disturbances to be regulated with accuracy. This paper develops a robust nonlinear formation control algorithm to fulfill precise close formation tracking control. The proposed control algorithm consists of baseline control laws and disturbance observers. The baseline control laws are employed to stabilize the nonlinear dynamics of close formation flight, while the disturbance observers are introduced to compensate system uncertainties and formation-related aerodynamic disturbances. The position control performance can be guaranteed within the desired boundedness to harvest enough drag reduction for a follower aircraft in close formation using the proposed design. The efficacy of the proposed design is demonstrated via numerical simulations of close formation flight of two aircraft.

URL PDF HTML ☆

赞 0 踩 0

1904.06892 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Learning to Guide: Guidance Law Based on Deep Meta-learning and Model Predictive Path Integral Control

学习引导：基于深度元学习和模型预测路径积分控制的引导法

Chen Liang, Weihong Wang, Zhenghua Liu, Chao Lai, Benchun Zhou

发表机构 * School of Automation Science and Electrical Engineering, Beihang University（北京航空航天大学自动化科学与电气工程学院）； Navigation and Control Technology Research Institute of China North Industries Group Corporation（中国北方工业集团有限公司导航与控制技术研究院）

AI总结本文提出了一种基于模型驱动深度强化学习的新型引导方案，通过将深度神经网络作为引导动力学的预测模型融入模型预测路径积分（MPPI）控制框架中，利用元学习技术使深度神经动力学模型能够在线适应环境变化，从而缓解标准MPPI控制因实际环境与训练数据差异导致的性能下降，并构建了在存在作动器故障时拦截机动目标的新型引导律。

Comments Code available at https://github.com/tccliangchen/deep_meta-learning_guidance_law . in IEEE Access 2019

详情

DOI: 10.1109/ACCESS.2019.2909579

AI中文摘要

在本文中，我们提出了一种基于模型驱动深度强化学习（RL）技术的新型引导方案。利用模型驱动深度RL方法，训练一个深度神经网络作为引导动力学的预测模型，并将其纳入模型预测路径积分（MPPI）控制框架中。然而，传统的MPPI框架假设实际环境与训练数据集相似，这在实践中由于目标机动、其他扰动和作动器故障等因素而难以实现。为了解决这个问题，我们的方法利用元学习技术，使深度神经动力学模型能够在线适应这些变化。通过这种方法，我们可以减轻标准MPPI控制因实际环境与训练数据差异导致的性能下降。然后，基于上述技术，构建了一种新型引导律，用于在存在作动器故障的情况下拦截具有期望终端撞击角的机动目标。不同情况下的仿真和实验结果表明，所提出的引导律在实现成功拦截机动目标方面具有有效性与鲁棒性。

英文摘要

In this paper, we present a novel guidance scheme based on model-based deep reinforcement learning (RL) technique. With model-based deep RL method, a deep neural network is trained as a predictive model of guidance dynamics which is incorporated into a model predictive path integral (MPPI) control framework. However the traditional MPPI framework assumes the actual environment similar to the training dataset for the deep neural network which is impractical in practice with different maneuvering of target, other perturbations and actuator failures. To address this problem, our method utilize meta-learning technique to make the deep neural dynamics model adapt to such changes online. With this approach we can alleviate the performance deterioration of standard MPPI control caused by the difference between actual environment and training data. Then, a novel guidance law for a varying velocity interceptor intercepting maneuvering target with desired terminal impact angle under actuator failure is constructed based on aforementioned techniques. Simulation and experiment results under different cases show the effectiveness and robustness of the proposed guidance law in achieving successful interceptions of maneuvering target.

URL PDF HTML ☆

赞 0 踩 0

1904.06680 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Online Sampling in the Parameter Space of a Neural Network for GPU-accelerated Motion Planning of Autonomous Vehicles

神经网络参数空间中的在线采样用于自动驾驶车辆的GPU加速运动规划

Mogens Graf Plessen

发表机构 * MPG

AI总结本文提出了一种用于自动驾驶车辆GPU加速运动规划的神经网络参数空间在线采样方法，通过神经网络作为控制器参数化来处理非线性和非凸系统，并在预测时间范围内保持参数化不变，同时通过变化的特征向量确定转向和纵向加速度控制。

Comments 8 pages, 8 figures, 3 tables, conference paper

详情

AI中文摘要

本文提出了一种用于自动驾驶车辆GPU加速运动规划的神经网络参数空间在线采样方法。神经网络被用作控制器参数化，因为它们能够处理非线性和非凸系统，且其复杂性不随预测时间长度而增加。在网络参数化在每个采样时间点被采样后，在预测时间范围内保持不变。由于输入到网络的特征向量在预测时间范围内变化，因此控制仍然在预测时间范围内变化。全维车辆通过多面体建模。在障碍物点数据假设下，并在其预测时间范围内以恒定速度假设进行外推的情况下，碰撞避免减少为线性不等式检查。转向和纵向加速度控制同时确定。所提出的方法设计用于并行化，因此非常适合从GPU等硬件的持续进步中受益。所提出方法的特点在5个数值模拟实验中得到说明，包括动态障碍物避让、需要交替前进和倒车的路径点跟踪以及倒车停车场景。

英文摘要

This paper proposes online sampling in the parameter space of a neural network for GPU-accelerated motion planning of autonomous vehicles. Neural networks are used as controller parametrization since they can handle nonlinear non-convex systems and their complexity does not scale with prediction horizon length. Network parametrizations are sampled at each sampling time and then held constant throughout the prediction horizon. Controls still vary over the prediction horizon due to varying feature vectors fed to the network. Full-dimensional vehicles are modeled by polytopes. Under the assumption of obstacle point data, and their extrapolation over a prediction horizon under constant velocity assumption, collision avoidance reduces to linear inequality checks. Steering and longitudinal acceleration controls are determined simultaneously. The proposed method is designed for parallelization and therefore well-suited to benefit from continuing advancements in hardware such as GPUs. Characteristics of proposed method are illustrated in 5 numerical simulation experiments including dynamic obstacle avoidance, waypoint tracking requiring alternating forward and reverse driving with maximal steering, and a reverse parking scenario.

URL PDF HTML ☆

赞 0 踩 0

1904.06524 2026-06-04 cs.RO cs.SY eess.SY 版本更新

On Model Adaptation for Sensorimotor Control of Robots

关于机器人传感器运动控制的模型适应

David Navarro-Alarcon, Andrea Cherubini, Xiang Li

发表机构 * The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong（香港理工大学）； University of Montpellier / LIRMM, Montpellier, France（蒙彼利埃大学 / LIRMM）； The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong（香港中文大学）

AI总结本文研究了如何计算用于引导具有不确定动作-感知关系的机器人系统运动的自适应传感器运动模型，通过两个案例研究展示了所提出的方法：变形物体的形状控制和未校准传感器下的超声探头软操控。

Comments 38th Chinese Control Conference

1904.05814 2026-06-04 cs.CV cs.GR cs.LG cs.NA cs.RO math.NA 版本更新

多相刚体接触动力学中的微分动态规划

Rohan Budhiraja, Justin Carpentier, Carlos Mastalli, Nicolas Mansard

发表机构 * CNRS, LAAS（法国国家科学研究中心，拉拉斯研究所）； INRIA, France（法国国家信息与自动化研究所，法国）

AI总结本文提出使用微分动态规划算法来优化多相刚体接触动力学的全身轨迹，通过利用角动量提高运动效率，减少力和冲击，并在无外力情况下实现姿态控制。

Comments 6 pages, IEEE RAS International Conference on Humanoid Robots

详情

DOI: 10.1109/HUMANOIDS.2018.8624925

AI中文摘要

当今生成高效运动的常见策略是将问题分解为两个连续步骤：第一步生成接触序列和质心轨迹，第二步计算遵循质心模式的全身轨迹。然而，第二步通常由简单的程序如逆运动学求解器处理。相反，我们提出使用局部最优控制求解器，即微分动态规划（DDP），来计算全身轨迹。我们的方法通过利用角动量产生更高效的运动，具有较低的力和较小的冲击。为此，我们提出了一种原始的DDP公式，利用刚体接触模型的Kuhn-Tucker约束。通过在真实HRP-2机器人上执行大步行走和无外力情况下的姿态控制问题，我们实验性地展示了这种方法的重要性。

英文摘要

A common strategy today to generate efficient locomotion movements is to split the problem into two consecutive steps: the first one generates the contact sequence together with the centroidal trajectory, while the second one computes the whole-body trajectory that follows the centroidal pattern. Yet the second step is generally handled by a simple program such as an inverse kinematics solver. In contrast, we propose to compute the whole-body trajectory by using a local optimal control solver, namely Differential Dynamic Programming (DDP). Our method produces more efficient motions, with lower forces and smaller impacts, by exploiting the Angular Momentum (AM). With this aim, we propose an original DDP formulation exploiting the Karush-Kuhn-Tucker constraint of the rigid contact model. We experimentally show the importance of this approach by executing large steps walking on the real HRP-2 robot, and by solving the problem of attitude control under the absence of external forces.

URL PDF HTML ☆

赞 0 踩 0

1904.04600 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Hierarchical Planning of Dynamic Movements without Scheduled Contact Sequences

无调度接触序列的动态运动分层规划

Carlos Mastalli, Ioannis Havoutis, Michele Focchi, Darwin G. Caldwell, Claudio Semini

发表机构 * Department of Advanced Robotics, Istituto Italiano di Tecnologia（意大利技术研究院先进机器人部）； Robot Learning and Interaction Group, Idiap Research Institute（伊迪普研究所机器人学习与交互小组）

AI总结本文提出了一种分层轨迹优化方法，用于规划无需调度接触序列的动态运动，通过计算能够实现无法通过刚体运动达到的目标的全身运动，首先根据机器人的质心动力学找到可行的质心运动，然后通过应用完整的动力学模型进行优化，利用可行的质心轨迹作为预热起点，通过互补约束描述接触模型，即环境几何和非滑动主动接触，两个优化阶段均作为互补约束数学规划问题（MPCC）进行求解，实验表明该规划方法在一系列具有挑战性的任务中表现出色。

Comments 6 pages, IEEE International Conference on Robotics and Automation (ICRA)

详情

DOI: 10.1109/ICRA.2016.7487664

AI中文摘要

大多数动物和人类完成复杂任务的运动行为涉及动态运动和丰富的接触交互。事实上，复杂操作需要同时考虑动态运动和接触事件。我们提出了一种分层轨迹优化方法，用于规划具有无调度接触序列的动态运动。我们计算出能够实现无法通过刚体运动达到的目标的全身运动。首先，我们根据机器人的质心动力学找到可行的质心运动。然后，我们通过应用机器人的完整动力学模型来优化解决方案，其中可行的质心轨迹用作预热起点。为了实现无调度的接触行为，我们使用互补约束来描述接触模型，即环境几何和非滑动主动接触。两个优化阶段均被提出为互补约束数学规划问题（MPCC）。实验测试展示了我们的规划方法在一系列具有挑战性的任务中的性能。

英文摘要

Most animal and human locomotion behaviors for solving complex tasks involve dynamic motions and rich contact interaction. In fact, complex maneuvers need to consider dynamic movement and contact events at the same time. We present a hierarchical trajectory optimization approach for planning dynamic movements with unscheduled contact sequences. We compute whole-body motions that achieve goals that cannot be reached in a kinematic fashion. First, we find a feasible CoM motion according to the centroidal dynamics of the robot. Then, we refine the solution by applying the robot's full-dynamics model, where the feasible CoM trajectory is used as a warm-start point. To accomplish the unscheduled contact behavior, we use complementarity constraints to describe the contact model, i.e. environment geometry and non-sliding active contacts. Both optimization phases are posed as Mathematical Program with Complementarity Constraints (MPCC). Experimental trials demonstrate the performance of our planning approach in a set of challenging tasks.

URL PDF HTML ☆

赞 0 踩 0

1904.04595 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

Simultaneous Contact, Gait and Motion Planning for Robust Multi-Legged Locomotion via Mixed-Integer Convex Optimization

通过混合整数凸优化实现鲁棒多足运动的同步接触、步态和运动规划

Bernardo Aceituno-Cabezas, Carlos Mastalli, Hongkai Dai, Michele Focchi, Andreea Radulescu, Darwin G. Caldwell, Jose Cappelletto, Juan C. Grieco, Gerardo Fernandez-Lopez, Claudio Semini

发表机构 * School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA 30332 USA（电气与计算机工程系，佐治亚理工学院，亚特兰大，GA 30332 USA）； Twentieth Century Fox, Springfield, USA（二十世纪福克斯，斯普林菲尔德，USA）； Starfleet Academy, San Francisco, CA 96678 USA（星际舰队学院，旧金山，CA 96678 USA）； Tyrell Inc., 123 Replicant Street, Los Angeles, CA 90210 USA（泰勒尔公司，123 复制人街，洛杉矶，CA 90210 USA）

AI总结本文提出了一种混合整数凸优化方法，用于同时规划多足机器人的接触位置、步态转换和运动，以提高运动的通用性并保持低计算时间。

Comments 8 pages, IEEE Robotics and Automation Letters

详情

DOI: 10.1109/LRA.2017.2779821

AI中文摘要

传统多足运动规划方法将问题分为多个阶段，如接触搜索和轨迹生成。然而，同时考虑接触和运动对于生成复杂的全身行为至关重要。目前，将这些问题耦合在一起需要假设固定的步态序列和平坦地形条件，或者使用非凸优化，计算时间不可行。本文提出了一种混合整数凸公式，以高效的方式同时规划接触位置、步态转换和运动。与之前的工作不同，我们的方法不限于平坦地形或预设的步态序列。相反，我们纳入摩擦锥稳定性边际，近似机器人扭矩限制，并使用混合整数凸约束规划步态。我们通过在HyQ机器人上实验验证了我们的方法，穿越了不同具有挑战性的地形，其中非凸性和平坦地形假设可能导致次优或不稳定计划。我们的方法在保持低计算时间的同时提高了运动的通用性。

英文摘要

Traditional motion planning approaches for multi-legged locomotion divide the problem into several stages, such as contact search and trajectory generation. However, reasoning about contacts and motions simultaneously is crucial for the generation of complex whole-body behaviors. Currently, coupling theses problems has required either the assumption of a fixed gait sequence and flat terrain condition, or non-convex optimization with intractable computation time. In this paper, we propose a mixed-integer convex formulation to plan simultaneously contact locations, gait transitions and motion, in a computationally efficient fashion. In contrast to previous works, our approach is not limited to flat terrain nor to a pre-specified gait sequence. Instead, we incorporate the friction cone stability margin, approximate the robot's torque limits, and plan the gait using mixed-integer convex constraints. We experimentally validated our approach on the HyQ robot by traversing different challenging terrains, where non-convexity and flat terrain assumptions might lead to sub-optimal or unstable plans. Our method increases the motion generality while keeping a low computation time.

URL PDF HTML ☆

赞 0 踩 0

1904.03742 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Nonlinear Model Predictive Control for 3D Formation of Multirotor Micro Aerial Vehicles with Relative Sensing in Local Coordinates

多旋翼微型飞行器在局部坐标系中基于相对感知的三维编队非线性模型预测控制

I. Kagan Erunsal, Rodrigo Ventura, Alcherio Martinoli

发表机构 * Institute for Systems and Robotics (ISR), IST, Lisbon, Portugal（系统与机器人研究所（ISR），IST，里斯本，葡萄牙）

AI总结本文提出了一种基于相对感知信息的多旋翼微型飞行器三维编队控制方法，采用集中式非线性模型预测控制策略，通过引入六自由度数学模型实现了对编队的鲁棒控制。

Comments 8 pages, 10 figures, IROS'2019 (submitted)

详情

AI中文摘要

复杂的任务如监视、建设、搜索和救援可以受益于多旋翼微型飞行器（MAVs）的机动性，以获得稳健、协作的系统行为和编队控制是这些复杂任务的重要组成部分。本文聚焦于利用仅有的相对传感信息实现多旋翼MAVs的三维编队控制。它提出了一种集中式非线性模型预测控制（NMPC）方法，在领导者-追随者方案中。介绍了一个现实的六自由度数学模型，并用于控制律。问题的制定基于NMPC和相对传感框架，相对于机器人的局部坐标系。这种制定使编队不依赖于全局或共同参考系的完整知识以及昂贵的全局定位传感器。通过考虑新的制定，提出了基于实时迭代（RTI）的最优控制问题（OCP）的解决方案。设计了一个广泛的场景来测试和验证该策略。结果评估表明，在模型不确定性和本地传感器噪声以及编队动态突然变化的情况下，取得了令人满意的鲁棒性能。

英文摘要

The complex tasks such as surveillance, construction, search and rescue can benefit of the maneuverability of multirotor Micro Aerial Vehicles (MAVs) to obtain robust, cooperative system behavior and formation control is a prominent component of the these complex tasks. This work focuses on the problem of three-dimensional formation control of multirotor MAVs by using exclusively relative sensory information. It proposes a centralized Nonlinear Model Predictive Control (NMPC) approach in a leader-follower scheme. A realistic six degrees of freedom mathematical model of a multirotor MAVs is introduced and leveraged in the control laws. The formulation of the problem is performed based on NMPC and relative sensing framework with respect to local coordinate frames of the robots. This type of formulation makes the formation independent of the full knowledge of global or common reference frames and the utilization of expensive global localization sensors. Real-time Iteration (RTI) based solution to optimal control problem (OCP) is proposed by taking the novel formulation into account. An extensive scenario is designed to test and validate the strategy. Evaluation of the results suggests that satisfactory robust performance is achieved and maintained under model uncertainty and noise in local sensors and even in cases where the dynamics of the formation suddenly changes.

URL PDF HTML ☆

赞 0 踩 0

1904.03665 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Learning to Control Highly Accelerated Ballistic Movements on Muscular Robots

学习控制高加速度的球形运动在肌肉机器人上

Dieter Büchler, Roberto Calandra, Jan Peters

发表机构 * Max Planck Institute for Intelligent Systems（智能系统马克斯·普朗克研究所）； Facebook AI Research（脸书人工智能研究）

AI总结本文研究了如何通过学习方法提高肌肉机器人在高速高加速度运动中的控制精度，提出了一种四自由度的机器人臂，利用气动人工肌肉实现高关节角加速度，并通过贝叶斯优化直接在硬件上调整控制参数，从而在快速轨迹上实现了优于以往的结果。

Comments 12 pages, preprint submitted to Journal of Robotics and Autonomous Systems

详情

AI中文摘要

高速和高加速度的运动本质上很难控制。在人形机器人臂上应用学习方法来控制此类运动可以提高控制的准确性，但可能会损害系统。学习方法的内在探索可能导致不稳定性和机器人在高速下达到关节极限。因此，具有安全探索高速和高加速度运动硬件的需求是必要的。为了解决这个问题，我们提出使用由气动人工肌肉（PAMs）驱动的机器人。在本文中，我们展示了一种四自由度的机器人臂，能够达到高达28000度/秒²的关节角加速度，同时通过拮抗驱动和空气压力范围限制避免危险的关节极限。利用这种机器人臂，我们能够通过贝叶斯优化直接在硬件上调整控制参数，而无需额外的安全考虑。在快速轨迹上的跟踪性能超过了以往在类似PAM驱动机器人上的结果。我们还展示了由于电缆弯曲最小、轻量级动力学和PAMs与链接之间的最小接触等精心设计考虑，系统能够使用PID控制器在慢速轨迹上良好控制。最后，我们提出了一种新的技术来控制拮抗肌肉对的协同收缩。实验结果表明，选择最佳的协同收缩水平对于达到更好的跟踪性能至关重要。通过使用PAM驱动机器人和学习，我们朝着未来能够实现更像人类运动的机器人发展迈出了小一步。

英文摘要

High-speed and high-acceleration movements are inherently hard to control. Applying learning to the control of such motions on anthropomorphic robot arms can improve the accuracy of the control but might damage the system. The inherent exploration of learning approaches can lead to instabilities and the robot reaching joint limits at high speeds. Having hardware that enables safe exploration of high-speed and high-acceleration movements is therefore desirable. To address this issue, we propose to use robots actuated by Pneumatic Artificial Muscles (PAMs). In this paper, we present a four degrees of freedom (DoFs) robot arm that reaches high joint angle accelerations of up to 28000 deg/s^2 while avoiding dangerous joint limits thanks to the antagonistic actuation and limits on the air pressure ranges. With this robot arm, we are able to tune control parameters using Bayesian optimization directly on the hardware without additional safety considerations. The achieved tracking performance on a fast trajectory exceeds previous results on comparable PAM-driven robots. We also show that our system can be controlled well on slow trajectories with PID controllers due to careful construction considerations such as minimal bending of cables, lightweight kinematics and minimal contact between PAMs and PAMs with the links. Finally, we propose a novel technique to control the the co-contraction of antagonistic muscle pairs. Experimental results illustrate that choosing the optimal co-contraction level is vital to reach better tracking performance. Through the use of PAM-driven robots and learning, we do a small step towards the future development of robots capable of more human-like motions.

URL PDF HTML ☆

赞 0 踩 0

1811.07049 2026-06-04 cs.RO cs.SY eess.SY 版本更新

RMPflow: A Computational Graph for Automatic Motion Policy Generation

RMPflow：一种用于自动运动策略生成的计算图

Ching-An Cheng, Mustafa Mukadam, Jan Issac, Stan Birchfield, Dieter Fox, Byron Boots, Nathan Ratliff

发表机构 * NVIDIA, Seattle Robotics Lab（NVIDIA西雅图机器人实验室）； Georgia Institute of Technology, Robot Learning Lab（佐治亚理工学院机器人学习实验室）； University of Washington, Robotics and State Estimation Lab（华盛顿大学机器人与状态估计实验室）

AI总结本文提出了一种基于Riemannian Motion Policies几何一致变换的新型策略合成算法RMPflow，通过结合局部策略生成全局策略并利用稀疏结构提高计算效率，同时研究了其几何性质并验证了在高自由度 manipulation 系统中通过障碍物规划的简化效果。

Comments WAFR 2018

1809.02472 2026-06-04 eess.SY cs.RO cs.SY 版本更新

An Analytical Design Optimization Method for Electric Propulsion Systems of Multicopter UAVs with Desired Hovering Endurance

多旋翼无人机电推进系统分析设计优化方法

Xunhua Dai, Quan Quan, Jinrui Ren, Kai-Yuan Cai

AI总结本文提出了一种简单实用的方法，帮助设计师根据给定的设计要求找到最优的推进系统，通过研究四个基本组件的建模方法，将整体优化设计问题分解为多个子问题，从而获得各组件的最佳参数，并快速从相应数据库中确定最优产品。

详情

DOI: 10.1109/TMECH.2019.2890901
Journal ref: IEEE/ASME Transactions on Mechatronics, 2019, 24: 228-239

AI中文摘要

多旋翼无人机在民用和军事领域正变得越来越重要。目前，大多数多旋翼推进系统是通过经验和试错实验设计的，这成本高且效率低。本文提出了一种简单实用的方法，帮助设计师根据给定的设计要求找到最优的推进系统。首先，分别研究了推进系统四个基本组件（螺旋桨、电机、电子调速器和电池）的建模方法。其次，将整体优化设计问题简化并分解为几个子问题。通过解决这些子问题，可以分别获得各组件的最佳参数。最后，基于获得的最佳组件参数，可以快速从相应数据库中确定每个组件的最优产品。实验和统计分析证明了所提方法的有效性。

英文摘要

Multicopters are becoming increasingly important in both civil and military fields. Currently, most multicopter propulsion systems are designed by experience and trial-and-error experiments, which are costly and ineffective. This paper proposes a simple and practical method to help designers find the optimal propulsion system according to the given design requirements. First, the modeling methods for four basic components of the propulsion system including propellers, motors, electric speed controls, and batteries are studied respectively. Secondly, the whole optimization design problem is simplified and decoupled into several sub-problems. By solving these sub-problems, the optimal parameters of each component can be obtained respectively. Finally, based on the obtained optimal component parameters, the optimal product of each component can be quickly located and determined from the corresponding database. Experiments and statistical analyses demonstrate the effectiveness of the proposed method.

URL PDF HTML ☆

赞 0 踩 0

1904.02765 2026-06-04 cs.RO cs.LG cs.SY eess.SY math.OC 版本更新

Intent-Aware Probabilistic Trajectory Estimation for Collision Prediction with Uncertainty Quantification

意图感知的概率轨迹估计用于碰撞预测与不确定性量化

Andrew Patterson, Arun Lakshmanan, Naira Hovakimyan

发表机构 * Department of Mechanical Science and Engineering, University of Illinois at Urbana-Champaign（伊利诺伊大学厄巴纳-香槟分校机械科学与工程系）

AI总结本文提出了一种基于高斯过程的概率轨迹估计方法，用于在不确定环境中预测碰撞，通过概率方法替代确定性假设，以考虑更广泛的障碍物类型，并通过案例研究展示了在有限障碍物行为知识下预测碰撞的能力。

详情

AI中文摘要

在动态和未知的环境中，碰撞预测依赖于对环境变化的理解。许多碰撞预测方法依赖于对障碍物运动的确定性知识，但完全确定性的障碍物运动知识往往不可用。本文提出了一种基于高斯过程的预测方法，用概率知识替代对每个障碍物未来行为的确定性假设，以考虑更广泛的障碍物。该方法仅依赖位置和速度测量来预测与动态障碍物的碰撞。我们证明，障碍物位置的不确定性区域可以表示为通过高斯过程回归生成的多项式的组合。为了控制任意时间范围内不确定性的增长，假设概率障碍物意图作为障碍物位置和速度的分布，这可以自然地包含在高斯过程框架中。我们的方法在两个案例研究中得到验证：(i) 障碍物超越代理；(ii) 障碍物垂直穿过代理的路径。在这些模拟中，我们展示了即使在有限的障碍物行为知识下也能预测碰撞。

英文摘要

Collision prediction in a dynamic and unknown environment relies on knowledge of how the environment is changing. Many collision prediction methods rely on deterministic knowledge of how obstacles are moving in the environment. However, complete deterministic knowledge of the obstacles' motion is often unavailable. This work proposes a Gaussian process based prediction method that replaces the assumption of deterministic knowledge of each obstacle's future behavior with probabilistic knowledge, to allow a larger class of obstacles to be considered. The method solely relies on position and velocity measurements to predict collisions with dynamic obstacles. We show that the uncertainty region for obstacle positions can be expressed in terms of a combination of polynomials generated with Gaussian process regression. To control the growth of uncertainty over arbitrary time horizons, a probabilistic obstacle intention is assumed as a distribution over obstacle positions and velocities, which can be naturally included in the Gaussian process framework. Our approach is demonstrated in two case studies in which (i), an obstacle overtakes the agent and (ii), an obstacle crosses the agent's path perpendicularly. In these simulations we show that the collision can be predicted despite having limited knowledge of the obstacle's behavior.

URL PDF HTML ☆

赞 0 踩 0

1904.02341 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

Online Risk-Bounded Motion Planning for Autonomous Vehicles in Dynamic Environments

在线风险受限的自主车辆动态环境中的运动规划

Xin Huang, Sungkweon Hong, Andreas Hofmann, Brian C. Williams

发表机构 * MIT Computer Science and Artificial Intelligence Laboratory（麻省理工学院计算机科学与人工智能实验室）

AI总结本文提出了一种在线风险受限的运动规划方法，通过结合意图识别算法和POMDP求解器，生成安全高效的路径规划方案，尤其在无保护左转和变道等复杂环境中表现更优。

Comments Accepted at ICAPS'19. 10 pages, 6 figures, 1 table

详情

AI中文摘要

高效且稳健的自主车辆运动规划面临的关键挑战是理解周围代理的意图。忽略动态环境中其他代理的意图会导致风险或过于保守的规划。本文将运动规划问题建模为部分可观测马尔可夫决策过程（POMDP），并提出一个在线系统，结合意图识别算法和POMDP求解器，为自主车辆生成风险受限的路径规划。意图识别算法利用贝叶斯过滤和预学习的机动运动模型，预测每个代理车辆在有限时间 horizon 内的混合运动状态。我们实时更新POMDP模型，并使用启发式搜索算法求解，生成具有碰撞概率上界保证的策略。我们证明，与基线方法相比，我们的系统在多个具有挑战性的环境中，能够生成更高效和安全的运动规划。

英文摘要

A crucial challenge to efficient and robust motion planning for autonomous vehicles is understanding the intentions of the surrounding agents. Ignoring the intentions of the other agents in dynamic environments can lead to risky or over-conservative plans. In this work, we model the motion planning problem as a partially observable Markov decision process (POMDP) and propose an online system that combines an intent recognition algorithm and a POMDP solver to generate risk-bounded plans for the ego vehicle navigating with a number of dynamic agent vehicles. The intent recognition algorithm predicts the probabilistic hybrid motion states of each agent vehicle over a finite horizon using Bayesian filtering and a library of pre-learned maneuver motion models. We update the POMDP model with the intent recognition results in real time and solve it using a heuristic search algorithm which produces policies with upper-bound guarantees on the probability of near colliding with other dynamic agents. We demonstrate that our system is able to generate better motion plans in terms of efficiency and safety in a number of challenging environments including unprotected intersection left turns and lane changes as compared to the baseline methods.

URL PDF HTML ☆

赞 0 踩 0

1903.09749 2026-06-04 cs.RO cs.SY eess.SY math.DS math.OC 版本更新

Passivity guaranteed stiffness control with multiple frequency band specifications for a cable-driven series elastic actuator

具有多频率带规范的电缆驱动串联弹性执行器的被动保证刚度控制

Ningbo Yu, Wulin Zou, Yubo Sun

发表机构 * Tianjin Key Laboratory of Intelligent Robotics, Nankai University（天津智能机器人重点实验室，南开大学）

AI总结本文针对电缆驱动串联弹性执行器的刚度控制问题，提出了一种基于H∞综合方法的改进方案，通过在特定频率带内满足被动性、执行器限制、扰动抑制和噪声抑制等约束条件，提升了刚度控制的精度和鲁棒性。

Comments 10 pages, already published in Mechanical Systems and Signal Processing

详情

DOI: 10.1016/j.ymssp.2018.08.007

AI中文摘要

阻抗控制和特别是刚度控制被广泛应用于物理人机交互。串联弹性执行器（SEA）提供了固有的柔顺性、安全性和进一步的好处。本文旨在改进电缆驱动SEA的刚度控制性能。现有的阻抗控制器是在全频域内设计的，尽管人机交互通常发生在低频范围内。我们通过制定的被动性、执行器限制、扰动抑制和特定频率范围内的噪声抑制约束条件来增强刚度渲染性能。首先，我们将多频率带优化问题重新公式化为H∞综合框架。然后，性能目标通过各自受限频域规范作为范数界限来定量描述。进一步，直接综合出一个结构化的控制器以满足所有竞争性性能要求。仿真和实验结果表明，所生成的控制器能够为每个期望的刚度（从0到1倍的物理弹簧常数）提供良好的交互性能。与基于被动性的PID方法相比，所提出的H∞综合方法在保证被动性的前提下实现了更精确和鲁棒的刚度控制性能。

英文摘要

Impedance control and specifically stiffness control are widely applied for physical human-robot interaction. The series elastic actuator (SEA) provides inherent compliance, safety and further benefits. This paper aims to improve the stiffness control performance of a cable-driven SEA. Existing impedance controllers were designed within the full frequency domain, though human-robot interaction commonly falls in the low frequency range. We enhance the stiffness rendering performance under formulated constraints of passivity, actuator limitation, disturbance attenuation, noise rejection at their specific frequency ranges. Firstly, we reformulate this multiple frequency-band optimization problem into the $H_\infty$ synthesis framework. Then, the performance goals are quantitatively characterized by respective restricted frequency-domain specifications as norm bounds. Further, a structured controller is directly synthesized to satisfy all the competing performance requirements. Both simulation and experimental results showed that the produced controller enabled good interaction performance for each desired stiffness varying from 0 to 1 times of the physical spring constant. Compared with the passivity-based PID method, the proposed $H_\infty$ synthesis method achieved more accurate and robust stiffness control performance with guaranteed passivity.

URL PDF HTML ☆

赞 0 踩 0

1904.00035 2026-06-04 cs.RO cs.LG cs.SY eess.SY stat.ML 版本更新

Autonomous Highway Driving using Deep Reinforcement Learning

使用深度强化学习实现自动驾驶高速公路驾驶

Subramanya Nageshrao, Eric Tseng, Dimitar Filev

发表机构 * Ford Greenfield Labs（福特绿谷实验室）； Ford Research and Innovation Center（福特研究与创新中心）

AI总结本文提出了一种基于强化学习的方法，通过与模拟交通直接交互，使自动驾驶车辆在复杂和多变的环境中做出决策，解决了传统规则和预设成本函数在实时优化中的不足，提高了学习效率和安全性。

详情

AI中文摘要

自动驾驶车辆的操作空间可以是多样的，并且可能显著变化。这可能导致设计阶段未预料到的场景。因此，基于规则的决策者选择动作可能并不理想。同样，设计一个先验成本函数然后在实时中求解最优控制问题可能也不够有效。为了应对这些问题并避免在遇到意外场景时出现异常行为，我们提出了一种基于强化学习（RL）的方法，其中自动驾驶车辆通过与模拟交通直接交互来学习决策。决策者由深度神经网络实现，根据给定的系统状态提供动作选择。在关键应用如驾驶中，没有明确安全概念的RL代理可能无法收敛，或者需要极大量的样本才能找到可靠的策略。为了更好地解决这个问题，本文将强化学习与额外的短时间安全检查（SC）相结合。在关键场景中，安全检查还将为代理提供替代的安全动作，如果存在的话。这导致了两个新的贡献。首先，它扩展了可能导致不良“接近事件”或“碰撞”的状态。其次，安全检查的加入可以提供一个安全且稳定的训练环境。这显著提高了学习效率，同时不抑制有意义的探索，以确保安全和最优的学习行为。我们展示了所开发算法在高速公路驾驶场景中的性能，其中训练好的自动驾驶车辆在高速公路环境下遇到不同交通密度的情况。

英文摘要

The operational space of an autonomous vehicle (AV) can be diverse and vary significantly. This may lead to a scenario that was not postulated in the design phase. Due to this, formulating a rule based decision maker for selecting maneuvers may not be ideal. Similarly, it may not be effective to design an a-priori cost function and then solve the optimal control problem in real-time. In order to address these issues and to avoid peculiar behaviors when encountering unforeseen scenario, we propose a reinforcement learning (RL) based method, where the ego car, i.e., an autonomous vehicle, learns to make decisions by directly interacting with simulated traffic. The decision maker for AV is implemented as a deep neural network providing an action choice for a given system state. In a critical application such as driving, an RL agent without explicit notion of safety may not converge or it may need extremely large number of samples before finding a reliable policy. To best address the issue, this paper incorporates reinforcement learning with an additional short horizon safety check (SC). In a critical scenario, the safety check will also provide an alternate safe action to the agent provided if it exists. This leads to two novel contributions. First, it generalizes the states that could lead to undesirable "near-misses" or "collisions ". Second, inclusion of safety check can provide a safe and stable training environment. This significantly enhances learning efficiency without inhibiting meaningful exploration to ensure safe and optimal learned behavior. We demonstrate the performance of the developed algorithm in highway driving scenario where the trained AV encounters varying traffic density in a highway setting.

URL PDF HTML ☆

赞 0 踩 0

1903.11204 2026-06-04 cs.RO cs.SY eess.SY math.OC 版本更新

Priority Maps for Surveillance and Intervention of Wildfires and other Spreading Processes

优先地图用于监视和干预野火及其他扩散过程

Vera L. J. Somers, Ian R. Manchester

发表机构 * Australian Centre for Field Robotics（澳大利亚田径场机器人研究中心）； University of Sydney（悉尼大学）

AI总结本文提出了一种生成优先地图的方法，用于监视或干预动态扩散过程，如野火。该方法利用正系统性质，特别是价值函数的分离结构，提供可扩展的算法。通过16和1000节点示例展示了方法如何响应系统动态变化，并结合旅行商问题进行无人机路径规划。

Comments Accepted for ICRA 2019

1903.09890 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Physics-Based Freely Scalable Continuum Deformation for UAS Traffic Coordination

基于物理的自由可扩展连续变形用于无人机交通协调

Hossein Rastgoftar, Ella Atkins

AI总结本文提出了一种新的基于物理的交通协调方法，并将其应用于无人机（UAS）交通管理。该方法扩展了之前应用于高速公路和城市街道1-D交通流的基于物理的方法，以支持更高维度空域中的交通协调模型，特别是在没有预定义路径的情况下。论文将空域视为有限控制体积，而无人机协调（视为连续变形）在空域边界处进行控制。通过将空域划分为计划和非计划空间，论文将计划空域中的名义协调建模为带有时空参数的偏微分方程的解。此外，本文改进了对车辆故障的韧性，通过一种鲁棒边界控制算法来更新计划空间的几何形状，当无人机问题威胁到现有可航行空域通道的安全协调时。为了支持微观层面的无人机协调，我们提出根据车辆性能限制对车辆进行聚类。无人机集群，每个无人机视为虚拟刚体的粒子，使用领导者-追随者包容策略来获取宏观期望轨迹。

Comments 11 pages; 9 figures; submitted for publication

详情

AI中文摘要

本文开发了一种新颖的基于物理的交通协调方法，并将其应用于无人机（UAS）交通管理。我们扩展了之前应用于高速公路和城市街道1-D交通流的基于物理的方法，以支持更高维度空域中的交通协调模型，特别是在没有预定义路径的情况下。论文将空域视为有限控制体积，而无人机协调（视为连续变形）在空域边界处进行控制。通过将空域划分为计划和非计划空间，论文将计划空域中的名义协调建模为带有时空参数的偏微分方程的解。本文还改进了对车辆故障的韧性，通过一种鲁棒边界控制算法来更新计划空间的几何形状，当无人机问题威胁到现有可航行空域通道的安全协调时。为了支持微观层面的无人机协调，我们提出根据车辆性能限制对车辆进行聚类。无人机集群，每个无人机视为虚拟刚体的粒子，使用领导者-追随者包容策略来获取宏观期望轨迹。

英文摘要

This paper develops a novel physics-inspired traffic coordination approach and applies it to Unmanned Aircraft System (UAS) traffic management. We extend available physics-inspired approaches previously applied to 1-D traffic flow on highways and urban streets to support models of traffic coordination in higher dimension airspace for cases where no predefined paths exist. The paper considers airspace as a finite control volume while UAS coordination, treated as continuum deformation, is controlled at the airspace boundaries. By partitioning airspace into planned and unplanned spaces, the paper models nominal coordination in the planned airspace as the solution of a partial differential equation with spatiotemporal parameters. This paper also improves resilience to vehicle failures with a resilient boundary control algorithm to update the geometry of the planned space when UAS problems threaten safe coordination in existing navigable airspace channels. To support UAS coordination at the microscopic level, we propose clustering vehicles based on vehicle performance limits. UAS clusters, with each UAS treated as a particle of a virtual rigid body, use leader-follower containment to acquire the macroscopic desired trajectory.

URL PDF HTML ☆

赞 0 踩 0

1903.09748 2026-06-04 cs.RO cs.SY eess.SY math.DS math.OC 版本更新

Impedance control of a cable-driven SEA with mixed $H_2/H_\infty$ synthesis

电缆驱动串联弹性执行器的阻抗控制：混合H2/H∞综合方法

Ningbo Yu, Wulin Zou

发表机构 * Institute of Robotics and Automatic Information Systems, Nankai University（机器人与自动信息系统研究所，南开大学）； Tianjin Key Laboratory of Intelligent Robotics, Nankai University（天津智能机器人重点实验室，南开大学）

AI总结本文提出了一种基于混合H2/H∞综合和放松被动性的电缆驱动串联弹性执行器的阻抗控制方法，用于物理人机交互。

Comments 11 pages, already published in Assembly Automation

详情

Journal ref: Assembly Automation, Vol. 37, Issue: 3, pp.296-303, 2017

AI中文摘要

目的：本文提出了一种混合H2/H∞综合和放松被动性的阻抗控制方法，用于电缆驱动串联弹性执行器，以应用于物理人机交互。设计/方法/研究途径：为了使系统的阻抗匹配所需的动态模型，将阻抗控制问题重新公式化为阻抗匹配结构。所需的竞争性能要求以及来自物理系统的约束可以通过针对各自信号的加权函数来表征。考虑到人类运动的频率特性，被动约束对于稳定的人机交互，其在整个频谱上要求，可能会带来保守的解决方案，已被放松成仅限制低频带。因此，阻抗控制成为混合H2/H∞综合问题，并可以得到动态输出反馈控制器。发现：所提出的阻抗控制策略已针对各种期望的阻抗进行了测试，包括在电缆驱动串联弹性执行器平台上进行的仿真和实验。实际的交互扭矩在期望的范数范围内良好跟踪了期望的扭矩，且控制输入被调节在电机速度限制以下。闭环系统可以在低频上保证放松的被动性。仿真和实验结果都验证了所提出方法的可行性和有效性。原创性/价值：这种基于混合H2/H∞综合和放松被动性的阻抗控制策略提供了一种新颖、有效且更少保守的方法用于物理人机交互控制。

英文摘要

Purpose: This paper presents an impedance control method with mixed $H_2/H_\infty$ synthesis and relaxed passivity for a cable-driven series elastic actuator to be applied for physical human-robot interaction. Design/methodology/approach: To shape the system's impedance to match a desired dynamic model, the impedance control problem was reformulated into an impedance matching structure. The desired competing performance requirements as well as constraints from the physical system can be characterized with weighting functions for respective signals. Considering the frequency properties of human movements, the passivity constraint for stable human-robot interaction, which is required on the entire frequency spectrum and may bring conservative solutions, has been relaxed in such a way that it only restrains the low frequency band. Thus, impedance control became a mixed $H_2/H_\infty$ synthesis problem, and a dynamic output feedback controller can be obtained. Findings: The proposed impedance control strategy has been tested for various desired impedance with both simulation and experiments on the cable-driven series elastic actuator platform. The actual interaction torque tracked well the desired torque within the desired norm bounds, and the control input was regulated below the motor velocity limit. The closed loop system can guarantee relaxed passivity at low frequency. Both simulation and experimental results have validated the feasibility and efficacy of the proposed method. Originality/value: This impedance control strategy with mixed $H_2/H_\infty$ synthesis and relaxed passivity provides a novel, effective and less conservative method for physical human-robot interaction control.

URL PDF HTML ☆

赞 0 踩 0

1903.09673 2026-06-04 cs.RO cs.SY eess.SY math.DS math.OC 版本更新

Compliance Shaping for Control of Strength Amplification Exoskeletons with Elastic Cuffs

合规性塑形用于具有弹性围裙的强度放大外骨骼控制

Gray Cortright Thomas, Jeremiah M. Coholich, Luis Sentis

发表机构 * Human Centered Robotics Lab in the University of Texas at Austin（德克萨斯大学奥斯汀分校人本机器人实验室）

AI总结本文提出了一种双合规性塑形方法，通过在力敏感围裙中串联弹簧来设计外骨骼的合规行为，以实现高放大比下的稳定性和鲁棒性，同时引入反馈控制器和增益调节方法，并通过单自由度肘部外骨骼验证了方法的有效性。

Comments 8 pages, 9 figures, conference

详情

AI中文摘要

能够放大操作者力量的外骨骼可以实现对未知物体的重载操作。然而，这种行为难以实现，因为外骨骼需要感知并放大操作者交互力的同时保持稳定。然而，放大与连接到操作者时的鲁棒稳定性目标本质上是冲突的。为此，我们引入了一种设计，在力敏感围裙中串联一个弹簧。这允许我们设计出名义上被动的外骨骼合规行为，即使具有高放大比。实际上，时间延迟和离散时间滤波器阻止我们的策略实际上实现被动性，但设计的合规性仍使外骨骼对弹簧状人类行为更具鲁棒性。我们的外骨骼由串联弹性执行器（SEA）驱动，这向系统引入了另一个弹簧。我们表明，为外骨骼塑形围裙的合规性可以近似转化为对SEA弹簧合规性的塑形问题。因此，我们引入了一种反馈控制器和增益调节方法，利用现有的SEAs合规性塑形技术。我们称之为“双合规性塑形”方法。在大放大比下，此控制器倾向于放大非线性传动摩擦效应，因此我们还提出了“传动扰动观测器”以缓解这一缺点。我们的方法在单自由度肘部外骨骼上进行了验证。

英文摘要

Exoskeletons which amplify the strength of their operators can enable heavy-duty manipulation of unknown objects. However, this type of behavior is difficult to accomplish; it requires the exoskeleton to sense and amplify the operator's interaction forces while remaining stable. But, the goals of amplification and robust stability when connected to the operator fundamentally conflict. As a solution, we introduce a design with a spring in series with the force sensitive cuff. This allows us to design an exoskeleton compliance behavior which is nominally passive, even with high amplification ratios. In practice, time delay and discrete time filters prevent our strategy from actually achieving passivity, but the designed compliance still makes the exoskeleton more robust to spring-like human behaviors. Our exoskeleton is actuated by a series elastic actuator (SEA), which introduces another spring into the system. We show that shaping the cuff compliance for the exoskeleton can be made into approximately the same problem as shaping the spring compliance of an SEA. We therefore introduce a feedback controller and gain tuning method which takes advantage of an existing compliance shaping technique for SEAs. We call our strategy the "double compliance shaping" method. With large amplification ratios, this controller tends to amplify nonlinear transmission friction effects, so we additionally propose a "transmission disturbance observer" to mitigate this drawback. Our methods are validated on a single-degree-of-freedom elbow exoskeleton.

URL PDF HTML ☆

赞 0 踩 0

1903.08781 2026-06-04 cs.AR cs.DC cs.RO cs.SY eess.SY 版本更新

Fault-Tolerant Nanosatellite Computing on a Budget

在预算内实现容错的纳卫星计算

Christian M. Fuchs, Nadia Murillo, Aske Plaat, Erik Van der Kouwe, Daniel Harsono, Todor Stefanov

发表机构 * Leiden Institute of Advanced Computer Science（莱顿先进计算机科学研究所）； Leiden Observatory（莱顿天文台）； Leiden University（莱顿大学）； European Space Agency（欧洲航天局）； Netherlands Research School for Astronomy（荷兰天文研究学校）； Royal Netherlands Academy of Arts and Sciences（荷兰皇家艺术与科学学院）

AI总结本文提出了一种基于线程级粗粒度锁步的软件容错方法，通过故障注入验证，利用FPGA实现 tiled MPSoC 架构，以满足未来科学和商业航天任务的高性能需求，同时提供强故障覆盖保障。

详情

Journal ref: Conference on Radiation Effects on Components and Systems 2018 (RADECS)

AI中文摘要

微卫星和纳卫星已成为各种商业和科学应用的流行平台，但目前主要适用于短时和低优先级空间任务，由于其可靠性较低。部分原因在于其依赖于便宜、低功能尺寸的COTS组件，这些组件最初为嵌入式和移动市场设计，传统硬件投票概念对此无效。软件容错概念已被证明对这类系统有效，但因成熟度低而被航天行业忽视，因为大多数仅在理论中研究。在实践中，载荷仪器和微型卫星的设计者通常被迫牺牲可靠性以满足尖端科学和创新商业应用所需的性能水平。因此，我们开发了一种基于线程级粗粒度锁步的软件容错方法，通过故障注入验证。为了提供强长期故障覆盖，我们的架构实现为FPGA上的tiled MPSoC，利用部分重新配置以及混合关键性。该架构可以在极低成本下满足当前和未来科学和商业航天任务的高性能需求，同时为长期任务提供必要的强故障覆盖保障。该架构是为一个为期4年的ESA项目开发的。与两家工业合作伙伴一起，我们正在开发原型，然后进行辐射测试。

英文摘要

Micro- and nanosatellites have become popular platforms for a variety of commercial and scientific applications, but today are considered suitable mainly for short and low-priority space missions due to their low reliability. In part, this can be attributed to their reliance upon cheap, low-feature size, COTS components originally designed for embedded and mobile-market applications, for which traditional hardware-voting concepts are ineffective. Software-fault-tolerance concepts have been shown effective for such systems, but have largely been ignored by the space industry due to low maturity, as most have only been researched in theory. In practice, designers of payload instruments and miniaturized satellites are usually forced to sacrifice reliability in favor deliver the level of performance necessary for cutting-edge science and innovative commercial applications. Thus, we developed a software-fault-tolerance-approach based upon thread-level coarse-grain lockstep, which was validated using fault-injection. To offer strong long-term fault coverage, our architecture is implemented as tiled MPSoC on an FPGA, utilizing partial reconfiguration, as well as mixed criticality. This architecture can satisfy the high performance requirements of current and future scientific and commercial space missions at very low cost, while offering the strong fault-coverage guarantees necessary for platform control even for missions with a long duration. This architecture was developed for a 4-year ESA project. Together with two industrial partners, we are developing a prototype to then undergo radiation testing.

URL PDF HTML ☆

赞 0 踩 0

1903.07353 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Joint axis estimation for fast and slow movements using weighted gyroscope and acceleration constraints

利用加权陀螺仪和加速度约束的快速和慢速运动联合轴估计

Fredrik Olsson, Thomas Seel, Dustin Lehmann, Kjartan Halvorsen

发表机构 * 1 4 Department of Information Technology, Uppsala University, Uppsala, Sweden ； 2 3 Department of Electrical Engineering ； 4 Department of Mecatronics, Tecnol \'o gico de Monterrey, Mexico City, Mexico

AI总结本文提出了一种结合加速度和陀螺仪约束的新型方法，用于准确估计关节轴，通过调整残差权重和非线性加速度范数差，实现快速和慢速运动中的高精度估计。

Comments 8 pages, 4 figures, 1 table

详情

AI中文摘要

传感器到肢体的校准是惯性运动跟踪中的关键步骤。当两个肢体通过铰链关节连接时，例如在人体膝关节和手指关节以及许多机械臂中，必须在内在传感器坐标系中确定关节轴向量。存在通过求解基于运动关节约束的优化问题来确定这些坐标的办法，这些约束涉及测量的加速度或角速度。在本文中，我们证明仅使用其中一个约束会导致在快速或慢速运动中估计不准确。我们提出了一种基于结合两种约束的成本函数的新方法。通过仅使用加速度计和陀螺仪读数，避免了对均匀磁场的严格假设。为了结合两种传感器的优点，残差权重根据估计信号方差和非线性加速度范数差自动调整。该方法使用来自上肢外骨骼九种不同运动的真实数据进行评估。结果表明，与以往方法不同，所提出的方法在仅五秒后即可对所有快速和慢速运动实现准确的关节轴估计。

英文摘要

Sensor-to-segment calibration is a crucial step in inertial motion tracking. When two segments are connected by a hinge joint, for example in human knee and finger joints as well as in many robotic limbs, then the joint axis vector must be identified in the intrinsic sensor coordinate systems. There exist methods that identify these coordinates by solving an optimization problem that is based on kinematic joint constraints, which involve either the measured accelerations or the measured angular rates. In the current paper we demonstrate that using only one of these constraints leads to inaccurate estimates at either fast or slow motions. We propose a novel method based on a cost function that combines both constraints. The restrictive assumption of a homogeneous magnetic field is avoided by using only accelerometer and gyroscope readings. To combine the advantages of both sensor types, the residual weights are adjusted automatically based on the estimated signal variances and a nonlinear weighting of the acceleration norm difference. The method is evaluated using real data from nine different motions of an upper limb exoskeleton. Results show that, unlike previous approaches, the proposed method yields accurate joint axis estimation after only five seconds for all fast and slow motions.

URL PDF HTML ☆

赞 0 踩 0

1807.06614 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Rapid Trajectory Optimization Using C-FROST with Illustration on a Cassie-Series Dynamic Walking Biped

利用C-FROST实现快速轨迹优化及在Cassie系列动态步行双足机器人上的示例

Ayonga Hereid, Omar Harib, Ross Hartley, Yukai Gong, Jessy W. Grizzle

发表机构 * College of Engineering and the Robotics Institute, University of Michigan（密歇根大学工程学院和机器人研究所）

AI总结本文提出了一种无需去除或合并自由度的方法，通过C-FROST和多线程技术快速确定人类oids的解决方案，并在20自由度的Cassie系列双足机器人浮基模型上进行数值计算和物理实验验证

1903.04706 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Control Barrier Functions for Systems with High Relative Degree

用于高相对次数系统的控制屏障函数

Wei Xiao, Calin Belta

发表机构 * Boston University（波士顿大学）

AI总结本文扩展了控制屏障函数（CBFs）到高阶控制屏障函数（HOCBFs），用于处理高相对次数约束。提出HOCBFs比最近提出的（指数）HOCBFs更通用。我们介绍了高阶屏障函数（HOBF），并展示了其满足Lyapunov-like条件会使得一系列集合的交集向前不变。然后引入HOCBF，并展示任何满足HOCBF约束的控制输入会使一系列集合的交集向前不变。我们提出了具有HOCBF和控制Lyapunov函数（CLF）约束的优化控制问题，并分析了在定义HOCBF时所用的$\mathcal{K}$类函数的选择对可行控制区域大小的影响。我们还提供了一种有前途的方法来解决HOCBF约束与控制限制之间的冲突，通过惩罚$\mathcal{K}$类函数。我们通过自适应巡航控制问题展示了所提出的方法。

Comments 9 pages, 7 figures, submitted to CDC19

详情

AI中文摘要

本文扩展了控制屏障函数（CBFs）到高阶控制屏障函数（HOCBFs），用于处理高相对次数约束。所提出的HOCBFs比最近提出的（指数）HOCBFs更通用。我们介绍了高阶屏障函数（HOBF），并展示了其满足Lyapunov-like条件会使得一系列集合的交集向前不变。然后引入HOCBF，并展示任何满足HOCBF约束的控制输入会使一系列集合的交集向前不变。我们提出了具有HOCBF和控制Lyapunov函数（CLF）约束的优化控制问题，并分析了在定义HOCBF时所用的$\mathcal{K}$类函数的选择对可行控制区域大小的影响。我们还提供了一种有前途的方法来解决HOCBF约束与控制限制之间的冲突，通过惩罚$\mathcal{K}$类函数。我们通过自适应巡航控制问题展示了所提出的方法。

英文摘要

This paper extends control barrier functions (CBFs) to high order control barrier functions (HOCBFs) that can be used for high relative degree constraints. The proposed HOCBFs are more general than recently proposed (exponential) HOCBFs. We introduce high order barrier functions (HOBF), and show that their satisfaction of Lyapunov-like conditions implies the forward invariance of the intersection of a series of sets. We then introduce HOCBF, and show that any control input that satisfies the HOCBF constraints renders the intersection of a series of sets forward invariant. We formulate optimal control problems with constraints given by HOCBF and control Lyapunov functions (CLF) and analyze the influence of the choice of the class $\mathcal{K}$ functions used in the definition of the HOCBF on the size of the feasible control region. We also provide a promising method to address the conflict between HOCBF constraints and control limitations by penalizing the class $\mathcal{K}$ functions. We illustrate the proposed method on an adaptive cruise control problem.

URL PDF HTML ☆

赞 0 踩 0

1903.05355 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

A Framework for On-line Learning of Underwater Vehicles Dynamic Models

在线学习水下机器人动态模型的框架

Bilal Wehbe, Marc Hildebrandt, Frank Kirchner

发表机构 * DFKI - Robotic Innovation Center（DFKI机器人创新中心）

AI总结本文提出了一种在线学习水下机器人动态模型的框架，通过增量支持向量回归方法从数据流中逐步学习模型，并结合增量学习策略来改进模型在整体状态空间上的泛化能力。

Comments 8 pages, 6 figures, ICRA 2019 authors preprint

1903.03318 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Development of an Autonomous Sanding Robot with Structured-Light Technology

基于结构光技术的自主打磨机器人开发

Yingxin Huo, Diancheng Chen, Xiang Li, Peng Li, Yun-Hui Liu

发表机构 * CUHK T Stone Robotics Institute（CUHK T Stone机器人研究所）； Innovation and Technology Commission of Hong Kong（香港创新及科技委员会）； Harbin Institute of Technology（哈尔滨工业大学）

AI总结本文提出了一种能够自主完成未知物体打磨工作的机器人，通过结构光相机扫描建模、优化运动规划和阻抗模型控制，实现了无需人工干预的自主打磨。

Comments 7 pages, 11 figures, IEEE/RSJ International Conference on Intelligent Robots and Systems 2019

详情

AI中文摘要

机器人和自动化的大需求在打磨工作中得到了体现，因为当前的手动操作是劳动密集型的，缺乏一致的质量，且存在安全和健康问题。虽然已经开发出几种自动化打磨工作的机器，但现有解决方案的自主能力相对较低，仍需要大量的人工协助或监督来校准目标物体或规划机器人运动和任务。本文提出了一种自主打磨机器人，能够自动对未知物体进行打磨，无需任何先前的校准或人工干预。该机器人工作流程如下：首先，使用结构光相机扫描并建模目标物体。其次，规划机器人运动以覆盖物体的所有表面，采用优化的过渡序列。第三，控制机器人在期望的阻抗模型下对物体进行打磨。制造了一个打磨机器人的原型，并在打磨一批木箱的任务中验证了其性能。凭借足够的自由度（DOFs）和末端执行器的模块化设计，该机器人能够为许多其他不同物体的自主打磨提供通用解决方案。

英文摘要

结构机械系统学习的通用框架

Jayesh K. Gupta, Kunal Menda, Zachary Manchester, Mykel J. Kochenderfer

发表机构 * Stanford University（斯坦福大学）

AI总结本文提出了一种通用框架，用于结构化学习机械系统，通过结合先验知识和训练表达式近似器来提高模型的准确性和效率。

Comments 10 pages, 7 figures. First two authors contributed equally. Submitted to IROS/RA-L. Code at https://github.com/sisl/mechamodlearn/

详情

AI中文摘要

学习准确的动力学模型对于优化和顺应性控制机器人系统至关重要。当前使用解析参数化进行白盒建模或使用神经网络进行黑盒建模的方法可能会产生高偏差或高方差。我们提出了一个灵活的灰盒模型，可以无缝地结合可用的先验知识，并在没有时训练具有表达能力的函数近似器。我们提出使用神经网络参数化机械系统，以建模其拉格朗日量和作用在其上的广义力。我们在模拟的驱动双摆上测试了我们的方法。我们展示了我们的方法在数据效率以及基于模型的强化学习中的性能优于朴素的黑盒模型。我们还系统地研究了我们的方法在结合可用的系统先验知识以提高数据效率方面的能力。

英文摘要

Learning accurate dynamics models is necessary for optimal, compliant control of robotic systems. Current approaches to white-box modeling using analytic parameterizations, or black-box modeling using neural networks, can suffer from high bias or high variance. We address the need for a flexible, gray-box model of mechanical systems that can seamlessly incorporate prior knowledge where it is available, and train expressive function approximators where it is not. We propose to parameterize a mechanical system using neural networks to model its Lagrangian and the generalized forces that act on it. We test our method on a simulated, actuated double pendulum. We show that our method outperforms a naive, black-box model in terms of data-efficiency, as well as performance in model-based reinforcement learning. We also conduct a systematic study of our method's ability to incorporate available prior knowledge about the system to improve data efficiency.

URL PDF HTML ☆

赞 0 踩 0

1903.00220 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Approximate Robust Control of Uncertain Dynamical Systems

不确定动力系统近似鲁棒控制

Edouard Leurent, Yann Blanco, Denis Efimov, Odalric-Ambrym Maillard

发表机构 * INRIA Lille（INRIA里尔）； Renault（雷诺）； Non-A team, INRIA Lille SequeL team, INRIA Lille（非A团队，INRIA里尔SequeL团队，INRIA里尔）

AI总结本文研究了在不确定环境中大型非线性系统安全控制策略的设计，提出两种可处理的鲁棒控制方法，应用于自动驾驶问题。

1806.07115 2026-06-04 cs.RO cs.SY eess.SY 版本更新

ConFusion: Sensor Fusion for Complex Robotic Systems using Nonlinear Optimization

ConFusion：利用非线性优化的复杂机器人系统传感器融合

Timothy Sandy, Lukas Stadelmann, Simon Kerscher, Jonas Buchli

发表机构 * Agile & Dexterous Robotics Lab, ETH Zurich（敏捷与灵活机器人实验室，苏黎世联邦理工学院）

AI总结本文提出ConFusion，一种用于机器人应用的开源传感器融合框架，通过非线性优化实现灵活的传感器融合设计，并展示了其在视觉惯性跟踪和移动机械臂上的性能。

1902.11015 2026-06-04 eess.SY cs.MA cs.RO cs.SY 版本更新

Mobile Formation Coordination and Tracking Control for Multiple Non-holonomic Vehicles

多非holonomic车辆的移动编队协调与跟踪控制

Xiuhui Peng, Zhiyong Sun, Kexin Guo, Zhiyong Geng

发表机构 * Department of Automatic Control, Lund University（自动化系，吕勒奥大学）

AI总结本文针对非holonomic车辆在SE(2)上的轨迹跟踪和移动编队协调问题，提出了一种基于中间姿态变量的分阶段控制方法，并证明了严格刚体运动的编队条件，同时通过仿真和实验验证了所提控制器的性能。

详情

AI中文摘要

本文针对在SE(2)上进行轨迹跟踪和移动编队协调的非holonomic车辆的前进运动控制问题。首先，通过构建一个包含车辆位置信息和期望姿态的中间姿态变量，设计了分阶段的平移和旋转控制输入以解决轨迹跟踪问题。其次，深入探讨了非holonomic车辆之间的相对位置和航向的协调关系，以维持具有刚体运动约束的移动编队。证明了除了平行编队和纯 translation 直线编队的情况外，只要每个车辆的线速度与角速度的比值为常数，就可以实现严格刚体运动的编队。还展示了具有弱刚体运动的移动编队的运动特性。之后，基于所提的轨迹跟踪方法，在有向树图上设计了分布式移动编队控制律。所提控制器的性能通过数值仿真和实验进行了验证。

英文摘要

This paper addresses forward motion control for trajectory tracking and mobile formation coordination for a group of non-holonomic vehicles on SE(2). Firstly, by constructing an intermediate attitude variable which involves vehicles' position information and desired attitude, the translational and rotational control inputs are designed in two stages to solve the trajectory tracking problem. Secondly, the coordination relationships of relative positions and headings are explored thoroughly for a group of non-holonomic vehicles to maintain a mobile formation with rigid body motion constraints. We prove that, except for the cases of parallel formation and translational straight line formation, a mobile formation with strict rigid-body motion can be achieved if and only if the ratios of linear speed to angular speed for each individual vehicle are constants. Motion properties for mobile formation with weak rigid-body motion are also demonstrated. Thereafter, based on the proposed trajectory tracking approach, a distributed mobile formation control law is designed under a directed tree graph. The performance of the proposed controllers is validated by both numerical simulations and experiments.

URL PDF HTML ☆

赞 0 踩 0

1810.03749 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Balancing Global Exploration and Local-connectivity Exploitation with Rapidly-exploring Random disjointed-Trees

在快速探索随机断树中平衡全局探索与局部连通性利用

Tin Lai, Fabio Ramos, Gilad Francis

发表机构 * NVIDIA, USA（美国NVIDIA公司）

AI总结本文提出了一种名为RRdT*的增量最优多查询规划器，通过使用多个断树来利用空间的局部连通性，通过马尔可夫链随机采样，并在局部连通性利用失败时主动探索全局空间，将局部利用与全局探索的平衡转化为多臂老虎机问题，从而提高采样效率。

Comments Submitted to IEEE International Conference on Robotics and Automation (ICRA) 2019

详情

AI中文摘要

在高度受限的环境中，采样效率长期以来一直是采样规划器的主要挑战。在本文中，我们提出了快速探索随机断树（RRdT*），一种增量最优多查询规划器。RRdT*使用多个断树来通过马尔可夫链随机采样来利用空间的局部连通性，该方法利用来自先前成功和失败样本的邻居信息。为了平衡局部利用，当局部连通性利用失败时，RRdT*会主动探索未见过的全局空间。局部利用与全局探索之间的主动权衡被公式化为一个多臂老虎机问题。我们主张，主动平衡全局探索与局部利用是提高采样规划器采样效率的关键。我们为这一新方法提供了严谨的完整性和最优收敛性证明。此外，我们通过实验展示了RRdT*的局部探索树在规划中提供改进可见性的有效性。因此，RRdT*在高度受限的环境中优于现有的最先进增量规划器。

英文摘要

Sampling efficiency in a highly constrained environment has long been a major challenge for sampling-based planners. In this work, we propose Rapidly-exploring Random disjointed-Trees* (RRdT*), an incremental optimal multi-query planner. RRdT* uses multiple disjointed-trees to exploit local-connectivity of spaces via Markov Chain random sampling, which utilises neighbourhood information derived from previous successful and failed samples. To balance local exploitation, RRdT* actively explore unseen global spaces when local-connectivity exploitation is unsuccessful. The active trade-off between local exploitation and global exploration is formulated as a multi-armed bandit problem. We argue that the active balancing of global exploration and local exploitation is the key to improving sample efficient in sampling-based motion planners. We provide rigorous proofs of completeness and optimal convergence for this novel approach. Furthermore, we demonstrate experimentally the effectiveness of RRdT*'s locally exploring trees in granting improved visibility for planning. Consequently, RRdT* outperforms existing state-of-the-art incremental planners, especially in highly constrained environments.

URL PDF HTML ☆

赞 0 踩 0

1902.10320 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A New Simulation Metric to Determine Safe Environments and Controllers for Systems with Unknown Dynamics

一种新的仿真度量用于确定具有未知动态系统的安全环境和控制器

Shromona Ghosh, Somil Bansal, Alberto Sangiovanni-Vincentelli, Sanjit A. Seshia, Claire J. Tomlin

发表机构 * University of California, Berkeley（加州大学伯克利分校）

AI总结本文提出了一种基于规范的仿真度量（SPEC），用于在系统动态未知的情况下，通过更严格的规范修改来合成安全控制器，从而扩大安全环境集。

Comments 22nd ACM International Conference on Hybrid Systems: Computation and Control (2019)

详情

DOI: 10.1145/3302504.3311795

AI中文摘要

我们考虑了在已知状态和控制空间但动态未知的情况下，提取安全环境和控制器以满足可达-避免目标的问题。在给定的环境中，通常通过从抽象或系统模型（可能从数据中学习）中合成控制器。然而，在许多情况下，模型的动力学与实际系统的关系并不明确，因此难以为系统提供安全保证。在这种情况下，标准仿真度量（SSM）定义为模型与系统输出轨迹之间最坏情况的范数距离，可以用来将系统的可达-避免规范修改为更严格的抽象规范。然而，获得的距离以及修改后的规范可能相当保守，这限制了能够获得安全控制器的环境集。我们提出SPEC，一种基于规范的仿真度量，通过仅计算违反系统规范的轨迹来克服这些限制。我们证明，使用SPEC修改可达-避免规范可以比SSM合成更大环境集的安全控制器。我们还提出了一种概率方法来计算一般系统的SPEC。使用四旋翼和自动驾驶汽车的仿真器进行的案例研究展示了所提出度量在确定安全环境集和控制器方面的优势。

英文摘要

We consider the problem of extracting safe environments and controllers for reach-avoid objectives for systems with known state and control spaces, but unknown dynamics. In a given environment, a common approach is to synthesize a controller from an abstraction or a model of the system (potentially learned from data). However, in many situations, the relationship between the dynamics of the model and the \textit{actual system} is not known; and hence it is difficult to provide safety guarantees for the system. In such cases, the Standard Simulation Metric (SSM), defined as the worst-case norm distance between the model and the system output trajectories, can be used to modify a reach-avoid specification for the system into a more stringent specification for the abstraction. Nevertheless, the obtained distance, and hence the modified specification, can be quite conservative. This limits the set of environments for which a safe controller can be obtained. We propose SPEC, a specification-centric simulation metric, which overcomes these limitations by computing the distance using only the trajectories that violate the specification for the system. We show that modifying a reach-avoid specification with SPEC allows us to synthesize a safe controller for a larger set of environments compared to SSM. We also propose a probabilistic method to compute SPEC for a general class of systems. Case studies using simulators for quadrotors and autonomous cars illustrate the advantages of the proposed metric for determining safe environment sets and controllers.

URL PDF HTML ☆

赞 0 踩 0

1902.09626 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Learning Extreme Hummingbird Maneuvers on Flapping Wing Robots

在扑翼机器人上学习极端蜂鸟动作

Fan Fei, Zhan Tu, Jian Zhang, Xinyan Deng

AI总结研究通过模仿蜂鸟的极端机动动作，开发了一种混合控制策略，利用模型驱动的非线性控制和模型无关的强化学习，实现了在12克仿生蜂鸟机器人上实现快速逃避机动。

Comments 6 pages, accepted at ICRA 2019

详情

AI中文摘要

生物学研究表明，蜂鸟在快速逃避时可以执行极端空战动作。在悬停时突然出现的视觉刺激下，蜂鸟会启动快速的后退平移并伴随180度的偏转，随后在不到10次振翅之间完成瞬间姿态稳定。考虑到振翅频率为40Hz，这种激进的动作仅在0.2秒内完成。受蜂鸟在这些极端动作中接近最大性能的启发，我们开发了一种飞行控制系统，并实验表明，这种机动性可通过配备两个执行器的12克仿生蜂鸟机器人实现。所提出的混合控制策略结合了基于模型的非线性控制和无模型强化学习。我们使用基于模型的非线性控制进行正常飞行控制，因为这些条件下的动态模型相对准确。然而，在极端机动中，建模误差变得无法控制。通过在仿真中训练的无模型强化学习策略被优化以'破坏'系统并最大化机动期间的性能。混合策略表现出接近蜂鸟观察到的机动动作。直接仿真到现实的转移得以实现，证明了仿生蜂鸟机器人上蜂鸟式的快速逃避机动。

英文摘要

Biological studies show that hummingbirds can perform extreme aerobatic maneuvers during fast escape. Given a sudden looming visual stimulus at hover, a hummingbird initiates a fast backward translation coupled with a 180-degree yaw turn, which is followed by instant posture stabilization in just under 10 wingbeats. Consider the wingbeat frequency of 40Hz, this aggressive maneuver is carried out in just 0.2 seconds. Inspired by the hummingbirds' near-maximal performance during such extreme maneuvers, we developed a flight control strategy and experimentally demonstrated that such maneuverability can be achieved by an at-scale 12-gram hummingbird robot equipped with just two actuators. The proposed hybrid control policy combines model-based nonlinear control with model-free reinforcement learning. We use model-based nonlinear control for nominal flight control, as the dynamic model is relatively accurate for these conditions. However, during extreme maneuver, the modeling error becomes unmanageable. A model-free reinforcement learning policy trained in simulation was optimized to 'destabilize' the system and maximize the performance during maneuvering. The hybrid policy manifests a maneuver that is close to that observed in hummingbirds. Direct simulation-to-real transfer is achieved, demonstrating the hummingbird-like fast evasive maneuvers on the at-scale hummingbird robot.

URL PDF HTML ☆

赞 0 踩 0

1812.06120 2026-06-04 eess.SY cs.AI cs.RO cs.SY 版本更新

Simulation to Scaled City: Zero-Shot Policy Transfer for Traffic Control via Autonomous Vehicles

模拟到缩放城市：通过自动驾驶车辆实现交通控制的零样本策略迁移

Kathy Jang, Eugene Vinitsky, Behdad Chalaki, Ben Remer, Logan Beaver, Andreas Malikopoulos, Alexandre Bayen

发表机构 * University of California, Berkeley（加州大学伯克利分校）； University of Delaware（德克萨斯大学）

AI总结本文通过深度强化学习训练自动驾驶车辆在环形交叉口的控制策略，并将训练好的策略迁移至缩放智能城市进行测试，发现注入噪声的策略在迁移后表现更佳，实现了交通流的优化。

Comments To be published at the International Conference on Cyber Physical Systems (ICCPS) 2019. 10 pages, 9 figures

详情

AI中文摘要

使用深度强化学习，我们训练了自动驾驶车辆在车队中通过环形交叉口的控制策略。使用Flow库，我们在微仿真器中训练了两种策略：一种在状态和动作空间中注入噪声，另一种则没有。在模拟中，自动驾驶车辆为两种策略都学习出一种涌现的引导行为，即减速以实现更流畅的合并。随后，我们将该策略直接迁移至德雷克塞尔大学缩放智能城市（UDSSC）测试平台，该平台是连接和自动化车辆的1:25比例测试场。我们对两种策略在缩放城市中的性能进行了表征。结果显示，无噪声策略经常导致碰撞，仅偶尔实现引导；而注入噪声的策略则始终表现出引导行为且无碰撞，表明噪声有助于零样本策略迁移。此外，迁移后的噪声注入策略在UDSSC中使平均行程时间减少了5%，最大行程时间减少了22%。控制器的视频可在https://sites.google.com/view/iccps-policy-transfer查看。

英文摘要

Using deep reinforcement learning, we train control policies for autonomous vehicles leading a platoon of vehicles onto a roundabout. Using Flow, a library for deep reinforcement learning in micro-simulators, we train two policies, one policy with noise injected into the state and action space and one without any injected noise. In simulation, the autonomous vehicle learns an emergent metering behavior for both policies in which it slows to allow for smoother merging. We then directly transfer this policy without any tuning to the University of Delaware Scaled Smart City (UDSSC), a 1:25 scale testbed for connected and automated vehicles. We characterize the performance of both policies on the scaled city. We show that the noise-free policy winds up crashing and only occasionally metering. However, the noise-injected policy consistently performs the metering behavior and remains collision-free, suggesting that the noise helps with the zero-shot policy transfer. Additionally, the transferred, noise-injected policy leads to a 5% reduction of average travel time and a reduction of 22% in maximum travel time in the UDSSC. Videos of the controllers can be found at https://sites.google.com/view/iccps-policy-transfer.

URL PDF HTML ☆

赞 0 踩 0

1711.09048 2026-06-04 cs.AI cs.RO cs.SY eess.SY 版本更新

A Compression-Inspired Framework for Macro Discovery

一种受压缩启发的宏发现框架

Francisco M. Garcia, Bruno C. da Silva, Philip S. Thomas

发表机构 * College of Information and Computer Sciences（信息与计算机科学学院）； Department of Computer Science（计算机科学系）； University of Massachusetts Amherst（马萨诸塞大学阿默斯特分校）； Federal University Rio Grande do Sul（里约格朗德杜斯阿鲁斯联邦大学）

AI总结本文提出了一种受压缩启发的宏发现框架，通过识别高性能策略获得的轨迹中的重复模式，帮助强化学习代理利用早期经验快速解决相关新任务。

Comments Accepted as Extended Abstract, AAMAS, 2019

1902.07708 2026-06-04 eess.SY cs.RO cs.SY 版本更新

A Stability Analysis for the Acceleration-based Robust Position Control of Robot Manipulators via Disturbance Observer

基于扰动观测器的机器人操作臂加速度鲁棒位置控制的稳定性分析

Emre Sariyildiz, Hiromu Sekiguchi, Takahiro Nozaki, Barkan Ugurlu, Kouhei Ohnishi

发表机构 * University of Wollongong（沃林戈大学）； Keio University（庆应大学）

AI总结本文提出了一种新的非线性稳定性分析方法，用于通过扰动观测器（DOb）实现机器人操作臂的加速度鲁棒位置控制，证明了在适当调节名义惯性矩阵时，位置误差在调节控制中渐近趋于零，在轨迹跟踪控制中是均匀最终有界的。随着DOb的带宽和名义惯性矩阵的增加，误差的界减小，即位置控制系统的鲁棒稳定性和性能得到改善。然而，由于实际设计限制，DOb的带宽和名义惯性矩阵不能随意增加，例如，当它们增加时，鲁棒位置控制器会变得更加噪声敏感。所提出的稳定性分析为DOb基于鲁棒运动控制系统动态行为提供了见解。理论和实验证明了名义惯性矩阵的非对角元素对稳定性和调整鲁棒性与噪声敏感性之间的权衡是有帮助的。该提议的有效性通过仿真和实验结果得到验证。

Comments 9 pages, 9 figures, Journal

详情

DOI: 10.1109/TMECH.2018.2854844
Journal ref: IEEE/ASME Transactions On Mechatronics, Vol. 23, No. 5, October 2018

AI中文摘要

本文提出了一种新的非线性稳定性分析方法，用于通过扰动观测器（DOb）实现机器人操作臂的加速度鲁棒位置控制。证明了如果在DOb的设计中适当调节名义惯性矩阵，则在调节控制中位置误差渐近趋于零，在轨迹跟踪控制中是均匀最终有界的。随着DOb的带宽和名义惯性矩阵的增加，误差的界减小，即位置控制系统的鲁棒稳定性和性能得到改善。然而，由于实际设计限制，DOb的带宽和名义惯性矩阵不能随意增加，例如，当它们增加时，鲁棒位置控制器会变得更加噪声敏感。所提出的稳定性分析为DOb基于鲁棒运动控制系统动态行为提供了见解。理论和实验证明了名义惯性矩阵的非对角元素对稳定性和调整鲁棒性与噪声敏感性之间的权衡是有帮助的。该提议的有效性通过仿真和实验结果得到验证。

英文摘要

This paper proposes a new nonlinear stability analysis for the acceleration-based robust position control of robot manipulators by using Disturbance Observer (DOb). It is shown that if the nominal inertia matrix is properly tuned in the design of DOb, then the position error asymptotically goes to zero in regulation control and is uniformly ultimately bounded in trajectory tracking control. As the bandwidth of DOb and the nominal inertia matrix are increased, the bound of error shrinks, i.e., the robust stability and performance of the position control system are improved. However, neither the bandwidth of DOb nor the nominal inertia matrix can be freely increased due to practical design constraints, e.g., the robust position controller becomes more noise sensitive when they are increased. The proposed stability analysis provides insights regarding the dynamic behavior of DOb-based robust motion control systems. It is theoretically and experimentally proved that non-diagonal elements of the nominal inertia matrix are useful to improve the stability and adjust the trade-off between the robustness and noise sensitivity. The validity of the proposal is verified by simulation and experimental results.

URL PDF HTML ☆

赞 0 踩 0

1812.07084 2026-06-04 cs.RO cs.AI cs.LG cs.SY eess.SY 版本更新

Learning Constraints from Demonstrations

从示范中学习约束

Glen Chou, Dmitry Berenson, Necmiye Ozay

发表机构 * Dept. of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI, 48109, USA（电气工程与计算机科学系，密歇根大学，安娜堡，MI，48109，美国）

AI总结该研究提出了一种从示范中学习未知约束的方法，通过任务示范、成本函数和系统动力学与控制约束，利用hit-and-run采样获取低成本但不安全的轨迹，并通过整数规划获得一致的不安全集表示，同时理论分析了可从安全示范中学习的约束子集。

Comments Presented at the Workshop on the Algorithmic Foundations of Robotics (WAFR), 2018, Mérida, Mexico

1902.06133 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A Fleet of Miniature Cars for Experiments in Cooperative Driving

用于合作驾驶实验的微型汽车车队

Nicholas Hyldmar, Yijun He, Amanda Prorok

发表机构 * University of Cambridge（剑桥大学）

AI总结本文介绍了一种由16辆微型Ackermann转向车辆组成的独特实验平台，旨在解决多车导航和轨迹规划研究与教育中低成本平台不足的问题，通过实验展示合作驾驶在多车道道路地形中的优势。

Comments Accepted to ICRA 2019

1902.05343 2026-06-04 cs.CV cs.RO cs.SY eess.SY 版本更新

Study of dynamical system based obstacle avoidance via manipulating orthogonal coordinates

基于操纵正交坐标的动态系统障碍避障研究

Weiya Ren

发表机构 * Artificial Intelligence Research Center of National Innovation Institute of Defense Technology（国家创新技术研究院人工智能研究中心）； Tianjin Artificial Intelligence Innovation Center（天津人工智能创新中心）

AI总结本文研究了基于动态系统的障碍避障问题，通过引入正交坐标开发了调制矩阵，使调制矩阵更加合理。新轨迹的方向可通过正交坐标的线性组合表示。提出了一种通过引入旋转矩阵来解决局部最小问题，并在三维或更高维空间中提供更合理运动的正交坐标操纵方法。该方法还为围绕凸形体巡逻提供了解决方案。实验结果表明所提出方法的有效性。

1809.04539 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Frequency-Aware Model Predictive Control

频率感知模型预测控制

Ruben Grandia, Farbod Farshidian, Alexey Dosovitskiy, René Ranftl, Marco Hutter

发表机构 * Robotic Systems Lab, ETH Zurich（机器人系统实验室，苏黎世联邦理工学院）； Intel Labs, Munich, Germany（英特尔实验室，德国慕尼黑）

AI总结本文提出频率形状成本函数，用于在腿足机器人最优控制中实现鲁棒解决方案，通过仿真和硬件实验展示了运动计划与执行器带宽限制的兼容性，并在未建模合规性地形上实现了稳健行走。

详情

DOI: 10.1109/LRA.2019.2895882
Journal ref: IEEE Robotics and Automation Letters 2019

AI中文摘要

将轨迹优化得到的解决方案转移到机器人硬件仍是一个具有挑战性的问题。当优化充分利用提供的模型执行动态任务时，未建模的动力学会使运动在现实系统中不可行。模型误差可能是由于模型简化，也自然出现在在无结构和非确定性环境中部署机器人时。主要的是，顺应性接触和执行器动力学导致带宽限制。虽然经典控制方法提供了合成对一类模型误差鲁棒的控制器的工具，但现代轨迹优化中缺少这种概念，该问题是在时域中解决的。我们提出频率形状成本函数，以在腿足机器人的最优控制中实现鲁棒解决方案。通过仿真和硬件实验，我们展示了运动计划可以与由执行器和接触动力学设定的带宽限制相兼容。模型预测解决方案的平滑度可以连续调节而不影响问题的可行性。与由高度顺应性串联弹性执行器驱动的四足机器人ANYmal的实验显示，计划的运动、扭矩和力轨迹的跟踪性能显著提高，并使机器在具有未建模顺应性的地形上稳健行走。

英文摘要

Transferring solutions found by trajectory optimization to robotic hardware remains a challenging task. When the optimization fully exploits the provided model to perform dynamic tasks, the presence of unmodeled dynamics renders the motion infeasible on the real system. Model errors can be a result of model simplifications, but also naturally arise when deploying the robot in unstructured and nondeterministic environments. Predominantly, compliant contacts and actuator dynamics lead to bandwidth limitations. While classical control methods provide tools to synthesize controllers that are robust to a class of model errors, such a notion is missing in modern trajectory optimization, which is solved in the time domain. We propose frequency-shaped cost functions to achieve robust solutions in the context of optimal control for legged robots. Through simulation and hardware experiments we show that motion plans can be made compatible with bandwidth limits set by actuators and contact dynamics. The smoothness of the model predictive solutions can be continuously tuned without compromising the feasibility of the problem. Experiments with the quadrupedal robot ANYmal, which is driven by highly-compliant series elastic actuators, showed significantly improved tracking performance of the planned motion, torque, and force trajectories and enabled the machine to walk robustly on terrain with unmodeled compliance.

URL PDF HTML ☆

赞 0 踩 0

1812.06325 2026-06-04 eess.SY cs.LG cs.RO cs.SY 版本更新

Data-efficient Auto-tuning with Bayesian Optimization: An Industrial Control Study

数据高效自动调优与贝叶斯优化：一项工业控制研究

Matthias Neumann-Brosig, Alonso Marco, Dieter Schwarzmann, Sebastian Trimpe

发表机构 * IAV GmbH（IAV集团）； Max Planck Society（马克斯·普朗克学会）； Cyber Valley initiative（Cyber Valley倡议）； Max Planck Institute for Intelligent Systems（智能系统研究所）

AI总结本文提出利用贝叶斯优化自动学习最优控制器参数，通过概率模型（高斯过程）建模控制器参数到用户定义成本的未知函数，并通过实验数据迭代优化，以高效找到全局最优参数，实验表明其在 throttle valve 控制中优于手动校准。

Comments 11 pages, 7 figures and 4 tables. To appear in IEEE Transactions on Control Systems Technology

详情

DOI: 10.1109/TCST.2018.2886159

AI中文摘要

贝叶斯优化被提出用于从实验数据自动学习最优控制器参数。通过概率描述（高斯过程）建模控制器参数到用户定义成本的未知函数。概率模型通过在物理系统上测试一组参数并评估成本来更新。为加快学习速度，贝叶斯优化算法系统地选择下一步评估的参数，例如通过最大化关于最优解的信息增益。因此，该算法通过少量实验迭代找到全局最优参数。以节流阀控制为例，所提出的自动调优方法在低实验次数下 consistently 实现更好的性能，优于手动校准。所提出的自动调优框架具有灵活性，可处理不同的控制结构和目标。

英文摘要

Bayesian optimization is proposed for automatic learning of optimal controller parameters from experimental data. A probabilistic description (a Gaussian process) is used to model the unknown function from controller parameters to a user-defined cost. The probabilistic model is updated with data, which is obtained by testing a set of parameters on the physical system and evaluating the cost. In order to learn fast, the Bayesian optimization algorithm selects the next parameters to evaluate in a systematic way, for example, by maximizing information gain about the optimum. The algorithm thus iteratively finds the globally optimal parameters with only few experiments. Taking throttle valve control as a representative industrial control example, the proposed auto-tuning method is shown to outperform manual calibration: it consistently achieves better performance with a low number of experiments. The proposed auto-tuning framework is flexible and can handle different control structures and objectives.

URL PDF HTML ☆

赞 0 踩 0

1309.7666 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Dynamic Sliding Mode Control based on Fractional calculus subject to uncertain delay based chaotic pneumatic robot

基于不确定延迟混沌气动机器人的分数 calculus 动态滑模控制

Sara Gholipour P., Heydar Toosian Shandiz, Mobin Alizadeh, Sara Minagar, Seyed Javad Kazemitabar

发表机构 * Farabina Noshirvani’s smart Co.（法拉比亚·诺希拉维智能公司）； Robotic Research Lab.（机器人研究实验室）； Faculty of Electrical and Robotic Engineering, Shahrood University of Technology（沙霍尔德大学电气与机器人工程学院）

AI总结本文针对机器人操作臂中由于延迟引起混沌的滑模控制颤振问题，提出了一种基于分数 calculus 的动态滑模控制方法，通过动态本质消除颤振，并在主从配置的混沌系统中实现控制，利用李雅普诺夫稳定性理论保证闭环系统稳定性，同时对延迟机器人运动进行定性与定量研究。

Comments 8 pages, 9 figures, will be submitted in journal

1708.05004 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

RodFIter: Attitude Reconstruction from Inertial Measurement by Functional Iteration

RodFIter：通过函数迭代从惯性测量中重建姿态

Yuanxin Wu

发表机构 * Shanghai Jiao Tong University（上海交通大学）

AI总结本文提出了一种基于罗德里格斯向量的函数迭代方法（RodFIter），用于从陀螺仪测量中精确重建姿态，该方法在理论上能够准确重构增量姿态，并在姿态摆动运动中表现出优于主流姿态算法的精度。

Comments IEEE TAES, 2018

详情

AI中文摘要

刚体运动的计算或估计是许多领域中的基石。通过整合陀螺仪测量的角速度可以实现姿态计算，其准确性对惯性导航中的死 reckoning至关重要。目前的姿态算法通常依赖于旋转向量的简化微分方程来获得姿态。本文提出了一种名为RodFIter的方法，该方法通过罗德里格斯向量进行函数迭代，以解析地从陀螺仪测量中重建姿态。RodFIter方法只要角速度是准确的，就能在理论上准确地重构增量姿态。值得注意的是，罗德里格斯向量可以解析地获得，并可用于在考虑的时间区间内更新姿态。所提出的方法产生了一种终极姿态算法方案，可以自然地扩展到一般的刚体运动计算。该方法在姿态摆动运动中进行了广泛评估，并在精度上优于主流姿态算法。这项工作被认为已经消除了从惯性测量中进行精确运动积分的长期理论障碍。

英文摘要

Rigid motion computation or estimation is a cornerstone in numerous fields. Attitude computation can be achieved by integrating the angular velocity measured by gyroscopes, the accuracy of which is crucially important for the dead-reckoning inertial navigation. The state-of-the-art attitude algorithms have unexceptionally relied on the simplified differential equation of the rotation vector to obtain the attitude. This paper proposes a Functional Iteration technique with the Rodrigues vector (named the RodFIter method) to analytically reconstruct the attitude from gyroscope measurements. The RodFIter method is provably exact in reconstructing the incremental attitude as long as the angular velocity is exact. Notably, the Rodrigues vector is analytically obtained and can be used to update the attitude over the considered time interval. The proposed method gives birth to an ultimate attitude algorithm scheme that can be naturally extended to the general rigid motion computation. It is extensively evaluated under the attitude coning motion and compares favorably in accuracy with the mainstream attitude algorithms. This work is believed having eliminated the long-standing theoretical barrier in exact motion integration from inertial measurements.

URL PDF HTML ☆

赞 0 踩 0

1712.03913 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

A Non-Cooperative Game Approach to Autonomous Racing

一种非合作博弈方法用于自动驾驶赛车

Alexander Liniger, John Lygeros

AI总结本文提出了一种非合作非零和博弈方法来解决自动驾驶赛车决策问题，通过设计三种不同的博弈模型，考虑静态赛道约束和车辆间碰撞避免，探讨了博弈策略对赛车行为建模的影响。

Comments 14 pages, 5 figuers

详情

AI中文摘要

我们考虑两辆车的自动驾驶赛车问题，并提出了一种将赛车决策建模为非合作非零和博弈的方法。我们设计了三种不同的博弈，玩家旨在满足静态赛道约束并避免相互碰撞；后者约束依赖于两个玩家的联合行动。三种博弈的区别在于碰撞约束和支付函数。在第一个博弈中，仅由跟随者考虑碰撞避免，每个玩家最大化自身向终点线的进度。我们证明，由于该博弈的顺序结构，可以通过高效的顺序最大化方法计算均衡。进一步，我们证明这些动作，如果可行，也是第二个博弈中纯策略的Stackelberg和Nash均衡，其中两个玩家都考虑碰撞约束。第三个博弈的支付函数被设计为促进阻塞，通过在时间 horizon 结束时奖励车辆保持领先。我们证明这会改变Stackelberg均衡，但对Nash均衡影响较小。对于在线实现，我们提出以移动时间窗的方式进行博弈，并讨论了保证由此产生的耦合重复博弈可行性的两种方法。最后，我们研究了所提出方法在模拟中的性能，以复制在苏黎世联邦理工学院自动控制实验室测试的微型赛车设置。模拟研究显示，所提出的博弈能够成功建模不同的赛车行为并生成有趣的赛车情境。

英文摘要

We consider autonomous racing of two cars and present an approach to formulate racing decisions as a non-cooperative non-zero-sum game. We design three different games where the players aim to fulfill static track constraints as well as avoid collision with each other; the latter constraint depends on the combined actions of the two players. The difference between the games are the collision constraints and the payoff. In the first game collision avoidance is only considered by the follower, and each player maximizes their own progress towards the finish line. We show that, thanks to the sequential structure of this game, equilibria can be computed through an efficient sequential maximization approach. Further, we show these actions, if feasible, are also a Stackelberg and Nash equilibrium in pure strategies of our second game where both players consider the collision constraints. The payoff of our third game is designed to promote blocking, by additionally rewarding the cars for staying ahead at the end of the horizon. We show that this changes the Stackelberg equilibrium, but has a minor influence on the Nash equilibria. For online implementation, we propose to play the games in a moving horizon fashion, and discuss two methods for guaranteeing feasibility of the resulting coupled repeated games. Finally, we study the performance of the proposed approaches in simulation for a set-up that replicates the miniature race car tested at the Automatic Control Laboratory of ETH Zurich. The simulation study shows that the presented games can successfully model different racing behaviors and generate interesting racing situations.

URL PDF HTML ☆

赞 0 踩 0

1710.11088 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Robust Cooperative Manipulation without Force/Torque Measurements: Control Design and Experiments

无需力/扭矩测量的鲁棒协同操作：控制设计与实验

Christos K. Verginis, Matteo Mastellaro, Dimos V. Dimarogonas

AI总结本文提出两种新型控制方法，用于由N个机器人代理协同操作物体，通过四元数反馈避免表示奇异，同时保证物体轨迹的暂态和稳态性能，且具有去中心化和抗扰动特性，无需力/扭矩测量，并通过仿真和实验验证理论结果。

1707.09718 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Adaptive Second-order Sliding Mode Control of UAVs for Civil Applications

无人机民用应用中的自适应二阶滑模控制

Van Truong Hoang, Ansu Man Singh, Manh Duong Phung, Quang Ha

发表机构 * Faculty of Engineering and Information Technology（工程与信息技术学院）； University of Technology Sydney, Australia（悉尼技术大学，澳大利亚）

AI总结本文提出了一种自适应二阶准连续滑模控制方案，用于无人机在基础设施监测中的鲁棒姿态控制，通过数学建模和稳定性分析，实现了对噪声和扰动的鲁棒性，并在仿真中验证了其优于实际无人机任务的跟踪性能。

Comments in Proceeding of The 34th International Symposium on Automation and Robotics in Construction (ISARC), pp. 823-829, Taipei, Taiwan, 2017

详情

DOI: 10.22260/ISARC2017/0114

AI中文摘要

四旋翼作为一种无人驾驶航空器（UAV），在测绘、建筑监控和基础设施状况评估等民用应用中具有巨大潜力。然而，四旋翼对噪声和扰动较为敏感，因此在控制不足、系统不确定性和/或外部扰动的情况下，其性能可能迅速下降。在本研究中，我们通过提出一种名为自适应二阶准连续滑模控制（自适应2-QCSM）的鲁棒方案，来处理四旋翼的低层控制。最终目标是实现无人机在基础设施监测和检查中的鲁棒姿态控制。首先，考虑非线性、强耦合、不确定动态和外部扰动，推导了四旋翼的数学模型。控制设计包括滑动面的选择和开发具有自适应增益的准连续二阶滑模控制器。通过使用全局李雅普诺夫函数分析整个控制系统的稳定性，以保证滑动动态和自适应方案的收敛性。进行了大量的仿真以进行评估。结果表明，所提出的控制器能够对扰动或参数变化具有鲁棒性，并在与实际无人机实时监控任务的实验响应相比，具有更好的跟踪性能。

英文摘要

Quadcopters, as unmanned aerial vehicles (UAVs), have great potential in civil applications such as surveying, building monitoring, and infrastructure condition assessment. Quadcopters, however, are relatively sensitive to noises and disturbances so that their performance may be quickly downgraded in the case of inadequate control, system uncertainties and/or external disturbances. In this study, we deal with the quadrotor low-level control by proposing a robust scheme named the adaptive second-order quasi-continuous sliding mode control (adaptive 2-QCSM). The ultimate objective is for robust attitude control of the UAV in monitoring and inspection of built infrastructure. First, the mathematical model of the quadcopter is derived considering nonlinearity, strong coupling, uncertain dynamics and external disturbances. The control design includes the selection of the sliding manifold and the development of quasi-continuous second-order sliding mode controller with an adaptive gain. Stability of the overall control system is analysed by using a global Lyapunov function for convergence of both the sliding dynamics and adaptation scheme. Extensive simulations have been carried out for evaluation. Results show that the proposed controller can achieve robustness against disturbances or parameter variations and has better tracking performance in comparison with experimental responses of a UAV in a real-time monitoring task.

URL PDF HTML ☆

赞 0 踩 0

1707.09715 2026-06-04 eess.SY cs.CV cs.RO cs.SY 版本更新

Automatic Crack Detection in Built Infrastructure Using Unmanned Aerial Vehicles

使用无人机自动检测建筑基础设施裂缝

Manh Duong Phung, Van Truong Hoang, Tran Hiep Dinh, Quang Ha

发表机构 * School of Electrical Mechanical and Mechatronic Systems, University of Technology Sydney, Australia（电气机械与机电系统学院，悉尼技术大学，澳大利亚）

AI总结本文提出了一种利用无人机采集数据并结合直方图分析进行建筑基础设施裂缝检测的方法，通过自动化流程提高检测效率并降低安全隐患。

Comments In proceeding of The 34th International Symposium on Automation and Robotics in Construction (ISARC), pp. 823-829, Taipei, Taiwan, 2017

详情

DOI: 10.22260/ISARC2017/0115

AI中文摘要

本文针对建筑基础设施健康监测中至关重要的裂缝检测问题，提出了一种包含两个阶段的方法：使用无人机（UAV）进行数据采集和利用直方图分析进行裂缝检测。首先，利用激光扫描仪创建结构的3D模型，然后提取几何属性以生成用于导航无人机拍摄结构图像的路径点。接着，将从重叠视野中获取的图像拼接在一起，通过直方图分析和峰值检测进行聚类，最后利用局部自适应阈值识别潜在裂缝。整个过程自动化进行，从而显著提高了检查时间并最小化了安全风险。已开发出原型系统进行评估，并包含实验结果。

英文摘要

This paper addresses the problem of crack detection which is essential for health monitoring of built infrastructure. Our approach includes two stages, data collection using unmanned aerial vehicles (UAVs) and crack detection using histogram analysis. For the data collection, a 3D model of the structure is first created by using laser scanners. Based on the model, geometric properties are extracted to generate way points necessary for navigating the UAV to take images of the structure. Then, our next step is to stick together those obtained images from the overlapped field of view. The resulting image is then clustered by histogram analysis and peak detection. Potential cracks are finally identified by using locally adaptive thresholds. The whole process is automatically carried out so that the inspection time is significantly improved while safety hazards can be minimised. A prototypical system has been developed for evaluation and experimental results are included.

URL PDF HTML ☆

赞 0 踩 0

1812.07879 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Modelling and Fast Terminal Sliding Mode Control for Mirror-based Pointing Systems

基于镜面的指向系统建模与快速终端滑模控制

Ansu Man Singh, Manh Duong Phung, Quang Ha

发表机构 * University of Technology Sydney（悉尼技术大学）

AI总结本文提出了一种新的离散时间快速终端滑模控制器，用于基于镜面的指向系统，通过非线性最小二乘识别方法估计参数，并基于推导的模型设计了连续域的滑模面，利用欧拉离散化合成离散时间控制器，并通过添加线性项改进滑模面的暂态动态，最后基于Sarpturk到达条件证明了控制器的稳定性。

Comments In Proceedings of the 15th International Conference on Control, Automation, Robotics and Vision (ICARCV 2018)

详情

DOI: 10.1109/ICARCV.2018.8581075

AI中文摘要

通过旋转识别进行相机-机械臂系统的外校准

Steve McGuire, Christoffer Heckman, Daniel Szafir, Simon Julier, Nisar Ahmed

发表机构 * Department of Aerospace Engineering Sciences, University of Colorado at Boulder（科罗拉多大学波尔德分校航空航天工程科学系）

AI总结本文提出了一种基于观察臂的结构运动来恢复外校准参数的方法，结合已知的机械臂运动学和图像平面中的二次曲线观测，通过最大似然估计计算校准外参数，该方法在仿真中得到验证并在现实模型中测试，结果与尺子估计一致。

1812.06132 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Bernstein approximation of optimal control problems

伯恩斯坦逼近在最优控制问题中的应用

Venanzio Cichella, Isaac Kaminer, Claire Walton, Naira Hovakimyan, Antonio Pascoal

发表机构 * Department of Mechanical Engineering, University of Iowa（伊利诺伊大学厄巴纳-香槟分校机械科学与工程系）； Department of Mechanical and Aerospace Engineering, Naval Postgraduate School（海军研究生院机械与航空航天工程系）； Institute for Systems and Robotics (ISR), Instituto Superior Tecnico (IST), Univ. Lisbon, Portugal（葡萄牙里斯本大学系统与机器人研究所）

AI总结本文提出了一种基于伯恩斯坦多项式逼近的直接方法，用于解决具有混合输入和状态约束的非线性最优控制问题，并展示了该方法在连续时间最优控制问题中的一致性以及在最优控制问题共轭变量估计中的应用，从而推导出伯恩斯坦多项式逼近的共向量映射定理。

1802.08678 2026-06-04 eess.SY cs.LG cs.RO cs.SY stat.ML 版本更新

Verifying Controllers Against Adversarial Examples with Bayesian Optimization

通过贝叶斯优化验证控制器对抗示例

Shromona Ghosh, Felix Berkenkamp, Gireeja Ranade, Shaz Qadeer, Ashish Kapoor

发表机构 * Microsoft Research, Redmond（微软研究院（红mond））

AI总结本文提出基于贝叶斯优化的主动测试框架，用于验证控制器的安全性，通过逻辑定义安全约束并高效搜索行为空间以发现违反安全规范的对抗示例。

Comments Proc. of the IEEE International Conference on Robotics and Automation, 2018

详情

DOI: 10.1109/ICRA.2018.8460635

AI中文摘要

最近强化学习的成功促使开发了用于现实世界机器人的复杂控制器。由于这些机器人被部署在安全关键应用中并与人类交互，确保安全性以避免造成伤害变得至关重要。为此方向的一个初步步骤是测试控制器在仿真中的表现。为了做到这一点，我们需要明确安全的定义，然后高效地搜索所有行为空间以确定其安全性。在本文中，我们提出了一种基于贝叶斯优化的主动测试框架。我们使用逻辑指定安全约束，并利用问题中的结构来测试系统，以发现违反安全规范的对抗示例。这些规范被定义为轨迹上的光滑函数的复杂布尔组合，与强化学习中的奖励函数不同，它们是表达性强且对系统施加硬约束。在我们的框架中，我们利用单个函数的正则性假设，形式化为高斯过程（GP）先验。我们结合这些内容到一个连贯的优化框架中，利用问题结构。所得到的算法能够证明验证复杂的安全规范或找到对抗示例。实验结果表明，所提出的方法能够快速发现对抗示例。

英文摘要

Recent successes in reinforcement learning have lead to the development of complex controllers for real-world robots. As these robots are deployed in safety-critical applications and interact with humans, it becomes critical to ensure safety in order to avoid causing harm. A first step in this direction is to test the controllers in simulation. To be able to do this, we need to capture what we mean by safety and then efficiently search the space of all behaviors to see if they are safe. In this paper, we present an active-testing framework based on Bayesian Optimization. We specify safety constraints using logic and exploit structure in the problem in order to test the system for adversarial counter examples that violate the safety specifications. These specifications are defined as complex boolean combinations of smooth functions on the trajectories and, unlike reward functions in reinforcement learning, are expressive and impose hard constraints on the system. In our framework, we exploit regularity assumptions on individual functions in form of a Gaussian Process (GP) prior. We combine these into a coherent optimization framework using problem structure. The resulting algorithm is able to provably verify complex safety specifications or alternatively find counter examples. Experimental results show that the proposed method is able to find adversarial examples quickly.

URL PDF HTML ☆

赞 0 踩 0

1812.03216 2026-06-04 cs.LG cs.RO cs.SY eess.SY 版本更新

Zero-shot Deep Reinforcement Learning Driving Policy Transfer for Autonomous Vehicles based on Robust Control

基于鲁棒控制的零样本深度强化学习驾驶策略迁移用于自动驾驶车辆

Zhuo Xu, Chen Tang, Masayoshi Tomizuka

发表机构 * University of California, Berkeley（加州大学伯克利分校）

AI总结本文提出了一种基于鲁棒控制的零样本深度强化学习驾驶策略迁移方法，通过转移具体的运动学量来解决自动驾驶中源域与目标域之间的建模差距问题，采用可转移的分层强化学习轨迹规划器和基于扰动观测器的鲁棒跟踪控制器，验证了该方法在多个驾驶场景中的零样本迁移能力。

Comments Published at IEEE ITSC 2018

详情

AI中文摘要

尽管深度强化学习（深度RL）方法在应用于自动驾驶时具有诸多优势，但真实应用却受到源域（训练）与目标域（部署）之间建模差距的限制。与当前的策略迁移方法不同，本文提出转移具体的自动驾驶运动学量。所提出的基于鲁棒控制的（RC）通用迁移架构，称为RL-RC，结合了可转移的分层强化学习轨迹规划器和基于扰动观测器（DOB）的鲁棒跟踪控制器。通过训练已知的名义动力学模型的深度RL策略直接转移到目标域，DOB基于的鲁棒跟踪控制用于处理建模差距，包括车辆动力学误差和外部扰动如侧向力。我们提供了模拟验证所提出方法在多个驾驶场景如车道保持、车道变更和障碍物避让中的迁移能力。

英文摘要

Although deep reinforcement learning (deep RL) methods have lots of strengths that are favorable if applied to autonomous driving, real deep RL applications in autonomous driving have been slowed down by the modeling gap between the source (training) domain and the target (deployment) domain. Unlike current policy transfer approaches, which generally limit to the usage of uninterpretable neural network representations as the transferred features, we propose to transfer concrete kinematic quantities in autonomous driving. The proposed robust-control-based (RC) generic transfer architecture, which we call RL-RC, incorporates a transferable hierarchical RL trajectory planner and a robust tracking controller based on disturbance observer (DOB). The deep RL policies trained with known nominal dynamics model are transfered directly to the target domain, DOB-based robust tracking control is applied to tackle the modeling gap including the vehicle dynamics errors and the external disturbances such as side forces. We provide simulations validating the capability of the proposed method to achieve zero-shot transfer across multiple driving scenarios such as lane keeping, lane changing and obstacle avoidance.

URL PDF HTML ☆

赞 0 踩 0

1804.01013 2026-06-04 math.OC cs.RO cs.SY eess.SY stat.ML 版本更新

Resilient Non-Submodular Maximization over Matroid Constraints

基于Matroid约束的抗扰非子模最大化

Vasileios Tzoumas, Ali Jadbabaie, George J. Pappas

AI总结本文研究了在传感器和执行器失效情况下，基于Matroid约束的控制与传感问题，提出了一种具有系统级鲁棒性、可扩展性和可证明近似界的新算法。

Comments arXiv admin note: substantial text overlap with arXiv:1803.07954. Correction on problem statement (Problem 1), and change in authors' info

详情

AI中文摘要

大规模系统的控制和传感导致了不仅在传感器和执行器布置上，而且在调度或可观测性/可控性上都出现组合问题。此类系统设计和实现中的组合约束可以利用一种称为Matroid的结构来捕捉。特别是，Matroid的代数结构可以被用来开发可扩展的算法用于传感器和执行器选择，以及具有可量化近似界。然而，在大规模系统中，传感器和执行器可能失效或可能被（网络-）攻击。本文的目标是关注在存在传感器和执行器失效情况下的Matroid约束问题。一般来说，鲁棒Matroid约束问题在计算上是困难的。与非鲁棒情况（无故障）相反，尽管它们通常涉及单调或子模目标函数，但尚无已知的可扩展近似算法。在本文中，我们提供了第一个算法，其具有以下特性：首先，它实现了系统级鲁棒性，即该算法适用于任何数量的拒绝服务攻击或故障。其次，它是可扩展的，因为我们的算法终止时的运行时间与最先进的非鲁棒Matroid约束优化算法相同。第三，它提供了对系统性能的可证明近似界，因为对于单调目标函数，我们的算法保证了接近最优的解。我们使用单调（不一定子模）集合函数的曲率概念来量化我们的算法的近似性能。最后，我们通过考虑一个控制感知的传感器选择场景，即受传感约束的机器人导航，来支持我们的理论分析。

英文摘要

The control and sensing of large-scale systems results in combinatorial problems not only for sensor and actuator placement but also for scheduling or observability/controllability. Such combinatorial constraints in system design and implementation can be captured using a structure known as matroids. In particular, the algebraic structure of matroids can be exploited to develop scalable algorithms for sensor and actuator selection, along with quantifiable approximation bounds. However, in large-scale systems, sensors and actuators may fail or may be (cyber-)attacked. The objective of this paper is to focus on resilient matroid-constrained problems arising in control and sensing but in the presence of sensor and actuator failures. In general, resilient matroid-constrained problems are computationally hard. Contrary to the non-resilient case (with no failures), even though they often involve objective functions that are monotone or submodular, no scalable approximation algorithms are known for their solution. In this paper, we provide the first algorithm, that also has the following properties: First, it achieves system-wide resiliency, i.e., the algorithm is valid for any number of denial-of-service attacks or failures. Second, it is scalable, as our algorithm terminates with the same running time as state-of-the-art algorithms for (non-resilient) matroid-constrained optimization. Third, it provides provable approximation bounds on the system performance, since for monotone objective functions our algorithm guarantees a solution close to the optimal. We quantify our algorithm's approximation performance using a notion of curvature for monotone (not necessarily submodular) set functions. Finally, we support our theoretical analyses with numerical experiments, by considering a control-aware sensor selection scenario, namely, sensing-constrained robot navigation.

URL PDF HTML ☆

赞 0 踩 0

1803.00444 2026-06-04 cs.LG cs.AI cs.RO cs.SY eess.SY stat.ML 版本更新

Inverse Reinforcement Learning via Nonparametric Spatio-Temporal Subgoal Modeling

通过非参数时空子目标建模实现逆强化学习

Adrian Šošić, Elmar Rueckert, Jan Peters, Abdelhak M. Zoubir, Heinz Koeppl

发表机构 * Signal Processing Group（信号处理组）； Institute for Robotics and Cognitive Systems（机器人与认知系统研究所）； Autonomous Systems Labs（自主系统实验室）； Bioinspired Communication Systems（生物启发通信系统）

AI总结本文提出了一种基于非参数时空子目标建模的逆强化学习方法，通过局部上下文更高效地解释单条轨迹，实现更紧凑的行为表示，并构建隐式意图模型以预测未观察到的情况，从而在处理意图变化和主动学习场景中表现出色。

Comments 45 pages, 14 figures; ### Version 3 ### published in the Journal of Machine Learning Research

详情

AI中文摘要

逆强化学习（IRL）领域的发展导致了更复杂的推理框架，这些框架放宽了原始建模假设，即观察到的代理行为仅反映单一意图。相反于学习全局行为模型，最近的IRL方法将演示数据分成部分，以考虑不同轨迹可能对应不同意图，例如因为它们由不同领域专家生成。在本工作中，我们进一步采用子目标的直观概念，建立一个前提：即使单条轨迹在特定上下文中局部解释也比全局更高效，从而实现更紧凑的行为表示。基于这一假设，我们构建了代理目标的隐式意图模型，以预测未观察到的情况。结果是一种集成的贝叶斯预测框架，显著优于现有IRL解决方案，并提供与专家计划一致的平滑策略估计。最值得注意的是，我们的框架自然处理代理意图随时间变化的情况，而经典IRL算法失败。此外，由于其概率性质，该模型可以轻松应用于主动学习场景，以指导专家的演示过程。

英文摘要

Advances in the field of inverse reinforcement learning (IRL) have led to sophisticated inference frameworks that relax the original modeling assumption of observing an agent behavior that reflects only a single intention. Instead of learning a global behavioral model, recent IRL methods divide the demonstration data into parts, to account for the fact that different trajectories may correspond to different intentions, e.g., because they were generated by different domain experts. In this work, we go one step further: using the intuitive concept of subgoals, we build upon the premise that even a single trajectory can be explained more efficiently locally within a certain context than globally, enabling a more compact representation of the observed behavior. Based on this assumption, we build an implicit intentional model of the agent's goals to forecast its behavior in unobserved situations. The result is an integrated Bayesian prediction framework that significantly outperforms existing IRL solutions and provides smooth policy estimates consistent with the expert's plan. Most notably, our framework naturally handles situations where the intentions of the agent change over time and classical IRL algorithms fail. In addition, due to its probabilistic nature, the model can be straightforwardly applied in active learning scenarios to guide the demonstration process of the expert.

URL PDF HTML ☆

赞 0 踩 0

1811.11573 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Distributed Impedance Control of Latency-Prone Robotic Systems with Series Elastic Actuation

具有串联弹性执行器的延迟敏感机器人系统的分布式阻抗控制

Ye Zhao, Luis Sentis

AI总结本文研究了具有串联弹性执行器的延迟敏感机器人系统的分布式阻抗控制问题，提出了一种关键阻尼增益设计方法，用于优化SEA级联控制架构的阻抗控制器设计，并通过频率域方法分析了时间延迟、滤波和负载惯量对SEA阻抗性能的影响。

Comments 24 pages, 16 figures. arXiv admin note: text overlap with arXiv:1501.02854

详情

AI中文摘要

机器人系统越来越多地依赖分布式反馈控制器来解决复杂且延迟敏感的传感和决策问题。这些需求带来了计算负担的增加，从而导致更大的控制器延迟。为了最大化对机械扰动的鲁棒性和实现高性能控制，我们强调需要在靠近控制对象的位置执行阻尼反馈，并在延迟敏感的集中式控制过程中分配刚度反馈。此外，串联弹性执行器（SEAs）近年来在力控机器人中变得越来越普遍，以实现与环境和人类的顺应性交互。然而，设计最优的阻抗控制器和表征具有时间延迟和滤波的SEAs的阻抗性能仍然是未充分研究的问题。本文通过设计一种关键阻尼增益设计方法，针对一类SEA级联控制架构（由外阻抗和内扭矩反馈环组成）解决最优控制器设计问题。通过所提出的控制器设计准则，我们采用频域方法对时间延迟、滤波和负载惯量对SEA阻抗性能的影响进行了深入分析。这些结果通过在高性能执行器和 omnidirectional 移动基座上的分析、仿真和实验测试进一步验证。

英文摘要

Robotic systems are increasingly relying on distributed feedback controllers to tackle complex and latency-prone sensing and decision problems. These demands come at the cost of a growing computational burden and, as a result, larger controller latencies. To maximize robustness to mechanical disturbances and achieve high control performance, we emphasize the necessity for executing damping feedback in close proximity to the control plant while allocating stiffness feedback in a latency-prone centralized control process. Additionally, series elastic actuators (SEAs) are becoming prevalent in torque-controlled robots during recent years to achieve compliant interactions with environments and humans. However, designing optimal impedance controllers and characterizing impedance performance for SEAs with time delays and filtering are still under-explored problems. The presented study addresses the optimal controller design problem by devising a critically-damped gain design method for a class of SEA cascaded control architectures, which is composed of outer-impedance and inner-torque feedback loops. Via the proposed controller design criterion, we adopt frequency-domain methods to thoroughly analyze the effects of time delays, filtering and load inertia on SEA impedance performance. These results are further validated through the analysis, simulation, and experimental testing on high-performance actuators and on an omnidirectional mobile base.

URL PDF HTML ☆

赞 0 踩 0

1809.07916 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Decentralized Optimal Merging Control for Connected and Automated Vehicles

去中心化最优合并控制用于联网自动化车辆

Wei Xiao, Christos G. Cassandras

发表机构 * Boston University（波士顿大学）

AI总结本文研究了联网自动化车辆在合并点的最优控制问题，旨在共同最小化每辆车的行驶时间和能耗，同时保证速度相关的安全约束在合并点及控制区内的持续满足。通过分析无主动约束的情况，证明在特定条件下安全约束保持非活跃，从而简化了显式去中心化解的确定。当这些条件不适用时，仍能获得包含安全约束活跃区间的显式解。分析结果有助于研究行驶时间和控制区内的能耗之间的权衡。

Comments 16 pages, 2nd version, 20 figures

详情

AI中文摘要

本文针对联网自动化车辆（CAVs）从两条道路汇入合并点的最优控制问题，目标是共同最小化每辆CAV的行驶时间和能耗。解决方案保证在合并点及控制区内的速度相关安全约束始终满足。我们首先分析无主动约束的情况，证明在特定条件下安全约束保持非活跃，从而显著简化了显式去中心化解的确定。当这些条件不适用时，仍能获得包含安全约束活跃区间的显式解。我们的分析使我们能够研究行驶时间和控制区内的能耗之间的权衡。仿真示例用于比较最优控制器与由人工驾驶车辆组成的基线的性能，结果显示在两个指标上均有改进。

英文摘要

This paper addresses the optimal control of Connected and Automated Vehicles (CAVs) arriving from two roads at a merging point where the objective is to jointly minimize the travel time and energy consumption of each CAV. The solution guarantees that a speed-dependent safety constraint is always satisfied, both at the merging point and everywhere within a control zone which precedes it. We first analyze the case of no active constraints and prove that under certain conditions the safety constraint remains inactive, thus significantly simplifying the determination of an explicit decentralized solution. When these conditions do not apply, an explicit solution is still obtained that includes intervals over which the safety constraint is active. Our analysis allows us to study the tradeoff between the two objective function components (travel time and energy within the control zone). Simulation examples are included to compare the performance of the optimal controller to a baseline with human-driven vehicles with results showing improvements in both metrics.

URL PDF HTML ☆

赞 0 踩 0

1809.00367 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Momentum Model-based Minimal Parameter Identification of a Space Robot

基于动量模型的太空机器人最小参数识别

B. Naveen, Suril V. Shah, Arun K. Misra

发表机构 * Indian Institute of Technology, Jodhpur, Rajasthan, India（印度理工学院，朱达普尔，拉贾斯坦邦，印度）； McGill University, Montreal, Quebec H3A 0C3, Canada（麦吉尔大学，蒙特利尔，魁北克 H3A 0C3，加拿大）

AI总结本文提出了一种基于动量模型的最小参数识别方法，用于在轨识别太空机器人的最小参数，这些参数能够唯一确定动量和动力学模型，从而支持卫星及其安装的机械臂的运动规划和控制。

Comments Accepted for publication in AIAA Journal of Guidance, Control, and Dynamics

详情

DOI: 10.2514/1.G003541

AI中文摘要

准确的惯性参数信息对空间机器人的运动规划和控制至关重要。在发射前，仅能通过实验和计算机辅助设计（CAD）模型获得惯性参数的粗略估计。在发射后，轨道操作会显著改变惯性参数的值。本文提出了一种新的基于动量模型的方法，在轨识别太空机器人的最小参数。最小参数是链接的惯性参数的组合，并唯一定义动量和动力学模型。因此，它们对于卫星及其安装的机械臂的运动规划和控制是足够的。所提出框架的关键在于将动量模型以最小参数的线性形式进行唯一建模。进一步，为了估计最小参数，我们提出了一种基于关节速度方向组合的新型关节轨迹规划和优化技术。该识别框架的有效性在具有12个自由度的空间双臂机器人上得到验证。该方法适用于树型空间机器人，仅需要姿态和扭转数据，并且随着关节数量的增加而可扩展。

英文摘要

Accurate information of inertial parameters is critical to motion planning and control of space robots. Before the launch, only a rudimentary estimate of the inertial parameters is available from experiments and computer-aided design (CAD) models. After the launch, on-orbit operations substantially alter the value of inertial parameters. In this work, we propose a new momentum model-based method for identifying the minimal parameters of a space robot while on orbit. Minimal parameters are combinations of the inertial parameters of the links and uniquely define the momentum and dynamic models. Consequently, they are sufficient for motion planning and control of both the satellite and robotic arms mounted on it. The key to the proposed framework is the unique formulation of momentum model in the linear form of minimal parameters. Further, to estimate the minimal parameters, we propose a novel joint trajectory planning and optimization technique based on direction combinations of joints' velocity. The efficacy of the identification framework is demonstrated on a 12 degrees-of-freedom, spatial, dual-arm space robot. The methodology is developed for tree-type space robots, requires just the pose and twist data, and scalable with increasing number of joints.

URL PDF HTML ☆

赞 0 踩 0

1811.09914 2026-06-04 eess.SY cs.AI cs.MA cs.RO cs.SY 版本更新

RADMPC: A Fast Decentralized Approach for Chance-Constrained Multi-Vehicle Path-Planning

RADMPC：一种用于机会约束多车辆路径规划的快速去中心化方法

Aaron Huang, Benjamin J. Ayton, Brian C. Williams

发表机构 * Computer Science and Artificial Intelligence Laboratory（计算机科学与人工智能实验室）； Massachusetts Institute of Technology（麻省理工学院）

AI总结本文提出了一种基于去中心化路径规划方法RADMPC的快速机会约束多车辆路径规划方法，通过评估车辆交互来确定需要耦合规划的车辆集，并利用IRA在较小的车辆集上快速规划安全路径，从而显著提高计算效率。

详情

AI中文摘要

鲁棒的多车辆路径规划对于确保运输、搜索救援和机器人探索等应用中的多车辆系统安全性至关重要。迭代风险分配（IRA）等机会约束方法已被开发用于环境扰动无界的场景。然而，多车辆情况下的机会约束方法通常采用集中策略，其中所有车辆对之间存在耦合关系。随着车队规模的增加，这种策略变得不可行，因为计算时间与规划的车辆数呈指数增长，由于车辆对之间的耦合约束呈多项式增长。我们提出了一种更快的机会约束多车辆路径规划方法，该方法依赖于一种称为风险意识去中心化模型预测控制（RADMPC）的去中心化路径规划方法，以快速近似集中IRA方法。RADMPC近似通过评估车辆交互来确定应耦合规划的车辆集。将IRA应用于由RADMPC近似确定的较小车辆集上，能够快速为整个车队规划安全路径。蒙特卡洛模拟分析证明了我们方法的正确性，并与集中IRA方法相比显示出显著的计算时间改进。

英文摘要

Robust multi-vehicle path-planning is important for ensuring the safety of multi-vehicle systems in applications like transportation, search and rescue, and robotic exploration. Chance-constrained methods like Iterative Risk Allocation (IRA)\cite{IRA} have been developed for situations where environmental disturbances are unbounded. However, chance-constrained methods for the multi-vehicle case generally use centralized strategies where the vehicle set is planned with couplings between all vehicle pairs. This approach is intractable as fleet size increases because computation time is exponential with respect to the number of vehicles being planned over due to a polynomial increase in coupling constraints between vehicle pairs. We present a faster approach for chance-constrained multi-vehicle path-planning that relies upon a decentralized path-planning method called Risk-Aware Decentralized Model Predictive Control (RADMPC) to rapidly approximate a centralized IRA approach. The RADMPC approximation is evaluated for vehicle interactions to determine the vehicle sets that should be planned in a coupled manner. Applying IRA to the smaller vehicle sets determined from the RADMPC approximation rapidly plans safe paths for the entire fleet. A Monte Carlo simulation analysis demonstrates the correctness of our approach and a significant improvement in computation time compared to a centralized IRA approach.

URL PDF HTML ☆

赞 0 踩 0

1811.06350 2026-06-04 eess.SY cs.MA cs.RO cs.SC cs.SY math.OC 版本更新

Temporal viability regulation for control affine systems with applications to mobile vehicle coordination under time-varying motion constraints

时间可行性调节用于控制仿射系统及其在时间变化运动约束下移动车辆协调中的应用

Marcus Greiff, Zhiyong Sun, Anders Robertsson, Rolf Johansson

发表机构 * LCCC Linnaeus Center（LCCC 林纳厄中心）； ELLIIT Excellence Center（ELLIIT 卓越中心）； Lund University（Lund 大学）

AI总结本文提出了一种时间可行性调节理论，用于一般动态控制系统，特别是控制仿射系统，通过时间变化约束函数参数化时间变化可行集，以确保动态控制系统在时间变化可行集中不变，从而执行时间依赖状态约束。同时，本文还给出了控制仿射系统可行控制输入存在的充分条件，并将该理论应用于移动车辆协调。

Comments 7 pages, 3 figures. Submitted to a conference for publication

1808.00113 2026-06-04 eess.SY cs.LG cs.RO cs.SY math.OC 版本更新

Learning Stabilizable Dynamical Systems via Control Contraction Metrics

通过控制收缩度量学习可稳定化的动态系统

Sumeet Singh, Vikas Sindhwani, Jean-Jacques E. Slotine, Marco Pavone

发表机构 * Dept. of Aeronautics and Astronautics, Stanford University（航空航天系，斯坦福大学）； Google Brain Robotics, New York（谷歌大脑机器人，纽约）； Dept. of Mechanical Engineering, Massachusetts Institute of Technology（机械工程系，麻省理工学院）

AI总结本文提出了一种新的框架，用于学习可稳定化的非线性动态系统，以实现机器人连续控制任务。核心方法是开发一种基于稳定性的控制理论正则化器，以确保学习到的系统可以配备一个稳健的控制器，能够稳定任何系统生成的开环轨迹。通过利用收缩理论、统计学习和凸优化工具，我们提供了一个通用且可操作的半监督算法来学习可稳定化的动态系统，可以应用于复杂的欠驱动系统。在模拟平面四旋翼系统上验证了所提算法，并观察到与传统回归技术学习的模型相比，使用控制理论正则化模型在轨迹生成和跟踪性能上有显著改进，尤其是在使用少量示范示例时。结果展示了将标准基于模型的强化学习算法与非线性控制理论概念结合的必要性，以提高可靠性。

Comments To appear at WAFR 2018. v2: re-structured Sections 3 & 4 to improve clarity; expanded discussion on limitations & future work in Section 5; added details on training & validation, significantly expanded experiments

详情

AI中文摘要

我们提出了一种新的框架，用于学习可稳定化的非线性动态系统，以实现机器人连续控制任务。关键思想是开发一种基于稳定性的控制理论正则化器，用于动态拟合，该正则化器保证所学习的系统可以配备一个稳健的控制器，能够稳定任何系统可能生成的开环轨迹。通过利用收缩理论、统计学习和凸优化工具，我们提供了一个通用且可操作的半监督算法来学习可稳定化的动态系统，可以应用于复杂的欠驱动系统。我们在模拟平面四旋翼系统上验证了所提算法，并观察到与传统回归技术学习的模型相比，使用控制理论正则化模型在轨迹生成和跟踪性能上有显著改进，尤其是在使用少量示范示例时。所呈现的结果展示了将标准基于模型的强化学习算法与非线性控制理论概念结合的必要性，以提高可靠性。

英文摘要

We propose a novel framework for learning stabilizable nonlinear dynamical systems for continuous control tasks in robotics. The key idea is to develop a new control-theoretic regularizer for dynamics fitting rooted in the notion of stabilizability, which guarantees that the learned system can be accompanied by a robust controller capable of stabilizing any open-loop trajectory that the system may generate. By leveraging tools from contraction theory, statistical learning, and convex optimization, we provide a general and tractable semi-supervised algorithm to learn stabilizable dynamics, which can be applied to complex underactuated systems. We validated the proposed algorithm on a simulated planar quadrotor system and observed notably improved trajectory generation and tracking performance with the control-theoretic regularized model over models learned using traditional regression techniques, especially when using a small number of demonstration examples. The results presented illustrate the need to infuse standard model-based reinforcement learning algorithms with concepts drawn from nonlinear control theory for improved reliability.

URL PDF HTML ☆

赞 0 踩 0

1803.08287 2026-06-04 eess.SY cs.AI cs.LG cs.RO cs.SY 版本更新

Learning-based Model Predictive Control for Safe Exploration

基于学习的模型预测控制用于安全探索

Torsten Koller, Felix Berkenkamp, Matteo Turchetta, Andreas Krause

发表机构 * Vector Institute（向量研究所）； Max Planck ETH Center for Learning Systems（马克斯·普朗克-ETH学习系统中心）

AI总结本文提出了一种基于学习的模型预测控制方法，通过高斯过程先验假设构建可证明准确的轨迹置信区间，从而提供可证明的高概率安全保证，用于动态系统的安全高效探索和学习。

Comments Proc. of the Conference on Decision and Control, 2018

详情

AI中文摘要

基于学习的方法在没有显著系统先验知识的情况下成功解决了复杂控制任务。然而，这些方法通常不提供任何安全保证，这限制了它们在安全关键的现实应用中的使用。在本文中，我们提出了一种基于学习的模型预测控制方案，可以提供可证明的高概率安全保证。为此，我们利用高斯过程先验对动态特性进行正则性假设，以构建可证明准确的预测轨迹置信区间。与以往的方法不同，我们不假设模型不确定性是独立的。基于这些预测，我们保证轨迹满足安全约束。此外，我们使用终端集约束递归地保证在每个迭代中都存在安全的控制动作。在我们的实验中，我们展示了所提出算法可以安全且高效地探索和学习动态系统。

英文摘要

Learning-based methods have been successful in solving complex control tasks without significant prior knowledge about the system. However, these methods typically do not provide any safety guarantees, which prevents their use in safety-critical, real-world applications. In this paper, we present a learning-based model predictive control scheme that can provide provable high-probability safety guarantees. To this end, we exploit regularity assumptions on the dynamics in terms of a Gaussian process prior to construct provably accurate confidence intervals on predicted trajectories. Unlike previous approaches, we do not assume that model uncertainties are independent. Based on these predictions, we guarantee that trajectories satisfy safety constraints. Moreover, we use a terminal set constraint to recursively guarantee the existence of safe control actions at every iteration. In our experiments, we show that the resulting algorithm can be used to safely and efficiently explore and learn about dynamic systems.

URL PDF HTML ☆

赞 0 踩 0

1811.01774 2026-06-04 cs.SE cs.RO cs.SY eess.SY 版本更新

SCAV'18: Report of the 2nd International Workshop on Safe Control of Autonomous Vehicles

SCAV'18: 第二届安全控制自主车辆国际研讨会报告

Mario Gleirscher, Sven Linker, Stefan Kugele

发表机构 * University of York, UK（英国约克大学）； Technical University of Munich, Germany（德国慕尼黑技术大学）； University of Liverpool, UK（英国利兹大学）

AI总结本文总结了第二届SCAV研讨会的讨论、开放问题、关键信息和结论，探讨了自主车辆安全控制的核心挑战与未来方向。

Comments 3 pages, 1 table

1806.06498 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Conditional Affordance Learning for Driving in Urban Environments

面向城市环境的条件性 affordance 学习

Axel Sauer, Nikolay Savinov, Andreas Geiger

发表机构 * Chair of Robotics Science and System Intelligence, Technical University of Munich（慕尼黑技术大学机器人科学与系统智能系）

AI总结本文提出了一种直接感知方法，通过将视频输入映射到适合复杂城市环境自主导航的中间表示，结合高层方向输入，实现了比现有强化学习和条件模仿学习方法更高的目标导向导航性能，并首次通过图像级标签处理交通灯和速度标志，显著减少模拟中的交通事故。

Comments Accepted for Conference on Robot Learning (CoRL) 2018

详情

AI中文摘要

大多数现有的自动驾驶方法分为两类：模块化流水线，通过构建环境的详尽模型，以及模仿学习方法，直接将图像映射到控制输出。最近提出的一种第三范式，直接感知，旨在通过神经网络学习适当的低维中间表示来结合两者的优点。然而，现有的直接感知方法仅限于简单的高速公路场景，缺乏在交叉路口导航、在交通灯前停止或遵守速度限制的能力。在本文中，我们提出了一种直接感知方法，将视频输入映射到适合复杂城市环境自主导航的中间表示，给定高层方向输入。与最先进的强化学习和条件模仿学习方法相比，在具有挑战性的CARLA模拟基准上，我们实现了高达68%的目标导向导航改进。此外，我们的方法是首次通过仅使用图像级标签来处理交通灯和速度标志，从而在模拟中显著减少交通事故。

英文摘要

Most existing approaches to autonomous driving fall into one of two categories: modular pipelines, that build an extensive model of the environment, and imitation learning approaches, that map images directly to control outputs. A recently proposed third paradigm, direct perception, aims to combine the advantages of both by using a neural network to learn appropriate low-dimensional intermediate representations. However, existing direct perception approaches are restricted to simple highway situations, lacking the ability to navigate intersections, stop at traffic lights or respect speed limits. In this work, we propose a direct perception approach which maps video input to intermediate representations suitable for autonomous navigation in complex urban environments given high-level directional inputs. Compared to state-of-the-art reinforcement and conditional imitation learning approaches, we achieve an improvement of up to 68 % in goal-directed navigation on the challenging CARLA simulation benchmark. In addition, our approach is the first to handle traffic lights and speed signs by using image-level labels only, as well as smooth car-following, resulting in a significant reduction of traffic accidents in simulation.

URL PDF HTML ☆

赞 0 踩 0

1811.00426 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

Improving the Modularity of AUV Control Systems using Behaviour Trees

使用行为树提高水下机器人控制系统的模块化程度

Christopher Iliffe Sprague, Özer Özkahraman, Andrea Munafo, Rachel Marlow, Alexander Phillips, Petter Ögren

发表机构 * Robotics, Perception and Learning Lab（机器人、感知与学习实验室）； Royal Institute of Technology（皇家理工学院）； National Oceanography Centre（国家海洋学研究中心）

AI总结本文展示如何利用行为树设计模块化、多功能且稳健的控制架构，用于关键任务系统，特别针对自主水下机器人。研究强调了系统安全的稳健性、执行多种任务的多功能性以及模块化在结合稳健性和多功能性中的重要性。

Comments Submitted to 2018 IEEE OES Autonomous Underwater Vehicle Symposium

1810.13087 2026-06-04 cs.RO cs.FL cs.SY eess.SY math.OC 版本更新

Multirobot Coordination with Counting Temporal Logics

多机器人协调与计数时序逻辑

Yunus Emre Sahin, Petter Nilsson, Necmiye Ozay

发表机构 * Department of Electrical Engineering and Computer Science（电气工程与计算机科学系）； University of Michigan（密歇根大学）； Department of Mechanical and Civil Engineering（机械与土木工程系）； California Institute of Technology（加州理工学院）

AI总结本文提出了一种基于计数时序逻辑的多机器人协调方法，通过优化算法生成轨迹以保证同步执行时满足给定的逻辑公式，并展示了在机器人动力学相同的情况下，使用计数线性时序逻辑（cLTL）能更高效地解决规划问题，同时讨论了异步执行下保持逻辑规范性的方法及鲁棒轨迹生成。

Comments Under submission for a journal

详情

AI中文摘要

在许多多机器人应用中，规划轨迹以确保机器人集体行为满足某种高层规范至关重要。受此问题启发，我们引入了计数时序逻辑——一种能够简洁表达多机器人任务规范的正式语言，适用于可能无限的时域。我们首先介绍了一种通用逻辑，称为计数线性时序逻辑加（cLTL+），并提出了一种基于优化的方法，生成个体轨迹，使得在这些轨迹同步执行时满足给定的cLTL+公式。我们随后介绍cLTL+的一个片段，称为计数线性时序逻辑（cLTL），并展示当所有机器人具有相同动力学时，使用cLTL约束的规划问题的解决方案可以更高效地获得。在论文的第二部分，我们放松同步假设，讨论如何生成可以在异步执行下保持所需cLTL+规范性的轨迹。特别是，我们证明当机器人之间的异步性受限制时，本文提出的方法可以修改以生成鲁棒轨迹。我们通过实验演示这些想法，并提供数值结果，展示该方法的可扩展性。

英文摘要

In many multirobot applications, planning trajectories in a way to guarantee that the collective behavior of the robots satisfies a certain high-level specification is crucial. Motivated by this problem, we introduce counting temporal logics---formal languages that enable concise expression of multirobot task specifications over possibly infinite horizons. We first introduce a general logic called counting linear temporal logic plus (cLTL+), and propose an optimization-based method that generates individual trajectories such that satisfaction of a given cLTL+ formula is guaranteed when these trajectories are synchronously executed. We then introduce a fragment of cLTL+, called counting linear temporal logic (cLTL), and show that a solution to planning problem with cLTL constraints can be obtained more efficiently if all robots have identical dynamics. In the second part of the paper, we relax the synchrony assumption and discuss how to generate trajectories that can be asynchronously executed, while preserving the satisfaction of the desired cLTL+ specification. In particular, we show that when the asynchrony between robots is bounded, the method presented in this paper can be modified to generate robust trajectories. We demonstrate these ideas with an experiment and provide numerical results that showcase the scalability of the method.

URL PDF HTML ☆

赞 0 踩 0

1810.13072 2026-06-04 cs.AI cs.RO cs.SY eess.SY 版本更新

Formal Verification of Neural Network Controlled Autonomous Systems

神经网络控制自主系统的形式验证

Xiaowu Sun, Haitham Khedr, Yasser Shoukry

发表机构 * Department of Electrical ； Computer Engineering University of Maryland, College Park

AI总结本文研究了如何形式验证配备神经网络控制器的自主机器人在LiDAR图像处理中安全性的核心问题，通过构建有限状态抽象并利用可达性分析计算安全的初始条件，提出了一种多项式时间算法来分区工作空间并计算对应的仿射成像函数，同时利用SMC编码分析神经网络行为，通过数值模拟验证了算法的效率。

详情

AI中文摘要

在本文中，我们考虑了正式验证配备神经网络（NN）控制器的自主机器人在处理LiDAR图像以产生控制动作时的安全性问题。给定一个由一组多边形障碍物特征化的工作空间，我们的目标是计算一组安全的初始条件，使得从这些初始条件出发的机器人轨迹能够保证避开障碍物。我们的方法是构建系统的有限状态抽象，并利用标准的可达性分析在有限状态抽象上计算安全的初始状态集。计算有限状态抽象的第一个技术问题是数学建模将机器人位置映射到LiDAR图像的成像函数。为此，我们引入了成像适应集的概念，作为工作空间的分区，在这些分区中，成像函数被保证为仿射的。我们开发了一种多项式时间算法，用于将工作空间划分为成像适应集并计算相应的仿射成像函数。给定这种工作空间分区，机器人的离散时间线性动力学以及一个预训练的具有修正线性单元（ReLU）非线性的神经网络控制器，第二个技术挑战是分析神经网络的行为。为此，我们利用满足模凸（SMC）编码来枚举所有可能的ReLU段落。SMC求解器随后使用布尔可满足性求解器和凸优化求解器，将问题分解为更小的子问题。为了加速这个过程，我们开发了一种预处理算法，可以快速修剪可行的ReLU段落。最后，我们通过数值模拟验证了所提出算法的效率，模拟中神经网络控制器的复杂性逐渐增加。

英文摘要

In this paper, we consider the problem of formally verifying the safety of an autonomous robot equipped with a Neural Network (NN) controller that processes LiDAR images to produce control actions. Given a workspace that is characterized by a set of polytopic obstacles, our objective is to compute the set of safe initial conditions such that a robot trajectory starting from these initial conditions is guaranteed to avoid the obstacles. Our approach is to construct a finite state abstraction of the system and use standard reachability analysis over the finite state abstraction to compute the set of the safe initial states. The first technical problem in computing the finite state abstraction is to mathematically model the imaging function that maps the robot position to the LiDAR image. To that end, we introduce the notion of imaging-adapted sets as partitions of the workspace in which the imaging function is guaranteed to be affine. We develop a polynomial-time algorithm to partition the workspace into imaging-adapted sets along with computing the corresponding affine imaging functions. Given this workspace partitioning, a discrete-time linear dynamics of the robot, and a pre-trained NN controller with Rectified Linear Unit (ReLU) nonlinearity, the second technical challenge is to analyze the behavior of the neural network. To that end, we utilize a Satisfiability Modulo Convex (SMC) encoding to enumerate all the possible segments of different ReLUs. SMC solvers then use a Boolean satisfiability solver and a convex programming solver and decompose the problem into smaller subproblems. To accelerate this process, we develop a pre-processing algorithm that could rapidly prune the space feasible ReLU segments. Finally, we demonstrate the efficiency of the proposed algorithms using numerical simulations with increasing complexity of the neural network controller.

URL PDF HTML ☆

赞 0 踩 0

1706.08932 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Iterative Sequential Action Control for Stable, Model-Based Control of Nonlinear Systems

迭代序列动作控制用于非线性系统的稳定模型控制

Emmanouil Tzorakoleftherakis, Todd Murphey

发表机构 * Neuroscience and Robotics Laboratory (N×R)（神经科学与机器人实验室）

AI总结本文提出了一种迭代序列动作控制(iSAC)方法，用于非线性系统的控制，该方法通过在时间步之间迭代更新常数控制值来获得闭环渐近稳定性，并探讨了渐近衰减扰动对系统轨迹的影响。

详情

Journal ref: IEEE Transactions on Automatic Control, 2018

AI中文摘要

本文提出了迭代序列动作控制(iSAC)，一种用于非线性系统的递推时间窗口控制方法。iSAC方法具有闭式开环解，通过在时间步之间迭代更新，引入常数控制值用于短时间应用。在成本上应用收缩约束被证明在温和假设下可以实现闭环渐近稳定性。还研究了渐近衰减扰动对系统轨迹的影响。为了展示iSAC在各种系统和条件下的适用性，我们采用了五个不同的系统，包括一个基于四元数的13维四旋翼。每个系统在不同的场景中进行测试，从可行和不可行的轨迹跟踪到设定点稳定化，有或没有外部扰动的存在。最后讨论了该工作的局限性。

英文摘要

This paper presents iterative Sequential Action Control (iSAC), a receding horizon approach for control of nonlinear systems. The iSAC method has a closed-form open-loop solution, which is iteratively updated between time steps by introducing constant control values applied for short duration. Application of a contractive constraint on the cost is shown to lead to closed-loop asymptotic stability under mild assumptions. The effect of asymptotically decaying disturbances on system trajectories is also examined. To demonstrate the applicability of iSAC to a variety of systems and conditions, we employ five different systems, including a 13-dimensional quaternion-based quadrotor. Each system is tested in different scenarios, ranging from feasible and infeasible trajectory tracking, to setpoint stabilization, with or without the presence of external disturbances. Finally, limitations of this work are discussed.

URL PDF HTML ☆

赞 0 踩 0

1802.00285 2026-06-04 cs.CV cs.RO cs.SY eess.SY 版本更新

Virtual-to-Real: Learning to Control in Visual Semantic Segmentation

虚拟到现实：学习在视觉语义分割中的控制

Zhang-Wei Hong, Chen Yu-Ming, Shih-Yang Su, Tzu-Yun Shann, Yi-Hsiang Chang, Hsuan-Kung Yang, Brian Hsi-Lin Ho, Chih-Chieh Tu, Yueh-Chuan Chang, Tsu-Ching Hsiao, Hsin-Wei Hsiao, Sih-Pin Lai, Chun-Yi Lee

发表机构 * Elsa Lab（Elsa实验室）； Department of Computer Science（计算机科学系）； National Tsing Hua University（国立清华大学）

AI总结本文提出了一种模块化架构，通过将感知模块和控制策略模块结合，利用语义图像分割作为元表示，解决虚拟到现实的迁移问题，并在障碍避让和目标跟随任务中展示了优越的性能。

Comments 7 pages, accepted by IJCAI-18

详情

AI中文摘要

在欧几里得空间中系统控制器设计方法

Dong Eui Chang

发表机构 * Electrical Engineering（电气工程）； Korea Advanced Institute of Science（韩国科学技术院）

AI总结本文提出了一种在欧几里得空间中为定义在流形上的系统设计控制器的新方法，通过将状态空间流形嵌入到欧几里得空间中，并在该空间中扩展系统以增加最终动态中的横截稳定性，从而设计出适用于原始系统的控制器。

Comments International Journal of Robust and Nonlinear Control (Accepted July 2018

详情

DOI: 10.1002/rnc.4294
Journal ref: International J of Robust and Nonlinear Control, 28(16), 4981--4998, 2018

AI中文摘要

本文提出了一种在欧几里得空间中为定义在流形上的系统设计控制器的新方法。该方法的思路是将给定控制系统的状态空间流形M嵌入到某个欧几里得空间R^n中，将系统从M扩展到环境空间R^n，并在该空间中修改系统以在R^n中的最终动态中增加M的横截稳定性。在环境空间R^n中为最终系统设计控制器，然后将其限制到M上，从而得到原始系统在M上的控制器。该方法的优点是仅使用一个单一的全局笛卡尔坐标系在环境空间R^n中进行控制器合成，并且任何在R^n中的控制器设计方法，如线性化方法，都可以全局应用于控制器合成。所提出的方法成功应用于以下两个基准系统的跟踪问题：完全驱动的刚体系统和四旋翼无人机系统。

英文摘要

A new method is developed to design controllers in Euclidean space for systems defined on manifolds. The idea is to embed the state-space manifold $M$ of a given control system into some Euclidean space $\mathbb R^n$, extend the system from $M$ to the ambient space $\mathbb R^n$, and modify it outside $M$ to add transversal stability to $M$ in the final dynamics in $\mathbb R^n$. Controllers are designed for the final system in the ambient space $\mathbb R^n$. Then, their restriction to $M$ produces controllers for the original system on $M$. This method has the merit that only one single global Cartesian coordinate system in the ambient space $\mathbb R^n$ is used for controller synthesis, and any controller design method in $\mathbb R^n$, such as the linearization method, can be globally applied for the controller synthesis. The proposed method is successfully applied to the tracking problem for the following two benchmark systems: the fully actuated rigid body system and the quadcopter drone system.

URL PDF HTML ☆

赞 0 踩 0

1706.05104 2026-06-04 cs.RO cs.ET cs.SY eess.SY 版本更新

Personal Food Computer: A new device for controlled-environment agriculture

个人食物计算机：一种用于受控环境农业的新设备

Eduardo Castelló Ferrer, Jake Rye, Gordon Brander, Tim Savas, Douglas Chambers, Hildreth England, Caleb Harper

发表机构 * MIT Media Lab（麻省理工学院媒体实验室）

AI总结本文提出了一种低成本的桌面平台OpenAg Personal Food Computer（PFC），旨在为植物物候研究、爱好者、制作者和K-12教师提供工具，支持集体数据共享和植物生长分析。

Comments 9 pages, 11 figures, Accepted at the 2017 Future Technologies Conference (FTC)

详情

DOI: 10.1007/978-3-030-02683-7_79

AI中文摘要

由于其跨学科性质，受控环境农业设备有可能成为研究植物物候和在各种学科中创建课程的理想工具。受控环境设备正在增加其功能并改进其可访问性。传统上，从头开始建造这些设备需要机械工程、数字电子、编程和能源管理方面的知识。然而，为个人使用设计的有效受控环境设备带来了新的约束和挑战。本文提出了OpenAg Personal Food Computer（PFC）；一种低成本的桌面平台，不仅针对植物物候研究人员，还针对爱好者、制作者和K-12级别的教师。PFC完全开源，并旨在成为可用于集体数据共享和植物生长分析的工具。得益于其模块化设计，PFC可以用于广泛活动。

英文摘要

Due to their interdisciplinary nature, devices for controlled-environment agriculture have the possibility to turn into ideal tools not only to conduct research on plant phenology but also to create curricula in a wide range of disciplines. Controlled-environment devices are increasing their functionalities as well as improving their accessibility. Traditionally, building one of these devices from scratch implies knowledge in fields such as mechanical engineering, digital electronics, programming, and energy management. However, the requirements of an effective controlled environment device for personal use brings new constraints and challenges. This paper presents the OpenAg Personal Food Computer (PFC); a low cost desktop size platform, which not only targets plant phenology researchers but also hobbyists, makers, and teachers from elementary to high-school levels (K-12). The PFC is completely open-source and it is intended to become a tool that can be used for collective data sharing and plant growth analysis. Thanks to its modular design, the PFC can be used in a large spectrum of activities.

URL PDF HTML ☆

赞 0 踩 0

1810.09365 2026-06-04 cs.LG cs.RO cs.SY eess.SY stat.ML 版本更新

Coupled Longitudinal and Lateral Control of a Vehicle using Deep Learning

使用深度学习进行车辆纵向和横向控制的耦合控制

Guillaume Devineau, Philip Polack, Florent Altché, Fabien Moutarde

发表机构 * Center for Robotics, MINES ParisTech（机器人中心，巴黎综合理工学院）； PSL Research University（巴黎综合理工大学）

AI总结本文研究了深度神经网络在捕捉车辆动力学关键特性及执行耦合纵向和横向控制方面的潜力，通过高保真车辆动力学模拟数据集训练两种不同的人工神经网络，评估多层感知机和卷积神经网络在复杂测试赛道上的性能，与传统解耦控制器进行比较。

Comments Published in the IEEE 2018 International Conference on Intelligent Transportation Systems (ITSC 2018)

1810.09000 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Safe Adaptive Cruise Control with Road Grade Preview and V2V Communication

安全自适应巡航控制系统与道路坡度预览及车对车通信

Roya Firoozi, Shima Nazari, Jacopo Guanetti, Ryan O'Gorman, Francesco Borrelli

发表机构 * University of Michigan（密歇根大学）

AI总结本文提出了一种安全自适应巡航控制系统，利用道路坡度和前车运动预览，通过模型预测控制框架优化舒适性、安全性和能耗，采用新颖的方法计算鲁棒不变终端集以确保车辆间安全距离，仿真结果验证了该控制算法的有效性。

1803.11247 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Scalable Integrated Task and Motion Planning from Signal Temporal Logic Specifications

可扩展的信号时序逻辑规范下的集成任务和运动规划

Rafael Rodrigues da Silva, Hai Lin

AI总结本文提出了一种可扩展且可证明完整的算法，直接合成连续轨迹以满足非凸的信号时序逻辑规范，通过分离离散任务规划和连续运动规划，并利用高效的求解器找到高维系统中满足非凸STL规范的动态可行轨迹。

Comments 13 pages, report

详情

AI中文摘要

许多安全关键系统必须实现高阶任务规范，并保证安全性和正确性。为实现这一目标，最近通过从信号时序逻辑（STL）规范中合成控制器取得了许多进展。然而，现有方法要么考虑了状态空间的先验离散化，要么只处理STL的凸片段，或无法证明是完整的。我们提出了一种可扩展、可证明完整的算法，直接合成连续轨迹以满足非凸STL规范。我们在线分离离散任务规划和连续运动规划，并利用高效的可满足性模理论（SMT）和线性规划（LP）求解器，为高维系统找到满足非凸STL规范的动态可行轨迹。所提出的设计算法已被证明是正确且完整的，仿真结果展示了我们方法的可扩展性。

英文摘要

Many safety-critical systems must achieve high-level task specifications with guaranteed safety and correctness. Much recent progress towards this goal has been made through controller synthesis from signal temporal logic (STL) specifications. Existing approaches, however, either consider some a priori discretization of the state-space, deal only with a convex fragment of STL, or are not provably complete. We propose a scalable, provably complete algorithm that directly synthesizes continuous trajectories to satisfy non-convex STL specifications. We separate discrete task planning and continuous motion planning on the fly and harness highly efficient satisfiability modulo theories (SMT) and linear programming (LP) solvers to find dynamically feasible trajectories for high dimensional systems that satisfies non-convex STL specifications. The proposed design algorithms are proved sound and complete, and simulation results demonstrate the scalability of our approach.

URL PDF HTML ☆

赞 0 踩 0

1506.02438 2026-06-04 cs.LG cs.RO cs.SY eess.SY 版本更新

High-Dimensional Continuous Control Using Generalized Advantage Estimation

利用广义优势估计进行高维连续控制

John Schulman, Philipp Moritz, Sergey Levine, Michael Jordan, Pieter Abbeel

发表机构 * Department of Electrical Engineering and Computer Science（电气工程与计算机科学系）； University of California, Berkeley（加州大学伯克利分校）

AI总结本文提出了一种基于广义优势估计的方法，通过减少策略梯度估计的方差来解决高维连续控制中的样本需求问题，并通过信任区域优化提高稳定性和收敛性，从而在复杂的3D运动任务中实现了高效的政策学习。

详情

AI中文摘要

策略梯度方法在强化学习中受到青睐，因为它们直接优化累积奖励，并且可以方便地与非线性函数近似器如神经网络结合使用。主要挑战是通常需要大量的样本，以及在输入数据非平稳性下获得稳定和持续改进的难度。我们通过使用价值函数来显著减少策略梯度估计的方差（尽管引入了偏差），并利用类似于TD(λ)的指数加权优势函数估计来解决第一个挑战。我们通过使用信任区域优化过程来解决第二个挑战，该过程用于策略和价值函数，它们由神经网络表示。我们的方法在高度具有挑战性的3D运动任务中表现出强大的经验结果，包括学习双足和四足仿真实体的行走姿态，以及学习使双足机器人从地面平躺状态站立的策略。与使用手工制定政策表示的先前工作相比，我们的神经网络策略直接从原始运动学映射到关节扭矩。我们的算法是完全模型无关的，并且在3D双足机器人上的学习任务所需的模拟经验时间相当于1-2周的真实时间。

英文摘要

Policy gradient methods are an appealing approach in reinforcement learning because they directly optimize the cumulative reward and can straightforwardly be used with nonlinear function approximators such as neural networks. The two main challenges are the large number of samples typically required, and the difficulty of obtaining stable and steady improvement despite the nonstationarity of the incoming data. We address the first challenge by using value functions to substantially reduce the variance of policy gradient estimates at the cost of some bias, with an exponentially-weighted estimator of the advantage function that is analogous to TD(lambda). We address the second challenge by using trust region optimization procedure for both the policy and the value function, which are represented by neural networks. Our approach yields strong empirical results on highly challenging 3D locomotion tasks, learning running gaits for bipedal and quadrupedal simulated robots, and learning a policy for getting the biped to stand up from starting out lying on the ground. In contrast to a body of prior work that uses hand-crafted policy representations, our neural network policies map directly from raw kinematics to joint torques. Our algorithm is fully model-free, and the amount of simulated experience required for the learning tasks on 3D bipeds corresponds to 1-2 weeks of real time.

URL PDF HTML ☆

赞 0 踩 0

1810.05683 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Long-Duration Autonomy for Small Rotorcraft UAS including Recharging

小型旋翼机无人机的长期自主性包括充电

Christian Brommer, Danylo Malyuta, Daniel Hentzen, Roland Brockers

发表机构 * Autonomous Controls Laboratory, University of Washington（华盛顿大学自主控制实验室）； Jet Propulsion Laboratory, California Institute of Technology（加州理工学院喷气推进实验室）

AI总结本研究提出了一种完全自主的小型旋翼机无人机，能够在无人干预的情况下执行长期观测任务，通过全平台自主性和基于视觉的精确着陆技术实现自动能源补充，实验结果展示了其在室内和室外环境中的11小时自主操作能力。

Comments 7 pages

1807.09904 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

A Data-Efficient Approach to Precise and Controlled Pushing

一种数据高效且精确可控的推动作方法

Maria Bauza, Francois R. Hogan, Alberto Rodriguez

发表机构 * Department of Mechanical Engineering — Massachusetts Institute of Technology（机械工程系——麻省理工学院）

AI总结本文提出了一种数据高效的方法，通过学习动态模型来控制复杂机械系统，仅需10个数据点即可完成复杂的推动作轨迹。

Comments Maria Bauza and Francois R. Hogan contributed equally to this work. 10 pages, 5 figures

1803.01940 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Tactile Regrasp: Grasp Adjustments via Simulated Tactile Transformations

触觉重抓：通过模拟触觉变换进行抓取调整

Francois R. Hogan, Maria Bauza, Oleguer Canal, Elliott Donlon, Alberto Rodriguez

发表机构 * Massachusetts Institute of Technology（麻省理工学院）

AI总结本文提出了一种新的重抓控制策略，利用触觉传感进行局部抓取调整。该方法通过虚拟搜索局部触觉测量变换来提高抓取质量。首先，使用深度卷积神经网络构建基于触觉的抓取质量度量，该网络在超过2800次抓取上进行训练。每个抓取的质量是一个介于0和1之间的连续值，通过实验测量其对外部扰动的抵抗性来确定。其次，通过刚体变换模拟机器人运动相对于初始抓取的触觉印记，新生成的触觉印记与学习的抓取质量网络进行评估，选择最大化抓取质量的重抓动作。结果表明，抓取质量网络在已知物体上的平均准确率为85%，在12个物体的交叉验证集上的准确率为75%。重抓控制策略在8个物体的测试集上将抓取动作的成功率提高了70%。

Comments Francois R. Hogan and Maria Bauza contributed equally to this work. 8 pages, 7 figures

详情

Journal ref: IROS 2018

AI中文摘要

本文提出了一种新颖的重抓控制策略，利用触觉传感进行局部抓取调整。我们的方法通过虚拟搜索局部触觉测量变换来确定重抓动作。首先，我们使用深度卷积神经网络构建基于触觉的抓取质量度量，该网络在超过2800次抓取上进行训练。每个抓取的质量是一个介于0和1之间的连续值，通过实验测量其对外部扰动的抵抗性来确定。其次，我们通过执行刚体变换，模拟机器人运动相对于初始抓取的触觉印记。新生成的触觉印记与学习的抓取质量网络进行评估，重抓动作被选择以最大化抓取质量。结果表明，抓取质量网络在已知物体上的平均准确率为85%，在12个物体的交叉验证集上的准确率为75%。重抓控制策略在8个物体的测试集上将抓取动作的成功率提高了70%。

英文摘要

This paper presents a novel regrasp control policy that makes use of tactile sensing to plan local grasp adjustments. Our approach determines regrasp actions by virtually searching for local transformations of tactile measurements that improve the quality of the grasp. First, we construct a tactile-based grasp quality metric using a deep convolutional neural network trained on over 2800 grasps. The quality of each grasp, a continuous value between 0 and 1, is determined experimentally by measuring its resistance to external perturbations. Second, we simulate the tactile imprints associated with robot motions relative to the initial grasp by performing rigid-body transformations of the given tactile measurements. The newly generated tactile imprints are evaluated with the learned grasp quality network and the regrasp action is chosen to maximize the grasp quality. Results show that the grasp quality network can predict the outcome of grasps with an average accuracy of 85% on known objects and 75% on a cross validation set of 12 objects. The regrasp control policy improves the success rate of grasp actions by an average relative increase of 70% on a test set of 8 objects.

URL PDF HTML ☆

赞 0 踩 0

1802.04205 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

Efficient Hierarchical Robot Motion Planning Under Uncertainty and Hybrid Dynamics

在不确定性和混合动力学下的高效机器人运动规划

Ajinkya Jain, Scott Niekum

发表机构 * Department of Mechanical Engineering（机械工程系）； Department of Computer Science（计算机科学系）； University of Texas at Austin, USA（得克萨斯大学奥斯汀分校）

AI总结本文提出了一种分层POMDP规划器，用于在存在不确定性的情况下为混合动力学模型生成成本优化的运动计划，通过将非线性动力学分解为离散的局部动力学模型，从而有效减少状态不确定性。

Comments 2nd Conference on Robot Learning (CoRL 2018), Zurich, Switzerland

详情

AI中文摘要

嘈杂的观测与非线性动力学是机器人运动规划中最大的挑战之一。通过将非线性动力学分解为一组离散的局部动力学模型，混合动力学提供了一种自然的方式来建模非线性动力学，尤其是在由于接触等因素导致动力学突然不连续的系统中。我们提出了一种分层POMDP规划器，该规划器为混合动力学模型开发成本优化的运动计划。分层规划器首先开发一个高层运动计划，以确定要访问的局部动力学模型的顺序，然后将其转换为详细的连续状态计划。这种分层规划方法将POMDP规划问题分解为更小的子部分，这些子部分可以以显著降低的计算成本解决。能够按顺序访问局部动力学模型的能力也提供了一种强大的方法，利用混合动力学来减少状态不确定性。我们在模拟领域导航任务和具有机械臂的装配任务上评估了所提出的规划器，证明了我们的方法可以有效解决具有高观测噪声和非线性动力学的任务，且计算成本显著低于直接规划方法。

英文摘要

Noisy observations coupled with nonlinear dynamics pose one of the biggest challenges in robot motion planning. By decomposing nonlinear dynamics into a discrete set of local dynamics models, hybrid dynamics provide a natural way to model nonlinear dynamics, especially in systems with sudden discontinuities in dynamics due to factors such as contacts. We propose a hierarchical POMDP planner that develops cost-optimized motion plans for hybrid dynamics models. The hierarchical planner first develops a high-level motion plan to sequence the local dynamics models to be visited and then converts it into a detailed continuous state plan. This hierarchical planning approach results in a decomposition of the POMDP planning problem into smaller sub-parts that can be solved with significantly lower computational costs. The ability to sequence the visitation of local dynamics models also provides a powerful way to leverage the hybrid dynamics to reduce state uncertainty. We evaluate the proposed planner on a navigation task in the simulated domain and on an assembly task with a robotic manipulator, showing that our approach can solve tasks having high observation noise and nonlinear dynamics effectively with significantly lower computational costs compared to direct planning approaches.

URL PDF HTML ☆

赞 0 踩 0

1810.03074 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Hierarchical Optimization for Whole-Body Control of Wheeled Inverted Pendulum Humanoids

为轮式倒立摆人形机器人的全身控制设计分层优化

Munzir Zafar, Seth Hutchinson, Evangelos A. Theodorou

发表机构 * Institute of Robotics and Intelligent Machines（机器人与智能机构研究所）

AI总结本文提出了一种用于轮式倒立摆人形机器人的全身控制框架，通过分层优化方法实现多任务的同时执行，同时考虑关节角度和扭矩限制，以提高整体性能。

详情

AI中文摘要

本文提出了一种用于轮式倒立摆人形机器人的全身控制框架。轮式倒立摆人形机器人是一种具有多个自由度的冗余操作臂，能够动态地在轮子上保持平衡。这些机器人能够同时执行多种任务，如平衡、维持身体姿态、控制视线、举重物或维持末端执行器在操作空间中的配置。全身控制问题旨在在指定优先级下，通过最优利用所有自由度的同时执行这些任务。控制还必须遵守每个关节的角度和扭矩限制。所提出的方法是分层的，包括一个低级控制器用于身体关节的操作，以及一个高级控制器定义用于低级控制器的质心（CoM）目标，以控制系统的零动力学，从而驱动轮子。低级控制器在考虑系统更完整的动力学时计划较短的规划范围，而高级控制器则基于对机器人近似模型的规划来提高计算效率，以规划更长的规划范围。

英文摘要

In this paper, we present a whole-body control framework for Wheeled Inverted Pendulum (WIP) Humanoids. WIP Humanoids are redundant manipulators dynamically balancing themselves on wheels. Characterized by several degrees of freedom, they have the ability to perform several tasks simultaneously, such as balancing, maintaining a body pose, controlling the gaze, lifting a load or maintaining end-effector configuration in operation space. The problem of whole-body control is to enable simultaneous performance of these tasks with optimal participation of all degrees of freedom at specified priorities for each objective. The control also has to obey constraint of angle and torque limits on each joint. The proposed approach is hierarchical with a low level controller for body joints manipulation and a high-level controller that defines center of mass (CoM) targets for the low-level controller to control zero dynamics of the system driving the wheels. The low-level controller plans for shorter horizons while considering more complete dynamics of the system, while the high-level controller plans for longer horizon based on an approximate model of the robot for computational efficiency.

URL PDF HTML ☆

赞 0 踩 0

1810.01577 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Moment-Sum-Of-Squares Approach For Fast Risk Estimation In Uncertain Environments

基于矩-平方和方法的不确定环境中的快速风险估计

Ashkan Jasour, Andreas Hofmann, Brian C. Williams

发表机构 * MIT（麻省理工学院）； Computer Science and Artificial Intelligence Laboratory（计算机科学与人工智能实验室）

AI总结本文提出了一种基于矩-平方和的方法，用于在存在有界不确定性的环境中快速估计机器人安全约束违反的概率。该方法利用多项式水平集描述不安全集，通过求解平方和优化问题获得单变量切比雪夫多项式的系数，从而在实时条件下利用有限的矩估计风险。

Comments 57th IEEE Conference on Decision and Control 2018

详情

网络化移动机器人故障容错控制

Pietro Pierpaoli, Dominique Sauter, Magnus Egerstedt

AI总结本文提出了一种两阶段技术，用于网络化移动机器人中偏置测量代理的识别与补偿，通过单个观测者代理部署的故障识别滤波器估计网络中的单个故障，并在检测到故障后启动最优领导者基于的补偿策略。

Comments 7 pages, 7 figures, conference

详情

AI中文摘要

网络化自主代理已被用于许多应用，如移动传感器网络和智能交通系统。然而，在此类系统中，一个或多个子系统的故障或误差效应可能迅速扩散到整个网络，快速降低系统整体性能。在以共识驱动的动力学中，故障效应尤为重要，因为系统传递函数中存在无约束刚性模式。本文提出了一种两阶段技术，用于网络化移动机器人中偏置测量代理的识别与补偿，在时间不变的交互拓扑下进行。我们假设这些交互仅以相对位置测量的形式发生。一个部署在单个观测者代理上的故障识别滤波器用于估计网络中任何位置发生的单个故障。一旦检测到故障，就会启动最优领导者基于的补偿策略。结果通过数值模拟和机器人实验呈现。

英文摘要

Teams of networked autonomous agents have been used in a number of applications, such as mobile sensor networks and intelligent transportation systems. However, in such systems, the effect of faults and errors in one or more of the sub-systems can easily spread throughout the network, quickly degrading the performance of the entire system. In consensus-driven dynamics, the effects of faults are particularly relevant because of the presence of unconstrained rigid modes in the transfer function of the system. Here, we propose a two-stage technique for the identification and accommodation of a biased-measurements agent, in a network of mobile robots with time invariant interaction topology. We assume these interactions to only take place in the form of relative position measurements. A fault identification filter deployed on a single observer agent is used to estimate a single fault occurring anywhere in the network. Once the fault is detected, an optimal leader-based accommodation strategy is initiated. Results are presented by means of numerical simulations and robot experiments.

URL PDF HTML ☆

赞 0 踩 0

1809.10012 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Using Neural Networks to Generate Information Maps for Mobile Sensors

用神经网络为移动传感器生成信息图

Louis Dressel, Mykel J. Kochenderfer

AI总结本文提出利用卷积神经网络实时生成移动传感器的信息图，以提高轨迹生成的效率和准确性。

Comments Accepted to the 2018 IEEE Conference on Decision and Control (CDC)

1809.09716 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Sampling-based Polytopic Trees for Approximate Optimal Control of Piecewise Affine Systems

基于采样的多面体树用于分段仿射系统的近似最优控制

Sadra Sadraddini, Russ Tedrake

AI总结本文提出了一种基于采样的多面体树方法，用于分段仿射系统的近似最优控制，通过混合整数凸规划将轨迹优化与反馈控制结合，以满足硬约束条件。

Comments Under Review

详情

AI中文摘要

分段仿射（PWA）系统广泛用于建模高度非线性行为，如机器人运动和操作中的接触动力学。现有PWA系统的控制技术在离线设计和在线实现中都有计算上的缺点。本文介绍了一种方法，用于获得反馈控制策略和对应的可接受初始条件，使得所有闭环轨迹都能达到目标多面体，同时优化成本函数。该方法的概念类似于LQR-trees [1]，包括三个步骤：（1）开环轨迹优化，（2）反馈控制用于计算轨迹周围的“漏斗”状态，以及（3）以树状方式重复（1）和（2），使漏斗从目标向后生长并尽可能填充状态空间。我们展示了PWA动力学可以被利用将步骤（1）和（2）结合为一个步骤，该步骤通过混合整数凸规划解决，使该方法适用于处理硬约束。接触动力学的示例进行了说明。

英文摘要

Piecewise affine (PWA) systems are widely used to model highly nonlinear behaviors such as contact dynamics in robot locomotion and manipulation. Existing control techniques for PWA systems have computational drawbacks, both in offline design and online implementation. In this paper, we introduce a method to obtain feedback control policies and a corresponding set of admissible initial conditions for discrete-time PWA systems such that all the closed-loop trajectories reach a goal polytope, while a cost function is optimized. The idea is conceptually similar to LQR-trees \cite{tedrake2010lqr}, which consists of 3 steps: (1) open-loop trajectory optimization, (2) feedback control for computation of "funnels" of states around trajectories, and (3) repeating (1) and (2) in a way that the funnels are grown backward from the goal in a tree fashion and fill the state-space as much as possible. We show PWA dynamics can be exploited to combine step (1) and (2) into a single step that is tackled using mixed-integer convex programming, which makes the method suitable for dealing with hard constraints. Illustrative examples on contact-based dynamics are presented.

URL PDF HTML ☆

赞 0 踩 0

1809.08819 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Oscillation Damping Control of Pendulum-like Manipulation Platform using Moving Masses

使用移动质量抑制摆动式操作平台的振动

Min Jun Kim, Jianjie Lin, Konstantin Kondak, Dongheui Lee, Christian Ott

发表机构 * Institute of Robotics and Mechatronics, German Aerospace Center (DLR), Wessling, Germany（机器人与机电研究所，德国航空航天中心（DLR），德国韦斯林）； Chair of Automatic Control Engineering, Technical University of Munich (TUM), Munich, Germany（自动控制工程系，慕尼黑技术大学（TUM），德国慕尼黑）； Fortiss Institute, Munich, Germany（Fortiss研究所，德国慕尼黑）

AI总结本文提出了一种通过在平台上安装移动质量来抑制机器人操作臂悬挂平台振动的方法，通过合理设计移动质量的参考加速度实现平台的渐近稳定性，同时克服了欠驱动带来的挑战。

Comments IFAC Symposium on Robot Control (SYROCO) 2018

1602.01891 2026-06-04 cs.RO cs.MA cs.SY eess.SY math.OC 版本更新

Distributed Estimation of State and Parameters in Multi-Agent Cooperative Load Manipulation

多智能体协同负载操作中状态与参数的分布式估计

Antonio Franchi, Antonio Petitti, Alessandro Rizzo

发表机构 * CNRS, LAAS（法国国家科学研究中心（CNRS）、拉瓦尔大学（LAAS））

AI总结本文提出两种分布式方法，用于估计未知平面体的运动学参数、动力学参数和运动学状态，利用刚体运动学和动力学、非线性观测理论和一致性算法，通过智能体对负载施加二维力矩、测量接触点速度以及通信图连通性来实现，理论分析和收敛证明均提供，第一种方法假设参数恒定，第二种方法可处理时变参数并可并行应用于任何任务导向的控制律，对于无控制律的情况，提出了一种分布式且安全的控制策略以满足可观测性条件，通过现实的蒙特卡洛模拟展示了估计策略的有效性和鲁棒性。

Comments Accepted for publication to the IEEE Transactions on Control of Network Systems

详情

AI中文摘要

我们提出了两种分布式方法，用于估计未知平面体的运动学参数、动力学参数和运动学状态，这些方法依赖于刚体运动学和动力学、非线性观测理论和一致性算法。唯一三个要求是每个智能体可以对负载施加二维力矩，可以测量其接触点的速度，并且通信图是连通的。提供了理论非线性可观测性分析和收敛证明。第一种方法假设参数恒定，而第二种方法可以处理时变参数，并且可以并行应用于任何任务导向的控制律。对于没有提供控制律的情况，我们提出了一种分布式且安全的控制策略，以满足可观测性条件。通过现实的蒙特卡洛模拟展示了估计策略的有效性和鲁棒性。

英文摘要

We present two distributed methods for the estimation of the kinematic parameters, the dynamic parameters, and the kinematic state of an unknown planar body manipulated by a decentralized multi-agent system. The proposed approaches rely on the rigid body kinematics and dynamics, on nonlinear observation theory, and on consensus algorithms. The only three requirements are that each agent can exert a 2D wrench on the load, it can measure the velocity of its contact point, and that the communication graph is connected. Both theoretical nonlinear observability analysis and convergence proofs are provided. The first method assumes constant parameters while the second one can deal with time-varying parameters and can be applied in parallel to any task-oriented control law. For the cases in which a control law is not provided, we propose a distributed and safe control strategy satisfying the observability condition. The effectiveness and robustness of the estimation strategy is showcased by means of realistic MonteCarlo simulations.

URL PDF HTML ☆

赞 0 踩 0

1809.08022 2026-06-04 cs.RO cs.SY eess.SY 版本更新

The Urban Last Mile Problem: Autonomous Drone Delivery to Your Balcony

城市最后一公里问题：自主无人机送货到您的阳台

Gino Brunner, Bence Szebedy, Simon Tanner, Roger Wattenhofer

发表机构 * Computer Engineering and Networks Laboratory（计算机工程与网络实验室）

AI总结本文提出了一种基于商用无人机的城市最后一公里自主送货方法，通过GPS定位和视觉导航实现对阳台或门廊等非集中地点的精准配送，并开源代码以促进未来研究。

详情

AI中文摘要

无人机送货在过去几年中已成为行业热点。然而，现有方法要么专注于农村地区，要么依赖于集中式配送点来执行最后一公里配送。在本文中，我们使用商用无人机解决城市环境中自主最后一公里配送的问题。我们构建了一个原型系统，该系统能够利用GPS飞向近似配送位置，然后使用视觉导航找到精确的配送位置。配送位置可能例如在阳台或门廊上，并且只需在墙上或窗户上用视觉标记指示即可。我们测试了我们的系统组件在模拟环境中的表现，包括视觉导航和避障。最后，我们在现实环境中部署了我们的无人机，并展示了它如何在阳台上找到配送点。为了促进该主题的未来研究，我们开源了我们的代码。

英文摘要

Drone delivery has been a hot topic in the industry in the past few years. However, existing approaches either focus on rural areas or rely on centralized drop-off locations from where the last mile delivery is performed. In this paper we tackle the problem of autonomous last mile delivery in urban environments using an off-the-shelf drone. We build a prototype system that is able to fly to the approximate delivery location using GPS and then find the exact drop-off location using visual navigation. The drop-off location could, e.g., be on a balcony or porch, and simply needs to be indicated by a visual marker on the wall or window. We test our system components in simulated environments, including the visual navigation and collision avoidance. Finally, we deploy our drone in a real-world environment and show how it can find the drop-off point on a balcony. To stimulate future research in this topic we open source our code.

URL PDF HTML ☆

赞 0 踩 0

1803.02238 2026-06-04 cs.RO cs.CL cs.SY eess.SY 版本更新

Precise but Natural Specification for Robot Tasks

机器人任务的精确但自然的规范

Ivan Gavran, Brendon Boldt, Eva Darulova, Rupak Majumdar

发表机构 * Max Planck Institute for Software Systems, Germany（德国马克斯·普朗克软件研究所）

AI总结 Flipper通过自然语言接口实现机器人高阶任务规范，结合形式化核心语言与语义解析器，提供可视化反馈并支持自然语言扩展，提升任务描述效率。

详情

AI中文摘要

我们提出了Flipper，一种自然语言接口，用于描述机器人高阶任务规范并编译为机器人动作。Flipper始于形式化核心语言，允许表达丰富的时序规范，并通过语义解析器提供自然语言接口。Flipper通过在图形用户界面中执行自动构建的计划提供即时视觉反馈，允许用户解决潜在的歧义解释。Flipper通过自然化扩展自身：用户可以添加 utterances 的定义，Flipper 由此诱导新规则并将其添加到核心语言中，逐渐形成更加自然的任务规范语言。Flipper通过泛化用户提供的定义来改进自然化。与其他任务规范系统不同，Flipper在保持编程语言的表达力和形式精确性的同时，实现了自然语言交互。我们通过初始用户研究证明，自然语言交互和泛化可以显著简化任务描述。此外，随着时间推移，用户会使用更多超出初始核心语言的概念。这些扩展可供Flipper社区使用，用户可以使用其他人定义的概念。

英文摘要

We present Flipper, a natural language interface for describing high-level task specifications for robots that are compiled into robot actions. Flipper starts with a formal core language for task planning that allows expressing rich temporal specifications and uses a semantic parser to provide a natural language interface. Flipper provides immediate visual feedback by executing an automatically constructed plan of the task in a graphical user interface. This allows the user to resolve potentially ambiguous interpretations. Flipper extends itself via naturalization: its users can add definitions for utterances, from which Flipper induces new rules and adds them to the core language, gradually growing a more and more natural task specification language. Flipper improves the naturalization by generalizing the definition provided by users. Unlike other task-specification systems, Flipper enables natural language interactions while maintaining the expressive power and formal precision of a programming language. We show through an initial user study that natural language interactions and generalization can considerably ease the description of tasks. Moreover, over time, users employ more and more concepts outside of the initial core language. Such extensions are available to the Flipper community, and users can use concepts that others have defined.

URL PDF HTML ☆

赞 0 踩 0

1710.06537 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Sim-to-Real Transfer of Robotic Control with Dynamics Randomization

机器人控制的仿真到现实转移与动力学随机化

Xue Bin Peng, Marcin Andrychowicz, Wojciech Zaremba, Pieter Abbeel

发表机构 * OpenAI

AI总结本文提出一种简单方法通过随机化仿真动力学来弥合现实与仿真的差距，使策略能适应不同动态，从而在无需真实系统训练的情况下实现现实世界泛化。

详情

DOI: 10.1109/ICRA.2018.8460528

AI中文摘要

仿真环境为训练智能体提供了丰富的数据源，并在训练过程中减少了某些安全方面的担忧。但智能体在仿真中开发的行为往往特定于模拟器的特性。由于建模误差，仿真中表现良好的策略可能无法转移到现实世界。本文提出了一种简单的方法来弥合这一“现实差距”。通过在训练过程中随机化模拟器的动力学，我们能够开发出能够适应非常不同动力学的策略，包括那些与策略训练所基于的动力学有显著差异的动力学。这种适应性使策略能够在没有对物理系统进行训练的情况下泛化到现实世界的动力学。我们的方法在使用机械臂的物体推动任务上进行了演示。尽管策略仅在仿真中进行训练，但部署在真实机器人上时，其性能仍能保持相似水平，能够可靠地将物体从随机初始配置移动到目标位置。我们探讨了各种设计决策的影响，并展示了所得到的策略对显著校准误差具有鲁棒性。

英文摘要

Simulations are attractive environments for training agents as they provide an abundant source of data and alleviate certain safety concerns during the training process. But the behaviours developed by agents in simulation are often specific to the characteristics of the simulator. Due to modeling error, strategies that are successful in simulation may not transfer to their real world counterparts. In this paper, we demonstrate a simple method to bridge this "reality gap". By randomizing the dynamics of the simulator during training, we are able to develop policies that are capable of adapting to very different dynamics, including ones that differ significantly from the dynamics on which the policies were trained. This adaptivity enables the policies to generalize to the dynamics of the real world without any training on the physical system. Our approach is demonstrated on an object pushing task using a robotic arm. Despite being trained exclusively in simulation, our policies are able to maintain a similar level of performance when deployed on a real robot, reliably moving an object to a desired location from random initial configurations. We explore the impact of various design decisions and show that the resulting policies are robust to significant calibration error.

URL PDF HTML ☆

赞 0 踩 0

1809.06179 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Learning of Multi-Context Models for Autonomous Underwater Vehicles

多情境模型学习用于自主水下车辆

Bilal Wehbe, Octavio Arriaga, Mario Michael Krell, Frank Kirchner

发表机构 * DFKI - Robotic Innovation Center（DFKI机器人创新中心）； Robotics Research Group（机器人研究组）

AI总结本文提出利用LSTM网络学习自主水下车辆的多情境模型，通过实验数据构建仿真模型，生成不同情境并提高分类准确性，展现对噪声的鲁棒性和大数据集的扩展能力。

Comments 6 pages, 7 figures, AUV 2018 author copy

1806.06161 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

BaRC: Backward Reachability Curriculum for Robotic Reinforcement Learning

BaRC：机器人强化学习中的逆向可达性课程

Boris Ivanovic, James Harrison, Apoorva Sharma, Mo Chen, Marco Pavone

发表机构 * Department of Mechanical Engineering, Stanford University（斯坦福大学机械工程系）； School of Computing Science, Simon Fraser University（西蒙弗雷泽大学计算机科学学院）

AI总结本文提出BaRC方法，利用物理先验知识设计课程方案，通过逆向可达性策略加速连续控制MDP中模型无关RL算法的训练，提升性能并减少探索需求。

详情

AI中文摘要

模型无关强化学习（RL）为高维系统学习控制策略提供了有吸引力的方法，但其相对差的样本复杂性通常迫使在模拟环境中进行训练。即使在模拟中，具有稀疏自然奖励函数的目标导向任务仍难以被最先进的模型无关算法处理。这些任务的瓶颈在于从系统初始状态获取学习信号所需的大量探索。本文利用物理先验知识（以近似系统动力学模型的形式）设计了一种课程方案，用于模型无关策略优化算法。我们的逆向可达性课程（BaRC）从需要少量动作完成任务的状态开始策略训练，并在策略优化算法表现出足够性能后，以动态一致的方式扩展初始状态分布。BaRC具有通用性，可以加速任何模型无关RL算法在广泛目标导向连续控制MDP上的训练。其课程策略具有物理直观性、易于调节，并允许将物理先验整合到训练中，而不会影响模型无关RL算法的性能、灵活性和适用性。我们在两个代表性的动态机器人学习问题上评估了我们的方法，并发现相对于先前的课程生成技术和朴素探索策略，有显著的性能提升。

英文摘要

Model-free Reinforcement Learning (RL) offers an attractive approach to learn control policies for high-dimensional systems, but its relatively poor sample complexity often forces training in simulated environments. Even in simulation, goal-directed tasks whose natural reward function is sparse remain intractable for state-of-the-art model-free algorithms for continuous control. The bottleneck in these tasks is the prohibitive amount of exploration required to obtain a learning signal from the initial state of the system. In this work, we leverage physical priors in the form of an approximate system dynamics model to design a curriculum scheme for a model-free policy optimization algorithm. Our Backward Reachability Curriculum (BaRC) begins policy training from states that require a small number of actions to accomplish the task, and expands the initial state distribution backwards in a dynamically-consistent manner once the policy optimization algorithm demonstrates sufficient performance. BaRC is general, in that it can accelerate training of any model-free RL algorithm on a broad class of goal-directed continuous control MDPs. Its curriculum strategy is physically intuitive, easy-to-tune, and allows incorporating physical priors to accelerate training without hindering the performance, flexibility, and applicability of the model-free RL algorithm. We evaluate our approach on two representative dynamic robotic learning problems and find substantial performance improvement relative to previous curriculum generation techniques and naive exploration strategies.

URL PDF HTML ☆

赞 0 踩 0

1701.03913 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Modeling and control of a cable-driven series elastic actuator

缆驱动系列弹性执行器的建模与控制

Wulin Zou, Ningbo Yu

发表机构 * Institute of Robotics and Automatic Information Systems, Nankai University（机器人与自动化信息系统研究所，南开大学）； Tianjin Key Laboratory of Intelligent Robotics, Nankai University（天津智能机器人重点实验室，南开大学）

AI总结本文研究缆驱动系列弹性执行器的建模与控制，采用双自由度控制方法提升鲁棒性，验证了其在人机交互中的应用潜力。

1804.01031 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Provably Robust Learning-Based Approach for High-Accuracy Tracking Control of Lagrangian Systems

具有证明鲁棒性的基于学习的方法用于拉格朗日系统高精度跟踪控制

Mohamed K. Helwa, Adam Heins, Angela P. Schoellig

发表机构 * Dynamic Systems Lab（动态系统实验室）； Institute for Aerospace Studies（航空航天研究院）； University of Toronto（多伦多大学）

AI总结本文提出基于高斯过程的新型学习控制方法，确保系统稳定性与高精度跟踪，通过不确定性界保证鲁棒性，并在仿真和实验中验证有效性。

Comments 8 pages, 4 figures, 2 tables, submitted to IEEE Robotics and Automation Letters (RA-L) and the 2019 International Conference on Robotics and Automation (ICRA) (created: March 2018; updated: September 2018)

详情

AI中文摘要

拉格朗日系统涵盖了多种机器人系统，包括机械臂、轮式和腿部机器人以及四旋翼。通常使用逆动力学控制和前馈线性化技术将复杂非线性动力学转换为解耦的二阶积分器，然后使用标准外环控制器计算线性化系统的期望加速度。然而，这些方法通常依赖于非常准确的系统模型，这在实践中往往不可用。尽管文献中使用了不同的学习方法来解决这一挑战，但大多数方法在学习控制系统稳定性方面缺乏安全保证。本文提出了一种基于高斯过程（GPs）的新学习控制方法，确保闭环系统的稳定性和高精度跟踪。我们使用GPs近似命令加速度与系统实际加速度之间的误差，并利用GP预测的均值和方差计算线性化模型不确定性的上界。此不确定性界随后用于鲁棒的外环控制器以确保整个系统的稳定性。此外，我们证明跟踪误差收敛到一个半径可任意小的球体。进一步，我们通过在2自由度平面机械臂上的仿真和6自由度工业机械臂上的实验验证了我们方法的有效性。

英文摘要

Lagrangian systems represent a wide range of robotic systems, including manipulators, wheeled and legged robots, and quadrotors. Inverse dynamics control and feedforward linearization techniques are typically used to convert the complex nonlinear dynamics of Lagrangian systems to a set of decoupled double integrators, and then a standard, outer-loop controller can be used to calculate the commanded acceleration for the linearized system. However, these methods typically depend on having a very accurate system model, which is often not available in practice. While this challenge has been addressed in the literature using different learning approaches, most of these approaches do not provide safety guarantees in terms of stability of the learning-based control system. In this paper, we provide a novel, learning-based control approach based on Gaussian processes (GPs) that ensures both stability of the closed-loop system and high-accuracy tracking. We use GPs to approximate the error between the commanded acceleration and the actual acceleration of the system, and then use the predicted mean and variance of the GP to calculate an upper bound on the uncertainty of the linearized model. This uncertainty bound is then used in a robust, outer-loop controller to ensure stability of the overall system. Moreover, we show that the tracking error converges to a ball with a radius that can be made arbitrarily small. Furthermore, we verify the effectiveness of our approach via simulations on a 2 degree-of-freedom (DOF) planar manipulator and experimentally on a 6 DOF industrial manipulator.

URL PDF HTML ☆

赞 0 踩 0

1809.03826 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Real-time force control of an SEA-based body weight support unit with the 2-DOF control structure

基于SEA的体位支撑装置的实时力控研究：采用2自由度控制结构

Yubo Sun, Yuqi Lei, Wulin Zou, Jianmin Li, Ningbo Yu

AI总结本文提出一种基于SEA的体位支撑装置，采用2自由度控制结构实现实时力控，通过仿真和实验验证了其在康复中的有效性。

Comments In proceedings of the IEEE International Conference on Real-time Computing and Robotics 2018

详情

AI中文摘要

体位支撑（BWS）是康复领域的重要技术。随着康复科学与工程的快速发展，BWS正迅速发展并受到广泛关注。本文构建了一种新型重力卸载系统，允许患者在三维笛卡尔空间内自由移动并受到重力支撑。因此，对于患有神经损伤（如中风或脊髓损伤）的患者，可以将残余运动控制能力专注于平衡和行走的治疗训练。实时力控性能对BWS单元提供合适支撑并避免干扰至关重要。在本文中，我们重新设计了BWS单元，采用一系列弹性执行机构结构以提高人机交互性能。进一步，采用了2自由度（2-DOF）控制方法以实现精确且鲁棒的BWS力控。仿真和实验结果验证了BWS设计和实时控制方法的有效性。

英文摘要

Body weight support (BWS) is a fundamental technique in rehabilitation. Along with the dramatic progressing of rehabilitation science and engineering, BWS is quickly evolving with new initiatives and has attracted deep research effort in recent years. We have built up a novel gravity offloading system, in which the patient is allowed to move freely in the three-dimensional Cartesian space and receives support against gravity. Thus, the patients, especially for those that suffer from neurological injury such as stroke or spinal cord injury, can focus their residual motor control capabilities on essential therapeutic trainings of balance and gait. The real-time force control performance is critical for the BWS unit to provide suitable support and avoid disturbance. In this work, we have re-designed our BWS unit with a series elastic actuation structure to improve the human-robot interaction performance. Further, the 2 degrees of freedom (2-DOF) control approach was taken for accurate and robust BWS force control. Both simulation and experimental results have validated the efficacy of the BWS design and real-time control methods.

URL PDF HTML ☆

赞 0 踩 0

1809.02867 2026-06-04 eess.SY cs.IT cs.RO cs.SY math.IT 版本更新

A Review on Cooperative Adaptive Cruise Control (CACC) Systems: Architectures, Controls, and Applications

协同自适应巡航控制系统（CACC）综述：架构、控制与应用

Ziran Wang, Guoyuan Wu, Matthew Barth

发表机构 * Department of Mechanical Engineering（机械工程系）

AI总结本文综述了全球范围内关于CACC系统不同方面的研究进展，涵盖架构、控制方法及应用，分析了当前机遇与挑战，并展望了未来发展方向。

1709.06196 2026-06-04 cs.AI cs.RO cs.SY eess.SY 版本更新

Online algorithms for POMDPs with continuous state, action, and observation spaces

在线算法用于具有连续状态、动作和观察空间的POMDPs

Zachary Sunberg, Mykel Kochenderfer

发表机构 * Aeronautics and Astronautics Dept. Stanford University（航空航天系斯坦福大学）

AI总结本文提出POMCPOW和PFT-DPW算法，通过加权粒子过滤解决连续状态空间POMDPs的求解问题，验证了改进方法的有效性。

Comments Added Multilane section

1804.11278 2026-06-04 eess.SY cs.RO cs.SY 版本更新

On the Interaction between Autonomous Mobility-on-Demand and Public Transportation Systems

自动驾驶出行即服务与公共交通系统的交互

Mauro Salazar, Federico Rossi, Maximilian Schiffer, Christopher H. Onder, Marco Pavone

发表机构 * Stanford University（斯坦福大学）； Technical University of Munich（慕尼黑技术大学）

AI总结本文研究了自动驾驶出行即服务与公共交通系统的耦合模型及协调策略，通过网络流模型最大化社会福利，并设计定价与收费方案实现社会最优，以纽约市为例验证了协同效应。

Comments 9 pages, 8 figures, ITSC 2018

1809.00037 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Estimation for Quadrotors

四旋翼估计

Stefanie Tellex, Andy Brown, Sergei Lupashin

发表机构 * Brown University（布朗大学）； Udacity, Inc.（Udacity公司）； Fotokite（Fotokite公司）

AI总结本文基于四旋翼模型，推导了扩展卡尔曼滤波器的推导过程，提供EKF、贝叶斯滤波和无迹卡尔曼滤波的伪代码，旨在解决四旋翼状态估计中的噪声和计算限制问题。

详情

AI中文摘要

本文描述了四旋翼滤波和估计的标准方法，适用于Udacity飞行汽车课程。本文假设具备概率知识和一些线性代数知识，不假设卡尔曼滤波或贝叶斯滤波的先前知识。本文推导了不同无人机模型在1D、2D和3D中的EKF。本文使用Thrun等人[13]定义的EKF和符号，并提供了贝叶斯滤波、EKF和无迹卡尔曼滤波[14]的伪代码。本文的动机是缺乏提供四旋翼直升机推导的逐步EKF教程。估计的目标是从传感器值和控制输入推断无人机的状态（姿态、速度、加速度和偏差）。这个问题具有挑战性，因为传感器噪声很大。此外，由于重量和成本问题，许多无人机具有有限的机载计算能力，因此希望快速估计这些值。标准方法是扩展卡尔曼滤波，它是卡尔曼滤波的非线性扩展，通过在当前状态附近线性化非线性转换和测量模型。然而，无迹卡尔曼滤波在几乎所有方面都更好：更容易实现，估计更准确，运行时间相当。

英文摘要

This document describes standard approaches for filtering and estimation for quadrotors, created for the Udacity Flying Cars course. We assume previous knowledge of probability and some knowledge of linear algebra. We do not assume previous knowledge of Kalman filters or Bayes filters. This document derives an EKF for various models of drones in 1D, 2D, and 3D. We use the EKF and notation as defined in Thrun et al. [13]. We also give pseudocode for the Bayes filter, the EKF, and the Unscented Kalman filter [14]. The motivation behind this document is the lack of a step-by-step EKF tutorial that provides the derivations for a quadrotor helicopter. The goal of estimation is to infer the drone's state (pose, velocity, acceleration, and biases) from its sensor values and control inputs. This problem is challenging because sensors are noisy. Additionally, because of weight and cost issues, many drones have limited on-board computation so we want to estimate these values as quickly as possible. The standard method for performing this method is the Extended Kalman filter, a nonlinear extension of the Kalman filter which linearizes a nonlinear transition and measurement model around the current state. However the Unscented Kalman filter is better in almost every respect: simpler to implement, more accurate to estimate, and comparable runtimes.

URL PDF HTML ☆

赞 0 踩 0

1808.06900 2026-06-04 cs.NI cs.DS cs.RO cs.SY eess.SY 版本更新

Defending against Intrusion of Malicious UAVs with Networked UAV Defense Swarms

用网络化无人机防御群抵御恶意无人机入侵

Matthias R. Brust, Grégoire Danoy, Pascal Bouvry, Dren Gashi, Himadri Pathak, Mike P. Gonçalves

发表机构 * Interdisciplinary Centre for Security, Reliability and Trust (SnT), University of Luxembourg, Luxembourg（安全、可靠与信任 interdisciplinary 中心（SnT），卢森堡大学，卢森堡）； Faculty of Science, Technology and Communication (FSTC), University of Luxembourg, Luxembourg（科学、技术与通信学院（FSTC），卢森堡大学，卢森堡）

AI总结本文提出了一种无人机防御系统，通过自组织防御编队拦截并护送恶意无人机至飞行区外，采用模块化设计和自平衡聚类算法，实现通信损失下的鲁棒性。

Comments IEEE Conference on Local Computer Networks (LCN), 2017

详情

DOI: 10.1109/LCN.Workshops.2017.71

AI中文摘要

如今，亚马逊、阿里巴巴等公司正推动使用无人机（UAVs）提供服务，如包裹和食品配送。随着政府希望利用无人机带来的巨大经济利益，城市规划者正在将所谓的无人机飞行区和无人机高速公路纳入智能城市设计。然而，无人机的高速移动和行为动态需要监控以检测并处理入侵者、 rogue drones 和恶意无人机。本文提出了一种无人机防御系统，旨在拦截并护送恶意无人机至飞行区外。所提出的无人机防御系统由一个能够自行组织防御编队的防御无人机群组成，并在检测到入侵者时形成拦截和捕获编队。我们采用了模块化设计原则，开发了一种创新的自平衡聚类过程来实现拦截和捕获编队。结果表明，所得到的网络化防御无人机群对通信损失具有鲁棒性。最后，实现了一个原型无人机模拟器。通过广泛的模拟，我们展示了该方法的可行性和性能。

英文摘要

Nowadays, companies such as Amazon, Alibaba, and even pizza chains are pushing forward to use drones, also called UAVs (Unmanned Aerial Vehicles), for service provision, such as package and food delivery. As governments intend to use these immense economic benefits that UAVs have to offer, urban planners are moving forward to incorporate so-called UAV flight zones and UAV highways in their smart city designs. However, the high-speed mobility and behavior dynamics of UAVs need to be monitored to detect and, subsequently, to deal with intruders, rogue drones, and UAVs with a malicious intent. This paper proposes a UAV defense system for the purpose of intercepting and escorting a malicious UAV outside the flight zone. The proposed UAV defense system consists of a defense UAV swarm, which is capable to self-organize its defense formation in the event of intruder detection, and chase the malicious UAV as a networked swarm. Modular design principles have been used for our fully localized approach. We developed an innovative auto-balanced clustering process to realize the intercept- and capture-formation. As it turned out, the resulting networked defense UAV swarm is resilient against communication losses. Finally, a prototype UAV simulator has been implemented. Through extensive simulations, we show the feasibility and performance of our approach.

URL PDF HTML ☆

赞 0 踩 0

1806.05220 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Decentralized Ergodic Control: Distribution-Driven Sensing and Exploration for Multi-Agent Systems

去中心化恒定控制：面向多智能体系统的分布驱动感知与探索

Ian Abraham, Todd D. Murphey

发表机构 * Neuroscience and Robotics Laboratory（神经科学与机器人实验室）

AI总结本文提出一种去中心化恒定控制策略，用于解决多智能体非线性动态系统的时间变化区域覆盖问题，通过共识实现完全去中心化的多智能体控制政策，并展示了其在多智能体地形映射和目标定位中的应用。

Comments 8 pages, Accepted for publication in IEEE Robotics and Automation Letters

1808.08309 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Trajectory Tracking Control of a Flexible Spine Robot, With and Without a Reference Input

具有和不具有参考输入的柔性脊柱机器人类轨迹跟踪控制

Andrew P. Sabelhaus, Shirley Huajing Zhao, Mallory C. Daly, Ellande Tang, Edward Zhu, Abishek K. Akella, Zeerek A. Ahmad, Vytas SunSpiral, Alice M. Agogino

发表机构 * NASA Ames Intelligent Robotics Group（美国航空航天局阿姆斯研究中心智能机器人组）； Dynamic Tensegrity Robotics Lab（动态张力机器人实验室）； Levant Power Corp.（Levant Power公司）； Velo3D Inc.（Velo3D公司）； SGT Inc.（SGT公司）

AI总结本文提出两种控制器，一种不使用参考输入但包含平滑约束，另一种使用参考输入但无平滑，用于柔性脊柱机器人轨迹跟踪控制。

1808.06652 2026-06-04 eess.SY cs.RO cs.SY 版本更新

On the Optimality of Ergodic Trajectories for Information Gathering Tasks

关于信息采集任务中ergodic轨迹的最优性

Louis Dressel, Mykel J. Kochenderfer

AI总结本文研究了在重复测量同一状态下信息衰减线性假设条件下，最优信息采集轨迹为ergodic轨迹的问题，通过实验验证了ergodic性与最优信息采集及submodularity之间的联系。

Comments Presented at 2018 American Control Conference (ACC)

详情

空载合规控制的空中机械臂

Min Jun Kim, Ribin Balachandran, Marco De Stefano, Konstantin Kondak, Christian Ott

发表机构 * German Aerospace Center (DLR)（德国航空航天中心）

AI总结本文提出了一种空载合规控制方法，通过合理选择末端执行器坐标和时间域被动技术，确保空中机械臂在无动力驱动情况下实现稳定环境交互。

Comments IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2018

1808.02393 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Control of Multi-Agent Systems with Finite Time Control Barrier Certificates and Temporal Logic

多智能体系统的有限时间控制屏障证书与时序逻辑控制

Mohit Srinivasan, Samuel Coogan, Magnus Egerstedt

发表机构 * Georgia Institute of Technology（佐治亚理工学院）

AI总结本文提出利用有限时间收敛控制屏障函数和线性时序逻辑规范合成多智能体连续时间动态系统的控制器，确保系统在有限时间内收敛到目标集并保持前向不变性，解决连续时间可达性问题。

Comments To appear in the 57th IEEE Conference on Decision and Control, Miami Beach, FL, USA, 2018

详情

AI中文摘要

本文提出了一种方法，用于利用有限时间收敛控制屏障函数和线性时序逻辑规范合成连续时间多智能体动态系统的控制器。在存在合适有限时间收敛控制屏障函数的情况下，保证系统在状态空间中有限时间收敛到目标集。此外，这些屏障函数在系统收敛到目标集后也保证前向不变性。这使得我们能够建立一个理论框架，用于合成多智能体系统的控制器。这些性质还使我们能够通过多个有限时间收敛控制屏障函数的组合定理来解决连续时间的可达性问题。该方法比现有方法更灵活，并允许更广泛的可行控制律。线性时序逻辑用于指定多智能体系统需要满足的复杂任务规范。通过这种方法，可以合成满足给定时序逻辑任务规范的控制律。提供的机器人实验是在佐治亚理工学院的Robotarium多机器人测试平台上进行的。

英文摘要

In this paper, a method to synthesize controllers using finite time convergence control barrier functions guided by linear temporal logic specifications for continuous time multi-agent dynamical systems is proposed. Finite time convergence to a desired set in the state space is guaranteed under the existence of a suitable finite time convergence control barrier function. In addition, these barrier functions also guarantee forward invariance once the system converges to the desired set. This allows us to formulate a theoretical framework which synthesizes controllers for the multi-agent system. These properties also enable us to solve the reachability problem in continuous time by formulating a theorem on the composition of multiple finite time convergence control barrier functions. This approach is more flexible than existing methods and also allows for a greater set of feasible control laws. Linear temporal logic is used to specify complex task specifications that need to be satisfied by the multi-agent system. With this solution methodology, a control law is synthesized that satisfies the given temporal logic task specification. Robotic experiments are provided which were performed on the Robotarium multi-robot testbed at Georgia Tech.

URL PDF HTML ☆

赞 0 踩 0

1808.02000 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Bionic Reflex Control Strategy for Robotic Finger with Kinematic Constraints

仿生反射控制策略用于具有运动学约束的机械手指

Narkhede Kunal Sanjay, Shyamanta M. Hazarika

发表机构 * Department of Mechanical Engineering（机械工程系）； Indian Institute of Technology Kharagpur（印度理工学院卡里格普分校）； Indian Institute of Technology Guwahati（印度理工学院古瓦哈蒂）

AI总结本文提出了一种用于具有运动学约束的机械手指的仿生反射控制策略，通过力跟踪阻抗控制策略实现仿生反射，减少手指的动力学模型并讨论了允许精确力跟踪的阻抗控制策略，展示了单指在平面上支撑矩形物体的仿真结果，反射响应时间在毫秒级。

Comments 5 pages, 7 figures

1712.07249 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Probabilistic Learning of Torque Controllers from Kinematic and Force Constraints

基于概率学习的扭矩控制器从运动学和力约束中学习

João Silvério, Yanlong Huang, Leonel Rozo, Sylvain Calinon, Darwin G. Caldwell

发表机构 * Department of Advanced Robotics, Istituto Italiano di Tecnologia（意大利先进机器人研究所机器人部）； Idiap Research Institute（Idiap研究 institute）

AI总结本文提出一种概率方法，同时学习和合成扭矩控制命令，考虑任务空间、关节空间和力约束，通过概率学习不同扭矩控制器的相关性，结合高斯分布特性生成满足任务特征的新扭矩命令。

Comments Accepted for publication at 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

1808.00869 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Developing Robot Driver Etiquette Based on Naturalistic Human Driving Behavior

基于自然人类驾驶行为发展机器人驾驶礼仪

Xianan Huang, Songan Zhang, Huei Peng

AI总结本文研究机器人驾驶礼仪问题，通过分析自然驾驶数据库提取人类驾驶行为关键参数，为未来高自动化车辆算法设计和仿真中的人类驾驶行为建模提供指导。

1807.05290 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Adaptive Model Predictive Control for High-Accuracy Trajectory Tracking in Changing Conditions

自适应模型预测控制在变化条件下高精度轨迹跟踪中的应用

Karime Pereida, Angela Schoellig

发表机构 * Dynamic Systems Lab（动态系统实验室）； University of Toronto Institute for Aerospace Studies（多伦多大学航空航天研究 institute）

AI总结本文提出一种结合模型预测控制与L1自适应控制器的自适应模型预测控制器，用于在未知和变化的扰动环境下提高系统轨迹跟踪性能。通过实验验证，该方法在四旋翼无人机上表现出更低的轨迹跟踪误差。

详情

AI中文摘要

机器人和自动化系统越来越多地被引入未知和动态环境，这些环境要求它们能够处理扰动、未建模动力学和参数不确定性。为在这些动态环境中实现高性能，需要鲁棒和自适应的控制策略。本文提出了一种新颖的自适应模型预测控制器，结合模型预测控制（MPC）与底层的L1自适应控制器，以提高受未知和变化扰动影响的系统轨迹跟踪性能。L1自适应控制器迫使系统以参考模型指定的方式运行。更高层的模型预测控制器则基于成本函数计算最优参考输入，同时考虑输入和状态约束。我们专注于所提出方法的实验验证，并在四旋翼无人机上展示了其有效性。我们表明，所提出的方法在外部风扰动下，其轨迹跟踪误差比非预测性自适应方法和预测性非自适应方法更低。

英文摘要

Robots and automated systems are increasingly being introduced to unknown and dynamic environments where they are required to handle disturbances, unmodeled dynamics, and parametric uncertainties. Robust and adaptive control strategies are required to achieve high performance in these dynamic environments. In this paper, we propose a novel adaptive model predictive controller that combines model predictive control (MPC) with an underlying $\mathcal{L}_1$ adaptive controller to improve trajectory tracking of a system subject to unknown and changing disturbances. The $\mathcal{L}_1$ adaptive controller forces the system to behave in a predefined way, as specified by a reference model. A higher-level model predictive controller then uses this reference model to calculate the optimal reference input based on a cost function, while taking into account input and state constraints. We focus on the experimental validation of the proposed approach and demonstrate its effectiveness in experiments on a quadrotor. We show that the proposed approach has a lower trajectory tracking error compared to non-predictive, adaptive approaches and a predictive, non-adaptive approach, even when external wind disturbances are applied.

URL PDF HTML ☆

赞 0 踩 0

1807.11578 2026-06-04 eess.SP cs.RO cs.SY eess.SY 版本更新

Trajectory Optimization for Cooperative Dual-band UAV Swarms

协同双频无人机编队轨迹优化

Hakim Ghazzai, Mahdi Ben Ghorbel, Andreas Kassler, Md. Jahangir Hossain

发表机构 * Stevens Institute of Technology（史蒂文斯理工学院）； The University of British Columbia（不列颠哥伦比亚大学）； Karlstad University（卡尔斯塔德大学）

AI总结本文针对带宽需求大且容忍延迟的应用，提出双频无人机轨迹优化框架，通过混合非线性规划问题优化无人机路径和停靠点，以最小化总服务时间。

Comments 8 pages, 5 figures, conference Globecom 2018

详情

AI中文摘要

无人驾驶航空器（UAVs）在多样无线通信领域中获得了广泛 popularity。由于其能够提供视距链路的能力，它们可以作为高空飞行中继器，支持地面节点之间的通信。随着物联网的蓬勃发展，多种新型应用不断涌现。本文聚焦于带宽需求大且容忍延迟的应用，其中多个收发器对需要无人机的支持来完成其传输。为此，无人机有可能使用两种不同的频段，即典型的微波频段和高速毫米波频段。本文开发了一个通用框架，将无人机分配给支持的收发器并优化其轨迹，以最小化总服务时间的加权函数。考虑到中继消息所需的时间和无人机飞行时间，一个混合非线性规划问题被提出，旨在找到无人机停靠点以转发数据给接收器。然后开发了一种迭代方法来解决该问题。首先，通过混合线性规划问题最优解决来确定每架可用无人机的路径。然后执行分层迭代搜索以提高无人机停靠点位置并减少服务时间。所提框架的行为和优势在选定场景中进行了展示。

英文摘要

Unmanned aerial vehicles (UAVs) have gained a lot of popularity in diverse wireless communication fields. They can act as high-altitude flying relays to support communications between ground nodes due to their ability to provide line-of-sight links. With the flourishing Internet of Things, several types of new applications are emerging. In this paper, we focus on bandwidth hungry and delay-tolerant applications where multiple pairs of transceivers require the support of UAVs to complete their transmissions. To do so, the UAVs have the possibility to employ two different bands namely the typical microwave and the high-rate millimeter wave bands. In this paper, we develop a generic framework to assign UAVs to supported transceivers and optimize their trajectories such that a weighted function of the total service time is minimized. Taking into account both the communication time needed to relay the message and the flying time of the UAVs, a mixed non-linear programming problem aiming at finding the stops at which the UAVs hover to forward the data to the receivers is formulated. An iterative approach is then developed to solve the problem. First, a mixed linear programming problem is optimally solved to determine the path of each available UAV. Then, a hierarchical iterative search is executed to enhance the UAV stops' locations and reduce the service time. The behavior of the UAVs and the benefits of the proposed framework are showcased for selected scenarios.

URL PDF HTML ☆

赞 0 踩 0

1807.11553 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Reach-Avoid Problems via Sum-of-Squares Optimization and Dynamic Programming

通过平方和优化与动态规划解决可达-回避问题

Benoit Landry, Mo Chen, Scott Hemley, Marco Pavone

AI总结本文结合平方和优化与动态规划，提出一种保守解法，保证系统在可达与回避之间的切换，适用于多项式动力学和一般问题设置，并在计算效率上优于传统方法。

Comments International Conference on Intelligent Robots & Systems (IROS), 2018

详情

AI中文摘要

可达-回避问题涉及将系统驱动到期望配置集的同时避免不利配置。提供数学保证具有挑战性，但有广泛应用。由于挑战，分析可达-回避问题需要在系统动力学的通用性、问题设置的通用性、解的最优性和计算复杂性之间做出权衡。本文结合平方和优化与动态规划，提出一种保守解法，保持可达性和回避保证。我们的方法适用于多项式系统动力学和一般问题设置，并比以前的相关方法更具计算可扩展性。通过涉及两个单积分器的数值示例，我们验证了所提理论，并将其与哈密顿-雅可比可达性进行比较。在验证理论后，我们通过计算涉及两个运动学汽车系统的可达-回避集，展示了我们方法的计算可扩展性。

英文摘要

Reach-avoid problems involve driving a system to a set of desirable configurations while keeping it away from undesirable ones. Providing mathematical guarantees for such scenarios is challenging but have numerous potential practical applications. Due to the challenges, analysis of reach-avoid problems involves making trade-offs between generality of system dynamics, generality of problem setups, optimality of solutions, and computational complexity. In this paper, we combine sum-of-squares optimization and dynamic programming to address the reach-avoid problem, and provide a conservative solution that maintains reaching and avoidance guarantees. Our method is applicable to polynomial system dynamics and to general problem setups, and is more computationally scalable than previous related methods. Through a numerical example involving two single integrators, we validate our proposed theory and compare our method to Hamilton-Jacobi reachability. Having validated our theory, we demonstrate the computational scalability of our method by computing the reach-avoid set of a system involving two kinematic cars.

URL PDF HTML ☆

赞 0 踩 0

1807.03515 2026-06-04 eess.SY cs.NI cs.RO cs.SY 版本更新

A Reinforcement Learning Approach to Jointly Adapt Vehicular Communications and Planning for Optimized Driving

一种联合适应车载通信与规划的强化学习方法

Mayank K. Pal, Rupali Bhati, Anil Sharma, Sanjit K. Kaul, Saket Anand, P. B. Sujit

发表机构 * IIIT-Delhi（印度德里印度理工学院）

AI总结本文提出一种强化学习方法，用于联合优化自动驾驶车辆的通信与运动规划，通过模拟验证了该方法在提升驾驶效益方面的有效性。

Comments 7 pages, 7 figures; Accepted as a conference paper at IEEE ITSC 2018

详情

AI中文摘要

我们的前提是自动驾驶车辆必须联合优化通信和运动规划。具体而言，车辆必须在考虑通信速率相关约束的情况下调整其运动计划，并在考虑道路环境可能施加的运动规划限制的情况下调整通信使用。为此，我们提出了一个强化学习问题，其中自动驾驶车辆同时选择（a）在道路上执行的运动规划动作和（b）查询基础设施感知信息的通信动作。目标是优化自动驾驶车辆的驾驶效益。我们应用Q学习算法使车辆学习最优策略，该策略在任何给定时间都能做出最优的规划和通信动作选择。我们通过模拟验证了最优策略在智能适应通信和规划动作方面的能力，同时实现了较高的驾驶效益。

英文摘要

Our premise is that autonomous vehicles must optimize communications and motion planning jointly. Specifically, a vehicle must adapt its motion plan staying cognizant of communications rate related constraints and adapt the use of communications while being cognizant of motion planning related restrictions that may be imposed by the on-road environment. To this end, we formulate a reinforcement learning problem wherein an autonomous vehicle jointly chooses (a) a motion planning action that executes on-road and (b) a communications action of querying sensed information from the infrastructure. The goal is to optimize the driving utility of the autonomous vehicle. We apply the Q-learning algorithm to make the vehicle learn the optimal policy, which makes the optimal choice of planning and communications actions at any given time. We demonstrate the ability of the optimal policy to smartly adapt communications and planning actions, while achieving large driving utilities, using simulations.

URL PDF HTML ☆

赞 0 踩 0

1807.09905 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Toward Efficient and Robust Biped Walking Optimization

迈向高效且稳健的双足行走优化

Nihar Talele, Katie Byl

AI总结本文研究双足机器人行走的高效与稳健优化，探讨步态的能耗与鲁棒性量化，以及运动轨迹与机器人参数的联合优化，通过五连杆平面行走模型验证效率与稳健性的平衡。

1803.09022 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Controller Synthesis for Discrete-Time Polynomial Systems via Occupation Measures

通过占用测度方法为离散时间多项式系统设计控制器

Weiqiao Han, Russ Tedrake

发表机构 * Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology（计算机科学与人工智能实验室，麻省理工学院）

AI总结本文通过占用测度方法为离散时间多项式动力系统设计非线性状态反馈控制器，将控制器合成问题转化为无限维线性规划问题，并通过松弛为有限维半正定规划问题，从而提取非线性控制器，该方法具有计算复杂度低且可扩展性强的优势。

详情

AI中文摘要

本文中，我们通过占用测度方法为离散时间多项式动力系统设计非线性状态反馈控制器。我们提出了离散时间受控李雅普诺夫方程，并利用该方程将控制器合成问题转化为关于测度的无限维线性规划问题，随后将其松弛为关于测度矩及其对偶的有限维半正定规划问题。非线性控制器可以从松弛问题的解中提取。占用测度方法的优势在于我们解决的是凸问题而非通常非凸问题，且计算复杂度与状态和输入维度呈多项式关系，因此该方法更具可扩展性。此外，我们还展示了该方法可以应用于近似离散时间自主多项式系统的后向可达集以及在已知状态反馈控制律下的离散时间多项式系统的可控集。我们还在多个动力系统上展示了我们的方法。

英文摘要

In this paper, we design nonlinear state feedback controllers for discrete-time polynomial dynamical systems via the occupation measure approach. We propose the discrete-time controlled Liouville equation, and use it to formulate the controller synthesis problem as an infinite-dimensional linear programming problem on measures, which is then relaxed as finite-dimensional semidefinite programming problems on moments of measures and their duals on sums-of-squares polynomials. Nonlinear controllers can be extracted from the solutions to the relaxed problems. The advantage of the occupation measure approach is that we solve convex problems instead of generally non-convex problems, and the computational complexity is polynomial in the state and input dimensions, and hence the approach is more scalable. In addition, we show that the approach can be applied to over-approximating the backward reachable set of discrete-time autonomous polynomial systems and the controllable set of discrete-time polynomial systems under known state feedback control laws. We illustrate our approach on several dynamical systems.

URL PDF HTML ☆

赞 0 踩 0

1803.07725 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Semidefinite Outer Approximation of the Backward Reachable Set of Discrete-time Autonomous Polynomial Systems

半正定外近似法用于离散时间自治多项式系统的逆向可达集

Weiqiao Han, Russ Tedrake

AI总结本文提出利用半正定规划对离散时间自治多项式系统的逆向可达集进行半正定外近似，通过构造测度优化问题并求解得到逼近结果。

Comments merged with the paper arXiv:1803.09022

1807.08855 2026-06-04 stat.ML cs.LG cs.RO cs.SY eess.SP eess.SY 版本更新

Weak in the NEES?: Auto-tuning Kalman Filters with Bayesian Optimization

在NEES中薄弱：基于贝叶斯优化的自动调节卡尔曼滤波器

Zhaozhong Chen, Christoffer Heckman, Simon Julier, Nisar Ahmed

发表机构 * Department of Computer Science（计算机科学系）； University of Colorado Boulder（科罗拉多大学博尔德分校）； University College London（伦敦大学学院）； Smead Aerospace Engineering Sciences（Smead航空航天工程科学系）

AI总结本文提出一种基于贝叶斯优化的自动调节卡尔曼滤波器方法，通过智能采样参数空间，利用非参数高斯过程代理函数，高效识别多个局部极小值并提供结果不确定性量化。

Comments Final version presented at FUSION 2018 Conference, Cambridge, UK, July 2018 (submitted June 1, 2018)

详情

AI中文摘要

卡尔曼滤波器被广泛用于数据融合应用，包括导航、跟踪和同时定位与建图问题。然而，调整各种卡尔曼滤波器模型参数需要大量时间和努力，例如过程噪声协方差、非白噪声预白化滤波器模型等。传统优化技术在调整时容易陷入较差的局部极小值，并且使用真实传感器数据实施成本较高。为了解决这些问题，本文开发了一种新的“黑箱”贝叶斯优化策略，用于自动调节卡尔曼滤波器。在该方法中，性能由两种随机目标函数之一来表征：当可用真实状态模型时为归一化估计误差平方（NEES），当只有传感器数据可用时为归一化创新误差平方（NIS）。通过智能采样参数空间，学习和利用非参数高斯过程代理函数，贝叶斯优化可以高效地识别多个局部极小值，并对其结果提供不确定性量化。

英文摘要

Kalman filters are routinely used for many data fusion applications including navigation, tracking, and simultaneous localization and mapping problems. However, significant time and effort is frequently required to tune various Kalman filter model parameters, e.g. process noise covariance, pre-whitening filter models for non-white noise, etc. Conventional optimization techniques for tuning can get stuck in poor local minima and can be expensive to implement with real sensor data. To address these issues, a new "black box" Bayesian optimization strategy is developed for automatically tuning Kalman filters. In this approach, performance is characterized by one of two stochastic objective functions: normalized estimation error squared (NEES) when ground truth state models are available, or the normalized innovation error squared (NIS) when only sensor data is available. By intelligently sampling the parameter space to both learn and exploit a nonparametric Gaussian process surrogate function for the NEES/NIS costs, Bayesian optimization can efficiently identify multiple local minima and provide uncertainty quantification on its results.

URL PDF HTML ☆

赞 0 踩 0

1807.08604 2026-06-04 eess.SY cs.RO cs.SY eess.SP math.OC 版本更新

A Frequency-Domain Characterization of Optimal Error Covariance for the Kalman-Bucy Filter

卡尔曼-布西滤波器最优误差协方差的频域特性

Song Fang, Hideaki Ishii, Jie Chen, Karl Henrik Johansson

AI总结本文通过频域积分特性，揭示卡尔曼-布西滤波器最优输出估计误差协方差与噪声协方差的比值迹可由系统动态和噪声统计表达。采用解析函数理论分析代数里卡蒂方程，并与布德积分相关联。

1807.08048 2026-06-04 cs.RO cs.AI cs.LG cs.SY eess.SY 版本更新

Baidu Apollo EM Motion Planner

百度 Apollo EM 运动规划器

Haoyang Fan, Fan Zhu, Changchun Liu, Liangliang Zhang, Li Zhuang, Dong Li, Weicheng Zhu, Jiangtao Hu, Hongye Li, Qi Kong

发表机构 * Baidu USA LLC（百度美国有限公司）

AI总结本文提出基于百度 Apollo 开源自动驾驶平台的实时运动规划系统，解决工业级4级运动规划问题，兼顾安全性、舒适性和可扩展性，通过分层结构实现多车道和单车道自动驾驶。

详情

AI中文摘要

本文介绍了一种基于百度 Apollo（开源）自动驾驶平台的实时运动规划系统。该系统旨在解决工业级4级运动规划问题，同时考虑安全性、舒适性和可扩展性。系统采用分层结构处理多车道和单车道自动驾驶：（1）系统顶层为多车道策略，通过并行计算的车道级轨迹进行比较以处理变道场景。（2）在车道级轨迹生成器中，基于弗伦兹框架迭代求解路径和速度优化。（3）对于路径和速度优化，提出结合动态规划和基于样条的二次规划的方法，构建可扩展且易于调节的框架，同时处理交通规则、障碍物决策和平滑性。该规划器可扩展至高速公路和低速城市驾驶场景。我们通过场景示例和道路测试结果展示了该算法。本文描述的系统自2017年9月Apollo v1.5发布以来已部署到数十辆百度Apollo自动驾驶车辆。截至2018年5月16日，该系统已在各种城市场景下进行了3,380小时和约68,000公里（42,253英里）的闭环自动驾驶测试。本文描述的算法可在https://github.com/ApolloAuto/apollo/tree/master/modules/planning上获得。

英文摘要

In this manuscript, we introduce a real-time motion planning system based on the Baidu Apollo (open source) autonomous driving platform. The developed system aims to address the industrial level-4 motion planning problem while considering safety, comfort and scalability. The system covers multilane and single-lane autonomous driving in a hierarchical manner: (1) The top layer of the system is a multilane strategy that handles lane-change scenarios by comparing lane-level trajectories computed in parallel. (2) Inside the lane-level trajectory generator, it iteratively solves path and speed optimization based on a Frenet frame. (3) For path and speed optimization, a combination of dynamic programming and spline-based quadratic programming is proposed to construct a scalable and easy-to-tune framework to handle traffic rules, obstacle decisions and smoothness simultaneously. The planner is scalable to both highway and lower-speed city driving scenarios. We also demonstrate the algorithm through scenario illustrations and on-road test results. The system described in this manuscript has been deployed to dozens of Baidu Apollo autonomous driving vehicles since Apollo v1.5 was announced in September 2017. As of May 16th, 2018, the system has been tested under 3,380 hours and approximately 68,000 kilometers (42,253 miles) of closed-loop autonomous driving under various urban scenarios. The algorithm described in this manuscript is available at https://github.com/ApolloAuto/apollo/tree/master/modules/planning.

URL PDF HTML ☆

赞 0 踩 0

1807.05289 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Transfer Learning for High-Precision Trajectory Tracking Through $\mathcal{L}_1$ Adaptive Feedback and Iterative Learning

通过L1自适应反馈和迭代学习实现高精度轨迹跟踪的迁移学习

Karime Pereida, Dave Kooijman, Rikky R. P. R. Duivenvoorden, Angela P. Schoellig

发表机构 * Institute for Aerospace Studies, University of Toronto, North York, ON M3H 5T6, Canada（多伦多大学航空航天研究 institute, 北York, ON M3H 5T6, 加拿大）

AI总结本文提出结合L1自适应控制与迭代学习控制的框架，用于在未知动态环境中实现高精度轨迹跟踪，通过迁移学习实现不同系统间的经验传递。

详情

DOI: 10.1002/acs.2887

AI中文摘要

当机器人或自动化系统被引入未知和动态环境时，需要鲁棒且适应性的控制策略以应对干扰、未建模动力学和参数不确定性。本文展示了一种结合L1自适应控制与迭代学习控制（ILC）的框架，用于在存在未知和变化的干扰时实现高精度轨迹跟踪。L1自适应控制器使系统接近参考模型，但无法保证完美轨迹跟踪，而ILC则通过以前的迭代改进轨迹跟踪性能。本文的综合框架使用L1自适应控制作为底层控制器，实现鲁棒且可重复的行为，而ILC则作为高层适应方案，主要补偿系统跟踪误差。我们证明了该框架能够在动态不同的系统间实现迁移学习，其中一个系统的学习经验可对另一个系统有益。两种不同四旋翼的实验结果表明，与使用PID控制器的ILC方法相比，该综合L1-ILC框架具有优越性能。结果表明，当初始输入基于自适应控制器的参考模型生成时，我们的L1-ILC框架能够实现精确的轨迹跟踪，即使在存在未知和变化的干扰时，也能实现系统间的学习经验迁移。

英文摘要

Robust and adaptive control strategies are needed when robots or automated systems are introduced to unknown and dynamic environments where they are required to cope with disturbances, unmodeled dynamics, and parametric uncertainties. In this paper, we demonstrate the capabilities of a combined $\mathcal{L}_1$ adaptive control and iterative learning control (ILC) framework to achieve high-precision trajectory tracking in the presence of unknown and changing disturbances. The $\mathcal{L}_1$ adaptive controller makes the system behave close to a reference model; however, it does not guarantee that perfect trajectory tracking is achieved, while ILC improves trajectory tracking performance based on previous iterations. The combined framework in this paper uses $\mathcal{L}_1$ adaptive control as an underlying controller that achieves a robust and repeatable behavior, while the ILC acts as a high-level adaptation scheme that mainly compensates for systematic tracking errors. We illustrate that this framework enables transfer learning between dynamically different systems, where learned experience of one system can be shown to be beneficial for another different system. Experimental results with two different quadrotors show the superior performance of the combined $\mathcal{L}_1$-ILC framework compared with approaches using ILC with an underlying proportional-derivative controller or proportional-integral-derivative controller. Results highlight that our $\mathcal{L}_1$-ILC framework can achieve high-precision trajectory tracking when unknown and changing disturbances are present and can achieve transfer of learned experience between dynamically different systems. Moreover, our approach is able to achieve precise trajectory tracking in the first attempt when the initial input is generated based on the reference model of the adaptive controller.

URL PDF HTML ☆

赞 0 踩 0

1806.00627 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Fast Rigid 3D Registration Solution: A Simple Method Free of SVD and Eigen-Decomposition

快速刚性三维配准解决方案：一种无需SVD和特征分解的简单方法

Jin Wu, Ming Liu, Zebo Zhou, Rui Li

发表机构 * School of Aeronautics and Astronautics, University of Electronic Science and Technology of China, Chengdu, 611731, China（电子科技大学航空宇航学院）

AI总结本文提出一种无需SVD和特征分解的快速刚性三维配准方法，通过点交叉协方差矩阵的最优特征向量计算实现高效配准，验证了其在噪声点云中的鲁棒性和速度优势。

详情

AI中文摘要

本文提出了一种新的解决方案来解决刚性三维配准问题，受之前基于特征分解的方法启发。与现有求解器不同，所提算法不需要复杂的矩阵运算，如奇异值分解或特征值分解。相反，点交叉协方差矩阵的最优特征向量可以在几次迭代中计算出来。此外，证明了在不需要四元数的情况下可以直接计算最优旋转矩阵。该简单框架提供了在嵌入式平台上整数实现的非常简便方法。对噪声污染点云的仿真验证了所提方法的鲁棒性和计算速度。最终结果表明，所提算法准确、鲁棒，并且比代表方法计算时间减少了60%至80%。它还已应用于实际世界，以实现更快的相对机器人导航。

英文摘要

A novel solution is obtained to solve the rigid 3D registration problem, motivated by previous eigen-decomposition approaches. Different from existing solvers, the proposed algorithm does not require sophisticated matrix operations e.g. singular value decomposition or eigenvalue decomposition. Instead, the optimal eigenvector of the point cross-covariance matrix can be computed within several iterations. It is also proven that the optimal rotation matrix can be directly computed for cases without need of quaternion. The simple framework provides very easy approach of integer-implementation on embedded platforms. Simulations on noise-corrupted point clouds have verified the robustness and computation speed of the proposed method. The final results indicate that the proposed algorithm is accurate, robust and owns over $60\% \sim 80\%$ less computation time than representatives. It has also been applied to real-world applications for faster relative robotic navigation.

URL PDF HTML ☆

赞 0 踩 0

1806.09849 2026-06-04 eess.SY cs.RO cs.SY 版本更新

SENSE: Abstraction-Based Synthesis of Networked Control Systems

SENSE：基于抽象的网络控制系统合成

Mahmoud Khaled, Matthias Rungger, Majid Zamani

发表机构 * Technical University of Munich（慕尼黑技术大学）

AI总结 SENSE通过符号模型构建和自动控制器合成，解决网络控制系统复杂规格的满足问题，支持VHDL/Verilog或C/C++代码生成。

Comments In Proceedings MeTRiD 2018, arXiv:1806.09330

详情

DOI: 10.4204/EPTCS.272.6
Journal ref: EPTCS 272, 2018, pp. 65-78

AI中文摘要

SENSE通过符号模型构建和自动控制器合成，解决网络控制系统复杂规格的满足问题，支持VHDL/Verilog或C/C++代码生成。

英文摘要

While many studies and tools target the basic stabilizability problem of networked control systems (NCS), nowadays modern systems require more sophisticated objectives such as those expressed as formulae in linear temporal logic or as automata on infinite strings. One general technique to achieve this is based on so-called symbolic models, where complex systems are approximated by finite abstractions, and then, correct-by-construction controllers are automatically synthesized for them. We present tool SENSE for the construction of finite abstractions for NCS and the automated synthesis of controllers. Constructed controllers enforce complex specifications over plants in NCS by taking into account several non-idealities of the communication channels. Given a symbolic model of the plant and network parameters, SENSE can efficiently construct a symbolic model of the NCS, by employing operations on binary decision diagrams (BDDs). Then, it synthesizes symbolic controllers satisfying a class of specifications. It has interfaces for the simulation and the visualization of the resulting closed-loop systems using OMNETPP and MATLAB. Additionally, SENSE can generate ready-to-implement VHDL/Verilog or C/C++ codes from the synthesized controllers.

URL PDF HTML ☆

赞 0 踩 0

1806.08810 2026-06-04 cs.LO cs.RO cs.SY eess.SY 版本更新

Self-Driving Vehicle Verification Towards a Benchmark

自动驾驶车辆验证：一个基准

Nima Roohi, Ramneet Kaur, James Weimer, Oleg Sokolsky, Insup Lee

发表机构 * University of Pennsylvania（宾夕法尼亚大学）

AI总结本文提出一个简单的形式化模型用于自动驾驶车辆，尽管经过简化后其安全性已手动证明，但目前尚无自动形式化验证工具支持其动态特性，旨在为形式化验证工具提供挑战。

Comments 7 pages

1709.04940 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A Robust Model Predictive Control Approach for Autonomous Underwater Vehicles Operating in a Constrained workspace

一种用于在受限工作空间中运行的自主水下机器人鲁棒模型预测控制方法

Shahab Heshmati-alamdari, George C. Karras, Panos Marantos, Kostas J. Kyriakopoulos

AI总结本文提出了一种新型非线性模型预测控制方案，用于在存在静态障碍物的受限工作空间中引导水下机器人到达特定路径点，通过考虑障碍物、工作空间边界、推进器饱和度和预定义的车辆速度上限等约束条件，提高控制性能。

Comments IEEE International Conference on Robotics and Automation (ICRA-2018), Accepted

详情

AI中文摘要

本文提出了一种新型非线性模型预测控制（NMPC）方案，用于在存在静态障碍物的受限工作空间中运行的水下机器人。控制器的目标是引导车辆朝着特定路径点前进。各种限制，如障碍物、工作空间边界、推进器饱和度和预定义的车辆速度上限，被作为状态和输入约束，并在控制设计中得到保证。所提出的方案结合了车辆的全部动力学特性，包括海洋 currents。因此，所提出方案计算出的控制输入以一种方式制定，使车辆能够利用有利的海洋 currents，从而减少推进器的能耗。所提出控制策略的性能通过在带有障碍物的受限测试水槽中使用4自由度水下机器人进行实验验证。

英文摘要

This paper presents a novel Nonlinear Model Predictive Control (NMPC) scheme for underwater robotic vehicles operating in a constrained workspace including static obstacles. The purpose of the controller is to guide the vehicle towards specific way points. Various limitations such as: obstacles, workspace boundary, thruster saturation and predefined desired upper bound of the vehicle velocity are captured as state and input constraints and are guaranteed during the control design. The proposed scheme incorporates the full dynamics of the vehicle in which the ocean currents are also involved. Hence, the control inputs calculated by the proposed scheme are formulated in a way that the vehicle will exploit the ocean currents, when these are in favor of the way-point tracking mission which results in reduced energy consumption by the thrusters. The performance of the proposed control strategy is experimentally verified using a $4$ Degrees of Freedom (DoF) underwater robotic vehicle inside a constrained test tank with obstacles.

URL PDF HTML ☆

赞 0 踩 0

1804.01926 2026-06-04 cs.RO cs.SY eess.SY stat.ML 版本更新

Scalable Magnetic Field SLAM in 3D Using Gaussian Process Maps

基于高斯过程地图的可扩展三维磁场SLAM

Manon Kok, Arno Solin

发表机构 * Delft University of Technology（代尔夫特理工大学）； Aalto University（阿尔托大学）； University of Cambridge（剑桥大学）

AI总结本文提出一种利用磁场局部异常进行三维磁场SLAM的方法，采用高斯过程模型和六边形分块映射，结合降维高斯过程回归与 Rao-Blackwellised 粒子滤波，实现高效计算和存储的SLAM算法。

Comments 11 pages, 5 figures

1711.03449 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Optimization-Based Collision Avoidance

基于优化的避障方法

Xiaojing Zhang, Alexander Liniger, Francesco Borrelli

发表机构 * Model Predictive Control Laboratory, Department of Mechanical Engineering, University of California, Berkeley, USA（加州大学伯克利分校机械工程系模型预测控制实验室）； Automatic Control Laboratory, Department of Information Technology and Electrical Engineering, ETH Zurich, Switzerland（苏黎世联邦理工学院信息科技与电气工程系自动控制实验室）

AI总结本文提出一种将非光滑避障约束转化为光滑非线性约束的方法，利用凸优化的强对偶性。该方法适用于在n维空间中移动的受控物体，能处理一般障碍物和可表示为有限凸集并集的受控物体，并结合了传统轨迹生成算法中常用的有号距离概念。

Comments 27 pages, 9 figures, 2 tables

详情

AI中文摘要

本文提出了一种将非光滑避障约束转化为光滑非线性约束的方法，利用凸优化的强对偶性。我们关注一个受控物体在n维空间中移动时避免障碍物的目标。所提出的改写方法不引入近似，并适用于一般障碍物和可在n维空间中表示为有限凸集并集的受控物体。此外，我们结果与传统轨迹生成算法中常用的有号距离概念相连接。我们的方法可以用于通用的导航和轨迹规划任务，光滑性属性允许使用通用的梯度和Hessian基于优化算法。最后，在无法避免碰撞的情况下，我们的框架允许找到

英文摘要

This paper presents a novel method for reformulating non-differentiable collision avoidance constraints into smooth nonlinear constraints using strong duality of convex optimization. We focus on a controlled object whose goal is to avoid obstacles while moving in an n-dimensional space. The proposed reformulation does not introduce approximations, and applies to general obstacles and controlled objects that can be represented in an n-dimensional space as the finite union of convex sets. Furthermore, we connect our results with the notion of signed distance, which is widely used in traditional trajectory generation algorithms. Our method can be used in generic navigation and trajectory planning tasks, and the smoothness property allows the use of general-purpose gradient- and Hessian-based optimization algorithms. Finally, in case a collision cannot be avoided, our framework allows us to find "least-intrusive" trajectories, measured in terms of penetration. We demonstrate the efficacy of our framework on a quadcopter navigation and automated parking problem, and our numerical experiments suggest that the proposed methods enable real-time optimization-based trajectory planning problems in tight environments. Source code of our implementation is provided at https://github.com/XiaojingGeorgeZhang/OBCA.

URL PDF HTML ☆

赞 0 踩 0

1704.06053 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Using Inertial Sensors for Position and Orientation Estimation

利用惯性传感器进行位置和姿态估计

Manon Kok, Jeroen D. Hol, Thomas B. Schön

发表机构 * Delft Center for Systems and Control, Delft University of Technology, the Netherlands（荷兰代尔夫特理工大学系统与控制中心）； Xsens Technologies B.V., Enschede, the Netherlands（荷兰恩schede市Xsens技术公司）； Department of Information Technology, Uppsala University, Sweden（瑞典乌普萨拉大学信息科技系）

AI总结本文探讨了惯性传感器在位置和姿态估计中的信号处理方法，分析了不同建模选择和关键算法，如优化平滑滤波、扩展卡尔曼滤波和互补滤波，并通过实验和模拟数据验证其性能。

Comments 90 pages, 38 figures

详情

DOI: 10.1561/2000000094
Journal ref: Foundations and Trends in Signal Processing, Vol. 11: No. 1-2, Pages 1-153, 2017

AI中文摘要

近年来，由于体积小、成本低，MEMS惯性传感器（3D加速度计和3D陀螺仪）已广泛可用。惯性传感器以高采样率获取数据，可通过积分获得位置和姿态信息。这些估计在短时间尺度上准确，但长时间尺度上会因积分漂移而产生误差。为克服此问题，惯性传感器通常与其它传感器和模型结合。本文教程聚焦于惯性传感器用于位置和姿态估计的信号处理方面，讨论了不同的建模选择和若干重要的算法。这些算法包括基于优化的平滑和滤波方法，以及计算成本更低的扩展卡尔曼滤波和互补滤波实现。通过实验和模拟数据展示了这些算法的估计质量。

英文摘要

In recent years, MEMS inertial sensors (3D accelerometers and 3D gyroscopes) have become widely available due to their small size and low cost. Inertial sensor measurements are obtained at high sampling rates and can be integrated to obtain position and orientation information. These estimates are accurate on a short time scale, but suffer from integration drift over longer time scales. To overcome this issue, inertial sensors are typically combined with additional sensors and models. In this tutorial we focus on the signal processing aspects of position and orientation estimation using inertial sensors. We discuss different modeling choices and a selected number of important algorithms. The algorithms include optimization-based smoothing and filtering as well as computationally cheaper extended Kalman filter and complementary filter implementations. The quality of their estimates is illustrated using both experimental and simulated data.

URL PDF HTML ☆

赞 0 踩 0

1806.01777 2026-06-04 eess.SP cs.RO cs.SY eess.SY 版本更新

Safe Driving Capacity of Autonomous Vehicles

自动驾驶车辆的安全驾驶能力

Yuan-Ying Wang, Hung-Yu Wei

发表机构 * Department of Electrical Engineering（电气工程系）； National Taiwan University（国立台湾大学）

AI总结本文通过线性时序逻辑定义道路和车辆的安全状态，提出安全驾驶吞吐量和容量概念，分析不同因素对安全驾驶吞吐量的影响，并比较基于感知和协作车辆的道路安全驾驶容量差异。

Comments 5 pages, VTC 2018

1806.01623 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Adaptive twisting sliding mode control for quadrotor unmanned aerial vehicles

自适应扭曲滑模控制用于四旋翼无人机

V. T. Hoang, M. D. Phung, Q. P. Ha

AI总结本文提出自适应扭曲滑模控制算法，用于四旋翼无人机在非线性、扰动和参数变化下的姿态控制，通过改进滑模律和增益自适应方案提升控制性能。

Comments 2017 11th Asian Control Conference (ASCC)

1806.00727 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

Closed-loop Bayesian Semantic Data Fusion for Collaborative Human-Autonomy Target Search

闭环贝叶斯语义数据融合用于协同人机目标搜索

Luke Burks, Ian Loefgren, Luke Barbier, Jeremy Muesing, Jamison McGinley, Sousheel Vunnam, Nisar Ahmed

AI总结本文提出一种闭环贝叶斯语义数据融合方法，通过CPOMDP规划生成最优轨迹，结合不完美传感器数据和人类提供的语义观察，提升动态目标搜索效率。

Comments Final version accepted and submitted to 2018 FUSION Conference (Cambridge, UK, July 2018)

详情

AI中文摘要

在搜索应用中，自主无人车辆必须能够高效重新获取和定位长时间可能处于视线外的大空间中移动目标。为此，本文开发并验证了一种新的协同人机感知解决方案。我们的方法利用连续部分可观测马尔可夫决策过程（CPOMDP）规划，生成最优利用不完美传感器数据和可请求的语义自然语言观察的车辆轨迹。关键创新是可扩展的层次高斯混合模型形式，用于在连续动态状态空间中高效求解包含语义观察的CPOMDPs。该方法在定制测试平台上通过真实的人机团队在动态室内目标搜索和捕捉场景中进行了演示和验证。

英文摘要

In search applications, autonomous unmanned vehicles must be able to efficiently reacquire and localize mobile targets that can remain out of view for long periods of time in large spaces. As such, all available information sources must be actively leveraged -- including imprecise but readily available semantic observations provided by humans. To achieve this, this work develops and validates a novel collaborative human-machine sensing solution for dynamic target search. Our approach uses continuous partially observable Markov decision process (CPOMDP) planning to generate vehicle trajectories that optimally exploit imperfect detection data from onboard sensors, as well as semantic natural language observations that can be specifically requested from human sensors. The key innovation is a scalable hierarchical Gaussian mixture model formulation for efficiently solving CPOMDPs with semantic observations in continuous dynamic state spaces. The approach is demonstrated and validated with a real human-robot team engaged in dynamic indoor target search and capture scenarios on a custom testbed.

URL PDF HTML ☆

赞 0 踩 0

1806.00678 2026-06-04 cs.RO cs.SY eess.SY 版本更新

AutoRally An open platform for aggressive autonomous driving

AutoRally：一个用于激进自动驾驶的开放平台

Brian Goldfain, Paul Drews, Changxi You, Matthew Barulic, Orlin Velev, Panagiotis Tsiotras, James M. Rehg

发表机构 * Georgia Tech Autonomous Racing Facility（佐治亚理工学院自动驾驶赛车中心）

AI总结本文介绍了一个1:5比例的机器人测试平台AutoRally，旨在提供稳健、易用和可重复的自动驾驶研究环境，使非专业人员也能收集真实世界的数据。

1805.12170 2026-06-04 eess.SY cs.RO cs.SY 版本更新

An Improved Active Disturbance Rejection Control for a Differential Drive Mobile Robot with Mismatched Disturbances and Uncertainties

一种改进的主动扰动拒绝控制用于有差驱动移动机器人以应对不匹配扰动和不确定性

Ibraheem Kasim Ibraheem, Wameedh Riyadh Abdul-Adeem

发表机构 * Electrical Engineering Department（电气工程系）； College of Engineering, Baghdad University（巴格达大学工程学院）

AI总结本文提出了一种基于扰动和不确定性估计与抑制技术的改进主动扰动拒绝控制方法，用于非线性运动学模型的有差驱动移动机器人，通过消除扰动和不确定性提高动态性能。

详情

AI中文摘要

本文提出了一种基于扰动和不确定性估计与抑制技术的改进主动扰动拒绝控制方法，用于非线性运动学模型的有差驱动移动机器人。所提出的方法是J.Han建议的主动扰动拒绝控制（ADRC）的改进版本。ADRC用于主动抵消由未知外源信号和系统模型的匹配不确定性引起的扰动，这些扰动被合并为总扰动。在本工作中，假设系统为仿射系统，总扰动和输入被认为是不同通道。为处理不匹配扰动和不确定性，总扰动已被转换为匹配扰动。然后基于改进的ADRC（IADRC），通过估计总扰动并将其从系统中抵消，提高了DDMR的动态性能。通过数字仿真，应用了不同的性能指标，所有结果都表明所提出的IADRC的有效性，几乎消除了抖振现象，并在闭环系统中提供了对扭矩扰动的高鲁棒性。

英文摘要

In this paper a new strategy based on disturbance and uncertainty (DU) estimation and attenuation technique is proposed and tested on the nonlinear kinematic model of the differential drive mobile robot (DDMR). The proposed technique is an improved version of the Active Disturbance Rejection Control (ADRC) strategy suggested by J. Han. The ADRC is used to actively reject disturbances caused by the unknown exogenous signals and the matched uncertainties of the system model, which are lumped all together and attributed as a total disturbance. In this work, the considered system is assumed to be affine and the total disturbance and the input are considered to be on different channels. To deal with the mismatched disturbances and uncertainties, the total disturbance has been converted into a matched one. Then, based on the improved ADRC (IADRC), the dynamic performance of the DDMR has been enhanced by estimating the total disturbance and canceling it from the system. Through digital simulations, different performance measures are applied, and they all indicate the effectiveness of the proposed IADRC by almost removing the chattering phenomenon and providing a high immunity in the closed-loop system against torque disturbance.

URL PDF HTML ☆

赞 0 踩 0

1709.05746 2026-06-04 cs.RO cs.AI cs.CV cs.LG cs.SY eess.SY 版本更新

Adversarial Discriminative Sim-to-real Transfer of Visuo-motor Policies

对抗性判别仿真到现实的视觉-运动策略转移

Fangyi Zhang, Jürgen Leitner, Zongyuan Ge, Michael Milford, Peter Corke

发表机构 * Australian Centre for Robotic Vision (ACRV)（澳大利亚机器人视觉中心）； Queensland University of Technology (QUT)（昆士兰技术大学）； Monash University（墨尔本大学）

AI总结本文提出对抗性判别仿真到现实转移方法，减少现实数据标注成本，在桌面上物体抓取任务中，通过视觉观测控制7自由度机械臂在障碍物中抓取蓝色立方体，仅需93个标注和186个未标注图像即可实现97.8%的成功率和1.8厘米的控制精度。

Comments Under review for the International Journal of Robotics Research

详情

AI中文摘要

各种方法已被提出以学习用于现实世界机器人应用的视觉-运动策略。一种解决方案是首先在仿真中学习然后转移到现实世界。在转移过程中，大多数现有方法需要带有标签的真实图像。然而，在许多机器人应用中，标注过程往往昂贵甚至不实际。在本文中，我们提出了一种对抗性判别仿真到现实转移方法，以减少标注真实数据的成本。通过模块化网络在桌面物体抓取任务中验证了该方法的有效性，其中7自由度的机械臂以速度模式控制在障碍物中抓取蓝色立方体。对抗性转移方法将标注真实数据的需求减少了50%。策略可以仅使用93个标注和186个未标注的真实图像转移到现实环境。转移的视觉-运动策略对训练中未见过的物体和移动目标具有鲁棒性，实现了97.8%的成功率和1.8厘米的控制精度。

英文摘要

Various approaches have been proposed to learn visuo-motor policies for real-world robotic applications. One solution is first learning in simulation then transferring to the real world. In the transfer, most existing approaches need real-world images with labels. However, the labelling process is often expensive or even impractical in many robotic applications. In this paper, we propose an adversarial discriminative sim-to-real transfer approach to reduce the cost of labelling real data. The effectiveness of the approach is demonstrated with modular networks in a table-top object reaching task where a 7 DoF arm is controlled in velocity mode to reach a blue cuboid in clutter through visual observations. The adversarial transfer approach reduced the labelled real data requirement by 50%. Policies can be transferred to real environments with only 93 labelled and 186 unlabelled real images. The transferred visuo-motor policies are robust to novel (not seen in training) objects in clutter and even a moving target, achieving a 97.8% success rate and 1.8 cm control accuracy.

URL PDF HTML ☆

赞 0 踩 0

1805.09875 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Autonomous Thermalling as a Partially Observable Markov Decision Process (Extended Version)

自主热力上升作为部分可观测马尔可夫决策过程（扩展版本）

Iain Guilliard, Richard Rogahn, Jim Piavis, Andrey Kolobov

发表机构 * Australian National University（澳大利亚国立大学）； Microsoft Research（微软研究院）

AI总结本文提出将自主热力上升建模为POMDP，并设计基于此的递推地平线控制器，通过在ArduPlane中实现并对比现有方法，验证了其在多架sUAV同时热力上升时的显著优势。

1805.09613 2026-06-04 stat.ML cs.AI cs.LG cs.RO cs.SY eess.SY 版本更新

A0C: Alpha Zero in Continuous Action Space

A0C：在连续动作空间中的Alpha Zero

Thomas M. Moerland, Joost Broekens, Aske Plaat, Catholijn M. Jonker

发表机构 * Dep. of Computer Science, Delft University of Technology, The Netherlands（代尔夫特理工大学计算机科学系，荷兰）； Dep. of Computer Science, Leiden University, The Netherlands（莱顿大学计算机科学系，荷兰）

AI总结本文提出将Alpha Zero扩展到连续动作空间的理论方法，并在倒摆任务中验证了其可行性，为连续动作空间中的迭代搜索与学习应用奠定了基础。

1805.08551 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Robust Model Predictive Control for Autonomous Vehicles/Self Driving Cars

自主车辆/自动驾驶汽车的鲁棒模型预测控制

Che Kun Law, Darshit Dalal, Stephen Shearrow

AI总结本文提出一种用于自主车辆前轮控制的鲁棒模型预测控制方法，通过权重调整和在线线性化技术提升控制鲁棒性，并讨论各方法在精度和计算负载上的有效性。

Comments 12 pages,9 figures

1510.07380 2026-06-04 cs.RO cs.SY eess.SY 版本更新

SLAP: Simultaneous Localization and Planning Under Uncertainty for Physical Mobile Robots via Dynamic Replanning in Belief Space: Extended version

SLAP：通过信念空间中的动态重新规划实现物理移动机器人的同时定位与规划（在不确定性下）：扩展版

Ali-akbar Agha-mohammadi, Saurav Agarwal, Sung-Kyun Kim, Suman Chakravorty, Nancy M. Amato

发表机构 * NASA-JPL, Caltech（NASA-喷气推进中心，加州理工学院）； Dept. of Aerospace Eng. and Amato is with the Dept. of Computer Science（航空航天工程系和计算机科学系）； Dept. of Computer Science（计算机科学系）

AI总结本文提出一种在不确定性环境下通过信念空间动态重新规划实现物理移动机器人同时定位与规划的方法，通过在线重新规划循环改进离线策略，有效应对环境变化和大定位误差，优于FIRM方法。

Comments 20 pages, updated figures, extended theory and simulation results

详情

AI中文摘要

同时定位与规划（SLAP）是自主机器人在不确定性环境下至关重要的能力。在最一般的形式下，SLAP诱导出一个连续的POMDP（部分可观测马尔可夫决策过程），需要在线不断求解。本文针对此问题提出一种在信念空间中的动态重新规划方案。该连续的POMDP在状态、动作和观测空间中通过采样方法进行离线近似，但通过在线重新规划循环实现局部改进。这种构造使所提方法能够应对环境变化和大定位误差，即使环境变化改变了最优轨迹的同调类。此外，本文方法优于当前最先进的FIRM（反馈信息路标）方法，通过消除不必要的稳定步骤。将信念空间规划应用于物理系统带来了诸多挑战。本文的重点是将所提规划器应用于物理机器人，并展示在不确定性、变化环境和存在大干扰（如被绑架机器人情况）下的SLAP解决方案性能。

英文摘要

Simultaneous localization and Planning (SLAP) is a crucial ability for an autonomous robot operating under uncertainty. In its most general form, SLAP induces a continuous POMDP (partially-observable Markov decision process), which needs to be repeatedly solved online. This paper addresses this problem and proposes a dynamic replanning scheme in belief space. The underlying POMDP, which is continuous in state, action, and observation space, is approximated offline via sampling-based methods, but operates in a replanning loop online to admit local improvements to the coarse offline policy. This construct enables the proposed method to combat changing environments and large localization errors, even when the change alters the homotopy class of the optimal trajectory. It further outperforms the state-of-the-art FIRM (Feedback-based Information RoadMap) method by eliminating unnecessary stabilization steps. Applying belief space planning to physical systems brings with it a plethora of challenges. A key focus of this paper is to implement the proposed planner on a physical robot and show the SLAP solution performance under uncertainty, in changing environments and in the presence of large disturbances, such as a kidnapped robot situation.

URL PDF HTML ☆

赞 0 踩 0

1805.04201 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

Learning to Grasp Without Seeing

无需视觉的抓取学习

Adithyavairavan Murali, Yin Li, Dhiraj Gandhi, Abhinav Gupta

发表机构 * The Robotics Institute, Carnegie Mellon University（卡内基梅隆大学机器人研究所）

AI总结本文提出基于触觉感知的抓取方法，通过触觉信号表征和迭代重抓取提升抓取稳定性，实验表明在无视觉信息下可有效抓取新型物体。

详情

AI中文摘要

能否在不看到物体的情况下让机器人抓取未知物体？本文提出了一种基于触觉感知的解决方案，结合触觉信号定位与触觉反馈重抓取。我们创建了一个大规模抓取数据集，包含超过30帧RGB图像和280万条触觉样本。提出了一种无监督自编码方案，显著提升了触觉感知任务的性能。系统分为两个步骤：首先，触觉定位模型通过粒子滤波聚合目标信息，输出物体位置估计以建立初始抓取；其次，重抓取模型基于学习特征逐步改进抓取，估计抓取稳定性并预测下一步调整。最终通过大量实验验证了在无视觉信息下抓取新型物体的有效性，并在视觉策略基础上提升了整体准确率10.6%。

英文摘要

Can a robot grasp an unknown object without seeing it? In this paper, we present a tactile-sensing based approach to this challenging problem of grasping novel objects without prior knowledge of their location or physical properties. Our key idea is to combine touch based object localization with tactile based re-grasping. To train our learning models, we created a large-scale grasping dataset, including more than 30 RGB frames and over 2.8 million tactile samples from 7800 grasp interactions of 52 objects. To learn a representation of tactile signals, we propose an unsupervised auto-encoding scheme, which shows a significant improvement of 4-9% over prior methods on a variety of tactile perception tasks. Our system consists of two steps. First, our touch localization model sequentially 'touch-scans' the workspace and uses a particle filter to aggregate beliefs from multiple hits of the target. It outputs an estimate of the object's location, from which an initial grasp is established. Next, our re-grasping model learns to progressively improve grasps with tactile feedback based on the learned features. This network learns to estimate grasp stability and predict adjustment for the next grasp. Re-grasping thus is performed iteratively until our model identifies a stable grasp. Finally, we demonstrate extensive experimental results on grasping a large set of novel objects using tactile sensing alone. Furthermore, when applied on top of a vision-based policy, our re-grasping model significantly boosts the overall accuracy by 10.6%. We believe this is the first attempt at learning to grasp with only tactile sensing and without any prior object knowledge.

URL PDF HTML ☆

赞 0 踩 0

1805.03358 2026-06-04 eess.SY cs.RO cs.SY 版本更新

A Collision-Free Path Planning Algorithm for Unmanned Aerial Vehicle Delivery

一种用于无人机送货的无碰撞路径规划算法

Ziji Shi, Wee Keong Ng

AI总结本文提出一种基于A*算法的无碰撞路径规划方法，通过引入考虑等待时间的启发函数，解决动态环境中无人机送货的碰撞问题，并通过仿真验证了算法的有效性。

Comments Accepted by 2018 International Conference on Unmanned Aircraft Systems (ICUAS)

详情

AI中文摘要

路径规划对于无人机的自主性至关重要，特别是在调度无人机送货时。然而，无人机的操作环境通常是不确定和动态的。没有适当的规划，多个无人机拥堵时可能发生碰撞。此外，当局可能设置临时禁飞区，使空域无法使用。因此，需要适当的预出发规划以避免这些地方。本文将该问题转化为约束满足问题，以在动态图中找到无碰撞的最短路径。我们提出了一种基于A*算法的无碰撞路径规划算法。主要创新点是发明了一个同时考虑等待时间的启发函数。我们后来证明，添加等待惩罚后，所提出算法是最优的，因为启发函数是可接受的。该算法的实现使用了新加坡的空域结构进行无人机送货仿真。我们的仿真展示了良好的运行时间性能。使用所提出的算法，碰撞自由路线的百分比随着单位面积请求数量的增加而减少，并在边界值处显著下降。我们的实证分析可以辅助禁飞区政策和无人机送货基础设施的决策。

英文摘要

Path planning is important for the autonomy of Unmanned Aerial Vehicle (UAV), especially for scheduling UAV delivery. However, the operating environment of UAVs is usually uncertain and dynamic. Without proper planning, collisions may happen where multiple UAVs are congested. Besides, there may also be temporary no-fly zone setup by authorities that makes airspace unusable. Thus, proper pre-departure planning that avoids such places is needed. In this paper, we formulate this problem into a Constraint Satisfaction Problem to find a collision-free shortest path on a dynamic graph. We propose a collision-free path planning algorithm that is based on A* algorithm. The main novelty is that we invent a heuristic function that also considers waiting time. We later show that, with added waiting penalty, the proposed algorithm is optimal because the heuristic is admissible. Implementation of this algorithm simulates UAV delivery using Singapore's airspace structure. Our simulation exhibits desirable runtime performance. Using the proposed algorithm, the percentage of collision-free routes decreases as number of requests per unit area increases, and this percentage drops significantly at boundary value. Our empirical analysis could aid the decision-making of no-fly zone policy and infrastructure of UAV delivery.

URL PDF HTML ☆

赞 0 踩 0

1805.02508 2026-06-04 eess.SY cs.RO cs.SY 版本更新

A Generic Self-Evolving Neuro-Fuzzy Controller based High-performance Hexacopter Altitude Control System

一种基于通用自进化神经模糊控制器的高性能六旋翼无人机高度控制系统

Md Meftahul Ferdaus, Mahardhika Pratama, Sreenatha G. Anavatti, Matthew A. Garratt

AI总结本文提出一种无需领域知识的通用自进化神经模糊控制器，用于六旋翼无人机的高度控制，通过自适应规则调整和滑模控制理论保证系统稳定性。

Comments submitted in the 2018 IEEE International Conference on Systems, Man, and Cybernetics (SMC2018)

详情

AI中文摘要

如今，全自主系统如旋翼无人机（UAV）的应用迅速增加。由于非线性动态复杂，开发基于学习机器的智能自组织进化控制器成为研究热点。本文提出名为通用控制器（G-controller）的进化控制器，用于控制六旋翼无人机的高度。该控制器可与极少的领域知识一起工作。其进化架构基于先进的增量学习算法，即通用进化神经模糊推理系统（GENEFIS）。控制器不需要离线训练，因为它从空规则集开始运行，然后根据需求添加或删除规则。后件参数的适应律来源于滑模控制（SMC）理论。Lyapunov理论用于保证所提控制器的稳定性。此外，还实施了辅助鲁棒化控制项，以获得跟踪误差趋于零的均匀渐近收敛。最后，通过六旋翼无人机在各种轨迹上的高度跟踪来评估G控制器的性能。

英文摘要

Nowadays, the application of fully autonomous system like rotary wing unmanned air vehicles (UAVs) is increasing sharply. Due to the complex nonlinear dynamics, a huge research interest is witnessed in developing learning machine based intelligent, self-organizing evolving controller for these vehicles notably to address the system's dynamic characteristics. In this work, such an evolving controller namely Generic-controller (G-controller) is proposed to control the altitude of a rotary wing UAV namely hexacopter. This controller can work with very minor expert domain knowledge. The evolving architecture of this controller is based on an advanced incremental learning algorithm namely Generic Evolving Neuro-Fuzzy Inference System (GENEFIS). The controller does not require any offline training, since it starts operating from scratch with an empty set of fuzzy rules, and then add or delete rules on demand. The adaptation laws for the consequent parameters are derived from the sliding mode control (SMC) theory. The Lyapunov theory is used to guarantee the stability of the proposed controller. In addition, an auxiliary robustifying control term is implemented to obtain a uniform asymptotic convergence of tracking error to zero. Finally, the G-controller's performance evaluation is observed through the altitude tracking of a UAV namely hexacopter for various trajectories.

URL PDF HTML ☆

赞 0 踩 0

1801.08995 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Trajectory Generation using Sharpness Continuous Dubins-like Paths with Applications in Control of Heavy Duty Vehicles

利用连续尖锐度的Dubins-like路径生成轨迹，应用于重载车辆控制

Rui Oliveira, Pedro F. Lima, Marcello Cirillo, Jonas Mårtensson, Bo Wahlberg

发表机构 * Scania, Autonomous Transport Solutions（斯堪尼亚，自主运输解决方案）

AI总结本文提出了一种考虑转向执行器速率和扭矩限制的轨迹生成框架，通过连续尖锐度曲线提升重载车辆的自主驾驶性能。

Comments 18 pages, 7 figures, accepted for publication at ECC 2018 - European Control Conference

详情

AI中文摘要

我们提出了一种用于控制轮式车辆的轨迹生成框架，考虑转向执行器的速率和扭矩限制。关键思想是直接考虑转向执行器的速率和扭矩限制，而传统方法仅考虑曲率速率限制。我们提出新的尖锐度连续曲线概念，结合三次和Sigmoid曲率轨迹以及圆弧来引导车辆。所获得的轨迹具有平滑且连续可微的转向角剖面。这些轨迹为低层控制器提供更易跟踪的参考信号，从而提高性能。所获得的转向剖面的平滑性也提高了乘客舒适度。该方法具有快速的计算时间，可通过简单预计算进一步加速。我们详细讨论了该方法的路径规划应用，并通过仿真展示了其优势和实时能力。

英文摘要

We present a trajectory generation framework for control of wheeled vehicles under steering actuator constraints. The motivation is smooth autonomous driving of heavy vehicles. The key idea is to take into account rate, and additionally, torque limitations of the steering actuator directly. Previous methods only take into account curvature rate limitations, which deal indirectly with steering rate limitations. We propose the new concept of Sharpness Continuous curves, which uses cubic and sigmoid curvature trajectories together with circular arcs to steer the vehicle. The obtained trajectories are characterized by a smooth and continuously differentiable steering angle profile. These trajectories provide low-level controllers with reference signals which are easier to track, resulting in improved performance. The smoothness of the obtained steering profiles also results in increased passenger comfort. The method is characterized by a fast computation time, which can be further speeded up through the use of simple pre-computations. We detail possible path planning applications of the method, and conduct simulations that show its advantages and real time capabilities.

URL PDF HTML ☆

赞 0 踩 0

1804.08676 2026-06-04 cs.RO cs.MA cs.SY eess.SY 版本更新

Gesture based Human-Swarm Interactions for Formation Control using interpreters

基于手势的人群-蜂群交互的编队控制使用解释器

Aamodh Suresh, Sonia Martinez

发表机构 * Department of Mechanical and Aerospace Engineering, University of California at San Diego, La Jolla, CA 92093, USA（机械与航空航天工程系，加州大学圣地亚哥分校，拉古拉，CA 92093，美国）

AI总结本文提出了一种新颖的人群-蜂群交互框架，通过手势控制蜂群形状和编队。该框架利用可穿戴臂带记录手势，通过解释器将手势转化为蜂群控制指令，结合机器学习和最优控制技术实现编队控制。

详情

AI中文摘要

我们提出了一种新颖的人群-蜂群交互（HSI）框架，使用户能够通过简单的手臂手势和动作控制蜂群的形状和编队。用户通过可穿戴的臂带记录手势，该框架引入了一种新颖的解释器系统，作为用户和蜂群之间的中介，简化用户的交互角色。解释器接收用户通过手势绘制的高层次输入，并将其转化为低层次的蜂群控制指令。该解释器利用机器学习、卡尔曼滤波和最优控制技术将用户输入转化为蜂群控制参数。引入了人类可解释的动力学概念，用于解释器的规划以及向用户提供反馈。蜂群的动力学通过基于分布式线性迭代和动态平均一致的新型去中心化编队控制器进行控制。该框架在二维环境中理论和实验上均得到了验证，展示了人类实时控制模拟机器人蜂群的能力。

英文摘要

We propose a novel Human-Swarm Interaction (HSI) framework which enables the user to control a swarm shape and formation. The user commands the swarm utilizing just arm gestures and motions which are recorded by an off-the-shelf wearable armband. We propose a novel interpreter system, which acts as an intermediary between the user and the swarm to simplify the user's role in the interaction. The interpreter takes in a high level input drawn using gestures by the user, and translates it into low level swarm control commands. This interpreter employs machine learning, Kalman filtering and optimal control techniques to translate the user input into swarm control parameters. A notion of Human Interpretable dynamics is introduced, which is used by the interpreter for planning as well as to provide feedback to the user. The dynamics of the swarm are controlled using a novel decentralized formation controller based on distributed linear iterations and dynamic average consensus. The framework is demonstrated theoretically as well as experimentally in a 2D environment, with a human controlling a swarm of simulated robots in real time.

URL PDF HTML ☆

赞 0 踩 0

1804.06586 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Composite Adaptive Control for Bilateral Teleoperation Systems without Persistency of Excitation

双向往复操作系统无持续激励的复合自适应控制

Yuling Li, Yixin Yin, Sen Zhang, Jie Dong, Rolf Johansson

发表机构 * School of Automation and Electrical Engineering, University of Science and Technology Beijing（自动化与电气工程学院，北京科技大学）； Department of Automatic Control, Lund University, P.O. Box 118, 22100 Lund, Sweden（自动控制系，卢德大学）

AI总结本文提出一种无需持续激励条件的复合自适应控制方法，用于解决非线性双向往复操作系统中参数收敛问题，通过线性矩阵不等式给出闭环系统稳定性准则，并通过仿真验证了方法的有效性。

Comments 21 pages, 9 figures, submitted to Journal of The Franklin Institute

1804.04696 2026-06-04 cs.RO cs.AI cs.LG cs.SY eess.SY 版本更新

Efficient Model Identification for Tensegrity Locomotion

高效 tensegrity 机器人运动的模型识别

Shaojun Zhu, David Surovik, Kostas E. Bekris, Abdeslam Boularias

发表机构 * Department of Computer Science, Rutgers University（计算机科学系，罗格斯大学）

AI总结本文提出一种高效方法，利用物理引擎和贝叶斯优化框架，用于识别高维顺应性tensegrity机器人中的未知机械参数，提升运动控制精度。

1804.04349 2026-06-04 eess.SY cs.RO cs.SE cs.SY 版本更新

On the Application of ISO 26262 in Control Design for Automated Vehicles

在自动驾驶车辆控制设计中应用ISO 26262

Georg Schildbach

发表机构 * Institute for Electrical Engineering in Medicine, University of Luebeck（医学电气工程研究所，吕贝克大学）

AI总结本文探讨了ISO 26262标准在自动驾驶车辆控制设计中的应用，分析了其在高自动化车辆中的适用性与争议，并总结了该标准的安全设计步骤。

Comments In Proceedings SCAV 2018, arXiv:1804.03406

详情

DOI: 10.4204/EPTCS.269.7
Journal ref: EPTCS 269, 2018, pp. 74-82

AI中文摘要

过去十年间，自动驾驶车辆的研究经历了爆炸性增长。然而，其实际应用的主要障碍是一个令人信服的安全概念。随着算法日益复杂和车辆自动化水平提高，这一问题愈发重要。功能安全领域提供了一种系统方法，用于识别潜在风险源并提高车辆安全性。该领域基于航空航天、过程和其他行业数十年的实践经验。这些经验汇总成汽车领域的功能安全标准ISO 26262，已被广泛采用。然而，其在高自动化车辆中的适用性和相关性却存在争议。本文对这一讨论进行了批判性分析，并总结了ISO 26262的主要步骤，以实现自动驾驶车辆的安全控制设计。

英文摘要

Research on automated vehicles has experienced an explosive growth over the past decade. A main obstacle to their practical realization, however, is a convincing safety concept. This question becomes ever more important as more sophisticated algorithms are used and the vehicle automation level increases. The field of functional safety offers a systematic approach to identify possible sources of risk and to improve the safety of a vehicle. It is based on practical experience across the aerospace, process and other industries over multiple decades. This experience is compiled in the functional safety standard for the automotive domain, ISO 26262, which is widely adopted throughout the automotive industry. However, its applicability and relevance for highly automated vehicles is subject to a controversial debate. This paper takes a critical look at the discussion and summarizes the main steps of ISO 26262 for a safe control design for automated vehicles.

URL PDF HTML ☆

赞 0 踩 0

1804.04347 2026-06-04 cs.RO cs.SE cs.SY eess.SY 版本更新

The CAT Vehicle Testbed: A Simulator with Hardware in the Loop for Autonomous Vehicle Applications

CAT车辆测试平台：用于自动驾驶应用的具有闭环硬件的模拟器

Rahul Kumar Bhadani, Jonathan Sprinkle, Matthew Bunting

发表机构 * Department of Electrical and Computer Engineering（电气与计算机工程系）； University of Arizona（亚利桑那大学）； Tucson, USA（美国图森市）

AI总结本文提出CAT车辆测试平台，通过闭环硬件模拟验证仿真结果，支持自动驾驶技术研究。平台基于ROS和物理车辆模型，支持多车交互和实时数据回放，可快速验证算法性能。

Comments In Proceedings SCAV 2018, arXiv:1804.03406

详情

DOI: 10.4204/EPTCS.269.4
Journal ref: EPTCS 269, 2018, pp. 32-47

AI中文摘要

本文介绍了CAT车辆（认知与自主测试车辆）测试平台：一个由分布式仿真为基础的自动驾驶车辆组成的研发测试平台，能够轻松过渡到闭环硬件测试和执行，以支持自动驾驶技术的研究。自动驾驶技术从主动安全功能和高级驾驶辅助系统发展到完全传感器引导的自动驾驶，需要测试所有可能的场景。然而，研究人员若没有自己的机器人平台，想要在物理平台上展示新成果将面临困难。因此，需要一个研究测试平台，使基于仿真的结果能够通过闭环仿真快速验证，以便在物理平台上测试软件。CAT车辆测试平台提供了这样的测试平台，可以在仿真中模拟真实车辆的动力学，然后无缝过渡到使用案例的硬件再现。该模拟器使用机器人操作系统（ROS）和基于物理的车辆模型，包括具有可配置参数的模拟传感器和执行器。该测试平台允许多车仿真以支持车辆间交互。我们的测试平台还支持实时数据记录和捕获，可以回放以检查特定场景或使用案例，并用于回归测试。作为可行性演示的一部分，我们介绍了CAT车辆挑战，全球各地的学生研究人员能够在少于2天的物理平台接口时间内重现他们的仿真结果。

英文摘要

This paper presents the CAT Vehicle (Cognitive and Autonomous Test Vehicle) Testbed: a research testbed comprised of a distributed simulation-based autonomous vehicle, with straightforward transition to hardware in the loop testing and execution, to support research in autonomous driving technology. The evolution of autonomous driving technology from active safety features and advanced driving assistance systems to full sensor-guided autonomous driving requires testing of every possible scenario. However, researchers who want to demonstrate new results on a physical platform face difficult challenges, if they do not have access to a robotic platform in their own labs. Thus, there is a need for a research testbed where simulation-based results can be rapidly validated through hardware in the loop simulation, in order to test the software on board the physical platform. The CAT Vehicle Testbed offers such a testbed that can mimic dynamics of a real vehicle in simulation and then seamlessly transition to reproduction of use cases with hardware. The simulator utilizes the Robot Operating System (ROS) with a physics-based vehicle model, including simulated sensors and actuators with configurable parameters. The testbed allows multi-vehicle simulation to support vehicle to vehicle interaction. Our testbed also facilitates logging and capturing of the data in the real time that can be played back to examine particular scenarios or use cases, and for regression testing. As part of the demonstration of feasibility, we present a brief description of the CAT Vehicle Challenge, in which student researchers from all over the globe were able to reproduce their simulation results with fewer than 2 days of interfacing with the physical platform.

URL PDF HTML ☆

赞 0 踩 0

1804.03036 2026-06-04 eess.IV cs.RO cs.SY eess.SY 版本更新

Image Moment Models for Extended Object Tracking

图像矩模型用于扩展目标跟踪

Gang Yao, Ashwin Dani

发表机构 * Department of Electrical and Computer Engineering, University of Connecticut Storrs（电气与计算机工程系，康涅狄格大学斯托尔斯分校）

AI总结本文提出基于图像矩的新型模型，用于估计和跟踪复杂轨迹下的目标形状。通过无迹卡尔曼滤波-交互多模型算法估计目标形状及位置速度，结合IOU和RMSE指标进行验证。

详情

Journal ref: IEEE Transactions on Aerospace and Electronic Systems, 2018

AI中文摘要

本文提出了一种基于图像矩的新型模型，用于估计和跟踪具有复杂轨迹的物体形状。假设相机静止注视移动物体，内部点特征作为测量值。假设椭圆近似为基本形状，利用图像矩组合估计椭圆形状。推导了在恒速或协调转弯运动模型下图像矩的动态模型，作为物体形状估计的函数。应用无迹卡尔曼滤波-交互多模型（UKF-IMM）滤波算法估计目标形状（近似为椭圆）并跟踪其位置和速度。基于平均对数似然推导IMM滤波器的似然函数。展示了所提UKF-IMM算法与图像矩模型的仿真结果，显示了在复杂轨迹中移动物体的估计结果。通过交并比（IOU）和位置和速度均方根误差（RMSE）等指标与文献中的基准算法进行比较。还展示了从四旋翼无人机捕获的真实图像数据结果。

英文摘要

In this paper, a novel image moments based model for shape estimation and tracking of an object moving with a complex trajectory is presented. The camera is assumed to be stationary looking at a moving object. Point features inside the object are sampled as measurements. An ellipsoidal approximation of the shape is assumed as a primitive shape. The shape of an ellipse is estimated using a combination of image moments. Dynamic model of image moments when the object moves under the constant velocity or coordinated turn motion model is derived as a function for the shape estimation of the object. An Unscented Kalman Filter-Interacting Multiple Model (UKF-IMM) filter algorithm is applied to estimate the shape of the object (approximated as an ellipse) and track its position and velocity. A likelihood function based on average log-likelihood is derived for the IMM filter. Simulation results of the proposed UKF-IMM algorithm with the image moments based models are presented that show the estimations of the shape of the object moving in complex trajectories. Comparison results, using intersection over union (IOU), and position and velocity root mean square errors (RMSE) as metrics, with a benchmark algorithm from literature are presented. Results on real image data captured from the quadcopter are also presented.

URL PDF HTML ☆

赞 0 踩 0

1804.02814 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Nonlinear Unknown Input and State Estimation Algorithm in Mobile Robots

移动机器人中的非线性未知输入和状态估计算法

Pinyao Guo, Hunmin Kim, Nurali Virani, Jun Xu, Minghui Zhu, Peng Liu

发表机构 * College of Information Sciences（信息科学学院）； School of Electrical Engineering（电气工程学院）； GE Global Research（GE全球研究）

AI总结本文提出一种适用于非线性动态模型和随机噪声的移动机器人未知输入和状态估计算法，通过传感器数据和控制指令检测并量化传感器和执行器的异常。

Comments arXiv admin note: text overlap with arXiv:1708.01834

1612.07139 2026-06-04 cs.RO cs.AI cs.LG cs.SY eess.SY 版本更新

A Survey of Deep Network Solutions for Learning Control in Robotics: From Reinforcement to Imitation

深度网络在机器人学习控制中的应用综述：从强化到模仿

Lei Tai, Jingwei Zhang, Ming Liu, Joschka Boedecker, Wolfram Burgard

发表机构 * University of Freiburg（弗赖堡大学）

AI总结本文综述了深度学习在机器人学习控制中的应用，探讨了深度强化学习和模仿学习两大主流方法，分析了其在导航、 manipulation 任务中的应用及现实差距挑战。

Comments 19 pages, 1 figures

详情

AI中文摘要

深度学习技术已广泛应用于各种研究领域，取得了最先进的成果。本文综述了针对机器人应用的学习控制策略的深度学习解决方案。我们讨论了深度学习在学习控制中的两大主要范式：深度强化学习和模仿学习。对于深度强化学习（DRL），我们从传统强化学习算法开始，展示了如何将其扩展到深度领域，并介绍了在机器人导航和 manipulation 任务中使用 DRL 的代表性工作。我们继续讨论了解决现实差距挑战的方法，即如何将仿真中训练的 DRL 策略转移到现实世界场景，并总结了用于 DRL 研究的机器人仿真平台。对于模仿学习，我们探讨了其三个主要类别：行为克隆、逆强化学习和生成对抗模仿学习，介绍了它们的公式及其在机器人应用中的对应情况。最后，我们讨论了开放挑战和研究前沿。

英文摘要

Deep learning techniques have been widely applied, achieving state-of-the-art results in various fields of study. This survey focuses on deep learning solutions that target learning control policies for robotics applications. We carry out our discussions on the two main paradigms for learning control with deep networks: deep reinforcement learning and imitation learning. For deep reinforcement learning (DRL), we begin from traditional reinforcement learning algorithms, showing how they are extended to the deep context and effective mechanisms that could be added on top of the DRL algorithms. We then introduce representative works that utilize DRL to solve navigation and manipulation tasks in robotics. We continue our discussion on methods addressing the challenge of the reality gap for transferring DRL policies trained in simulation to real-world scenarios, and summarize robotics simulation platforms for conducting DRL research. For imitation leaning, we go through its three main categories, behavior cloning, inverse reinforcement learning and generative adversarial imitation learning, by introducing their formulations and their corresponding robotics applications. Finally, we discuss the open challenges and research frontiers.

URL PDF HTML ☆

赞 0 踩 0

1804.00311 2026-06-04 eess.SY cs.RO cs.SY eess.SP 版本更新

Trajectory Optimization of Robots with Regenerative Drive Systems: Numerical and Experimental Results

具有再生驱动系统的机器人轨迹优化：数值和实验结果

Poya Khalaf, Hanz Richter

AI总结本文研究了具有超级电容器再生驱动系统的机器人能量最优控制，通过数值和实验验证了轨迹优化方法，展示了13%的能耗降低效果。

1803.10371 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Reinforcement learning for non-prehensile manipulation: Transfer from simulation to physical system

基于非操控操作的强化学习：从仿真到物理系统的迁移

Kendall Lowrey, Svetoslav Kolev, Jeremy Dao, Aravind Rajeswaran, Emanuel Todorov

发表机构 * University of Washington（华盛顿大学）； Roboti LLC（Roboti公司）

AI总结本文提出了一种基于仿真的强化学习方法，用于非操控操作任务，通过在仿真环境中训练策略，成功迁移到物理系统中，且在模型集合训练下提升了策略的鲁棒性。

Comments Accepted at IEEE SIMPAR 2018. Project page: https://sites.google.com/view/phantomsim2real

详情

AI中文摘要

强化学习已作为一种有前途的方法用于训练机器人控制器。然而，大多数结果受限于仿真，因为需要大量样本且缺乏自动且安全的数据收集方法。基于模型的强化学习方法提供了一种途径来克服这些挑战，但传统关注的是仿真与现实世界之间的不匹配。这里，我们展示在仿真中学习的控制策略可以成功迁移到由三个Phantom机器人推动物体到各种目标位置的物理系统中。我们使用修改的自然策略梯度算法进行学习，应用于精心识别的仿真模型。所得到的策略在仿真中完全训练后，在物理系统中无需额外训练即可有效工作。此外，我们还表明，使用模型集合训练使学习的策略对建模误差更鲁棒，从而补偿系统识别的困难。

英文摘要

Reinforcement learning has emerged as a promising methodology for training robot controllers. However, most results have been limited to simulation due to the need for a large number of samples and the lack of automated-yet-safe data collection methods. Model-based reinforcement learning methods provide an avenue to circumvent these challenges, but the traditional concern has been the mismatch between the simulator and the real world. Here, we show that control policies learned in simulation can successfully transfer to a physical system, composed of three Phantom robots pushing an object to various desired target positions. We use a modified form of the natural policy gradient algorithm for learning, applied to a carefully identified simulation model. The resulting policies, trained entirely in simulation, work well on the physical system without additional training. In addition, we show that training with an ensemble of models makes the learned policies more robust to modeling errors, thus compensating for difficulties in system identification.

URL PDF HTML ☆

赞 0 踩 0

1803.09792 2026-06-04 cs.MA cs.RO cs.SY eess.SY 版本更新

Min-Max Tours for Task Allocation to Heterogeneous Agents

为异构智能体分配任务的最优化路线

Amritha Prasad, Han-Lim Choi, Shreyas Sundaram

发表机构 * School of Electrical and Computer Engineering at Purdue University（普渡大学电气与计算机工程学院）

AI总结研究如何为异构移动智能体分配任务，以最小化任何智能体完成任务并返回仓库的最大成本，提出了一种三阶段算法，提供5倍近似比。

详情

AI中文摘要

我们考虑一个场景，其中一组异构移动智能体位于仓库，一组任务分散在地理区域中。智能体分为不同类型，任务分为专用任务（只能由特定类型的智能体完成）和通用任务（可由任何智能体完成）。每对任务之间的距离已指定，并满足三角不等式。给定此场景，我们研究如何将这些任务分配给可用智能体（受类型兼容性约束），以最小化任何智能体完成任务并返回仓库的最大成本。该问题属于NP难问题，我们提出了一种三阶段算法来解决该问题，无论总智能体数量和每种类型智能体数量如何，该算法提供5倍的近似比。我们还证明，在仅有一种类型智能体的情况下，该算法的近似因子为4。

英文摘要

We consider a scenario consisting of a set of heterogeneous mobile agents located at a depot, and a set of tasks dispersed over a geographic area. The agents are partitioned into different types. The tasks are partitioned into specialized tasks that can only be done by agents of a certain type, and generic tasks that can be done by any agent. The distances between each pair of tasks are specified, and satisfy the triangle inequality. Given this scenario, we address the problem of allocating these tasks among the available agents (subject to type compatibility constraints) while minimizing the maximum cost to tour the allocation by any agent and return to the depot. This problem is NP-hard, and we give a three phase algorithm to solve this problem that provides 5-factor approximation, regardless of the total number of agents and the number of agents of each type. We also show that in the special case where there is only one agent of each type, the algorithm has an approximation factor of 4.

URL PDF HTML ☆

赞 0 踩 0

1802.04929 2026-06-04 eess.SY cs.LG cs.RO cs.SY 版本更新

Context-Specific Validation of Data-Driven Models

基于情境的驱动模型验证

Somil Bansal, Shromona Ghosh, Alberto Sangiovanni-Vincentelli, Sanjit A. Seshia, Claire J. Tomlin

AI总结本文提出了一种基于情境的驱动模型验证框架，通过计算闭环实际系统与学习模型之间的距离来评估模型质量，并采用主动采样方案高效计算距离上界，用于验证实际系统控制器设计。

详情

AI中文摘要

随着数据驱动模型在机器人系统控制中的广泛应用，开发一种在部署前验证此类模型的方法变得至关重要。具体而言，必须确保为学习模型设计的控制器在实际物理系统上能按预期运行。本文提出了一种基于情境的验证框架，通过闭环实际系统与学习模型之间的距离度量来量化学习模型的质量。随后，我们提出了一种主动采样方案，以在样本高效的方式下计算该距离的概率上界。所提出的框架仅验证与模型预期用途相关的行为，且不需要任何先验系统动力学知识。几个模拟示例展示了该框架在验证真实系统模型及控制器合成中的实用性。

英文摘要

With an increasing use of data-driven models to control robotic systems, it has become important to develop a methodology for validating such models before they can be deployed to design a controller for the actual system. Specifically, it must be ensured that the controller designed for a learned model would perform as expected on the actual physical system. We propose a context-specific validation framework to quantify the quality of a learned model based on a distance measure between the closed-loop actual system and the learned model. We then propose an active sampling scheme to compute a probabilistic upper bound on this distance in a sample-efficient manner. The proposed framework validates the learned model against only those behaviors of the system that are relevant for the purpose for which we intend to use this model, and does not require any a priori knowledge of the system dynamics. Several simulations illustrate the practicality of the proposed framework for validating the models of real-world systems, and consequently, for controller synthesis.

URL PDF HTML ☆

赞 0 踩 0

1709.05363 2026-06-04 cs.RO cs.GT cs.SY eess.SY 版本更新

Synthesis of surveillance strategies via belief abstraction

通过信念抽象合成监视策略

Suda Bharadwaj, Rayna Dimitrova, Ufuk Topcu

发表机构 * University of Leicester（莱斯特大学）

AI总结本文研究了合成具有监视目标的机器人控制器问题，通过将问题建模为单侧部分信息游戏，并利用抽象技术减少状态空间爆炸，从而实现监视策略的合成。

详情

AI中文摘要

我们研究了合成具有监视目标的机器人控制器问题，即机器人需维持对移动目标位置的了解。我们将该问题建模为单侧部分信息游戏，其中代理的胜利条件由时序逻辑公式指定。该规范形式化了用户提供的监视需求，包括额外的非监视任务。为了合成满足规范的监视策略，我们将部分信息游戏转换为完美信息游戏，利用抽象技术缓解此类转换通常导致的指数级状态空间爆炸。这使得可以使用现成的工具进行反应合成。我们使用反例引导的细化技术自动实现足够的抽象精度以合成监视策略。我们在两个案例研究中评估了所提出的方法，展示了其在大规模状态空间和多样化需求中的适用性。

英文摘要

We study the problem of synthesizing a controller for a robot with a surveillance objective, that is, the robot is required to maintain knowledge of the location of a moving, possibly adversarial target. We formulate this problem as a one-sided partial-information game in which the winning condition for the agent is specified as a temporal logic formula. The specification formalizes the surveillance requirement given by the user, including additional non-surveillance tasks. In order to synthesize a surveillance strategy that meets the specification, we transform the partial-information game into a perfect-information one, using abstraction to mitigate the exponential blow-up typically incurred by such transformations. This enables the use of off-the-shelf tools for reactive synthesis. We use counterexample-guided refinement to automatically achieve abstraction precision that is sufficient to synthesize a surveillance strategy. We evaluate the proposed method on two case-studies, demonstrating its applicability to large state-spaces and diverse requirements.

URL PDF HTML ☆

赞 0 踩 0

1703.02660 2026-06-04 cs.LG cs.AI cs.RO cs.SY eess.SY 版本更新

Towards Generalization and Simplicity in Continuous Control

连续控制中的泛化与简洁性

Aravind Rajeswaran, Kendall Lowrey, Emanuel Todorov, Sham Kakade

发表机构 * University of Washington（华盛顿大学）

AI总结本文展示简单线性与RBF参数化策略可解决多种连续控制任务，性能可与更复杂网络相媲美，且多样初始化提升泛化能力。

Comments NIPS 2017, Project page: https://sites.google.com/view/simple-pol

1709.04407 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

An Inversion-Based Learning Approach for Improving Impromptu Trajectory Tracking of Robots with Non-Minimum Phase Dynamics

基于逆向学习的方法用于改进具有非最小相位动态的机器人即兴轨迹跟踪

Siqi Zhou, Mohamed K. Helwa, Angela P. Schoellig

发表机构 * Dynamic Systems Lab（动态系统实验室）； Institute for Aerospace Studies（航空航天研究 institute）； University of Toronto（多伦多大学）； Cairo University（开罗大学）

AI总结本文提出一种基于学习的方法，用于改进非最小相位系统的即兴轨迹跟踪，通过直接从输入输出数据学习稳定近似逆向，验证了方法的稳定性与高精度跟踪效果。

Comments Accepted for publication in the IEEE Robotics and Automation Letters (RA-L), July 2018

详情

DOI: 10.1109/LRA.2018.2801471

AI中文摘要

本文提出了一种基于学习的方法，用于非最小相位系统的即兴轨迹跟踪。逆向前馈方法常用于提高跟踪性能，但无法直接应用于非最小相位系统，因为其固有不稳定。为解决此问题，现有方法假设系统模型已知，并使用预动作或逆向近似技术。本文提出了一种从输入输出数据直接学习稳定近似逆向的方法。通过理论讨论、模拟和两种不同平台的实验，展示了所提方法的稳定性及其在高精度即兴跟踪中的有效性。此外，本文还表明，在训练中包含更多信息，尽管通常被认为有用，但未必能提高性能，反而可能引发不稳定性并影响整体方法的效果。

英文摘要

This paper presents a learning-based approach for impromptu trajectory tracking for non-minimum phase systems, i.e., systems with unstable inverse dynamics. Inversion-based feedforward approaches are commonly used for improving tracking performance; however, these approaches are not directly applicable to non-minimum phase systems due to their inherent instability. In order to resolve the instability issue, existing methods have assumed that the system model is known and used pre-actuation or inverse approximation techniques. In this work, we propose an approach for learning a stable, approximate inverse of a non-minimum phase baseline system directly from its input-output data. Through theoretical discussions, simulations, and experiments on two different platforms, we show the stability of our proposed approach and its effectiveness for high-accuracy, impromptu tracking. Our approach also shows that including more information in the training, as is commonly assumed to be useful, does not lead to better performance but may trigger instability and impact the effectiveness of the overall approach.

URL PDF HTML ☆

赞 0 踩 0

1803.01579 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Motion and Cooperative Transportation Planning for Multi-Agent Systems under Temporal Logic Formulas

多智能体系统在时序逻辑公式下的运动与协作运输规划

Christos K. Verginis, Dimos V. Dimarogonas

发表机构 * KTH Center for Autonomous Systems（KTH 自主系统中心）

AI总结本文提出一种混合控制框架，用于在高阶目标表达为线性时序逻辑（LTL）公式的情况下，多智能体系统的运动规划。设计控制协议以实现智能体在预定义兴趣区域间的过渡及协作运输物体。通过抽象智能体与物体的耦合行为为有限状态转移系统，设计满足智能体和物体规格的高层多智能体计划。

Comments Submitted to IEEE Transactions on Automation Science and Engineering. arXiv admin note: text overlap with arXiv:1611.05186

1802.08327 2026-06-04 cs.SE cs.RO cs.SY eess.SY 版本更新

From Hazard Analysis to Hazard Mitigation Planning: The Automated Driving Case

从危险分析到危险缓解规划：自动驾驶案例

Mario Gleirscher, Stefan Kugele

AI总结本文提出一个框架，用于分析和设计能够实时识别和缓解危险的规划器，同时介绍构建规划模型的增量算法及基于给定控制系统架构设计故障容错控制器的示例应用。

详情

DOI: 10.1007/978-3-319-57288-8_23
Journal ref: Barrett C., Davies M., Kahsai T. (eds) NASA Formal Methods. LNCS, vol 10227. Springer

AI中文摘要

车辆安全依赖于识别的危险范围和缓解这些危险的运营情况，以降低风险。随着自动化程度的提高，风险责任可能增加，因此高自动化车辆必须配备经过验证的控制器，能够可靠地在所有可能的运营情况下识别和缓解危险。为此，可用的方法必须通过危险分析和缓解的模型来支持自动化车辆控制器的设计和验证。本文描述了（1）一个用于分析和设计规划器（即高层控制器）的框架，该规划器能够实时识别和缓解危险；（2）一个增量算法，用于从危险分析中构建规划模型；以及（3）一个基于给定控制系统架构设计故障容错控制器的示例应用。我们的方法使安全工程师能够（2a）详细阐述危险场景并（2b）设计缓解此类场景的操作策略。

英文摘要

Vehicle safety depends on (a) the range of identified hazards and (b) the operational situations for which mitigations of these hazards are acceptably decreasing risk. Moreover, with an increasing degree of autonomy, risk ownership is likely to increase for vendors towards regulatory certification. Hence, highly automated vehicles have to be equipped with verified controllers capable of reliably identifying and mitigating hazards in all possible operational situations. To this end, available methods for the design and verification of automated vehicle controllers have to be supported by models for hazard analysis and mitigation. In this paper, we describe (1) a framework for the analysis and design of planners (i.e., high-level controllers) capable of run-time hazard identification and mitigation, (2) an incremental algorithm for constructing planning models from hazard analysis, and (3) an exemplary application to the design of a fail-operational controller based on a given control system architecture. Our approach equips the safety engineer with concepts and steps to (2a) elaborate scenarios of endangerment and (2b) design operational strategies for mitigating such scenarios.

URL PDF HTML ☆

赞 0 踩 0

1802.07346 2026-06-04 cs.RO cs.SY eess.SP eess.SY stat.AP 版本更新

Cooperative Robot Localization Using Event-triggered Estimation

基于事件触发估计的协作机器人定位

Michael Ouimet, David Iglesias, Nisar Ahmed, Sonia Martinez

发表机构 * SPAWAR Systems Center Pacific（SPAWAR太平洋系统中心）； University of Colorado Boulder（科罗拉多大学博尔德分校）； University of California San Diego（加州大学圣地亚哥分校）

AI总结本文提出一种低通信开销的协作定位算法，通过事件触发机制仅在状态估计创新度高时发送测量，结合协方差交叠机制实现网络状态同步，实验验证了其在多种动态模型下的高效定位性能。

Comments Revised submission in review with AIAA Journal of Aerospace Information Systems (JAIS), submitted February 17, 2018

详情

AI中文摘要

本文描述了一种新颖的低通信开销协作定位算法，适用于移动无人机器人团队。利用基于事件的估计范式，机器人仅在状态估计创新度较高时向邻居发送测量。由于代理已知触发测量的条件，缺少的测量信息也被融合到状态估计中。机器人使用协方差交叠（CI）机制偶尔同步其对完整网络状态的局部估计。此外，启发式平衡动态确保在大直径网络中，局部误差协方差始终保持在预期范围内。线性和非线性动态/测量模型的仿真表明，事件触发方法在广泛的操作条件下实现了接近最优的状态估计性能，即使仅使用传统全数据共享所需通信开销的小部分。还检验了所提方法对丢包通信的鲁棒性以及网络拓扑与基于CI的同步需求之间的关系。

英文摘要

This paper describes a novel communication-spare cooperative localization algorithm for a team of mobile unmanned robotic vehicles. Exploiting an event-based estimation paradigm, robots only send measurements to neighbors when the expected innovation for state estimation is high. Since agents know the event-triggering condition for measurements to be sent, the lack of a measurement is thus also informative and fused into state estimates. The robots use a Covariance Intersection (CI) mechanism to occasionally synchronize their local estimates of the full network state. In addition, heuristic balancing dynamics on the robots' CI-triggering thresholds ensure that, in large diameter networks, the local error covariances remains below desired bounds across the network. Simulations on both linear and nonlinear dynamics/measurement models show that the event-triggering approach achieves nearly optimal state estimation performance in a wide range of operating conditions, even when using only a fraction of the communication cost required by conventional full data sharing. The robustness of the proposed approach to lossy communications, as well as the relationship between network topology and CI-based synchronization requirements, are also examined.

URL PDF HTML ☆

赞 0 踩 0

1802.07199 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A Parametric MPC Approach to Balancing the Cost of Abstraction for Differential-Drive Mobile Robots

一种参数MPC方法用于平衡抽象成本的差分驱动移动机器人

Paul Glotfelter, Magnus Egerstedt

发表机构 * Georgia Institute of Technology（佐治亚理工学院）

AI总结本文提出了一种参数MPC方法，用于优化差分驱动移动机器人的抽象成本，通过分析精度和机动性成本，改进了单积分器模型的控制策略。

Comments ICRA 2018

详情

AI中文摘要

在为差分驱动移动机器人设计控制策略时，一种标准工具是考虑一条垂直于车轮轴线的固定距离的点，而不是车辆的完整姿态。这种抽象支持将非holonomic、三状态unicycle模型替换为更简单的两状态单积分器模型（即速度控制点）。然而，这种转换会带来性能代价，通过机器人的精度和机动性。本文推导了这些精度和机动性成本的表达式，以转换参数为变量。此外，这些成本表明，在应用过程中仅一次选择参数可能导致精度损失。模型预测控制（MPC）代表一种缓解此状况的方法。然而，MPC通常产生控制信号而不是参数，因此本文还提出了一种参数模型预测控制（PMPC）方法，用于参数和采样时间窗优化。实验结果展示了参数化对实际差分驱动移动机器人上开发的单积分器模型算法部署的影响。

英文摘要

When designing control strategies for differential-drive mobile robots, one standard tool is the consideration of a point at a fixed distance along a line orthogonal to the wheel axis instead of the full pose of the vehicle. This abstraction supports replacing the non-holonomic, three-state unicycle model with a much simpler two-state single-integrator model (i.e., a velocity-controlled point). Yet this transformation comes at a performance cost, through the robot's precision and maneuverability. This work contains derivations for expressions of these precision and maneuverability costs in terms of the transformation's parameters. Furthermore, these costs show that only selecting the parameter once over the course of an application may cause an undue loss of precision. Model Predictive Control (MPC) represents one such method to ameliorate this condition. However, MPC typically realizes a control signal, rather than a parameter, so this work also proposes a Parametric Model Predictive Control (PMPC) method for parameter and sampling horizon optimization. Experimental results are presented that demonstrate the effects of the parameterization on the deployment of algorithms developed for the single-integrator model on actual differential-drive mobile robots.

URL PDF HTML ☆

赞 0 踩 0

1702.03258 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Adaptive and Resilient Soft Tensegrity Robots

自适应且具有韧性的软 tensegrity 机器人

John Rieffel, Jean-Baptiste Mouret

发表机构 * Union College（联合学院）； Inria Nancy Grand - Est ； CNRS, Loria, UMR 7503（法国国家科学研究中心（CNRS）、洛里亚实验室（Loria）、UMR 7503）

AI总结本文提出一种易于组装的基于 tensegrity 的软机器人，能产生高动态运动步态，并在物理损伤下表现出结构和行为韧性，通过机器学习算法实现有效步态发现。

Comments video: https://youtu.be/SuLQDhrk9tQ

详情

AI中文摘要

生物体结合软（如肌肉）和硬（如骨骼）材料，赋予其内在灵活性和韧性，这在传统刚性机器人中往往缺失。新兴的软机器人领域试图利用这些特性创造韧性机器。然而，软材料的性质给设计、制造和控制带来了重大挑战，迄今为止，大多数软机器人的步态都是通过经验试错法手动设计的。本文描述了一种易于组装的基于 tensegrity 的软机器人，能够产生高度动态的运动步态，并在面对物理损伤时表现出结构和行为的韧性。使这一成果成为可能的是使用一种机器学习算法，能够以最少的物理试验发现有效的步态。这些结果进一步支持了软机器人方法，旨在利用复杂材料动力学的相互作用，以生成丰富的动态行为。

英文摘要

Living organisms intertwine soft (e.g., muscle) and hard (e.g., bones) materials, giving them an intrinsic flexibility and resiliency often lacking in conventional rigid robots. The emerging field of soft robotics seeks to harness these same properties in order to create resilient machines. The nature of soft materials, however, presents considerable challenges to aspects of design, construction, and control -- and up until now, the vast majority of gaits for soft robots have been hand-designed through empirical trial-and-error. This manuscript describes an easy-to-assemble tensegrity-based soft robot capable of highly dynamic locomotive gaits and demonstrating structural and behavioral resilience in the face of physical damage. Enabling this is the use of a machine learning algorithm able to discover effective gaits with a minimal number of physical trials. These results lend further credence to soft-robotic approaches that seek to harness the interaction of complex material dynamics in order to generate a wealth of dynamical behaviors.

URL PDF HTML ☆

赞 0 踩 0

1802.06314 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

Autonomous Vehicle Speed Control for Safe Navigation of Occluded Pedestrian Crosswalk

自动驾驶车辆速度控制：安全通过遮挡人行横道

Sarah Thornton

发表机构 * Dynamic Design Lab（动态设计实验室）

AI总结本文提出基于部分可观测马尔可夫决策过程的速度控制方法，用于安全通过遮挡人行横道，通过动态规划计算控制策略以应对感知限制。

Comments 6 pages, 9 figures

1802.05730 2026-06-04 cs.RO cs.SY eess.SY 版本更新

关于可观测性格拉姆矩阵在部分观测机器人路径规划问题中的应用

Mohammadhussein Rafieisakhaei, Suman Chakravorty, P. R. Kumar

发表机构 * Department of Electrical and Computer Engineering（电气与计算机工程系）

AI总结本文探讨了利用可观测性格拉姆矩阵作为估计性能代理进行优化的局限性，指出其可能产生误导性的路径规划结果。

Comments 6 pages, 9 figures. CDC 2017

1609.09861 2026-06-04 eess.SY cs.RO cs.SY 版本更新

An Efficient Optimal Planning and Control Framework For Quadrupedal Locomotion

一种高效的四足机器人运动规划与控制框架

Farbod Farshidian, Michael Neunert, Alexander W. Winkler, Gonzalo Rey, Jonas Buchli

AI总结本文提出一种高效的动态规划框架，用于四足机器人的最优规划与控制，通过多级优化方法高效求解切换时间和连续控制输入，并采用连续时间约束LQR算法优化前馈与反馈控制器。

Comments 8 Pages

详情

DOI: 10.1109/ICRA.2017.7989016

AI中文摘要

本文提出了一种高效的动态规划框架，用于四足机器人最优规划与控制。首先将问题表述为切换系统的最优控制问题。然后提出多级优化方法，以求得最优切换时间和最优连续控制输入。通过此方案，分解优化可能比综合方法更高效。最后，我们提出一种连续时间约束LQR算法，同时优化前馈和反馈控制器，具有O(n)的时间复杂度。为了验证我们的方法，我们在四足机器人上展示了框架的性能。我们选择质心动力学和完整运动学公式作为切换系统模型，其中切换时间以及接触力和关节速度针对不同的运动任务（如跨越间隙、行走和小跑）进行优化。

英文摘要

In this paper, we present an efficient Dynamic Programing framework for optimal planning and control of legged robots. First we formulate this problem as an optimal control problem for switched systems. Then we propose a multi--level optimization approach to find the optimal switching times and the optimal continuous control inputs. Through this scheme, the decomposed optimization can potentially be done more efficiently than the combined approach. Finally, we present a continuous-time constrained LQR algorithm which simultaneously optimizes the feedforward and feedback controller with $O(n)$ time-complexity. In order to validate our approach, we show the performance of our framework on a quadrupedal robot. We choose the Center of Mass dynamics and the full kinematic formulation as the switched system model where the switching times as well as the contact forces and the joint velocities are optimized for different locomotion tasks such as gap crossing, walking and trotting.

URL PDF HTML ☆

赞 0 踩 0

1801.09361 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Safe and Efficient Intersection Control of Connected and Autonomous Intersection Traffic

连接与自动驾驶交叉口交通的安全高效交叉控制

Qiang Lu

发表机构 * Daniel Felix Ritchie School of Engineering and Computer Science（丹尼尔·费利克斯·里奇工程与计算机科学学院）

AI总结本文提出DICA算法，用于协调自动驾驶车辆在交叉口的安全高效通行，同时通过Reactive DICA算法优化紧急车辆通行优先级，确保应急车辆快速通过而最小化其他车辆的行程影响。

Comments 104 pages, 23 figures, PhD comprehensive thesis

详情

AI中文摘要

本论文针对自动驾驶和连接地面交通在交叉口的安全高效通行问题，提出了一种名为离散时间占用轨迹基于交叉口交通协调算法（DICA）的算法。所有系统中的车辆均为连接和自动驾驶车辆（CAVs），并具备无线车与交叉口通信能力。在所提出的框架中，交叉口基于车辆提出的DTOTs协调其运动，以高效通过交叉口并避免碰撞。在车辆DTOTs发生碰撞时，交叉口修改冲突的DTOTs以避免碰撞，并请求CAVs根据修改后的DTOTs接近和通过交叉口。我们证明了基本DICA是无死锁和无饥饿的，并且在计算复杂度上具有保守性，通过几种计算方法进行了改进。接着，本文还解决了通过自动驾驶和连接交叉口交通快速疏散紧急车辆的问题。所提出的交叉口控制算法Reactive DICA旨在确定一个高效的车辆通行序列，使紧急车辆能够尽快通过交叉口，同时最小化其他车辆的行程时间。当没有紧急车辆在交叉口区域内时，车辆由DICA控制。当有紧急车辆进入通信范围时，我们通过优化车辆的排序优先处理紧急车辆。提出了一种遗传算法来解决优化问题，找到最优的车辆序列，使紧急车辆获得最高优先级。

英文摘要

In this dissertation, we address a problem of safe and efficient intersection crossing traffic management of autonomous and connected ground traffic. Toward this objective, an algorithm that is called the Discrete-time occupancies trajectory based Intersection traffic Coordination Algorithm (DICA) is proposed. All vehicles in the system are Connected and Autonomous Vehicles (CAVs) and capable of wireless Vehicle-to-Intersection communication. In the proposed framework, an intersection coordinates the motions of CAVs based on their proposed DTOTs to let them cross the intersection efficiently while avoiding collisions. In case when there is a collision between vehicles' DTOTs, the intersection modifies conflicting DTOTs to avoid the collision and requests CAVs to approach and cross the intersection according to the modified DTOTs. We then prove that the basic DICA is deadlock free and also starvation free. We also show that the basic DICA is conservative in computational complexity and improve it by several computational approaches. Next, we addressed the problem of evacuating emergency vehicles as quickly as possible through autonomous and connected intersection traffic in this dissertation. The proposed intersection control algorithm Reactive DICA aims to determine an efficient vehicle-passing sequence which allows the emergency vehicle to cross an intersection as soon as possible while the travel times of other vehicles are minimally affected. When there are no emergency vehicles within the intersection area, the vehicles are controlled by DICA. When there are emergency vehicles entering communication range, we prioritize emergency vehicles through optimal ordering of vehicles. A genetic algorithm is proposed to solve the optimization problem which finds the optimal vehicle sequence that gives the emergency vehicles the highest priority.

URL PDF HTML ☆

赞 0 踩 0

1801.07132 2026-06-04 cs.CR cs.RO cs.SY eess.SY 版本更新

SecSens: Secure State Estimation with Application to Localization and Time Synchronization

SecSens: 带有定位和时间同步应用的安全状态估计

Amr Alanwar, Bernhard Etzlinger, Henrique Ferraz, Joao Hespanha, Mani Srivastava

发表机构 * University of California, Los Angeles（加州大学洛杉矶分校）； University of California, Santa Barbara（加州大学圣芭芭拉分校）； Johannes Kepler University（约翰·凯撒大学）

AI总结本文提出SecSens，一种用于对抗模型和测量噪声的新型安全非线性状态估计方法，通过SecEKF和SecOPT算法实现安全状态估计，无需专用硬件或加密技术，在强攻击下性能优于现有方案。

详情

AI中文摘要

针对网络物理系统中信息技术和物理传感耦合带来的安全漏洞，本文提出SecSens，一种基于扩展卡尔曼滤波和最大似然估计的新型安全非线性状态估计方法。SecSens包含SecEKF和SecOPT两个独立算法，通过整体方法引入安全意识，无需专用硬件或加密技术。本文将SecSens应用于网络化移动设备的安全定位和时间同步，实验表明其在强攻击下性能优于现有方案。

英文摘要

Research evidence in Cyber-Physical Systems (CPS) shows that the introduced tight coupling of information technology with physical sensing and actuation leads to more vulnerability and security weaknesses. But, the traditional security protection mechanisms of CPS focus on data encryption while neglecting the sensors which are vulnerable to attacks in the physical domain. Accordingly, researchers attach utmost importance to the problem of state estimation in the presence of sensor attacks. In this work, we present SecSens, a novel approach for secure nonlinear state estimation in the presence of modeling and measurement noise. SecSens consists of two independent algorithms, namely, SecEKF and SecOPT, which are based on Extended Kalman Filter and Maximum Likelihood Estimation, respectively. We adopt a holistic approach to introduce security awareness among state estimation algorithms without requiring specialized hardware, or cryptographic techniques. We apply SecSens to securely localize and time synchronize networked mobile devices. SecSens provides good performance at run-time several order of magnitude faster than the state of art solutions under the presence of powerful attacks. Our algorithms are evaluated on a testbed with static nodes and a mobile quadrotor all equipped with commercial ultra-wide band wireless devices.

URL PDF HTML ☆

赞 0 踩 0

1801.04816 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Localizability-Constrained Deployment of Mobile Robotic Networks with Noisy Range Measurements

具有噪声测距的移动机器人网络的局部化约束部署

Jerome Le Ny, Simon Chauvière

发表机构 * Polytechnique Montreal（蒙特利尔理工学院）； GERAD

AI总结本文研究了在噪声测距下如何部署移动机器人网络以实现高精度定位，通过局部化函数优化网络几何结构，结合梯度下降法进行分布式控制。

Comments 7 pages, 3 figures

详情

AI中文摘要

当移动网络中的节点使用相对于邻居的噪声测距来估计位置时，整体连通性和测量网络的几何结构对可实现的定位精度有关键影响。本文考虑了部署一个仅基于测距的协作定位移动机器人网络的问题，旨在维持有利于高精度估计机器人位置的网络几何结构。网络几何的质量通过一个称为“局部化性”的函数来衡量，该函数作为机器人运动规划的潜在场。该函数基于Cramér-Rao界，为给定的几何结构提供任何无偏位置估计器的协方差矩阵的下界。我们描述了基于梯度下降的机器人运动规划器，试图优化或约束网络局部化函数的不同变体，并讨论了以分布式方式实现这些控制器的方法。最后，本文还建立了统计观点与维护捕捉相对测距的图的加权刚性形式之间的正式联系。

英文摘要

When nodes in a mobile network use relative noisy measurements with respect to their neighbors to estimate their positions, the overall connectivity and geometry of the measurement network has a critical influence on the achievable localization accuracy. This paper considers the problem of deploying a mobile robotic network implementing a cooperative localization scheme based on range measurements only, while attempting to maintain a network geometry that is favorable to estimating the robots' positions with high accuracy. The quality of the network geometry is measured by a "localizability" function serving as potential field for robot motion planning. This function is built from the Cramér-Rao bound, which provides for a given geometry a lower bound on the covariance matrix achievable by any unbiased position estimator that the robots might implement using their relative measurements. We describe gradient descent-based motion planners for the robots that attempt to optimize or constrain different variations of the network's localizability function, and discuss ways of implementing these controllers in a distributed manner. Finally, the paper also establishes formal connections between our statistical point of view and maintaining a form of weighted rigidity for the graph capturing the relative range measurements.

URL PDF HTML ☆

赞 0 踩 0

1611.04706 2026-06-04 cs.RO cs.SY eess.SY 版本更新

High-Dimensional Stochastic Optimal Control using Continuous Tensor Decompositions

高维随机最优控制的连续张量分解应用

Alex A. Gorodetsky, Sertac Karaman, Youssef M. Marzouk

AI总结本文提出基于连续张量分解的动态规划算法，解决高维状态空间中的随机最优控制问题，通过压缩表示实现多项式时间复杂度，显著提升计算效率。

Comments 32 pages, 20 figures

详情

AI中文摘要

运动规划和控制问题在几乎所有机器人应用中都是嵌入和关键的。这些问题通常被公式化为随机最优控制问题，并通过动态规划算法解决。不幸的是，大多数保证收敛到最优解的现有算法受到维度诅咒的影响：算法的运行时间随着系统状态空间维度的增长呈指数级增长。我们提出新的动态规划算法，以减轻在具有某些低秩结构的问题中的维度诅咒。所提出的算法基于最近由作者开发的连续张量分解。本质上，这些算法以压缩格式表示高维函数（例如价值函数），并在这种格式中直接执行动态规划计算（例如价值迭代、策略迭代）。在某些技术假设下，新算法保证能够以任意精度收敛到最优解。此外，新算法的运行时间与状态维度和价值函数的秩呈多项式增长。这种方法在

英文摘要

Motion planning and control problems are embedded and essential in almost all robotics applications. These problems are often formulated as stochastic optimal control problems and solved using dynamic programming algorithms. Unfortunately, most existing algorithms that guarantee convergence to optimal solutions suffer from the curse of dimensionality: the run time of the algorithm grows exponentially with the dimension of the state space of the system. We propose novel dynamic programming algorithms that alleviate the curse of dimensionality in problems that exhibit certain low-rank structure. The proposed algorithms are based on continuous tensor decompositions recently developed by the authors. Essentially, the algorithms represent high-dimensional functions (e.g., the value function) in a compressed format, and directly perform dynamic programming computations (e.g., value iteration, policy iteration) in this format. Under certain technical assumptions, the new algorithms guarantee convergence towards optimal solutions with arbitrary precision. Furthermore, the run times of the new algorithms scale polynomially with the state dimension and polynomially with the ranks of the value function. This approach realizes substantial computational savings in "compressible" problem instances, where value functions admit low-rank approximations. We demonstrate the new algorithms in a wide range of problems, including a simulated six-dimensional agile quadcopter maneuvering example and a seven-dimensional aircraft perching example. In some of these examples, we estimate computational savings of up to ten orders of magnitude over standard value iteration algorithms. We further demonstrate the algorithms running in real time on board a quadcopter during a flight experiment under motion capture.

URL PDF HTML ☆

赞 0 踩 0

1501.05151 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Recursive Bayesian Filtering in Circular State Spaces

循环状态空间中的递归贝叶斯滤波

Gerhard Kurz, Igor Gilitschenski, Uwe D. Hanebeck

发表机构 * Intelligent Sensor-Actuator-Systems Laboratory (ISAS)（智能传感器-执行器系统实验室）； Institute for Anthropomatics and Robotics（人机学与机器人研究所）； Karlsruhe Institute of Technology (KIT), Germany（卡尔斯鲁厄理工学院（KIT），德国）

AI总结本文提出了一种基于循环统计的递归滤波框架，利用循环分布（如环绕正态分布和von Mises分布）估计循环状态，并通过高效确定性采样技术处理非线性系统和测量函数，引入了分布无关预测算法和改进的环绕正态密度乘法公式。

详情

DOI: 10.1109/MAES.2016.150083

AI中文摘要

针对基于循环统计的递归循环滤波，我们介绍了一个通用框架，用于基于不同循环分布（特别是环绕正态分布和von Mises分布）估计循环状态。我们提出了一种用于具有非线性系统和测量函数的循环系统的估计方法，通过依赖高效的确定性采样技术实现。此外，我们展示了在多种重要特殊情况下计算如何简化，例如具有加性噪声的系统以及身份系统或测量函数。我们引入了几个新的关键组件，特别是分布无关的预测算法、新的和更优的环绕正态密度乘法公式，以及处理非加性系统噪声的能力。所有提出的方法都经过彻底评估，并与几种最先进的解决方案进行了比较。

英文摘要

For recursive circular filtering based on circular statistics, we introduce a general framework for estimation of a circular state based on different circular distributions, specifically the wrapped normal distribution and the von Mises distribution. We propose an estimation method for circular systems with nonlinear system and measurement functions. This is achieved by relying on efficient deterministic sampling techniques. Furthermore, we show how the calculations can be simplified in a variety of important special cases, such as systems with additive noise as well as identity system or measurement functions. We introduce several novel key components, particularly a distribution-free prediction algorithm, a new and superior formula for the multiplication of wrapped normal densities, and the ability to deal with non-additive system noise. All proposed methods are thoroughly evaluated and compared to several state-of-the-art solutions.

URL PDF HTML ☆

赞 0 踩 0

1605.06645 2026-06-04 math.OC cs.RO cs.SY eess.SY math.DS 版本更新

Full-Pose Tracking Control for Aerial Robotic Systems with Laterally-Bounded Input Force

具有横向受限输入力的空中机器人系统的全姿态跟踪控制

Antonio Franchi, Ruggero Carli, Davide Bicego, Markus Ryll

发表机构 * Department of Information Engineering, University of Padova（意大利帕多瓦大学信息工程系）

AI总结本文提出了一种新的几何控制策略，用于实现具有横向受限输入力的空中机器人系统在SE(3)中的全姿态轨迹跟踪，通过Lyapunov方法证明了可行全姿态参考轨迹的指数跟踪，并展示了该方法在欠驱动和全驱动平台中的应用效果。

详情

DOI: 10.1109/TRO.2017.2786734
Journal ref: IEEE Trascation on Robotics, 2018

AI中文摘要

本文定义了一类通用的抽象空中机器人系统，称为横向受限力（LBF）车辆，其中大部分控制权沿主推力方向表达，而在横向方向上可能利用较小甚至为零的力来实现全姿态跟踪。此类系统能够很好地近似具有非共面/非共线旋翼的平台，这些平台可以使用倾斜螺旋桨略微改变总推力相对于机体框架的方向。对于此类广泛系统，我们引入了一种新的几何控制策略，以在力约束允许的情况下实现位置加姿态轨迹的独立跟踪。使用SE(3)中的Lyapunov技术证明了可行的全姿态参考轨迹的指数跟踪。该方法可以无缝处理欠驱动和全驱动的LBF平台。控制器在提供不可行的全姿态参考轨迹时，至少保证位置部分的跟踪。本文提供了几个实验测试，清晰展示了该方法的实用性以及与现有方法相比的显著改进。

英文摘要

In this paper, we define a general class of abstract aerial robotic systems named Laterally Bounded Force (LBF) vehicles, in which most of the control authority is expressed along a principal thrust direction, while in the lateral directions a (smaller and possibly null) force may be exploited to achieve full-pose tracking. This class approximates well platforms endowed with non-coplanar/non-collinear rotors that can use the tilted propellers to slightly change the orientation of the total thrust w.r.t. the body frame. For this broad class of systems, we introduce a new geometric control strategy in SE(3) to achieve, whenever made possible by the force constraints, the independent tracking of position-plus-orientation trajectories. The exponential tracking of a feasible full-pose reference trajectory is proven using a Lyapunov technique in SE(3). The method can deal seamlessly with both under- and fully-actuated LBF platforms. The controller guarantees the tracking of at least the positional part in the case that an unfeasible full-pose reference trajectory is provided. The paper provides several experimental tests clearly showing the practicability of the approach and the sharp improvement with respect to state of-the-art approaches.

URL PDF HTML ☆

赞 0 踩 0

1610.06781 2026-06-04 cs.RO cs.AI cs.CV cs.LG cs.SY eess.SY 版本更新

Modular Deep Q Networks for Sim-to-real Transfer of Visuo-motor Policies

模块化深度Q网络用于视觉-运动策略的仿真到现实迁移

Fangyi Zhang, Jürgen Leitner, Michael Milford, Peter Corke

发表机构 * Australian Centre for Robotic Vision (ACRV)（澳大利亚机器人视觉中心）； Queensland University of Technology (QUT)（昆士兰理工大学）

AI总结本文提出模块化深度强化学习方法，通过在感知与控制之间引入瓶颈，实现仿真到现实的迁移，提升机器人视觉-运动协调能力。

Comments Australasian Conference on Robotics and Automation (ACRA) 2017, Student Paper Award Finalist

详情

Journal ref: The proceedings of the Australasian Conference on Robotics and Automation (ACRA) 2017

AI中文摘要

尽管深度学习在计算机视觉中因大量视觉数据而取得显著成功，但为机器人学习收集足够大的现实世界数据集成本较高。为提高这些技术在真实机器人上的实用性，我们提出了一种模块化深度强化学习方法，能够将仿真训练的模型迁移到现实世界机器人任务中。我们引入了感知与控制之间的瓶颈，使网络能够独立训练，然后在端到端方式下合并和微调，以进一步提高视觉-运动协调性。在经典的平面视觉引导机器人抓取任务中，微调后的准确度达到1.6像素，显著优于直接迁移（17.5像素），显示出在更复杂和广泛的应用中的潜力。我们的方法提供了一种更高效学习和迁移视觉-运动策略的技术，无需完全依赖大规模现实世界机器人数据集。

英文摘要

While deep learning has had significant successes in computer vision thanks to the abundance of visual data, collecting sufficiently large real-world datasets for robot learning can be costly. To increase the practicality of these techniques on real robots, we propose a modular deep reinforcement learning method capable of transferring models trained in simulation to a real-world robotic task. We introduce a bottleneck between perception and control, enabling the networks to be trained independently, but then merged and fine-tuned in an end-to-end manner to further improve hand-eye coordination. On a canonical, planar visually-guided robot reaching task a fine-tuned accuracy of 1.6 pixels is achieved, a significant improvement over naive transfer (17.5 pixels), showing the potential for more complicated and broader applications. Our method provides a technique for more efficient learning and transfer of visuo-motor policies for real robotic systems without relying entirely on large real-world robot datasets.

URL PDF HTML ☆

赞 0 踩 0

1603.07567 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Dynamics, Control, and Estimation for Aerial Robots Tethered by Cables or Bars

缆绳或刚性杆连接的空中机器人动力学、控制与估计

Marco Tognon, Antonio Franchi

AI总结研究缆绳或刚性杆连接的空中机器人动力学特性，提出基于加速度计和陀螺仪的非线性观测器和控制器，实现对系统状态的高精度估计与轨迹跟踪。

详情

DOI: 10.1109/TRO.2017.2677915
Journal ref: IEEE Transaction on Robotics Volume: 33, Issue: 4, Aug. 2017

AI中文摘要

我们考虑了通过被动缆绳或被动刚性连接器连接到地面的空中机器人控制问题。我们详细阐述了该非线性机械系统的特性，包括微分平坦性、可控性和可观测性。我们证明该机器人系统相对于两个输出对微分平坦：连接器的高度和车辆的姿态；连接器的高度和纵向连接力（如缆绳张力或杆压缩）。我们展示了仅使用机载加速度计和陀螺仪设计的近全球收敛非线性观测器，用于估计完整状态。我们还设计了两个近全球收敛的非线性控制器，以跟踪任何足够平滑的时间变化轨迹。最后，我们通过数值测试在远离名义条件下的鲁棒性：非线性交叉耦合效应、参数偏差、测量噪声和非理想执行器。

英文摘要

We consider the problem of controlling an aerial robot connected to the ground by a passive cable or a passive rigid link. We provide a thorough characterization of this nonlinear dynamical robotic system in terms of fundamental properties such as differential flatness, controllability, and observability. We prove that the robotic system is differentially flat with respect to two output pairs: elevation of the link and attitude of the vehicle; elevation of the link and longitudinal link force (e.g., cable tension, or bar compression). We show the design of an almost globally convergent nonlinear observer of the full state that resorts only to an onboard accelerometer and a gyroscope. We also design two almost globally convergent nonlinear controllers to track any sufficiently smooth time-varying trajectory of the two output pairs. Finally we numerically test the robustness of the proposed method in several far-from-nominal conditions: nonlinear cross-coupling effects, parameter deviations, measurements noise and non ideal actuators.

URL PDF HTML ☆

赞 0 踩 0

1712.06021 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Bendable Cuboid Robot Path Planning with Collision Avoidance using Generalized $L_p$ Norms

可弯曲立方体机器人路径规划与碰撞避免使用广义L_p范数

Nak-seung P. Hyun, Patricio A. Vela, Erik I. Verriest

发表机构 * Georgia Institute of Technology（佐治亚理工学院）

AI总结本文提出基于广义L_p范数的立方体机器人路径规划方法，通过隐式表面近似和双重优化问题解决碰撞避免问题，验证了方法的有效性。

Comments 12 pages, 6 figures

详情

AI中文摘要

本文考虑了刚性及可变形（可弯曲）立方体机器人的最优路径规划问题，通过提供基于广义L_p范数的解析安全约束来实现。对于常规立方体机器人，加权L_p范数的等高线生成其表面的隐式近似。对于可弯曲立方体机器人，极坐标中的加权L_p范数通过指定等高线隐式近似表面边界。环境中的障碍物体积被假设为加权L_p范数的子等高线近似。利用这些近似表面模型，最优安全路径规划问题被重新表述为一个两阶段优化问题，其中安全约束依赖于机器人上最接近障碍物的点。通过推导一组等式和不等式约束来替代最近点问题，从而为原始路径规划问题定义了额外的解析约束。将所有解析约束通过逻辑与操作结合，得到一个通用的最优安全路径规划问题。数值求解该问题涉及将其转换为非线性规划问题。对刚性和可弯曲立方体机器人进行了仿真验证。

英文摘要

Optimal path planning problems for rigid and deformable (bendable) cuboid robots are considered by providing an analytic safety constraint using generalized $L_p$ norms. For regular cuboid robots, level sets of weighted $L_p$ norms generate implicit approximations of their surfaces. For bendable cuboid robots a weighted $L_p$ norm in polar coordinates implicitly approximates the surface boundary through a specified level set. Obstacle volumes, in the environment to navigate within, are presumed to be approximately described as sub-level sets of weighted $L_p$ norms. Using these approximate surface models, the optimal safe path planning problem is reformulated as a two stage optimization problem, where the safety constraint depends on a point on the robot which is closest to the obstacle in the obstacle's distance metric. A set of equality and inequality constraints are derived to replace the closest point problem, which is then defines additional analytic constraints on the original path planning problem. Combining all the analytic constraints with logical AND operations leads to a general optimal safe path planning problem. Numerically solving the problem involve conversion to a nonlinear programing problem. Simulations for rigid and bendable cuboid robot verify the proposed method.

URL PDF HTML ☆

赞 0 踩 0

1711.11006 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

A Family of Iterative Gauss-Newton Shooting Methods for Nonlinear Optimal Control

非线性最优控制中的一种迭代高斯-牛顿射击方法族

Markus Giftthaler, Michael Neunert, Markus Stäuble, Jonas Buchli, Moritz Diehl

AI总结本文提出了一种非线性最优控制的迭代算法族，扩展了经典的iLQR算法，结合了简单初始化和闭环前向积分的优势，具有线性复杂度，适用于高维欠驱动机器人等模拟场景。

Comments 8 pages

1701.08735 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Real-Time Control for Autonomous Racing Based on Viability Theory

基于可行性理论的实时自主赛车控制

Alexander Liniger, John Lygeros

发表机构 * Automatic Control Laboratory, ETH Zurich（自动控制实验室，苏黎世联邦理工学院）

AI总结本文基于可行性理论生成有限步前瞻轨迹，结合低层模型预测控制器实现实时自主赛车，通过游戏理论方法改进可行性核计算，提升安全性并减少 lap 时间。

Comments 26 pages, 11 figures

详情

DOI: 10.1109/TCST.2017.2772903

AI中文摘要

本文研究了微型赛车的自主驾驶问题，利用可行性核生成递归可行的有限步前瞻轨迹，以最大化进度并保持在静态障碍物内。该方法结合低层模型预测控制器实现实时自主赛车。可行性核计算基于空间离散化，提出基于游戏理论的新型数值方案，特别是辨别核，以提高计算鲁棒性。结果显示，使用辨别核的保守近似方法在安全性上有所提升，但略微增加 lap 时间。

英文摘要

In this paper we consider autonomous driving of miniature race cars. The viability kernel is used to efficiently generate finite look-ahead trajectories that maximize progress while remaining recursively feasible with respect to static obstacles (e.g., stay inside the track). Together with a low-level model predictive controller, this method makes real-time autonomous racing possible. The viability kernel computation is based on space discretization. To make the calculation robust against discretization errors, we propose a novel numerical scheme based on game theoretical methods, in particular the discriminating kernel. We show that the resulting algorithm provides an inner approximation of the viability kernel and guarantees that, for all states in the cell surrounding a viable grid point, there exists a control that keeps the system within the kernel. The performance of the proposed control method is studied in simulation where we determine the effects of various design choices and parameters and in experiments on an autonomous racing set-up maintained at the Automatic Control Laboratory of ETH Zurich. Both simulation and experimental results suggest that the more conservative approximation using the discriminating kernel results in safer driving style at the cost of a small increase in lap time.

URL PDF HTML ☆

赞 0 踩 0

1712.02923 2026-06-04 cs.RO cs.SY eess.SY 版本更新

SPRK: A Low-Cost Stewart Platform For Motion Study In Surgical Robotics

SPRK：一种低成本的斯图尔特平台用于外科机器人中的运动研究

Vatsal Patel, Sanjay Krishnan, Aimee Goncalves, Ken Goldberg

发表机构 * UC Berkeley（加州大学伯克利分校）

AI总结本文提出了一种低成本的斯图尔特平台，用于模拟呼吸、心跳和蠕动运动，其成本仅为250美元，比商用平台低两个数量级，具备精确的位移和旋转能力。

详情

AI中文摘要

为了模拟由于呼吸、心跳或蠕动运动导致的身体器官运动，我们设计了一种低成本、微型化的SPRK（斯图尔特平台研究套件），用于翻译和旋转假体组织。该平台尺寸为20厘米x20厘米x10厘米，可适配达芬奇研究套件（DVRK）外科机器人的工作空间，并成本仅为250美元，比商用斯图尔特平台低两个数量级。该平台在x、y和z方向上的位移范围为±1.27厘米，并具有正弦运动和呼吸启发式运动模式。还设计了模块化平台支架用于图案切割和去痂实验。该平台的定位控制器具有0.2秒的时间常数，其均方误差在x、y和z方向分别为1.22毫米、1.07毫米和0.20毫米。所有细节、CAD模型和控制软件均可在github.com/BerkeleyAutomation/sprk上找到。

英文摘要

To simulate body organ motion due to breathing, heart beats, or peristaltic movements, we designed a low-cost, miniaturized SPRK (Stewart Platform Research Kit) to translate and rotate phantom tissue. This platform is 20cm x 20cm x 10cm to fit in the workspace of a da Vinci Research Kit (DVRK) surgical robot and costs $250, two orders of magnitude less than a commercial Stewart platform. The platform has a range of motion of +/- 1.27 cm in translation along x, y, and z directions and has motion modes for sinusoidal motion and breathing-inspired motion. Modular platform mounts were also designed for pattern cutting and debridement experiments. The platform's positional controller has a time-constant of 0.2 seconds and the root-mean-square error is 1.22 mm, 1.07 mm, and 0.20 mm in x, y, and z directions respectively. All the details, CAD models, and control software for the platform is available at github.com/BerkeleyAutomation/sprk.

URL PDF HTML ☆

赞 0 踩 0

1712.02917 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Using Intermittent Synchronization to Compensate for Rhythmic Body Motion During Autonomous Surgical Cutting and Debridement

利用间歇同步来补偿自主手术切割和清创中的节律性身体运动

Vatsal Patel, Sanjay Krishnan, Aimee Goncalves, Carolyn Chen, Walter Doug Boyd, Ken Goldberg

AI总结本文提出利用间歇同步技术来补偿手术中因呼吸、心跳和节律性运动导致的不稳定性，通过实验验证该方法在切割和清创任务中具有更高的鲁棒性和精度。

1712.00634 2026-06-04 cs.LG cs.AI cs.RO cs.SY eess.SY math.OC 版本更新

PFAx: Predictable Feature Analysis to Perform Control

PFAx：可预测特征分析用于控制

Stefan Richthofer, Laurenz Wiskott

AI总结 PFAx通过整合补充信息提升预测性能，并透明展示补充信息对特征选择的影响，应用于强化学习环境中的智能体控制优化。

详情

AI中文摘要

可预测特征分析（PFA）（Richthofer, Wiskott, ICMLA 2015）是一种对高维输入信号进行降维的算法，提取最可预测的子信号。本文扩展了PFA，考虑补充信息以提高预测。补充信息不参与特征提取，特征仅从主输入中提取。PFAx透明地展示补充信息如何提升预测质量，并可生成补充信息以实现主信号的特定目标。该方法应用于强化学习环境，使智能体局部优化状态，接近目标。后续论文将扩展此方法以实现全局优化。

英文摘要

Predictable Feature Analysis (PFA) (Richthofer, Wiskott, ICMLA 2015) is an algorithm that performs dimensionality reduction on high dimensional input signal. It extracts those subsignals that are most predictable according to a certain prediction model. We refer to these extracted signals as predictable features. In this work we extend the notion of PFA to take supplementary information into account for improving its predictions. Such information can be a multidimensional signal like the main input to PFA, but is regarded external. That means it won't participate in the feature extraction - no features get extracted or composed of it. Features will be exclusively extracted from the main input such that they are most predictable based on themselves and the supplementary information. We refer to this enhanced PFA as PFAx (PFA extended). Even more important than improving prediction quality is to observe the effect of supplementary information on feature selection. PFAx transparently provides insight how the supplementary information adds to prediction quality and whether it is valuable at all. Finally we show how to invert that relation and can generate the supplementary information such that it would yield a certain desired outcome of the main signal. We apply this to a setting inspired by reinforcement learning and let the algorithm learn how to control an agent in an environment. With this method it is feasible to locally optimize the agent's state, i.e. reach a certain goal that is near enough. We are preparing a follow-up paper that extends this method such that also global optimization is feasible.

URL PDF HTML ☆

赞 0 踩 0

1712.00390 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Gain Scheduling LPV Control Scheme for the Autonomous Guidance Problem using a Dynamic Modelling Approach

基于动态建模方法的自主引导问题的增益调度LPV控制方案

Eugenio Alcalá, Vicenç Puig, Joseba Quevedo, Teresa Escobet

发表机构 * 1 Advanced Control Systems Group, Automatic Control Department, Universitat Polit\` e cnica de Catalunya (UPC), Rambla Sant Nebridi, 10, 08222, Terrassa, Spain 11pt 13.2pt This paper is a preprint of a paper submitted to IET Control Theory \& Applications. If accepted, copy of record will be available at the IET Digital Library.

AI总结本文提出了一种用于城市自动驾驶车辆纵向和横向控制的增益调度LPV控制方法，通过动力学和运动学模型构建LPV表示，并采用级联控制方法同时控制车辆行为，通过解决两个LPV LMI-LQR问题实现性能优化。

1711.11018 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

PDE-Based Optimization for Stochastic Mapping and Coverage Strategies using Robotic Ensembles

基于偏微分方程的机器人群体映射与覆盖策略优化

Karthik Elamvazhuthi, Hendrik Kuiper, Spring Berman

发表机构 * School for Engineering of Matter, Transport and Energy, Arizona State University（物质、传输与能量工程学院，亚利桑那州立大学）； School of Mathematical and Statistical Sciences, Arizona State University（数学与统计科学学院，亚利桑那州立大学）

AI总结本文提出基于偏微分方程的机器人群体控制框架，用于解决具有随机行为的机器人在有限感知与驱动能力下的映射与覆盖任务，通过凸优化和最优控制方法实现区域空间分布的重建与活动率调控。

详情

AI中文摘要

本文提出了一种基于偏微分方程（PDE）的新型框架，用于控制具有有限感知和驱动能力且表现出随机行为的机器人群体，以执行映射和覆盖任务。我们将机器人群体的动态建模为一个输运-扩散-反应PDE模型，并将映射和覆盖任务建模为该模型的识别与控制问题。在映射任务中，机器人被部署在封闭域内以收集无定位信息的数据，用于重建目标区域的未知空间分布。我们将该任务建模为一个凸优化问题，其解表示为PDE模型中的空间依赖系数。然后考虑覆盖问题，其中机器人必须以可编程的概率率执行预期活动以实现目标区域的活动分布。我们将该任务建模为一个最优控制问题，其中PDE模型被表示为双线性控制系统，机器人覆盖活动率和速度场定义为控制输入。我们通过在两个环境中进行联合映射与覆盖场景的模拟来验证我们的方法。

英文摘要

This paper presents a novel partial differential equation (PDE)-based framework for controlling an ensemble of robots, which have limited sensing and actuation capabilities and exhibit stochastic behaviors, to perform mapping and coverage tasks. We model the ensemble population dynamics as an advection-diffusion-reaction PDE model and formulate the mapping and coverage tasks as identification and control problems for this model. In the mapping task, robots are deployed over a closed domain to gather data, which is unlocalized and independent of robot identities, for reconstructing the unknown spatial distribution of a region of interest. We frame this task as a convex optimization problem whose solution represents the region as a spatially-dependent coefficient in the PDE model. We then consider a coverage problem in which the robots must perform a desired activity at a programmable probability rate to achieve a target spatial distribution of activity over the reconstructed region of interest. We formulate this task as an optimal control problem in which the PDE model is expressed as a bilinear control system, with the robots' coverage activity rate and velocity field defined as the control inputs. We validate our approach with simulations of a combined mapping and coverage scenario in two environments with three target coverage distributions.

URL PDF HTML ☆

赞 0 踩 0

1711.07300 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Optimization-Based Autonomous Racing of 1:43 Scale RC Cars

基于优化的1:43比例遥控赛车自主赛车

Alexander Liniger, Alexander Domahidi, Manfred Morari

AI总结本文基于数学优化实现遥控赛车自主赛车，通过动态模型和递推控制策略，结合路径规划和非线性模型预测控制，解决赛道跟踪与对手避让问题，实验验证了优化方法在实时性和性能上的有效性。

详情

DOI: 10.1002/oca.2123
Journal ref: Optimal Control Applications and Methods, 36(5), 2015, pp.628-647

AI中文摘要

本文描述了基于数学优化的遥控赛车自主赛车方法。利用车辆动态模型，通过递推控制计算控制输入，目标是最大化赛道进展，同时确保车辆在赛道上并避开对手。提出了两种不同的控制方法：第一种采用两级结构，包括路径规划器和非线性模型预测控制器（NMPC）用于跟踪；第二种将两者结合为一个非线性优化问题（NLP），借鉴轮廓控制思想。通过线性化获得线性时变模型，构建局部控制NLP的凸二次规划（QP）形式。所得到的QP具有典型的MPC结构，可通过最新结构利用求解器在毫秒级内解决，是整体控制方案实时可行性的关键。通过动态规划的高阶走廊规划器实现障碍物避让，根据对手当前位置和赛道布局生成凸约束。通过实验使用1:43比例遥控赛车，在超过3 m/s的速度和饱和后轮力矩（漂移）操作区域进行测试。算法在嵌入式计算平台上以50 Hz采样率运行，验证了基于优化的自主赛车方法在实时性和性能上的有效性。

英文摘要

This paper describes autonomous racing of RC race cars based on mathematical optimization. Using a dynamical model of the vehicle, control inputs are computed by receding horizon based controllers, where the objective is to maximize progress on the track subject to the requirement of staying on the track and avoiding opponents. Two different control formulations are presented. The first controller employs a two-level structure, consisting of a path planner and a nonlinear model predictive controller (NMPC) for tracking. The second controller combines both tasks in one nonlinear optimization problem (NLP) following the ideas of contouring control. Linear time varying models obtained by linearization are used to build local approximations of the control NLPs in the form of convex quadratic programs (QPs) at each sampling time. The resulting QPs have a typical MPC structure and can be solved in the range of milliseconds by recent structure exploiting solvers, which is key to the real-time feasibility of the overall control scheme. Obstacle avoidance is incorporated by means of a high-level corridor planner based on dynamic programming, which generates convex constraints for the controllers according to the current position of opponents and the track layout. The control performance is investigated experimentally using 1:43 scale RC race cars, driven at speeds of more than 3 m/s and in operating regions with saturated rear tire forces (drifting). The algorithms run at 50 Hz sampling rate on embedded computing platforms, demonstrating the real-time feasibility and high performance of optimization-based approaches for autonomous racing.

URL PDF HTML ☆

赞 0 踩 0

1702.05116 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Collective Motion under Beacon-referenced Cyclic Pursuit

基于信标参考的循环追捕运动

Kevin S. Galloway, Biswadip Dey

AI总结本文提出基于信标参考的循环追捕法则，通过引入静止信标作为额外参考，研究集体机器人在循环框架下的环形运动均衡及稳定性，推导了参数选择的必要条件，并探讨了保持集体纯形状的不变流形。

详情

AI中文摘要

循环追捕框架基于环形图中邻近代理之间的追捕交互，为自主机器人集体产生有用全局行为提供了有效方法。先前工作考虑了具有恒定方位（CB）追捕法则的循环追捕，并展示了相应动力学的环形均衡存在性。本文提出一种基于信标参考的CB追捕法则，其中静止信标为集体中的个体代理提供额外参考。在循环框架中实施时，我们显示所得到的动力学允许对应于围绕信标环形轨道的相对均衡，环形半径和代理沿轨道的分布由所提追捕法则的参数决定。我们还推导了环形均衡稳定性的必要条件，为参数选择提供了指导。最后，通过引入变量变换，我们证明了与围绕信标螺旋运动相关的不变流形家族的存在，这些流形保持集体的“纯形状”，并研究了代表性流形上的简化动力学。

英文摘要

Cyclic pursuit frameworks, which are built upon pursuit interactions between neighboring agents in a cycle graph, provide an efficient way to create useful global behaviors in a collective of autonomous robots. Previous work had considered cyclic pursuit with a constant bearing (CB) pursuit law, and demonstrated the existence of circling equilibria for the corresponding dynamics. In this work, we propose a beacon-referenced version of the CB pursuit law, wherein a stationary beacon provides an additional reference for the individual agents in a collective. When implemented in a cyclic framework, we show that the resulting dynamics admit relative equilibria corresponding to a circling orbit around the beacon, with the circling radius and the distribution of agents along the orbit determined by parameters of the proposed pursuit law. We also derive necessary conditions for stability of the circling equilibria, which provides a guide for parameter selection. Finally, by introducing a change of variables, we demonstrate the existence of a family of invariant manifolds related to spiraling motions around the beacon which preserve the "pure shape" of the collective, and study the reduced dynamics on a representative manifold.

URL PDF HTML ☆

赞 0 踩 0

1711.04683 2026-06-04 cs.LG cs.RO cs.SY eess.SY stat.ML 版本更新

Tensor Decompositions for Modeling Inverse Dynamics

张量分解用于逆动力学建模

Stephan Baier, Volker Tresp

AI总结本文提出利用张量分解方法建模逆动力学，通过处理位置、速度和加速度的三重交互，实现对高非线性函数的近似，并在SARCOS机械臂数据集上验证了其优越性。

详情

AI中文摘要

建模逆动力学对于精确的前馈机器人控制至关重要。该模型计算所需的关节扭矩，以执行预期的运动。高度非线性的动态系统逆函数可以通过回归技术近似。我们提出了一种回归方法，即利用位置x速度x加速度的三重交互的张量分解模型。大多数张量分解工作都解决了密集张量的分解问题。本文在稀疏张量的分解基础上进行扩展，仅包含少量非零条目。稀疏张量的分解已成功应用于关系学习，例如大规模知识图谱的建模。最近，该方法已扩展到多类分类问题，涉及离散输入变量。在高维稀疏张量中表示数据可以近似复杂的高非线性函数。本文展示了稀疏张量分解如何应用于回归问题。此外，我们通过学习从连续输入到张量分解的潜在表示的映射，利用基函数将方法扩展到连续输入。我们在具有七自由度SARCOS机械臂轨迹的数据集上评估了所提出的模型。实验结果表明，所提出的功能张量模型相比挑战性的最新方法具有优越的性能。

英文摘要

Modeling inverse dynamics is crucial for accurate feedforward robot control. The model computes the necessary joint torques, to perform a desired movement. The highly non-linear inverse function of the dynamical system can be approximated using regression techniques. We propose as regression method a tensor decomposition model that exploits the inherent three-way interaction of positions x velocities x accelerations. Most work in tensor factorization has addressed the decomposition of dense tensors. In this paper, we build upon the decomposition of sparse tensors, with only small amounts of nonzero entries. The decomposition of sparse tensors has successfully been used in relational learning, e.g., the modeling of large knowledge graphs. Recently, the approach has been extended to multi-class classification with discrete input variables. Representing the data in high dimensional sparse tensors enables the approximation of complex highly non-linear functions. In this paper we show how the decomposition of sparse tensors can be applied to regression problems. Furthermore, we extend the method to continuous inputs, by learning a mapping from the continuous inputs to the latent representations of the tensor decomposition, using basis functions. We evaluate our proposed model on a dataset with trajectories from a seven degrees of freedom SARCOS robot arm. Our experimental results show superior performance of the proposed functional tensor model, compared to challenging state-of-the art methods.

URL PDF HTML ☆

赞 0 踩 0

1705.01426 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A Nonlinear Model Predictive Control Scheme for Cooperative Manipulation with Singularity and Collision Avoidance

一种用于协同操作的非线性模型预测控制方案，包含奇异性和避障

Alexandros Nikou, Christos Verginis, Shahab Heshmati-alamdari, Dimos V. Dimarogonas

AI总结本文提出了一种非线性模型预测控制方案，用于在存在障碍物的有限工作空间中安全导航被N个机器人代理抓取的物体，同时避免碰撞和奇异配置。

Comments Simulation results with 3 agents added

1602.06667 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

A Motion Planning Strategy for the Active Vision-Based Mapping of Ground-Level Structures

一种用于主动视觉建图的地面结构运动规划策略

Manikandasriram Srinivasan Ramanagopal, André Phu-Van Nguyen, Jerome Le Ny

AI总结本文提出了一种指导配备摄像头或深度传感器的地面机器人自主建图有限三维结构可见部分的策略，通过运动规划算法确定合适视角并自动填补点云中的空洞，适用于建筑、施工和检测领域。

Comments Accepted for publication in IEEE Transactions on Automation Science and Engineering. Available in IEEE Xplore at http://ieeexplore.ieee.org/document/8093664

详情

DOI: 10.1109/TASE.2017.2762088

AI中文摘要

本文提出了一种策略，用于指导配备摄像头或深度传感器的地面机器人，以自主建图有限三维结构的可见部分。我们描述了确定合适连续视角的运动规划算法，并尝试自动填补由感知和感知层产生的点云中的空洞。重点是准确重建中等大小结构的3D模型，而非映射大型开放环境。所提出的算法不需要以网格模型或包围盒形式的初始化，生成的路径适用于视觉传感器同时用于建图和机器人局部化的情况，特别是在没有额外绝对定位系统时。我们分析了我们的策略的覆盖性质，并将其性能与经典前沿探索算法进行比较。我们展示了其在不同结构大小、局部化精度水平和深度传感器范围下的有效性，并在真实世界实验中验证了我们的设计。

英文摘要

This paper presents a strategy to guide a mobile ground robot equipped with a camera or depth sensor, in order to autonomously map the visible part of a bounded three-dimensional structure. We describe motion planning algorithms that determine appropriate successive viewpoints and attempt to fill holes automatically in a point cloud produced by the sensing and perception layer. The emphasis is on accurately reconstructing a 3D model of a structure of moderate size rather than mapping large open environments, with applications for example in architecture, construction and inspection. The proposed algorithms do not require any initialization in the form of a mesh model or a bounding box, and the paths generated are well adapted to situations where the vision sensor is used simultaneously for mapping and for localizing the robot, in the absence of additional absolute positioning system. We analyze the coverage properties of our policy, and compare its performance to the classic frontier based exploration algorithm. We illustrate its efficacy for different structure sizes, levels of localization accuracy and range of the depth sensor, and validate our design on a real-world experiment.

URL PDF HTML ☆

赞 0 踩 0

1711.03906 2026-06-04 cs.LG cs.DC cs.NI cs.RO cs.SY eess.SY 版本更新

D-SLATS: Distributed Simultaneous Localization and Time Synchronization

D-SLATS：分布式的同时定位与时间同步

Amr Alanwar, Henrique Ferraz, Kevin Hsieh, Rohit Thazhath, Paul Martin, Joao Hespanha, Mani Srivastava

AI总结本文提出D-SLATS框架，通过分布式扩展卡尔曼滤波和优化技术联合解决时间同步与定位问题，实现3微秒精度和30厘米误差。

详情

DOI: 10.1145/3084041.3084049

AI中文摘要

通过过去十年，我们见证了物联网（IoT）设备数量的激增，随之而来的是一次对时间和空间上协同行动的更大需求。尽管时间同步和定位这两个问题在许多方面有共同点，但传统上它们被分别处理或在集中式方法中结合，导致资源利用效率低下，或在设备数量方面不可扩展的解决方案。因此，我们提出D-SLATS，一个由三种不同且独立算法组成的框架，以分布式方式联合解决时间和定位问题。前两个算法主要基于分布式扩展卡尔曼滤波（EKF），而第三个算法使用优化技术。不需要融合中心，设备仅与邻居通信。所提出的方法在定制的超宽带通信测试平台和四旋翼无人机上进行了评估，代表了静态和移动节点的网络。我们的算法实现了高达三微秒的时间同步精度和30厘米的定位误差。

英文摘要

Through the last decade, we have witnessed a surge of Internet of Things (IoT) devices, and with that a greater need to choreograph their actions across both time and space. Although these two problems, namely time synchronization and localization, share many aspects in common, they are traditionally treated separately or combined on centralized approaches that results in an ineffcient use of resources, or in solutions that are not scalable in terms of the number of IoT devices. Therefore, we propose D-SLATS, a framework comprised of three different and independent algorithms to jointly solve time synchronization and localization problems in a distributed fashion. The First two algorithms are based mainly on the distributed Extended Kalman Filter (EKF) whereas the third one uses optimization techniques. No fusion center is required, and the devices only communicate with their neighbors. The proposed methods are evaluated on custom Ultra-Wideband communication Testbed and a quadrotor, representing a network of both static and mobile nodes. Our algorithms achieve up to three microseconds time synchronization accuracy and 30 cm localization error.

URL PDF HTML ☆

赞 0 踩 0

1711.03819 2026-06-04 eess.SY cs.MA cs.RO cs.SY 版本更新

Cooperative control of multi-agent systems to locate source of an odor

多智能体系统协作控制以定位气味源

Abhinav Sinha, Rishemjit Kaur, Ritesh Kumar, Amol P. Bhondekar

AI总结本文提出分层协作控制方法，通过多智能体系统定位气味源，结合粒子群算法和滑模控制器实现决策与控制层的协同，验证了分层分布式控制的有效性。

Comments 8 pages, initial results on our work

1711.03605 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Stability and Transparency Analysis of a Bilateral Teleoperation in Presence of Data Loss

双边远程操作中数据丢失情况下稳定性与透明度分析

A. Bakhshi, H. A. Talebi, A. A. Suratgar, M. Abdeetedal

AI总结本文提出一种新的方法，用于分析通信介质中数据丢失情况下双边远程操作的稳定性与透明度。通过周期连续脉冲的集合及其有限级数表示提出新的数据丢失模型，并利用波变量方法证明了系统整体的被动性。

1708.01930 2026-06-04 cs.AI cs.MA cs.RO cs.SY eess.SY 版本更新

Enhanced Emotion Enabled Cognitive Agent Based Rear End Collision Avoidance Controller for Autonomous Vehicles

增强型情感驱动认知代理基于后方碰撞避免控制器用于自动驾驶车辆

Faisal Riaz, Muaz A. Niazi

AI总结本文提出一种基于增强型情感驱动认知代理的后方碰撞避免控制器，通过引入恐惧情绪生成机制，提高自动驾驶车辆的碰撞避免效率和规则数量。

Comments 39 pages, 17 figures

详情

AI中文摘要

后方碰撞是自然中最致命的事故，导致大多数交通伤亡和伤害。现有研究提出了许多后方碰撞避免解决方案，但这些方案高度依赖精确的数学模型。然而，实际道路驾驶受非线性因素如路面状况、驾驶员反应时间、行人流量和车辆动力学影响，因此获得车辆控制系统精确数学模型具有挑战性。这个问题通过模糊逻辑解决了，但过多的模糊规则直接影响其效率。此外，这些基于模糊逻辑的控制器未使用适当的代理建模来模拟人工驾驶员执行这些模糊规则的功能。鉴于这些限制，我们提出了一种增强型情感驱动认知代理（EEEC_Agent）控制器，帮助自动驾驶车辆（AVs）以较少的规则进行后方碰撞避免，设计基于恐惧情绪，并具有高效率。为了在EEEC_Agent中引入恐惧情绪生成机制，采用了Orton, Clore & Collins（OCC）模型。EEEC_Agent的恐惧生成机制通过NetLogo模拟验证。此外，通过特别设计的原型AV平台对EEEC_Agent的功能进行了实际验证。最终，与现有最先进研究的定性比较研究表明，所提出的模型优于近期研究。

英文摘要

Rear end collisions are deadliest in nature and cause most of traffic casualties and injuries. In the existing research, many rear end collision avoidance solutions have been proposed. However, the problem with these proposed solutions is that they are highly dependent on precise mathematical models. Whereas, the real road driving is influenced by non-linear factors such as road surface situations, driver reaction time, pedestrian flow and vehicle dynamics, hence obtaining the accurate mathematical model of the vehicle control system is challenging. This problem with precise control based rear end collision avoidance schemes has been addressed using fuzzy logic, but the excessive number of fuzzy rules straightforwardly prejudice their efficiency. Furthermore, these fuzzy logic based controllers have been proposed without using proper agent based modeling that helps in mimicking the functions of an artificial human driver executing these fuzzy rules. Keeping in view these limitations, we have proposed an Enhanced Emotion Enabled Cognitive Agent (EEEC_Agent) based controller that helps the Autonomous Vehicles (AVs) to perform rear end collision avoidance with less number of rules, designed after fear emotion, and high efficiency. To introduce a fear emotion generation mechanism in EEEC_Agent, Orton, Clore & Collins (OCC) model has been employed. The fear generation mechanism of EEEC_Agent has been verified using NetLogo simulation. Furthermore, practical validation of EEEC_Agent functions has been performed using specially built prototype AV platform. Eventually, the qualitative comparative study with existing state of the art research works reflect that proposed model outperforms recent research.

URL PDF HTML ☆

赞 0 踩 0

1710.02555 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Model Predictive Path-Following for Constrained Differentially Flat Systems

基于约束微分平坦系统的模型预测路径跟踪

Melissa Greeff, Angela P. Schoellig

AI总结本文提出一种结合前馈线性化与基于路径的模型预测控制的新型预测路径跟踪方法，通过微分平坦性将非线性问题转化为凸优化问题，并通过动态路径参考提高系统鲁棒性，实验验证了在四旋翼上优于传统轨迹跟踪控制器的性能。

Comments 8 pages, submitted to ICRA 2018

详情

AI中文摘要

对于许多任务，预测路径跟踪控制可以通过优先考虑接近路径而非沿路径的时间进展，并提前考虑路径变化来显著提高自主机器人性能和鲁棒性。本文提出了一种新颖的预测路径跟踪方法，将前馈线性化与基于路径的模型预测控制相结合。我们的方法有几个关键优势。通过利用微分平坦性，我们将基于路径的模型预测控制问题从非线性问题转化为凸优化问题。通过动态路径参考，可以实现对干扰的鲁棒性，该参考根据机器人进展调整其速度。我们还考虑了关键系统约束。我们在四旋翼上进行了实验，展示了在正常条件、初始偏移和风扰情况下，相比传统轨迹跟踪控制器，保持四旋翼更接近期望路径的改进性能。

英文摘要

For many tasks, predictive path-following control can significantly improve the performance and robustness of autonomous robots over traditional trajectory tracking control. It does this by prioritizing closeness to the path over timed progress along the path and by looking ahead to account for changes in the path. We propose a novel predictive path-following approach that couples feedforward linearization with path-based model predictive control. Our approach has a few key advantages. By utilizing the differential flatness property, we reduce the path-based model predictive control problem from a nonlinear to a convex optimization problem. Robustness to disturbances is achieved by a dynamic path reference, which adjusts its speed based on the robot's progress. We also account for key system constraints. We demonstrate these advantages in experiment on a quadrotor. We show improved performance over a baseline trajectory tracking controller by keeping the quadrotor closer to the desired path under nominal conditions, with an initial offset and under a wind disturbance.

URL PDF HTML ☆

赞 0 踩 0

1710.11040 2026-06-04 cs.RO cs.AI cs.SY eess.SY math.OC 版本更新

How Should a Robot Assess Risk? Towards an Axiomatic Theory of Risk in Robotics

机器人应如何评估风险？迈向机器人学中的风险轴理论

Anirudha Majumdar, Marco Pavone

AI总结本文探讨了机器人风险评估的理论基础，提出风险度量应满足的公理，讨论了风险度量的表示定理及其在机器人应用中的实例，并分析了常用风险度量的局限性。

Comments Extended version of paper published in International Symposium on Robotics Research (ISRR) 2017

详情

AI中文摘要

赋予机器人评估风险和做出风险感知决策的能力被视为确保在不确定环境下运作的机器人安全的关键步骤。但，机器人应如何量化风险？一种自然且常见的方法是考虑一种框架，即随机结果被赋予成本——这种分配由一个成本随机变量捕捉。量化风险则对应于评估风险度量，即从成本随机变量到实数的映射。然而，什么是构成

英文摘要

Endowing robots with the capability of assessing risk and making risk-aware decisions is widely considered a key step toward ensuring safety for robots operating under uncertainty. But, how should a robot quantify risk? A natural and common approach is to consider the framework whereby costs are assigned to stochastic outcomes - an assignment captured by a cost random variable. Quantifying risk then corresponds to evaluating a risk metric, i.e., a mapping from the cost random variable to a real number. Yet, the question of what constitutes a "good" risk metric has received little attention within the robotics community. The goal of this paper is to explore and partially address this question by advocating axioms that risk metrics in robotics applications should satisfy in order to be employed as rational assessments of risk. We discuss general representation theorems that precisely characterize the class of metrics that satisfy these axioms (referred to as distortion risk metrics), and provide instantiations that can be used in applications. We further discuss pitfalls of commonly used risk metrics in robotics, and discuss additional properties that one must consider in sequential decision making tasks. Our hope is that the ideas presented here will lead to a foundational framework for quantifying risk (and hence safety) in robotics applications.

URL PDF HTML ☆

赞 0 踩 0

1709.03153 2026-06-04 cs.LG cs.AI cs.RO cs.SY eess.SY 版本更新

MBMF: Model-Based Priors for Model-Free Reinforcement Learning

MBMF：基于模型的先验用于无模型强化学习

Somil Bansal, Roberto Calandra, Kurtland Chua, Sergey Levine, Claire Tomlin

AI总结本文提出一种结合模型与无模型强化学习的方法，通过学习概率动力学模型作为先验，提升数据效率和成本效益。

Comments After we submitted the paper for consideration in CoRL 2017 we found a paper published in the recent past with a similar method (see related work for a discussion). Considering the similarities between the two papers, we have decided to retract our paper from CoRL 2017

1709.04889 2026-06-04 math.OC cs.LG cs.RO cs.SY eess.SY 版本更新

Control-Oriented Learning on the Fly

实时控制导向学习

Melkior Ornik, Arie Israel, Ufuk Topcu

AI总结本文提出一种实时控制导向学习方法，用于在系统动力学几乎未知的情况下实现控制目标，通过小扰动学习局部动态并保证近似最优方向，验证了其在受损飞机避撞和Van der Pol振荡器中的有效性。

Comments Extended version of M. Ornik, A. Israel, U. Topcu, "Myopic Control of Systems with Unknown Dynamics". Detailed list of differences from that paper given within the manuscript. Changes in v2 include a discussion of myopic control in an LTL context and a correction of the bound for suboptimality of the algorithm

详情

AI中文摘要

本文聚焦于开发一种策略，用于控制其动力学几乎完全未知的系统。这种情况自然出现在系统经历关键故障的场景中。在这种情况下，保留满足基本控制目标的能力以避免即将来临的灾难至关重要。一个典型的此类目标是可达避障问题，其中系统需要在受限的状态空间中移动到某个状态。为了应对对系统动力学知识的限制，我们开发了一种贪心控制理论。贪心控制的主要目标是在任何给定时间，仅根据到目前为止获得的系统信息，优化系统的轨迹方向。我们提出了一种算法，利用小扰动的控制努力来学习局部动态，同时确保系统朝着看似近似最优的方向移动，并为其次优性能提供硬性界限。我们还验证了该算法在受损飞机避撞模拟以及Van der Pol振荡器示例中的有效性。

英文摘要

This paper focuses on developing a strategy for control of systems whose dynamics are almost entirely unknown. This situation arises naturally in a scenario where a system undergoes a critical failure. In that case, it is imperative to retain the ability to satisfy basic control objectives in order to avert an imminent catastrophe. A prime example of such an objective is the reach-avoid problem, where a system needs to move to a certain state in a constrained state space. To deal with limitations on our knowledge of system dynamics, we develop a theory of myopic control. The primary goal of myopic control is to, at any given time, optimize the current direction of the system trajectory, given solely the information obtained about the system until that time. We propose an algorithm that uses small perturbations in the control effort to learn local dynamics while simultaneously ensuring that the system moves in a direction that appears to be nearly optimal, and provide hard bounds for its suboptimality. We additionally verify the usefulness of the algorithm on a simulation of a damaged aircraft seeking to avoid a crash, as well as on an example of a Van der Pol oscillator.

URL PDF HTML ☆

赞 0 踩 0

1710.04052 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Robotic Control for Cognitive UWB Radar

具备认知能力的超宽频雷达机器人控制

Stefan Brüggenwirth, Fernando Rial

AI总结本文提出了一种用于6自由度机械臂的轨迹规划方法，该机械臂携带合成孔径超宽频雷达传感器，通过优化问题实现碰撞-free的末端执行器轨迹，以最小化观测时间，适用于三维重建和穿墙雷达扫描及IED检查。

Comments 4 pages, 9 figures, submitted to IEEE IRC 2018

1502.02860 2026-06-04 stat.ML cs.LG cs.RO cs.SY eess.SY 版本更新

Gaussian Processes for Data-Efficient Learning in Robotics and Control

高斯过程在机器人和控制中的数据高效学习

Marc Peter Deisenroth, Dieter Fox, Carl Edward Rasmussen

AI总结本文提出基于高斯过程的非参数转移模型，通过提取更多数据信息加速学习，减少模型误差影响，实现高效自主学习。

Comments 20 pages, 29 figures; fixed a typo in equation on page 8

详情

DOI: 10.1109/TPAMI.2013.218
Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 37, issue no 2, pages 408-423, February 2015

AI中文摘要

自主学习在控制和机器人领域已持续十多年，数据驱动学习可减少工程知识需求。然而，自主强化学习通常需要大量系统交互，这在实际系统中（如机器人）不现实。本文提出通过高斯过程转移模型提取更多数据信息，显式纳入模型不确定性以减少误差影响，相比现有RL方法，模型基于策略搜索方法实现了前所未有的学习速度，并在真实机器人和控制任务中展示了应用价值。

英文摘要

Autonomous learning has been a promising direction in control and robotics for more than a decade since data-driven learning allows to reduce the amount of engineering knowledge, which is otherwise required. However, autonomous reinforcement learning (RL) approaches typically require many interactions with the system to learn controllers, which is a practical limitation in real systems, such as robots, where many interactions can be impractical and time consuming. To address this problem, current learning approaches typically require task-specific knowledge in form of expert demonstrations, realistic simulators, pre-shaped policies, or specific knowledge about the underlying dynamics. In this article, we follow a different approach and speed up learning by extracting more information from data. In particular, we learn a probabilistic, non-parametric Gaussian process transition model of the system. By explicitly incorporating model uncertainty into long-term planning and controller learning our approach reduces the effects of model errors, a key problem in model-based learning. Compared to state-of-the art RL our model-based policy search method achieves an unprecedented speed of learning. We demonstrate its applicability to autonomous learning in real robot and control tasks.

URL PDF HTML ☆

赞 0 踩 0

1508.00952 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Graphical Newton

图牛顿法

Akshay Srinivasan, Emanuel Todorov

AI总结本文研究如何通过利用函数计算结构减少牛顿步计算复杂度，提出基于计算图线性规模和树宽立方的高效方法。

1710.00489 2026-06-04 cs.RO cs.AI cs.CV cs.NE cs.SY eess.SY 版本更新

SE3-Pose-Nets: Structured Deep Dynamics Models for Visuomotor Planning and Control

SE3-姿态网络：用于视觉-运动规划和控制的结构深度动力学模型

Arunkumar Byravan, Felix Leeb, Franziska Meier, Dieter Fox

AI总结本文提出了一种基于结构深度动力学模型的深度视觉-运动控制方法，通过编码器-解码器结构学习低维姿态嵌入，实现场景分割和姿态预测，并在现实世界中实现了闭环控制。

Comments 8 pages, Initial submission to IEEE International Conference on Robotics and Automation (ICRA) 2018

详情

AI中文摘要

本文提出了一种基于结构深度动力学模型的深度视觉-运动控制方法。我们的深度动力学模型是一种SE3-Nets的变体，通过编码器-解码器结构学习低维姿态嵌入用于视觉-运动控制。与以往工作不同，我们的动力学模型是结构化的：给定一个输入场景，我们的网络明确学习分割显著部分并预测其姿态嵌入以及其运动作为姿态空间中的变化。我们通过一对相隔动作的点云训练我们的模型，并展示在仅提供帧间点对数据关联的监督下，我们的网络能够学习有意义的场景分割以及一致的姿态。我们进一步展示我们的模型可以直接在学习的低维姿态空间中用于闭环控制，其中动作通过最小化姿态空间中的误差使用基于梯度的方法计算，类似于传统模型驱动控制。我们展示了在模拟和现实世界中控制Baxter机器人从原始深度数据的结果，并与两种基线深度网络进行了比较。我们的方法在实时运行，实现了良好的场景动态预测，并在多个控制运行中优于基线方法。视频结果可在：https://rse-lab.cs.washington.edu/se3-structured-deep-ctrl/

英文摘要

In this work, we present an approach to deep visuomotor control using structured deep dynamics models. Our deep dynamics model, a variant of SE3-Nets, learns a low-dimensional pose embedding for visuomotor control via an encoder-decoder structure. Unlike prior work, our dynamics model is structured: given an input scene, our network explicitly learns to segment salient parts and predict their pose-embedding along with their motion modeled as a change in the pose space due to the applied actions. We train our model using a pair of point clouds separated by an action and show that given supervision only in the form of point-wise data associations between the frames our network is able to learn a meaningful segmentation of the scene along with consistent poses. We further show that our model can be used for closed-loop control directly in the learned low-dimensional pose space, where the actions are computed by minimizing error in the pose space using gradient-based methods, similar to traditional model-based control. We present results on controlling a Baxter robot from raw depth data in simulation and in the real world and compare against two baseline deep networks. Our method runs in real-time, achieves good prediction of scene dynamics and outperforms the baseline methods on multiple control runs. Video results can be found at: https://rse-lab.cs.washington.edu/se3-structured-deep-ctrl/

URL PDF HTML ☆

赞 0 踩 0

1709.10237 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Beacon-referenced Mutual Pursuit in Three Dimensions

三维环境中基于信标的目标互追策略

Kevin S. Galloway, Biswadip Dey

AI总结本文提出一种三维环境中基于信标的互追控制律，通过引入额外项关注信标，分析了双体互追系统的闭环动态行为，证明在特定参数条件下，系统可形成具有共同半径的圆轨道稳定状态。

详情

AI中文摘要

受各种无人系统维持位置应用的启发，本文介绍了一种用于近固定信标工作的双体agent的转向控制律。该反馈律是对先前研究的三维恒向（CB）追捕律的修改，加入了额外项以关注信标。我们研究了双体互追系统闭环动态行为，其中每个agent使用针对另一个agent和静止信标的目标参考CB追捕律。在某些控制参数假设下，证明该问题存在环形平衡状态，其中agent在垂直于共同轴线并通过信标的平面内沿具有共同半径的圆轨道运动。由于共同半径和距离信标距离由反馈律中的参数选择决定，这种方法提供了一种在三维环境中设计所需编队的方法。

英文摘要

Motivated by station-keeping applications in various unmanned settings, this paper introduces a steering control law for a pair of agents operating in the vicinity of a fixed beacon in a three-dimensional environment. This feedback law is a modification of the previously studied three-dimensional constant bearing (CB) pursuit law, in the sense that it incorporates an additional term to allocate attention to the beacon. We investigate the behavior of the closed-loop dynamics for a two agent mutual pursuit system in which each agent employs the beacon-referenced CB pursuit law with regards to the other agent and a stationary beacon. Under certain assumptions on the associated control parameters, we demonstrate that this problem admits circling equilibria wherein the agents move on circular orbits with a common radius, in planes perpendicular to a common axis passing through the beacon. As the common radius and distances from the beacon are determined by choice of parameters in the feedback law, this approach provides a means to engineer desired formations in a three-dimensional setting.

URL PDF HTML ☆

赞 0 踩 0

1709.07610 2026-06-04 cs.CG cs.CC cs.RO cs.SY eess.SY 版本更新

Efficient Nearest-Neighbor Search for Dynamical Systems with Nonholonomic Constraints

动态系统非holonomic约束下的高效最近邻搜索

Valerio Varricchio, Brian Paden, Dmitry Yershov, Emilio Frazzoli

AI总结本文研究了非holonomic约束动态系统中最近邻搜索的复杂性，发现传统k-d树在子里曼度量下查询复杂度为Θ(N^p log N)，提出针对性改进策略提升效率。

Comments 16 pages, 3 figures, the 12th Workshop on the Algorithmic Foundations of Robotics (WAFR) 2016

1709.07310 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Constant Bearing Pursuit on Branching Graphs

在分支图上的恒定方位追捕

Kevin S. Galloway, Biswadip Dey

AI总结本文通过在单一循环上附加多个分支，扩展了恒定方位追捕框架，允许考虑任意弱连通的追捕图，其中每个节点出度为1，并研究了相对平衡、纯形状平衡和周期轨道，推导了三体集体的稳定性条件。

详情

AI中文摘要

循环追捕框架提供了一种通过群体中自主机器人对间相互作用生成有用全局行为的高效方法。先前工作研究了恒定方位（CB）追捕规则的循环追捕，并展示了相应动力学下各种有趣行为的存在。在本文中，通过在单一循环上附加多个分支，我们引入了该框架的修改版本，允许考虑任意弱连通的追捕图，其中每个节点的出度为1。这进一步扩展了循环追捕设置。然后，在展示相对平衡（直线或环形运动）、纯形状平衡（螺旋运动）和周期轨道存在后，我们还推导了三体集体稳定性的必要条件。通过为个体代理提供加入或离开集体而不扰动他人运动的途径，我们的方法提高了整体系统的可靠性。

英文摘要

Cyclic pursuit frameworks provide an efficient way to create useful global behaviors out of pairwise interactions in a collective of autonomous robots. Earlier work studied cyclic pursuit with a constant bearing (CB) pursuit law, and has demonstrated the existence of a variety of interesting behaviors for the corresponding dynamics. In this work, by attaching multiple branches to a single cycle, we introduce a modified version of this framework which allows us to consider any weakly connected pursuit graph where each node has an outdegree of 1. This provides a further generalization of the cyclic pursuit setting. Then, after showing existence of relative equilibria (rectilinear or circling motion), pure shape equilibria (spiraling motion) and periodic orbits, we also derive necessary conditions for stability of a 3-agent collective. By paving a way for individual agents to join or leave a collective without perturbing the motion of others, our approach leads to improved reliability of the overall system.

URL PDF HTML ☆

赞 0 踩 0

1709.07032 2026-06-04 cs.RO cs.MA cs.SY eess.SY stat.AP 版本更新

Data-Driven Model Predictive Control of Autonomous Mobility-on-Demand Systems

数据驱动的自动驾驶按需出行系统模型预测控制

Ramon Iglesias, Federico Rossi, Kevin Wang, David Hallac, Jure Leskovec, Marco Pavone

AI总结本文提出一种端到端的数据驱动框架，用于控制自动驾驶按需出行系统，通过时间扩展网络建模并设计MPC算法，利用历史数据预测短期需求，减少乘客等待时间达89.6%。

Comments Submitted to the International Conference on Robotics and Automation 2018

详情

AI中文摘要

本文旨在提出一种端到端的数据驱动框架，用于控制自动驾驶按需出行系统（AMoD，即自动驾驶车队）。我们首先使用时间扩展网络建模AMoD系统，并提出一种计算最优再平衡策略（即预置重新定位）和给定旅行需求的最小可行车队规模的公式。然后，我们适应此公式，设计出一种模型预测控制（MPC）算法，利用基于历史数据的短期需求预测来计算再平衡策略。我们使用最先进的LSTM神经网络和滴滴出行的真实客户数据测试该控制器的端到端性能，证明该方法在大规模系统中表现优异（MPC算法的计算复杂度不依赖于系统中的客户和车辆数量），并且在减少平均乘客等待时间方面优于现有再平衡策略，最高可减少89.6%。

英文摘要

The goal of this paper is to present an end-to-end, data-driven framework to control Autonomous Mobility-on-Demand systems (AMoD, i.e. fleets of self-driving vehicles). We first model the AMoD system using a time-expanded network, and present a formulation that computes the optimal rebalancing strategy (i.e., preemptive repositioning) and the minimum feasible fleet size for a given travel demand. Then, we adapt this formulation to devise a Model Predictive Control (MPC) algorithm that leverages short-term demand forecasts based on historical data to compute rebalancing strategies. We test the end-to-end performance of this controller with a state-of-the-art LSTM neural network to predict customer demand and real customer data from DiDi Chuxing: we show that this approach scales very well for large systems (indeed, the computational complexity of the MPC algorithm does not depend on the number of customers and of vehicles in the system) and outperforms state-of-the-art rebalancing strategies by reducing the mean customer wait time by up to to 89.6%.

URL PDF HTML ☆

赞 0 踩 0

1703.01250 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Virtual vs. Real: Trading Off Simulations and Physical Experiments in Reinforcement Learning with Bayesian Optimization

虚拟与现实：在强化学习中权衡模拟与物理实验

Alonso Marco, Felix Berkenkamp, Philipp Hennig, Angela P. Schoellig, Andreas Krause, Stefan Schaal, Sebastian Trimpe

AI总结本文提出利用模拟数据优化强化学习，通过结合低成本但不准确的模拟信息与高成本但准确的物理实验，提高效率。

Comments 7 pages, 6 figures, to appear in IEEE 2017 International Conference on Robotics and Automation (ICRA)

详情

DOI: 10.1109/ICRA.2017.7989186

AI中文摘要

在实践中，控制策略的参数通常手动调整，这耗时且令人沮丧。强化学习是一种有前途的替代方法，旨在自动化此过程，但通常需要太多实验才实用。本文提出了一种解决方案，通过利用可用于大多数机器人平台的模拟先验知识。具体而言，我们扩展了熵搜索，一种最大化每次实验信息增益的贝叶斯优化算法，以处理多个信息源的情况。结果是一种原则性的方法，可以有效地将低成本但不准确的模拟信息与高成本且准确的物理实验结合起来。我们将其应用于摆杆系统，证明该算法可以在比仅使用物理系统标准贝叶斯优化更少的实验中找到良好的控制策略。

英文摘要

In practice, the parameters of control policies are often tuned manually. This is time-consuming and frustrating. Reinforcement learning is a promising alternative that aims to automate this process, yet often requires too many experiments to be practical. In this paper, we propose a solution to this problem by exploiting prior knowledge from simulations, which are readily available for most robotic platforms. Specifically, we extend Entropy Search, a Bayesian optimization algorithm that maximizes information gain from each experiment, to the case of multiple information sources. The result is a principled way to automatically combine cheap, but inaccurate information from simulations with expensive and accurate physical experiments in a cost-effective manner. We apply the resulting method to a cart-pole system, which confirms that the algorithm can find good control policies with fewer experiments than standard Bayesian optimization on the physical system only.

URL PDF HTML ☆

赞 0 踩 0

1605.01950 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Automatic LQR Tuning Based on Gaussian Process Global Optimization

基于高斯过程全局优化的自动LQR调优

Alonso Marco, Philipp Hennig, Jeannette Bohg, Stefan Schaal, Sebastian Trimpe

AI总结本文提出一种结合线性最优控制的自动控制器调优框架，利用贝叶斯优化算法提升控制器参数，通过实验数据优化性能目标，以七自由度机械臂平衡倒立杆为例验证方法有效性。

Comments 8 pages, 5 figures, to appear in IEEE 2016 International Conference on Robotics and Automation. Video demonstration of the experiments available at https://am.is.tuebingen.mpg.de/publications/marco_icra_2016

详情

DOI: 10.1109/ICRA.2016.7487144

AI中文摘要

本文提出一种基于线性最优控制与贝叶斯优化的自动控制器调优框架。该框架根据预定义的性能目标，利用实验数据自动改进初始控制器参数。所采用的贝叶斯优化算法为熵搜索，将潜在目标表示为高斯过程，并构建关于目标最小值位置的显式信念。通过最大化每次实验评估的信息增益，该框架能够在较少评估次数下获得改进的控制器。实验演示使用了七自由度机械臂平衡倒立杆的任务，二、四维调优问题的结果展示了该方法在机器人平台上的自动控制器调优潜力。

英文摘要

This paper proposes an automatic controller tuning framework based on linear optimal control combined with Bayesian optimization. With this framework, an initial set of controller gains is automatically improved according to a pre-defined performance objective evaluated from experimental data. The underlying Bayesian optimization algorithm is Entropy Search, which represents the latent objective as a Gaussian process and constructs an explicit belief over the location of the objective minimum. This is used to maximize the information gain from each experimental evaluation. Thus, this framework shall yield improved controllers with fewer evaluations compared to alternative approaches. A seven-degree-of-freedom robot arm balancing an inverted pole is used as the experimental demonstrator. Results of a two- and four-dimensional tuning problems highlight the method's potential for automatic controller tuning on robotic platforms.

URL PDF HTML ☆

赞 0 踩 0

1709.05843 2026-06-04 cs.RO cs.MA cs.SY eess.SY 版本更新

Decentralized Collision-Free Control of Multiple Robots in 2D and 3D Spaces

多机器人在二维和三维空间中去碰撞的去中心化控制

Xiaotian Yang

AI总结本文提出在任意未知二维和三维区域中实现多机器人去中心化无碰撞控制的方法，通过网格选择和算法设计解决完全覆盖和搜索任务，算法在MATLAB中验证并与其他算法对比。

详情

AI中文摘要

去中心化机器人控制吸引了大量研究兴趣。然而，一些研究使用了不现实的假设而未考虑碰撞避免。本报告聚焦于在任意未知的二维和三维区域内实现多机器人无碰撞控制，以解决完全覆盖和搜索任务。所有算法均为去中心化，因为机器人能力有限，并且已数学证明。报告首先讨论了两种任务中的网格选择。网格模式简化了区域表示，机器人只需在邻居顶点之间直线移动。对于100%完全二维覆盖，提出了等边三角形网格。对于忽略边界效应的完全覆盖，每种情况都计算出在二维和三维区域中顶点最少的网格。第二部分针对二维和三维区域的完全覆盖，提出了一种去中心化无碰撞算法，驱动机器人前往离参考点最远的区域。该区域可以是静态或扩展的，并在MATLAB中模拟。第三部分提供了三种基于网格的去中心化随机算法，用于在二维或三维区域中搜索目标。目标数量可以是已知或未知的。在第一个算法中，机器人随机选择空闲邻居，优先选择未访问的邻居。第二个算法在机器人靠近时添加排斥力以分散机器人。第三个算法中，如果被已访问的顶点包围，机器人将使用广度优先搜索算法前往最近的未访问顶点。第二个搜索算法在Pioneer 3-DX机器人上验证。展示了生成公式以估计搜索时间的一般方法。算法在MATLAB中与其他五个算法比较，以展示其有效性。

英文摘要

Decentralized control of robots has attracted huge research interests. However, some of the research used unrealistic assumptions without collision avoidance. This report focuses on the collision-free control for multiple robots in both complete coverage and search tasks in 2D and 3D areas which are arbitrary unknown. All algorithms are decentralized as robots have limited abilities and they are mathematically proved. The report starts with the grid selection in the two tasks. Grid patterns simplify the representation of the area and robots only need to move straightly between neighbor vertices. For the 100% complete 2D coverage, the equilateral triangular grid is proposed. For the complete coverage ignoring the boundary effect, the grid with the fewest vertices is calculated in every situation for both 2D and 3D areas. The second part is for the complete coverage in 2D and 3D areas. A decentralized collision-free algorithm with the above selected grid is presented driving robots to sections which are furthest from the reference point. The area can be static or expanding, and the algorithm is simulated in MATLAB. Thirdly, three grid-based decentralized random algorithms with collision avoidance are provided to search targets in 2D or 3D areas. The number of targets can be known or unknown. In the first algorithm, robots choose vacant neighbors randomly with priorities on unvisited ones while the second one adds the repulsive force to disperse robots if they are close. In the third algorithm, if surrounded by visited vertices, the robot will use the breadth-first search algorithm to go to one of the nearest unvisited vertices via the grid. The second search algorithm is verified on Pioneer 3-DX robots. The general way to generate the formula to estimate the search time is demonstrated. Algorithms are compared with five other algorithms in MATLAB to show their effectiveness.

URL PDF HTML ☆

赞 0 踩 0

1709.03248 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Vector Field Guidance for Convoy Monitoring Using Elliptical Orbits

利用椭圆轨道的矢量场引导用于车队监控

Aseem V. Borkar, Vivek S. Borkar, Arpita Sinha

AI总结本文提出了一种基于矢量场的新型引导方案，用于无人机沿可能非线性轨迹跟踪和监控地面车队。通过回归算法计算时间变化的椭圆，确保无人机轨迹反复穿越该移动椭圆。

1709.03426 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Trajectory Synthesis for Fisher Information Maximization

信息最大化的轨迹合成

Andrew D. Wilson, Jarvis A. Schultz, Todd D. Murphey

AI总结本文提出一种连续时间优化方法，通过改进Fisher信息矩阵的范数来生成局部最优轨迹，用于动态系统参数估计，实验验证显示轨迹优化显著提升了参数估计精度。

Comments 12 pages

详情

DOI: 10.1109/TRO.2014.2345918
Journal ref: IEEE Transactions on Robotics, vol. 30, no. 6, pp. 1358-1370, 2014

AI中文摘要

动态系统模型参数估计可通过实验轨迹的选择得到显著提升。对于一般的非线性动态系统，找到全局最优轨迹通常不可行；然而，在给定初始模型参数估计和初始轨迹的情况下，本文提出了一种连续时间优化方法，生成在存在测量噪声下的局部最优轨迹。该优化算法旨在寻找能改进Fisher信息矩阵范数的系统轨迹。通过双臂小车装置进行数值和实验验证。在模拟中，优化轨迹使Fisher信息矩阵的最小特征值比初始轨迹提高了三个数量级。实验结果表明，优化轨迹在实践中使参数估计误差改善了一个数量级。

英文摘要

Estimation of model parameters in a dynamic system can be significantly improved with the choice of experimental trajectory. For general, nonlinear dynamic systems, finding globally "best" trajectories is typically not feasible; however, given an initial estimate of the model parameters and an initial trajectory, we present a continuous-time optimization method that produces a locally optimal trajectory for parameter estimation in the presence of measurement noise. The optimization algorithm is formulated to find system trajectories that improve a norm on the Fisher information matrix. A double-pendulum cart apparatus is used to numerically and experimentally validate this technique. In simulation, the optimized trajectory increases the minimum eigenvalue of the Fisher information matrix by three orders of magnitude compared to the initial trajectory. Experimental results show that this optimized trajectory translates to an order of magnitude improvement in the parameter estimate error in practice.

URL PDF HTML ☆

赞 0 踩 0

1709.02561 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Formal Verification of Station Keeping Maneuvers for a Planar Autonomous Hybrid System

平面自主混合系统的轨道保持 maneuver 的形式验证

Benjamin Martin, Khalil Ghorbal, Eric Goubault, Sylvie Putot

AI总结本文研究了平面自主混合系统轨道保持 maneuver 的形式验证，通过混合程序模型验证可达性和安全性属性，并利用Keymaera X自动生成不变量区域以满足验证需求。

Comments In Proceedings FVAV 2017, arXiv:1709.02126

1709.02560 2026-06-04 eess.SY cs.RO cs.SE cs.SY 版本更新

Run-Time Risk Mitigation in Automated Vehicles: A Model for Studying Preparatory Steps

自动驾驶车辆运行时的风险缓解：一种研究预备步骤的模型

Mario Gleirscher

AI总结本文研究自动驾驶车辆运行时的风险缓解，通过模型分析驾驶过程及控制回路，识别潜在危害并提出缓解措施，为安全控制器的自动化合成提供方法。

Comments In Proceedings FVAV 2017, arXiv:1709.02126

详情

DOI: 10.4204/EPTCS.257.8
Journal ref: EPTCS 257, 2017, pp. 75-90

AI中文摘要

本文假设自动驾驶或高度自动化驾驶（AD）将伴随着超越ISO 26262或SOTIF最新修订要求的严格保证义务。因此，汽车控制和安全工程师必须（i）全面分析驾驶过程及其控制回路，（ii）识别源自该回路的相关危害，（iii）建立可行的自动化措施以有效缓解这些危害或减轻其后果。通过研究一个例子，本文探讨了可用于形式验证 desired properties 的模型，这些 properties 源自潜在保证义务，如有效缓解策略的全局存在性。此外，所提出的方法旨在逐步细化，以实现 AD 安全控制器的自动化合成，这些控制器实现此类属性。

英文摘要

We assume that autonomous or highly automated driving (AD) will be accompanied by tough assurance obligations exceeding the requirements of even recent revisions of ISO 26262 or SOTIF. Hence, automotive control and safety engineers have to (i) comprehensively analyze the driving process and its control loop, (ii) identify relevant hazards stemming from this loop, (iii) establish feasible automated measures for the effective mitigation of these hazards or the alleviation of their consequences. By studying an example, this article investigates some achievements in the modeling for the steps (i), (ii), and (iii), amenable to formal verification of desired properties derived from potential assurance obligations such as the global existence of an effective mitigation strategy. In addition, the proposed approach is meant for step-wise refinement towards the automated synthesis of AD safety controllers implementing such properties.

URL PDF HTML ☆

赞 0 踩 0

1709.02456 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Attack-Aware Multi-Sensor Integration Algorithm for Autonomous Vehicle Navigation Systems

面向攻击的多传感器集成算法用于自动驾驶导航系统

Sangjun Lee, Yongbum Cho, Byung-Cheol Min

AI总结本文提出了一种基于故障检测与隔离的攻击感知多传感器集成算法，用于检测自动驾驶导航系统中的网络攻击。算法利用扩展卡尔曼滤波构建鲁棒残差，并通过参数统计工具识别攻击，通过软件在环仿真验证了其在动态系统中的快速检测和低误报率性能。

Comments "Copyright 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works."

1708.09347 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Sequential Action Control: Closed-Form Optimal Control for Nonlinear and Nonsmooth Systems

顺序动作控制：非线性与非光滑系统的闭环最优控制

Alex Ansari, Todd Murphey

AI总结本文提出一种基于模型的算法，用于在线闭环计算非线性系统的预测最优控制。通过高阶模型和轨迹目标控制混合脉冲、欠驱动和约束系统，采用闭式表达式优化短期控制动作，保证全局唯一性和稳定性。

Comments 19 pages

详情

DOI: 10.1109/TRO.2016.2596768
Journal ref: IEEE Transactions on Robotics, vol. 32, no. 5, pp. 1196-1214, 2016

AI中文摘要

本文提出了一种新的基于模型的算法，用于在线闭环计算传统上具有挑战性的非线性系统的预测最优控制。示例展示了该算法如何使用仅有的高层模型和轨迹目标来控制混合脉冲、欠驱动和约束系统。与迭代优化有限时间域控制序列不同，本文推导出个体控制动作的闭式表达式，即可以应用于短时间的控制值，以在长时间域内最优改善跟踪目标。在温和假设下，这些动作在平衡点附近变为线性反馈律，允许稳定性分析和基于性能的参数选择。全局上，最优动作保证存在性和唯一性。通过在线递推时间域的方式对这些动作进行序列化，所提出的控制器提供了一种最小-最大约束响应，避免了通常需要施加控制约束的开销。基准示例显示，该方法能够避免局部极小值，并在跟踪性能方面优于非线性最优控制器和最近的特定案例方法，且速度比传统方法快多个数量级。

英文摘要

This paper presents a new model-based algorithm that computes predictive optimal controls on-line and in closed loop for traditionally challenging nonlinear systems. Examples demonstrate the same algorithm controlling hybrid impulsive, underactuated, and constrained systems using only high-level models and trajectory goals. Rather than iteratively optimize finite horizon control sequences to minimize an objective, this paper derives a closed-form expression for individual control actions, i.e., control values that can be applied for short duration, that optimally improve a tracking objective over a long time horizon. Under mild assumptions, actions become linear feedback laws near equilibria that permit stability analysis and performance-based parameter selection. Globally, optimal actions are guaranteed existence and uniqueness. By sequencing these actions on-line, in receding horizon fashion, the proposed controller provides a min-max constrained response to state that avoids the overhead typically required to impose control constraints. Benchmark examples show the approach can avoid local minima and outperform nonlinear optimal controllers and recent, case-specific methods in terms of tracking performance, and at speeds orders of magnitude faster than traditionally achievable.

URL PDF HTML ☆

赞 0 踩 0

1708.09342 2026-06-04 eess.SY cs.LG cs.RO cs.SY math.OC 版本更新

Optimal and Learning Control for Autonomous Robots

自主机器人最优与学习控制

Jonas Buchli, Farbod Farshidian, Alexander Winkler, Timothy Sandy, Markus Giftthaler

AI总结本文基于最优控制与强化学习，从统一视角探讨自主机器人闭环控制问题，提供统一符号和术语对比，帮助理解不同领域方法。

Comments Lecture Notes, 101 pages

1708.06207 2026-06-04 cs.HC cs.CY cs.RO cs.SY eess.SY 版本更新

Givers & Receivers perceive handover tasks differently: Implications for Human-Robot collaborative system design

施与者与接收者对交接任务的认知不同：对人机协作系统设计的启示

Roy Someshwar, Yael Edan

AI总结研究通过三种方法探讨了超市瓶类交接任务中人类协作行为，揭示了角色差异对共同行动认知的影响，为短周期重复任务的人机协作系统设计提供系统分析方法。

Comments 16 pages, 8 figures, This manuscript is a prior version of the article accepted in the IJSR

1708.06345 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Robust Optimal Planning and Control of Non-Periodic Bipedal Locomotion with A Centroidal Momentum Model

非周期双足运动的鲁棒最优规划与控制：基于质心动量模型

Ye Zhao, Benito R. Fernandez, Luis Sentis

AI总结本文提出基于质心动量动力学的混合相空间规划与控制方法，通过非周期关键帧状态的鲁棒跟踪实现双足运动的稳健控制，重点解决非周期步态生成和扰动鲁棒性问题。

Comments 43 pages, 22 figures, journal, International Journal of Robotics Research, 2017. arXiv admin note: substantial text overlap with arXiv:1701.05929, arXiv:1511.04628

1708.06252 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Mixture Reduction on Matrix Lie Groups

矩阵李群上的混合减少

Josip Cesic, Ivan Markovic, Ivan Petrovic

AI总结本文提出在矩阵李群上用于混合滤波的集中高斯分布（CGDs）减少方法，通过重新参数化计算KL散度并合并混合组件，用于多目标跟踪滤波研究。

Comments IEEE Signal Processing Letters

详情

DOI: 10.1109/LSP.2017.2723765

AI中文摘要

许多物理系统在矩阵李群上演化，为这类流形设计的混合滤波是解决挑战性估计问题的必要工具。然而，混合滤波面临组件数量持续增长的问题，因此需要适当的混合减少技术。本文提出了一种用于矩阵李群上分布的混合减少方法，称为集中高斯分布（CGDs）。这包括适当重新参数化CGD参数以计算KL散度，选择并合并混合组件。此外，我们还引入了在李群上的多目标跟踪滤波器作为所提减少方法的混合滤波研究示例。特别是，我们实现了在矩阵李群上的概率假说密度滤波器。我们使用最优子图案分配度量在由100个随机生成多目标场景组成的合成数据集上验证了滤波器性能。

英文摘要

Many physical systems evolve on matrix Lie groups and mixture filtering designed for such manifolds represent an inevitable tool for challenging estimation problems. However, mixture filtering faces the issue of a constantly growing number of components, hence require appropriate mixture reduction techniques. In this letter we propose a mixture reduction approach for distributions on matrix Lie groups, called the concentrated Gaussian distributions (CGDs). This entails appropriate reparametrization of CGD parameters to compute the KL divergence, pick and merge the mixture components. Furthermore, we also introduce a multitarget tracking filter on Lie groups as a mixture filtering study example for the proposed reduction method. In particular, we implemented the probability hypothesis density filter on matrix Lie groups. We validate the filter performance using the optimal subpattern assignment metric on a synthetic dataset consisting of 100 randomly generated multitarget scenarios.

URL PDF HTML ☆

赞 0 踩 0

1708.01925 2026-06-04 cs.MA cs.AI cs.CY cs.RO cs.SY eess.SY 版本更新

Designing Autonomous Vehicles: Evaluating the Role of Human Emotions and Social Norms

设计自动驾驶车辆：评估人类情感与社会规范的作用

Faisal Riaz, Muaz A. Niazi

AI总结本文提出通过引入社会规范合规机制，使自动驾驶车辆遵循道路与社会规则，利用模糊逻辑和情绪计算提升决策能力，通过模拟验证其在减少碰撞方面的有效性。

Comments 42 pages, 12 figures

详情

AI中文摘要

人类即将在未来不久将驾驶权利委托给自动驾驶车辆。然而，为完成这一复杂任务，需要一种机制，迫使自动驾驶车辆遵守由良好驾驶者实践的道路和社会规则。此任务可通过在自动驾驶车辆中引入社会规范合规机制来实现。本文提出一个自动驾驶车辆的人工社会作为人类社会的类比。每个AV被分配了具有不同社会影响的社会性格。社会规范被引入，帮助AV在受情绪影响的情况下做出道路避障决策。此外，通过基于前景的情绪（即恐惧）的社交规范合规机制，利用模糊逻辑计算情绪，并通过SimConnect方法将恐惧的模糊值提供给Netlogo模拟环境，以模拟自动驾驶车辆的人工社会。通过行为空间工具进行了广泛的测试，以确定所提出方法在碰撞数量方面的性能。此外，还提出了基于随机漫步模型的人工社会作为比较。与随机漫步的比较证明，所提出的方法为未来自动驾驶车辆的自动驾驶系统提供了更好的选择，这些系统在安全道路旅行方面将更具社会接受性和信任度。

英文摘要

Humans are going to delegate the rights of driving to the autonomous vehicles in near future. However, to fulfill this complicated task, there is a need for a mechanism, which enforces the autonomous vehicles to obey the road and social rules that have been practiced by well-behaved drivers. This task can be achieved by introducing social norms compliance mechanism in the autonomous vehicles. This research paper is proposing an artificial society of autonomous vehicles as an analogy of human social society. Each AV has been assigned a social personality having different social influence. Social norms have been introduced which help the AVs in making the decisions, influenced by emotions, regarding road collision avoidance. Furthermore, social norms compliance mechanism, by artificial social AVs, has been proposed using prospect based emotion i.e. fear, which is conceived from OCC model. Fuzzy logic has been employed to compute the emotions quantitatively. Then, using SimConnect approach, fuzzy values of fear has been provided to the Netlogo simulation environment to simulate artificial society of AVs. Extensive testing has been performed using the behavior space tool to find out the performance of the proposed approach in terms of the number of collisions. For comparison, the random-walk model based artificial society of AVs has been proposed as well. A comparative study with a random walk, prove that proposed approach provides a better option to tailor the autopilots of future AVS, Which will be more socially acceptable and trustworthy by their riders in terms of safe road travel.

URL PDF HTML ☆

赞 0 踩 0

1708.04790 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Evaluation of Human-Robot Collaboration Models for Fluent Operations in Industrial Tasks

工业任务中流畅操作的人机协作模型评估

Lior Sayfeld, Ygal Peretz, Roy Someshwar, Yael Edan

AI总结本文评估了人机协作模型在工业任务中的性能，比较了基于定时和传感器的模型与自适应模型，发现自适应模型在总装配时间和空闲时间上有显著提升。

Comments Robotics: Science and Systems, Human-Robot Hand-Over Workshop 2015

1708.03055 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Optimal Control for Constrained Coverage Path Planning

受约束覆盖路径规划的最优控制

Ankit Manerikar, Debasmit Das, Pranay Banerjee

AI总结本文研究了在地图中存在障碍物约束下，如何通过改进线性扫掠覆盖方法实现最小能量/时间最优和最大面积覆盖，并分析不同参数对性能的影响。

Comments Report for AAE 568 (Applied Optimal Control) at Purdue

1708.00675 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Optical Target Tracking by Scheduled Range Measurements

通过计划范围测量进行光学目标跟踪

Mohammad Hossein Ferdowsi, Ebrahim Sabzikar

AI总结本文研究了通过定期目标方位测量和计划的范围测量率进行光学目标跟踪，利用目标范围估计误差方差作为调度准则，采用改进的球坐标系下目标动态状态向量，推导了近常速、近常加速度和协调转弯率运动模型的动力学方程，并利用UKF-IMM滤波器进行跟踪。

详情

DOI: 10.1117/1.OE.54.4.044101
Journal ref: Optical Engineering 54.4 (2015): 044101-044101

AI中文摘要

本文研究了通过定期目标方位测量和计划的范围测量率进行光学目标跟踪。利用目标范围估计误差方差作为调度准则。为此，将目标动态状态向量以改进的球坐标系表示，使得所有目标状态与范围相关的目标状态解耦。推导了近常速、近常加速度和协调转弯率运动模型在改进球坐标系下的目标状态动力学方程。对于所得到的状态动力学方程，采用具有范围测量调度的UKF-IMM滤波器作为跟踪滤波器。结果显示目标状态被正确估计，所用滤波器在机动目标跟踪中具有高性能。

英文摘要

In this paper, optical target tracking, by regular target bearing measurements and target range in a lower and scheduled measurement rate is considered. Variance of the target range estimation error is used as scheduling criterion. For this purpose, target dynamic state vector in modified spherical coordinates is stated in such a way that all target states be decoupled from range-related target state. Target state dynamic equations in modified spherical coordinates for nearly constant velocity, nearly constant acceleration and coordinated turn rate kinematic models, are analytically derived. For resulted state dynamic equations, a UKF-IMM filter with range measurement scheduling is utilized as a tracking filter. It is shown that target states are estimated properly and applied filter has high performance in maneuvering target tracking.

URL PDF HTML ☆

赞 0 踩 0

1605.05120 2026-06-04 cs.HC cs.RO cs.SY eess.SY 版本更新

Pushing the limits of the CyberGrasp for haptic rendering

突破CyberGrasp在触觉渲染中的极限

Manuel Aiple, André Schiele

AI总结本文提出ExHand Box，通过扩展带宽和定制控制器提升CyberGrasp的触觉反馈性能，实现在硬接触场景中更高的接触刚度和更稳定的反馈效果。

Comments 7 pages, 12 figures

详情

DOI: 10.1109/ICRA.2013.6631073
Journal ref: Robotics and Automation (ICRA), 2013 IEEE International Conference on, Karlsruhe, 2013, pp. 3541-3546

AI中文摘要

CyberGrasp是一种已知的数据手套-外骨骼设备组合，允许向人类手指渲染触觉反馈。然而，其设计限制了通过有限的控制带宽和位置传感器分辨率进行远程操作的实用性。因此，系统在触觉渲染中受限于低可实现的接触刚度和反馈增益幅度。此外，系统禁止其控制器实现的简单适应。在本文中，提出了ExHand Box，一种新设计的后端，以扩大CyberGrasp的带宽限制，并使其能够完全定制控制器实现。ExHand Box为否则未修改的CyberGlove和CyberGrasp手系统提供了新的计算机、接口电子和电机控制器。新系统的回路频率可以自由变化，最高可达2kHz，并且可以通过自动代码生成接口实现自定义控制器。系统性能识别实验展示了在不同采样周期范围内改进的硬接触行为。在稳定状态下演示了高达50kN/m的最大接触刚度，这显著高于非定制原始系统版本所能实现的水平。此外，进行了一项双侧控制实验，以展示新系统在通用远程操作研究中的实用性。在该实验中，引入了一种射线投射算法用于预接触检测，以补偿主从设备之间在以太网网络中出现的高延迟和抖动通信链路。证明了在稳定状态下接触刚度可以保持在系统性能识别的量级，演示了在稳定状态下达到41kN/m的刚度。

英文摘要

The CyberGrasp is a well known dataglove-exoskeleton device combination that allows to render haptic feedback to the human fingers. Its design, however, restricts its usability for teleoperation through a limited control bandwidth and position sensor resolution. Therefore the system is restricted to low achievable contact stiffness and feedback gain magnitudes in haptic rendering. Moreover, the system prohibits simple adaption of its controller implementation. In this paper, the ExHand Box is presented, a newly designed back-end to widen the CyberGrasp's bandwidth restrictions and to open it up for fully customized controller implementations. The ExHand Box provides a new computer, interface electronics and motor controllers for the otherwise unmodified CyberGlove and CyberGrasp hand systems. The loop frequency of the new system can be freely varied up to 2 kHz and custom controllers can be implemented through an automatic code generation interface. System performance identification experiments are presented that demonstrate improved behavior in hard contact situations over a range of sampling periods. Maximum contact stiffnesses of up to 50kN/m in a stable condition are demonstrated, which is significantly higher than what could be achieved with the non-customized original system version. Moreover, a bilateral control experiment is conducted to demonstrate the new system's usability for generic teleoperation research. In this experiment a raycasting algorithm is introduced for pre-contact detection in order to compensate for high delay and jitter communication links between master and slave as they appear in an Ethernet network. It is demonstrated that the contact stiffness can be maintained in the order of magnitude of the system performance identification with a demonstrated stiffness of 41kN/m in a stable condition.

URL PDF HTML ☆

赞 0 踩 0

1707.08689 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Multi-Robot Transfer Learning: A Dynamical System Perspective

多机器人迁移学习：动态系统视角

Mohamed K. Helwa, Angela P. Schoellig

AI总结本文从动态系统角度研究多机器人迁移学习中的最优转移映射性质，提出无需详细动力学知识的算法，通过实验验证该算法在四旋翼平台间迁移学习中减少60-70%的误差。

Comments 7 pages, 6 figures, accepted at the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems

详情

AI中文摘要

多机器人迁移学习允许一个机器人利用第二个相似机器人生成的数据来改进自身行为。潜在优势是减少训练时间并降低训练阶段不可避免的风险。迁移学习算法旨在找到不同机器人之间的最优转移映射。本文通过单输入单输出（SISO）系统的理论研究，探讨了此类最优转移映射的性质。我们首先证明最优迁移学习映射通常是一个动态系统。本文的主要贡献是提供一种确定该最优动态映射性质的算法，包括其阶数和回归器（即它所依赖的变量）。所提出的算法不需要详细的机器人动力学知识，但依赖于通过简单实验测试可获得的基本系统属性。我们通过两个不同四旋翼平台间的迁移学习示例验证了所提算法。实验结果表明，通过我们的算法获得的最优动态映射在减少迁移学习误差方面比直接转移数据或使用最优静态映射的情况减少了60-70%。

英文摘要

Multi-robot transfer learning allows a robot to use data generated by a second, similar robot to improve its own behavior. The potential advantages are reducing the time of training and the unavoidable risks that exist during the training phase. Transfer learning algorithms aim to find an optimal transfer map between different robots. In this paper, we investigate, through a theoretical study of single-input single-output (SISO) systems, the properties of such optimal transfer maps. We first show that the optimal transfer learning map is, in general, a dynamic system. The main contribution of the paper is to provide an algorithm for determining the properties of this optimal dynamic map including its order and regressors (i.e., the variables it depends on). The proposed algorithm does not require detailed knowledge of the robots' dynamics, but relies on basic system properties easily obtainable through simple experimental tests. We validate the proposed algorithm experimentally through an example of transfer learning between two different quadrotor platforms. Experimental results show that an optimal dynamic map, with correct properties obtained from our proposed algorithm, achieves 60-70% reduction of transfer learning error compared to the cases when the data is directly transferred or transferred using an optimal static map.

URL PDF HTML ☆

赞 0 踩 0

1707.08488 2026-06-04 eess.SY cs.RO cs.SY 版本更新

A New Framework for Synthetic Aperture Sonar Micronavigation

合成孔径声纳微导航的新框架

Salvatore Caporale, Yvan Petillot

AI总结本文提出基于误差函数最小化的新型微导航方法，通过比较连续回波的向量空间交集实现高精度定位，通过仿真和受控环境实验验证其有效性。

Comments 12 pages, 19 figures

1610.06283 2026-06-04 cs.RO cs.LG cs.NE cs.SY eess.SY 版本更新

Deep Neural Networks for Improved, Impromptu Trajectory Tracking of Quadrotors

用于四旋翼机即时轨迹跟踪的深度神经网络

Qiyang Li, Jingxing Qian, Zining Zhu, Xuchan Bao, Mohamed K. Helwa, Angela P. Schoellig

AI总结本文提出基于深度神经网络的算法，通过提供定制参考输入提升经典反馈控制器的轨迹跟踪性能，实验表明该方法能有效减少跟踪误差，适用于实时轨迹跟踪应用。

Comments 7 pages, 8 figures. Accepted final version. To appear in the proc. of the 2017 IEEE International Conference on Robotics and Automation

详情

AI中文摘要

四旋翼机的轨迹跟踪控制对于应用范围从勘测和检查到影视制作都至关重要。然而，设计和调优经典控制器，如比例-积分-微分（PID）控制器，以实现高跟踪精度可能耗时且困难，由于隐藏动态和其他非理想因素。深度神经网络（DNN）凭借其卓越的近似抽象、非线性函数的能力，提出了一种增强轨迹跟踪控制的新方法。本文提出了一种基于DNN的算法作为附加模块，以提高经典反馈控制器的跟踪性能。给定期望轨迹，DNNs根据其获得的经验为控制器提供定制参考输入。输入旨在实现期望轨迹与输出轨迹之间的单位映射。这项工作的动机是交互式“画即飞”应用，用户在移动设备上绘制轨迹，四旋翼机即时飞越该轨迹，通过DNN增强的控制系统。实验结果表明，所提出的方法在DNNs在选定的周期轨迹上训练后，能够提高用户绘制轨迹的跟踪精度，表明该方法在现实应用中的潜力。跟踪误差在训练和测试轨迹上分别减少约40-50%，突显了DNNs在知识泛化方面的能力。

英文摘要

Trajectory tracking control for quadrotors is important for applications ranging from surveying and inspection, to film making. However, designing and tuning classical controllers, such as proportional-integral-derivative (PID) controllers, to achieve high tracking precision can be time-consuming and difficult, due to hidden dynamics and other non-idealities. The Deep Neural Network (DNN), with its superior capability of approximating abstract, nonlinear functions, proposes a novel approach for enhancing trajectory tracking control. This paper presents a DNN-based algorithm as an add-on module that improves the tracking performance of a classical feedback controller. Given a desired trajectory, the DNNs provide a tailored reference input to the controller based on their gained experience. The input aims to achieve a unity map between the desired and the output trajectory. The motivation for this work is an interactive "fly-as-you-draw" application, in which a user draws a trajectory on a mobile device, and a quadrotor instantly flies that trajectory with the DNN-enhanced control system. Experimental results demonstrate that the proposed approach improves the tracking precision for user-drawn trajectories after the DNNs are trained on selected periodic trajectories, suggesting the method's potential in real-world applications. Tracking errors are reduced by around 40-50% for both training and testing trajectories from users, highlighting the DNNs' capability of generalizing knowledge.

URL PDF HTML ☆

赞 0 踩 0

1707.05458 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Stabilization Control of the Differential Mobile Robot Using Lyapunov Function and Extended Kalman Filter

微分移动机器人稳定控制的Lyapunov函数与扩展卡尔曼滤波应用

T. T. Hoang, P. M. Duong, N. T. T. Van, T. Q. Vinh

AI总结本文设计了导航微分移动机器人到达目标位置的控制模型，采用扩展卡尔曼滤波进行状态估计，结合Lyapunov函数实现稳定控制，确保闭环系统渐近稳定与鲁棒性。

Comments arXiv admin note: text overlap with arXiv:1611.07112, arXiv:1611.07114

1707.05456 2026-06-04 cs.RO cs.NI cs.SY eess.SY 版本更新

Control of an Internet-based Robot System Using the Real-time Transport Protocol

基于实时传输协议的互联网机器人系统控制

P. M. Duong, T. T. Hoang, T. Q. Vinh

AI总结本文提出利用实时传输协议替代传统TCP和UDP进行机器人系统控制，通过理论分析、仿真和实验验证其可行性与有效性。

Comments in Proceeding of The 5th Vietnam Conference on Mechatronics, Ho chi minh City, Vietnam, 2010

1705.10259 2026-06-04 cs.MA cs.LO cs.RO cs.SY eess.SY 版本更新

Distributed Communication-aware Motion Planning for Multi-agent Systems from STL and SpaTeL Specifications

分布式通信感知多智能体系统运动规划

Zhiyu Liu, Bo Wu, Jin Dai, Hai Lin

AI总结本文提出一种分布式运动规划框架，结合STL和SpaTeL规格，实现通信质量与运动控制的协同设计。

Comments Submitted for publication on 2017 IEEE Conference on Decision and Control (CDC2017)

详情

AI中文摘要

在未来智能交通系统中，联网车辆通过假设通信可靠来实现安全操作。传统方法将控制设计与通信网络分开处理，但车辆运动会影响通信质量。本文提出一种控制理论框架，用于多智能体系统的分布式运动规划，满足复杂的时空规格并考虑通信质量。将期望运动规格和通信性能分别编码为信号临时逻辑（STL）和时空逻辑（SpaTeL）公式，并作为混合整数线性规划（MILP）系统和环境状态变量的约束。通过分布式模型预测控制（MPC）框架为每个智能体生成满足STL和SpaTeL规格的控制策略。通过多智能体系统的分布式通信感知运动规划仿真验证了所提框架的有效性。

英文摘要

In future intelligent transportation systems, networked vehicles coordinate with each other to achieve safe operations based on an assumption that communications among vehicles and infrastructure are reliable. Traditional methods usually deal with the design of control systems and communication networks in a separated manner. However, control and communication systems are tightly coupled as the motions of vehicles will affect the overall communication quality. Hence, we are motivated to study the co-design of both control and communication systems. In particular, we propose a control theoretical framework for distributed motion planning for multi-agent systems which satisfies complex and high-level spatial and temporal specifications while accounting for communication quality at the same time. Towards this end, desired motion specifications and communication performances are formulated as signal temporal logic (STL) and spatial-temporal logic (SpaTeL) formulas, respectively. The specifications are encoded as constraints on system and environment state variables of mixed integer linear programs (MILP), and upon which control strategies satisfying both STL and SpaTeL specifications are generated for each agent by employing a distributed model predictive control (MPC) framework. Effectiveness of the proposed framework is validated by a simulation of distributed communication-aware motion planning for multi-agent systems.

URL PDF HTML ☆

赞 0 踩 0

1705.05065 2026-06-04 cs.RO cs.AI cs.CV cs.SY eess.SY 版本更新

AirSim: High-Fidelity Visual and Physical Simulation for Autonomous Vehicles

AirSim：面向自动驾驶车辆的高保真视觉与物理模拟

Shital Shah, Debadeepta Dey, Chris Lovett, Ashish Kapoor

AI总结本文提出基于Unreal引擎的AirSim模拟器，用于高效开发和测试自动驾驶算法，支持高频率物理模拟和多种协议，通过四旋翼实验验证其有效性。

Comments Accepted for Field and Service Robotics conference 2017 (FSR 2017)

1403.5204 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Adaptive Control of Robot Manipulators With Uncertain Kinematics and Dynamics

具有不确定运动学和动力学的机器人操作机自适应控制

Hanlei Wang

AI总结本文提出两种自适应控制方案，实现不管运动学和动力学不确定性的任务空间轨迹跟踪，通过数值模拟验证了控制器性能。

Comments 18 pages, 7 figures, revised for improving the presentation and adding some contents and references based on the reviewers' and AE's comments from IEEE Transactions on Automatic Control

1707.03092 2026-06-04 eess.SY cs.LG cs.RO cs.SY 版本更新

A Separation-Based Design to Data-Driven Control for Large-Scale Partially Observed Systems

基于分离的设计到数据驱动控制用于大规模部分观测系统

Dan Yu, Mohammadhussein Rafieisakhaei, Suman Chakravorty

AI总结本文研究了由偏微分方程（PDE）描述的状态动力学导致的 partially observed 随机最优控制问题，通过黑盒模拟模型求解开环确定性轨迹优化问题，并基于输入输出实验数据设计线性二次高斯控制器。

Comments 3 pages, 6 figures, In Robotics: Science and Systems (RSS) 2017 Workshop of "POMDPs in Robotics: State of The Art, Challenges, and Opportunities"

1707.02201 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Learning human behaviors from motion capture by adversarial imitation

通过对抗模仿学习学习人类行为

Josh Merel, Yuval Tassa, Dhruva TB, Sriram Srinivasan, Jay Lemmon, Ziyu Wang, Greg Wayne, Nicolas Heess

AI总结本文提出利用生成对抗模仿学习训练神经网络策略，从有限的不完全观测状态特征中生成人类化运动模式，即使演示来自不同物理参数的躯体，也能通过子技能策略解决任务。

1705.09415 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Near-Optimal Belief Space Planning via T-LQG

通过T-LQG实现接近最优的信念空间规划

Mohammadhussein Rafieisakhaei, Suman Chakravorty, P. R. Kumar

AI总结本文提出T-LQG方法，用于非线性机器人系统在观测和运动不确定性下的规划问题，提供近优反馈控制策略，解决POMDP问题。

Comments 3 pages, 3 figures, In Robotics: Science and Systems (RSS) 2017 Workshop of "POMDPs in Robotics: State of The Art, Challenges, and Opportunities"

1405.6341 2026-06-04 cs.RO cs.AI cs.LG cs.SY eess.SY 版本更新

Efficient Model Learning for Human-Robot Collaborative Tasks

高效的人机协作任务模型学习

Stefanos Nikolaidis, Keren Gu, Ramya Ramakrishnan, Julie Shah

AI总结本文提出一种框架，通过联合动作演示学习人类用户模型，使机器人能自动计算稳健的协作策略。采用无监督学习聚类动作序列，学习逆强化学习奖励函数，并在混合可观测马尔可夫决策过程框架中应用，实现对新用户的类型推断和策略计算。

详情

DOI: 10.1145/2696454.2696455
Journal ref: Proceedings of the Tenth Annual ACM/IEEE International Conference on Human-Robot Interaction (HRI 2015)

AI中文摘要

我们提出了一种框架，用于从联合动作演示中学习人类用户模型，使机器人能够计算协作任务的稳健策略。学习过程完全自动，无需人工干预。首先，我们描述了使用无监督学习算法将演示的动作序列聚类为不同的人类类型。这些演示序列还被机器人用来通过逆强化学习算法学习代表每种类型的奖励函数。学习的模型随后作为混合可观测马尔可夫决策过程（MO-MDP）的一部分使用，其中人类类型是部分可观测变量。通过该框架，我们可以推断新用户类型（未包含在训练集中），并计算与新用户偏好一致且对人类动作偏离具有鲁棒性的机器人策略。最后，我们通过人类受试者实验数据验证了该方法，并进行了概念验证演示，其中一个人与小型工业机器人进行协作任务。

英文摘要

We present a framework for learning human user models from joint-action demonstrations that enables the robot to compute a robust policy for a collaborative task with a human. The learning takes place completely automatically, without any human intervention. First, we describe the clustering of demonstrated action sequences into different human types using an unsupervised learning algorithm. These demonstrated sequences are also used by the robot to learn a reward function that is representative for each type, through the employment of an inverse reinforcement learning algorithm. The learned model is then used as part of a Mixed Observability Markov Decision Process formulation, wherein the human type is a partially observable variable. With this framework, we can infer, either offline or online, the human type of a new user that was not included in the training set, and can compute a policy for the robot that will be aligned to the preference of this new user and will be robust to deviations of the human actions from prior demonstrations. Finally we validate the approach using data collected in human subject experiments, and conduct proof-of-concept demonstrations in which a person performs a collaborative task with a small industrial robot.

URL PDF HTML ☆

赞 0 踩 0

1706.01127 2026-06-04 cs.RO cs.SY eess.SY math.DS math.OC 版本更新

Virtual Constraints and Hybrid Zero Dynamics for Realizing Underactuated Bipedal Locomotion

虚拟约束与混合零动力学用于实现欠驱动双足运动

Jessy W Grizzle, Christine Chevallereau

AI总结本文提出了一种协调理论，用于设计反馈控制器实现欠驱动双足机器人稳定行走。通过引入虚拟约束和混合零动力学，同步关节相位变量并捕捉欠驱动特性。

Comments 17 pages, 4 figures, bookchapter

详情

AI中文摘要

欠驱动在人类运动中普遍存在，也应普遍存在于双足机器人运动中。本章提出了一种协调理论，用于设计反馈控制器实现欠驱动双足机器人稳定行走。引入了两个基本工具：虚拟约束和混合零动力学。虚拟约束是通过时间不变反馈控制器施加在机械模型状态变量上的关系。其作用之一是同步机器人关节到内部步态相位变量。另一个作用是诱导一个低维系统，即零动力学，捕捉机器人模型的欠驱动特性，而无需任何近似。为增强直观，首先建立了物理约束与虚拟约束之间的关系。从这里，开发了欠驱动双足模型的混合零动力学，并确立了其在设计渐近稳定的行走运动中的基本作用。本章包含大量参考了已实现这些技术的机器人。

英文摘要

Underactuation is ubiquitous in human locomotion and should be ubiquitous in bipedal robotic locomotion as well. This chapter presents a coherent theory for the design of feedback controllers that achieve stable walking gaits in underactuated bipedal robots. Two fundamental tools are introduced, virtual constraints and hybrid zero dynamics. Virtual constraints are relations on the state variables of a mechanical model that are imposed through a time-invariant feedback controller. One of their roles is to synchronize the robot's joints to an internal gait phasing variable. A second role is to induce a low dimensional system, the zero dynamics, that captures the underactuated aspects of a robot's model, without any approximations. To enhance intuition, the relation between physical constraints and virtual constraints is first established. From here, the hybrid zero dynamics of an underactuated bipedal model is developed, and its fundamental role in the design of asymptotically stable walking motions is established. The chapter includes numerous references to robots on which the highlighted techniques have been implemented.

URL PDF HTML ☆

赞 0 踩 0

1705.10432 2026-06-04 cs.AI cs.RO cs.SY eess.SY 版本更新

Fine-grained acceleration control for autonomous intersection management using deep reinforcement learning

基于深度强化学习的细粒度加速控制用于自动驾驶交叉口管理

Hamid Mirzaei, Tony Givargis

AI总结本文利用信任区域策略优化方法，实现自动驾驶车辆在网格街道中的细粒度加速控制，以达成全局管理目标。

Comments Accepted in IEEE Smart World Congress 2017

1705.08566 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A Near-Optimal Separation Principle for Nonlinear Stochastic Systems Arising in Robotic Path Planning and Control

非线性随机系统在机器人路径规划与控制中的近最优分离原理

Mohammadhussein Rafieisakhaei, Suman Chakravorty, P. R. Kumar

AI总结本文提出了一种针对机器人路径规划与控制中非线性随机系统的近最优分离方法，通过小噪声假设实现可计算的近最优控制设计，同时推导出具有高斯噪声的非线性随机系统的轨迹优化线性二次调节器设计。

Comments 7 pages, 4 Figures, Submitted to 56th IEEE Conference on Decision and Control (CDC), 2017

1705.05727 2026-06-04 cs.RO cs.SY eess.SY nlin.CD 版本更新

A General Scheme Implicit Force Control for a Flexible-Link Manipulator

一种灵活连杆机械臂的通用隐式力控方案

Cecilia Murrugarra, Osberth De Castro, Juan Carlos Grieco, Gerardo Fernandez

AI总结本文提出了一种针对与柔顺环境交互的单连杆柔顺机械臂的隐式力控方案，基于机械臂的数学模型，考虑了柔性梁的动力学和重力作用，通过结构参数确定控制器参数，利用李雅普诺夫理论保证稳定性，采用内外闭环控制结构实现位置和力的跟踪控制及振动抑制。

Comments 16 pages, 14 figures

1705.05344 2026-06-04 cs.RO cs.SY eess.SY 版本更新

GP-ILQG: Data-driven Robust Optimal Control for Uncertain Nonlinear Dynamical Systems

GP-ILQG：基于数据的鲁棒最优控制用于不确定非线性动力学系统

Gilwoo Lee, Siddhartha S. Srinivasa, Matthew T. Mason

AI总结本文提出GP-ILQG算法，结合数据驱动的系统辨识方法与基于微分动态规划的鲁棒最优控制方法，有效解决现实与仿真之间的差距问题，实现快速修正模型并提升控制鲁棒性。

1705.05116 2026-06-04 cs.RO cs.AI cs.CV cs.LG cs.SY eess.SY 版本更新

Tuning Modular Networks with Weighted Losses for Hand-Eye Coordination

通过加权损失调节模块网络以提升手眼协调

Fangyi Zhang, Jürgen Leitner, Michael Milford, Peter I. Corke

AI总结本文提出端到端微调方法，通过加权损失提升模块化深度视觉-运动策略在平面抓取任务中的手眼协调性能。

Comments 2 pages, to appear in the Deep Learning for Robotic Vision (DLRV) Workshop in CVPR 2017

1705.04763 2026-06-04 cs.RO cs.SY eess.SY 版本更新

High-Precision Trajectory Tracking in Changing Environments Through $\mathcal{L}_1$ Adaptive Feedback and Iterative Learning

通过L1自适应反馈和迭代学习实现高精度轨迹跟踪

Karime Pereida, Rikky R. P. R. Duivenvoorden, Angela P. Schoellig

AI总结本文提出结合L1自适应反馈与迭代学习控制的框架，提升系统在未知动态扰动下的轨迹跟踪性能，通过实验验证其优于纯ILC方法的鲁棒性和泛化能力。

Comments 7 pages, 5 figures, Proc. of the 2017 IEEE International Conference on Robotics and Automation

详情

AI中文摘要

随着机器人等自动化系统被引入未知和动态环境，需要鲁棒且自适应的控制策略以应对扰动、未建模动态和参数不确定性。本文提出并证明了一种结合L1自适应反馈和迭代学习控制（ILC）的框架，以提高受未知和变化扰动影响的系统轨迹跟踪性能。L1自适应控制器使系统在存在未知和变化扰动的情况下表现出可重复的预定义行为，但并不意味着实现完美轨迹跟踪。ILC通过以往执行的经验改进跟踪性能。ILC的性能受限于底层系统的鲁棒性和可重复性，而在本方法中，这由L1自适应控制器处理。特别地，我们能够将学习的轨迹泛化到不同的系统配置，因为L1自适应控制器处理了系统底层的变化。我们通过四旋翼在未知动态扰动下的实验，展示了结合方法相比纯ILC在轨迹跟踪性能和泛化能力上的改进。这是首次在实验中展示L1自适应控制与ILC结合的工作。

英文摘要

As robots and other automated systems are introduced to unknown and dynamic environments, robust and adaptive control strategies are required to cope with disturbances, unmodeled dynamics and parametric uncertainties. In this paper, we propose and provide theoretical proofs of a combined $\mathcal{L}_1$ adaptive feedback and iterative learning control (ILC) framework to improve trajectory tracking of a system subject to unknown and changing disturbances. The $\mathcal{L}_1$ adaptive controller forces the system to behave in a repeatable, predefined way, even in the presence of unknown and changing disturbances; however, this does not imply that perfect trajectory tracking is achieved. ILC improves the tracking performance based on experience from previous executions. The performance of ILC is limited by the robustness and repeatability of the underlying system, which, in this approach, is handled by the $\mathcal{L}_1$ adaptive controller. In particular, we are able to generalize learned trajectories across different system configurations because the $\mathcal{L}_1$ adaptive controller handles the underlying changes in the system. We demonstrate the improved trajectory tracking performance and generalization capabilities of the combined method compared to pure ILC in experiments with a quadrotor subject to unknown, dynamic disturbances. This is the first work to show $\mathcal{L}_1$ adaptive control combined with ILC in experiment.

URL PDF HTML ☆

赞 0 踩 0

1702.02453 2026-06-04 cs.LG cs.RO cs.SY eess.SY 版本更新

Preparing for the Unknown: Learning a Universal Policy with Online System Identification

为未知做准备：学习通用策略与在线系统识别

Wenhao Yu, Jie Tan, C. Karen Liu, Greg Turk

AI总结本文提出了一种学习通用策略的方法，通过在线系统识别和大量训练示例，使策略在未知动态模型下具备鲁棒性，适用于多种动态模型和环境变化。

Comments Accepted as a conference paper at RSS 2017

详情

AI中文摘要

我们提出了一种学习控制策略的新方法，该方法能够在未知动态模型下有效运行。我们通过利用大量由物理模拟器生成的训练示例来创建此类策略。系统由两个组件组成：通用策略（UP）和在线系统识别（OSI）函数。我们描述我们的控制策略为通用，因为它是在广泛动态模型上训练的。这些动态模型的变化可能包括机器人组件的质量和惯性差异、摩擦系数变化或未知被操作物体的质量。通过在这些变化上训练通用策略，控制策略在未知环境中准备了更广泛的可能条件。系统第二部分利用系统的近期状态和动作历史来预测动态模型参数mu。在线系统识别的mu值然后作为输入提供给控制策略（连同系统状态）。UP-OSI是一种在广泛动态模型上适用且对环境突然变化具有响应性的稳健控制策略。我们评估了该系统在多种任务上的性能，包括cart-pole翻转问题、双倒立摆、跳蛙器的运动和机械臂的块投掷任务。UP-OSI在各种动态模型上均有效。此外，当测试动态模型超出训练范围时，UP-OSI在UP单独的情况下表现更优，即使UP被给予实际的动态模型值。除了创建更稳健的控制器的好处外，UP-OSI还具有缩小模拟与真实物理系统现实差距的潜力。

英文摘要

We present a new method of learning control policies that successfully operate under unknown dynamic models. We create such policies by leveraging a large number of training examples that are generated using a physical simulator. Our system is made of two components: a Universal Policy (UP) and a function for Online System Identification (OSI). We describe our control policy as universal because it is trained over a wide array of dynamic models. These variations in the dynamic model may include differences in mass and inertia of the robots' components, variable friction coefficients, or unknown mass of an object to be manipulated. By training the Universal Policy with this variation, the control policy is prepared for a wider array of possible conditions when executed in an unknown environment. The second part of our system uses the recent state and action history of the system to predict the dynamics model parameters mu. The value of mu from the Online System Identification is then provided as input to the control policy (along with the system state). Together, UP-OSI is a robust control policy that can be used across a wide range of dynamic models, and that is also responsive to sudden changes in the environment. We have evaluated the performance of this system on a variety of tasks, including the problem of cart-pole swing-up, the double inverted pendulum, locomotion of a hopper, and block-throwing of a manipulator. UP-OSI is effective at these tasks across a wide range of dynamic models. Moreover, when tested with dynamic models outside of the training range, UP-OSI outperforms the Universal Policy alone, even when UP is given the actual value of the model dynamics. In addition to the benefits of creating more robust controllers, UP-OSI also holds out promise of narrowing the Reality Gap between simulated and real physical systems.

URL PDF HTML ☆

赞 0 踩 0

1704.04722 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Bounded Distributed Flocking Control of Nonholonomic Mobile Robots

非holonomic移动机器人有界分布式编队控制

Thang Nguyen, Hung La, Vahid Azimi, Thanh-Trung Han

AI总结本文提出基于有界反馈的非holonomic移动机器人分布式编队控制方法，解决动态特性带来的挑战，实现速度一致性、避障和聚拢维持。

1705.01332 2026-06-04 eess.SY cs.RO cs.SY 版本更新

LiDAR-based Control of Autonomous Rotorcraft for the Inspection of Pier-like Structures: Proofs

基于LiDAR的自主旋翼飞行器对类似桥体结构的检测控制：证明

Bruno J. Guerreiro, Carlos Silvestre, Rita Cunha, David Cabecinhas

AI总结本文提出基于LiDAR的自主旋翼飞行器轨迹跟踪控制方法，通过定义基于LiDAR测量的运动学模型，结合LPV表示和H2控制理论，实现对桥体结构的高效检测。

Comments [1] B. J. Guerreiro, C. Silvestre, R. Cunha, and D. Cabecinhas, Lidar-based control of autonomous rotorcraft for the inspection of pier-like structures, IEEE Transactions in Control Systems Technology, 2017. (to appear)

1603.06716 2026-06-04 eess.SY cs.LO cs.RO cs.SY 版本更新

AWEsome：一种用于空载风能系统的开源测试平台

Philip Bechtle, Thomas Gehrmann, Christoph Sieg, Udo Zillmann

AI总结本文介绍了一种低成本、基于开源软件的空载风能系统测试平台AWEsome，可为研究团队和初创企业提供无需大额投资的飞行测试和控制策略验证机会。

Comments pdflatex, 25 pages, 9 figures

1704.03103 2026-06-04 cs.RO cs.AI cs.CG cs.SY eess.SY 版本更新

Minkowski Operations of Sets with Application to Robot Localization

Minkowski运算与机器人定位的应用

Benoit Desrochers, Luc Jaulin

AI总结本文通过引入Minkowski和与差的分离器，高效解决机器人在非结构化环境中基于声呐测量的定位问题，并通过测试案例验证了方法的有效性。

Comments In Proceedings SNR 2017, arXiv:1704.02421

1704.02075 2026-06-04 cs.RO cs.SY eess.SY 版本更新

On Sensing, Agility, and Computation Requirements for a Data-gathering Agile Robotic Vehicle

关于数据收集敏捷机器人车辆的感知、敏捷性和计算需求

Fangchang Ma, Sertac Karaman

AI总结本文研究了机器人车辆在未知数据源位置下收集信息的性能，分析了感知、敏捷性和计算能力对信息获取的影响，提出了理论结果和实验验证。

Comments 22 pages, 11 figures

详情

AI中文摘要

我们考虑了一种任务为通过访问一组空间分布的数据源来收集信息的机器人车辆，这些数据源的位置在事先未知，但可以在飞行中发现。我们假设涉及漂移的一阶机器人动力学，并假设数据源的位置是泊松分布的。在此设定下，我们以感知、敏捷性和计算能力来表征机器人的性能。更具体地说，机器人性能的表征包括其从远处感知目标位置的能力、快速移动的能力以及进行推理和规划的计算能力。我们还以每个数据源可以获取的信息量和分布来表征机器人的性能。我们的理论结果包括：目标位置的信息量分布对从远处感知目标的需求影响巨大；性能随着移动能力的增加而增加，但回报递减；并且计算需求随着规划的增加而比推理更快增加，同时随着感知范围和移动能力的增加。我们提供了计算实验来验证我们的理论结果。最后，我们展示了这些结果可以用于移动机器人系统的感知、驱动和计算能力的协同设计，以执行信息收集任务。我们的证明技术建立了机器人信息收集基本问题与统计力学中的最后通路渗透问题之间的新联系，这可能本身就有价值。

英文摘要

We consider a robotic vehicle tasked with gathering information by visiting a set of spatially-distributed data sources, the locations of which are not known a priori, but are discovered on the fly. We assume a first-order robot dynamics involving drift and that the locations of the data sources are Poisson-distributed. In this setting, we characterize the performance of the robot in terms of its sensing, agility, and computation capabilities. More specifically, the robot's performance is characterized in terms of its ability to sense the target locations from a distance, to maneuver quickly, and to perform computations for inference and planning. We also characterize the performance of the robot in terms of the amount and distribution of information that can be acquired at each data source. The following are among our theoretical results: the distribution of the amount of information among the target locations immensely impacts the requirements for sensing targets from a distance; performance increases with increasing maneuvering capability, but with diminishing returns; and the computation requirements increase more rapidly for planning as opposed to inference, with both increasing sensing range and maneuvering ability. We provide computational experiments to validate our theoretical results. Finally, we demonstrate that these results can be utilized in the co-design of sensing, actuation, and computation capabilities of mobile robotic systems for an information-gathering mission. Our proof techniques establish novel connections between the fundamental problems of robotic information-gathering and the last-passage percolation problem of statistical mechanics, which may be of interest on its own right.

URL PDF HTML ☆

赞 0 踩 0

1307.0276 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Controllability Analysis and Degraded Control for a Class of Hexacopters Subject to Rotor Failures

六旋翼无人机在旋翼失效情况下的可控性分析与退化控制

Guang-Xun Du, Quan Quan, Kai-Yuan Cai

AI总结本文分析了六旋翼无人机在旋翼失效时的可控性问题，提出了一种退化控制策略，证明当旋翼最大升力超过一定值时系统可控。

Comments 21 pages, 7 figures, submitted to Journal of Intelligent & Robotic Systems

1704.00888 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A Discrete-Time Attitude Observer on SO(3) for Vision and GPS Fusion

基于SO(3)的离散时间姿态观测器用于视觉与GPS融合

Alireza Khosravian, Tat-Jun Chin, Ian Reid, Robert Mahony

AI总结本文提出一种离散时间几何姿态观测器，融合单目视觉与GPS速度测量，通过减轻视觉里程计姿态估计的固有漂移并直接估计北东下坐标系下的姿态，提供收敛性分析和收敛率下界。

Comments To appear in IEEE Conference on Robotics and Automation 2017

详情

Journal ref: Alireza Khosravian, Tat-Jun Chin, Ian Reid, Robert Mahony "A Discrete-Time Attitude Observer on SO(3) for Vision and GPS Fusion", in Proc. IEEE Conf. Robotics and Automation, 2017

AI中文摘要

本文提出了一种离散时间几何姿态观测器，用于融合单目视觉与GPS速度测量。该观测器利用视觉里程计处理单目图像得到的相对变换，并将其与GPS速度测量融合。该传感器融合的目标是减轻视觉里程计姿态估计的固有漂移，并直接估计北东下坐标系下的姿态。本文的主要贡献是提供严谨的稳定性分析，证明观测器的姿态估计以指数形式收敛到真实姿态，并提供观测器收敛率的下界。通过实验研究，我们证明该观测器能有效补偿纯单目视觉姿态估计的固有漂移，并且即使初始化时具有非常大的姿态误差，也能恢复北东下坐标系下的姿态。

英文摘要

This paper proposes a discrete-time geometric attitude observer for fusing monocular vision with GPS velocity measurements. The observer takes the relative transformations obtained from processing monocular images with any visual odometry algorithm and fuses them with GPS velocity measurements. The objectives of this sensor fusion are twofold; first to mitigate the inherent drift of the attitude estimates of the visual odometry, and second, to estimate the orientation directly with respect to the North-East-Down frame. A key contribution of the paper is to present a rigorous stability analysis showing that the attitude estimates of the observer converge exponentially to the true attitude and to provide a lower bound for the convergence rate of the observer. Through experimental studies, we demonstrate that the observer effectively compensates for the inherent drift of the pure monocular vision based attitude estimation and is able to recover the North-East-Down orientation even if it is initialized with a very large attitude error.

URL PDF HTML ☆

赞 0 踩 0

1704.00534 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Controlling a triangular flexible formation of autonomous agents

控制自主体的三角形柔性编队

Hector Garcia de Marina, Zhiyong Sun, Ming Cao, Brian D. O. Anderson

AI总结本文提出结合位置和距离梯度下降控制的算法，通过添加人工偏置使自主体形成共线配置，并引入旋转矩阵控制邻近自主体的方位角。

Comments 7 pages, accepted in the 20th World Congress of the International Federation of Automatic Control (IFAC)

1703.08557 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Towards a Functional System Architecture for Automated Vehicles

迈向自动驾驶车辆的功能系统架构

Simon Ulbrich, Andreas Reschka, Jens Rieken, Susanne Ernst, Gerrit Bagschik, Frank Dierkes, Marcus Nolte, Markus Maurer

AI总结本文提出了一种通用功能架构，用于自动驾驶车辆，该架构独立于具体实现，并基于Stadtpilot项目中的真实世界实现进行验证，涵盖环境感知、规划控制、定位、地图提供及人机交互等方面。

Comments Submitted for review to IEEE Transactions on Intelligent Transportation Systems, 16 pages, 4 figures

1703.08515 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Mean-Field Controllability and Decentralized Stabilization of Markov Chains, Part II: Asymptotic Controllability and Polynomial Feedbacks

均场可控性与马尔可夫链的去中心化稳定化，第二部分：渐近可控性与多项式反馈

Shiba Biswal, Karthik Elamvazhuthi, Spring Berman

AI总结本文研究了有限状态空间中群体代理连续时间马尔可夫链的均场反馈稳定化方法，证明了强连通支持概率分布可被时间不变输入稳定，并展示了渐近可控性及去中心化密度反馈律的存在性。

Comments Submitted to IEEE Conference on Decision and Control, 2017

详情

AI中文摘要

本文是两部分系列论文的第二部分，提出了一种方法用于有限状态空间中群体代理连续时间马尔可夫链（CTMC）的均场反馈稳定化。所得到的（均场）控制问题为控制具有期望全局稳定性性质的非线性系统。首先证明任何具有强连通支持的概率分布均可通过时间不变输入稳定。其次展示所有可能概率分布的渐近可控性，包括分配零密度到某些状态且不一定具有强连通支持的分布。最后证明当图是强连通且双向时，总存在一个全局渐近稳定的去中心化密度反馈律，其控制输入在平衡时为零。然后将闭环多项式反馈的合成问题框架为使用最新求和平方优化工具的优化问题。该优化问题寻找多项式反馈律，使候选李雅普诺夫函数成为闭环系统稳定性证书。本文方法在五顶点图的两个案例中进行了测试，并通过相应常微分方程组的数值模拟验证了所构造控制律的稳定性特性。

英文摘要

This paper, the second of a two-part series, presents a method for mean-field feedback stabilization of a swarm of agents on a finite state space whose time evolution is modeled as a continuous time Markov chain (CTMC). The resulting (mean-field) control problem is that of controlling a nonlinear system with desired global stability properties. We first prove that any probability distribution with a strongly connected support can be stabilized using time-invariant inputs. Secondly, we show the asymptotic controllability of all possible probability distributions, including distributions that assign zero density to some states and which do not necessarily have a strongly connected support. Lastly, we demonstrate that there always exists a globally asymptotically stabilizing decentralized density feedback law with the additional property that the control inputs are zero at equilibrium, whenever the graph is strongly connected and bidirected. Then the problem of synthesizing closed-loop polynomial feedback is framed as a optimization problem using state-of-the-art sum-of-squares optimization tools. The optimization problem searches for polynomial feedback laws that make the candidate Lyapunov function a stability certificate for the resulting closed-loop system. Our methodology is tested for two cases on a five vertex graph, and the stabilization properties of the constructed control laws are validated with numerical simulations of the corresponding system of ordinary differential equations.

URL PDF HTML ☆

赞 0 踩 0

1703.08243 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Mean-Field Controllability and Decentralized Stabilization of Markov Chains, Part I: Global Controllability and Rational Feedbacks

马尔可夫链的均场可控性与去中心化稳定化，第一部分：全局可控性与理性反馈

Karthik Elamvazhuthi, Vaibhav Deshmukh, Matthias Kawski, Spring Berman

AI总结本文研究了连续时间马尔可夫链的可控性和稳定化性质，通过转移率作为控制参数。证明了在强连通图下，小时间局部和全局可控性，以及存在去中心化线性反馈法稳定概率密度。

Comments Submitted to IEEE Conference on Decision and Control, 2017

详情

AI中文摘要

本文研究了连续时间马尔可夫链的可控性和稳定化性质，通过转移率作为控制参数。首先，我们证明了在强连通图下，从和到严格正的平衡配置的小时间局部和全局可控性。其次，我们证明了当允许转移率为负且目标密度位于概率密度集内部时，总是存在一个局部指数稳定的去中心化线性（密度）反馈律，该反馈律在平衡点处取零值并尊重图结构。对于双向图，即存在反向边的图，我们展示了这种线性控制律可通过形式为k(x) = a(x) + b(x)f(x)/g(x)的去中心化理性反馈律实现，该反馈律也尊重图结构和控制约束（正性和平衡点处零值）。这使得可以利用线性矩阵不等式（LMI）工具算法性地构造去中心化密度反馈控制器，以稳定机器人集群到目标任务分布，且在平衡点处无任务切换，如通过多个数值示例所展示的。

英文摘要

In this paper, we study the controllability and stabilizability properties of the Kolmogorov forward equation of a continuous time Markov chain (CTMC) evolving on a finite state space, using the transition rates as the control parameters. Firstly, we prove small-time local and global controllability from and to strictly positive equilibrium configurations when the underlying graph is strongly connected. Secondly, we show that there always exists a locally exponentially stabilizing decentralized linear (density-)feedback law that takes zero valu at equilibrium and respects the graph structure, provided that the transition rates are allowed to be negative and the desired target density lies in the interior of the set of probability densities. For bidirected graphs, that is, graphs where a directed edge in one direction implies an edge in the opposite direction, we show that this linear control law can be realized using a decentralized rational feedback law of the form k(x) = a(x) + b(x)f(x)/g(x) that also respects the graph structure and control constraints (positivity and zero at equilibrium). This enables the possibility of using Linear Matrix Inequality (LMI) based tools to algorithmically construct decentralized density feedback controllers for stabilization of a robotic swarm to a target task distribution with no task-switching at equilibrium, as we demonstrate with several numerical examples.

URL PDF HTML ☆

赞 0 踩 0

1702.00785 2026-06-04 cs.MA cs.RO cs.SY eess.SY 版本更新

Evaluation of Automated Vehicles Encountering Pedestrians at Unsignalized Crossings

自动驾驶车辆在无信号交叉口与行人互动的评估

Baiming Chen, Ding Zhao, Huei Peng

AI总结本文提出了一种评估自动驾驶车辆在无信号交叉口与行人互动安全性和可行性的新方法，通过分析数据和模拟行人反应，验证了自动驾驶策略的有效性。

1703.07736 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Circular formation control of fixed-wing UAVs with constant speeds

固定翼无人机恒速圆形成形控制

Hector Garcia de Marina, Zhiyong Sun, Murat Bronz, Gautier Hattenberger

AI总结本文提出了一种稳定固定翼无人机恒速圆形成形的算法，通过跟踪不同半径的圆来控制车辆相对于目标圆周的相位，确保团队在特定区域内保持约束。

Comments 6 pages, submitted to IROS 2017

1703.06416 2026-06-04 cs.MA cs.DC cs.RO cs.SY eess.SY 版本更新

A Passivity-Based Distributed Reference Governor for Constrained Robotic Networks

基于被动性的分布式参考管理者用于受约束的机器人网络

Tam Nguyen, Takeshi Hatanaka, Mamoru Doi, Emanuele Garone, Masayuki Fujita

AI总结本文提出了一种基于被动性的分布式参考管理者，用于预稳定化的移动机器人网络，通过梯度下降法和对偶上升法解决参考管理者问题，并通过仿真实验验证了其有效性。

Comments 8 pages, International Federation of Automatic Conference 2017, 8 figures

1703.06387 2026-06-04 cs.RO cs.SY eess.SY 版本更新

An opportunistic linear-convex algorithm for localization in mobile robot networks

一种机会主义的线性-凸算法用于移动机器人网络的定位

Sam Safavi, Usman Khan

AI总结本文提出一种分布式算法用于定位在有界区域内任意移动的机器人网络，通过机会主义策略在附近机器人存在时更新位置，基于重心坐标设计线性-凸更新方法，并证明其在噪声下的渐近收敛性。

详情

AI中文摘要

本文开发了一种分布式算法，用于定位在有界区域内任意移动的机器人网络。在移动网络中，主要挑战是机器人可能无法找到附近的机器人来实现分布式算法。我们通过提供一种机会主义算法来解决这个问题，该算法仅在附近有机器人时进行位置更新，否则不更新。我们假设每个机器人测量其运动和到附近机器人的距离的噪声版本。为了在$\mathbb{R}^m$中定位移动机器人网络，我们提供了一个简单的线性更新，基于重心坐标并具有线性-凸性质。我们将相应的定位算法抽象为线性时变（LTV）系统，并证明其渐近收敛到真实位置。我们首先考虑噪声less情况，其中距离和运动向量已知（测量完美），并提供收敛性的充分条件。然后评估算法在存在噪声时的性能，并提供修改以抵消噪声的不利影响。我们进一步证明，只要至少有一个已知的基站（位置完全已知的节点）存在，我们的算法可以精确跟踪移动网络。

英文摘要

In this paper, we develop a \textcolor{black}{\emph{distributed}} algorithm to localize a network of robots moving arbitrarily in a bounded region. In the case of such mobile networks, the main challenge is that the robots may not be able to find nearby robots to implement a distributed algorithm. We address this issue by providing an opportunistic algorithm that only implements a location update when there are nearby robots and does not update otherwise. We assume that each robot measures a noisy version of its motion and the distances to the nearby robots. To localize a network of mobile robots in~$\mathbb{R}^m$, we provide a simple \emph{linear} update, which is based on barycentric coordinates and is linear-convex. We abstract the corresponding localization algorithm as a Linear Time-Varying (LTV) system and show that it asymptotically converges to the true locations~of~the robots. We first focus on the noiseless case, where the distance and motion vectors are known (measured) perfectly, and provide sufficient conditions on the convergence of the algorithm. We then evaluate the performance of the algorithm in the presence of noise and provide modifications to counter the undesirable effects of noise. \textcolor{black}{We further show that our algorithm precisely tracks a mobile network as long as there is at least one known beacon (a node whose location is perfectly known).

URL PDF HTML ☆

赞 0 踩 0

1703.05019 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Finding a Feasible Initial Solution for Flatness-Based Multi-Link Manipulator Motion Planning under State and Control Constraints

为基于平坦度的多连杆机械臂运动规划在状态和控制约束下寻找可行的初始解

Keisuke Uto, Makoto Obayashi, Gaku Takano

AI总结本文提出一种方法，在非凸优化问题中寻找可行点并可靠求解，用于在状态和控制约束下规划多连杆机械臂的设定点运动。通过分析最终时间效应，构造初始可行解，结合线性规划和递归逆动力学算法进行数值实验验证。

Comments accepted to the SICE International Symposium on Control Systems 2017

1703.04881 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Multi-Objective Cooperative Search of Spatially Diverse Routes in Uncertain Environments

多目标协作搜索空间多样路线以应对不确定环境

Johnathan Votion, Yongcan Cao

AI总结本文提出一种多车辆协作系统，用于在未知环境中搜索有价值路线，通过调整权重和控制增益实现路径多样性，最终选择最优路径。

Comments submitted to a conference

详情

AI中文摘要

本文旨在为不确定环境中的协作无人系统开发新的导航和侦察能力。目标是设计一种协作多车辆系统，能够探测未知环境并找到对人员最有价值的路线。为此，多车辆系统首先探索空间多样路线，然后选择最安全的路线。所提出的协作路径规划器根据多个因素（如行驶距离、通行便利性及通行便利性的不确定性）依次生成一组空间多样路线。规划器对每个因素的依赖性通过加权分数改变，从而改变最优路线的判断标准。为惩罚不同车辆选择相同路径，使用控制增益增加其他车辆分配路线附近路径的成本。通过调整控制增益，可以实现路线的空间多样性。通过反复协作搜索不同路径，最终可选择出最有价值的最优路径。

英文摘要

This paper focuses on developing new navigation and reconnaissance capabilities for cooperative unmanned systems in uncertain environments. The goal is to design a cooperative multi-vehicle system that can survey an unknown environment and find the most valuable route for personnel to travel. To accomplish the goal, the multi-vehicle system first explores spatially diverse routes and then selects the safest route. In particular, the proposed cooperative path planner sequentially generates a set of spatially diverse routes according to a number of factors, including travel distance, ease of travel, and uncertainty associated with the ease of travel. The planner's dependence on each of these factors is altered by a weighted score, doing so changes the criteria for determining an optimum route. To penalize the selection of same paths by different vehicles, a control gain is used to increase the cost of paths that lie near the route(s) assigned to other vehicles. By varying the control gain, the spatial diversity among routes can be accomplished. By repeatedly searching for different paths cooperatively, an optimal path can be selected that yields the most valuable route.

URL PDF HTML ☆

赞 0 踩 0

1703.04550 2026-06-04 cs.RO cs.LG cs.NE cs.SY eess.SY 版本更新

Sensor Fusion for Robot Control through Deep Reinforcement Learning

通过深度强化学习实现机器人控制的传感器融合

Steven Bohez, Tim Verbelen, Elias De Coninck, Bert Vankeirsbilck, Pieter Simoens, Bart Dhoedt

AI总结本文提出通过深度强化学习实现机器人传感器信息融合，提升机器人在搜索和拾取任务中的鲁棒性和性能。

Comments 6 pages, 6 figures, submitted to IROS 2017

1604.02943 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Taming mismatches in inter-agent distances for the formation-motion control of second-order agents

消除交互距离不匹配以实现二阶代理的形成-运动控制

Hector Garcia de Marina, Bayu Jayawardhana, Ming Cao

AI总结本文分析距离不匹配对二阶代理标准梯度刚性形成控制的影响，指出其导致最终形状扭曲和群体稳态运动，并通过结合标准控制律与分布式估计器消除这些行为，同时利用不匹配参数控制群体的平移和旋转运动。

Comments 14 pages, conditionally accepted in Automatic Control, IEEE Transactions on

1703.03161 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

Behavior-based Navigation of Mobile Robot in Unknown Environments Using Fuzzy Logic and Multi-Objective Optimization

基于模糊逻辑和多目标优化的未知环境中移动机器人行为导航

Thi Thanh Van Nguyen, Manh Duong Phung, Quang Vinh Tran

AI总结本文提出BBFM架构，通过模糊控制器和多目标优化协调机器人在未知环境中避障和避开局部极小值的问题，提升了导航精度和效率。

详情

DOI: 10.14257/ijca.2017.10.2.29
Journal ref: International Journal of Control and Automation, Vol. 10, No. 2 (2017), pp.349-364

AI中文摘要

本文提出一种名为BBFM的行为导航架构，用于解决在存在障碍物和局部极小值区域的未知环境中移动机器人的导航问题。在该架构中，复杂导航任务被分解为主要子任务或行为。每个行为由模糊控制器实现并独立执行以处理特定导航问题。模糊控制器被修改为仅包含模糊化和推理过程，使其输出表示行为的目标的隶属函数。所有控制器的隶属函数随后用作多目标优化过程的目标函数以协调所有行为。该过程的结果是整体控制信号，即帕累托最优的控制信号，用于控制机器人。进行了大量模拟、比较和实验。结果表明，所提出的架构在精度、平滑度、行驶距离和时间响应方面优于一些流行的基于行为的架构。

英文摘要

This study proposes behavior-based navigation architecture, named BBFM, to deal with the problem of navigating the mobile robot in unknown environments in the presence of obstacles and local minimum regions. In the architecture, the complex navigation task is split into principal sub-tasks or behaviors. Each behavior is implemented by a fuzzy controller and executed independently to deal with a specific problem of navigation. The fuzzy controller is modified to contain only the fuzzification and inference procedures so that its output is a membership function representing the behavior's objective. The membership functions of all controllers are then used as the objective functions for a multi-objective optimization process to coordinate all behaviors. The result of this process is an overall control signal, which is Pareto-optimal, used to control the robot. A number of simulations, comparisons, and experiments were conducted. The results show that the proposed architecture outperforms some popular behavior-based architectures in term of accuracy, smoothness, traveled distance, and time response.

URL PDF HTML ☆

赞 0 踩 0

1703.02899 2026-06-04 cs.LG cs.RO cs.SY eess.SY stat.ML 版本更新

Model-Based Policy Search for Automatic Tuning of Multivariate PID Controllers

基于模型的策略搜索用于多变量PID控制器的自动调优

Andreas Doerr, Duy Nguyen-Tuong, Alonso Marco, Stefan Schaal, Sebastian Trimpe

AI总结本文提出基于模型的策略搜索框架，用于自动调优多变量PID控制器，通过数据驱动的方法解决复杂系统的控制器调优问题。

Comments Accepted final version to appear in 2017 IEEE International Conference on Robotics and Automation (ICRA)

1502.02474 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Planning for robotic exploration based on forward simulation

基于正向模拟的机器人探索规划

Mikko Lauri, Risto Ritala

AI总结本文提出基于POMDP的探索规划方法，结合信息论目标函数和正向模拟算法，通过改进的互信息近似方法提升机器人在部分已知环境中的探索效率。

Comments 19 pages, 11 figures in Robotics and Autonomous Systems (2016)

详情

DOI: 10.1016/j.robot.2016.06.008
Journal ref: Robotics and Autonomous Systems 83 (2016) 15-31

AI中文摘要

我们解决了控制移动机器人探索部分已知环境的问题。机器人的目标是最大化收集的环境信息量。我们将问题建模为部分可观测马尔可夫决策过程（POMDP）并采用正向模拟算法进行求解。我们提出了一种新的基于样本的互信息近似方法，适用于移动机器人。该近似方法可无缝集成到正向模拟规划算法中。我们研究了基于POMDP的规划在探索中的有效性，并通过与前沿探索结合来缓解其不足。在模拟和真实环境中实验结果表明，根据环境不同，基于POMDP的探索规划可比前沿探索表现更优。

英文摘要

We address the problem of controlling a mobile robot to explore a partially known environment. The robot's objective is the maximization of the amount of information collected about the environment. We formulate the problem as a partially observable Markov decision process (POMDP) with an information-theoretic objective function, and solve it applying forward simulation algorithms with an open-loop approximation. We present a new sample-based approximation for mutual information useful in mobile robotics. The approximation can be seamlessly integrated with forward simulation planning algorithms. We investigate the usefulness of POMDP based planning for exploration, and to alleviate some of its weaknesses propose a combination with frontier based exploration. Experimental results in simulated and real environments show that, depending on the environment, applying POMDP based planning for exploration can improve performance over frontier exploration.

URL PDF HTML ☆

赞 0 踩 0

1702.07335 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Approximately Optimal Continuous-Time Motion Planning and Control via Probabilistic Inference

近似最优连续时间运动规划与控制 via 概率推理

Mustafa Mukadam, Ching-An Cheng, Xinyan Yan, Byron Boots

AI总结本文提出PIPC算法，通过概率推理和高斯过程轨迹表示，实现对非线性性能指标的近似最优控制，在仿真中展示了其在递推时间窗口内的多系统应用能力。

Comments minor fixes and typos

1702.05314 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Control of an Unmanned Surface Vehicle with Uncertain Displacement and Drag

无人水面舰艇的不确定位移和阻力控制

Wilhelm B. Klinger, Ivan R. Bertaska, Karl D. von Ellenrieder, Manhar R. Dhanak

AI总结本文研究了在位移和阻力不确定时无人水面舰艇的控制问题，采用低层控制器进行实验测试，通过系统辨识确定水动力系数，并验证了改进的反步控制器在处理变量质量和阻力时的有效性。

详情

DOI: 10.1109/JOE.2016.2571158

AI中文摘要

本文对无人水面舰艇（USV）进行了实验测试，以评估在位移和阻力随时间变化且不确定时两种低层控制器的性能。USV是一种4.3米长、150千克的波适应模块化舰艇（WAM-V），具有可膨胀双体配置和水jet推进。进行了开环机动测试以表征车辆的动力学。通过系统辨识 maneuvering 数据确定了车辆的水动力系数，并用于控制系统的模拟。所得到的控制器在水面上进行了实地测试。变量质量和阻力测试表明，车辆的最佳控制由模型参考自适应反步速度和航向控制器实现。Liao等人（2010）开发的反步控制器被修改以考虑必要控制动作的过度预测和电机饱和。研究表明，当在改进的反步控制器的冲量控制子系统中实现自适应算法时，变量质量和阻力的影响被减轻。

英文摘要

Experimental testing of an unmanned surface vehicle (USV) has been performed to evaluate the performance of two low-level controllers when displacement and drag properties are time-varying and uncertain. The USV is a 4.3 meter long, 150 kilogram wave adaptive modular vessel (WAM-V) with an inflatable twin hull configuration and waterjet propulsion. Open loop maneuvering tests were conducted to characterize the dynamics of the vehicle. The hydrodynamic coefficients of the vehicle were determined through system identification of the maneuvering data and were used for simulations during control system development. The resulting controllers were experimentally field tested on-water. Variable mass and drag tests show that the vehicle is best controlled by a model reference adaptive backstepping speed and heading controller. The backstepping controller developed by Liao et. al (2010) is modified to account for an overprediction of necessary control action and motor saturation. It is shown that when an adaptive algorithm is implemented for the surge control subsystem of the modified backstepping controller, the effects of variable mass and drag are mitigated.

URL PDF HTML ☆

赞 0 踩 0

1702.04941 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Station-keeping control of an unmanned surface vehicle exposed to current and wind disturbances

无人水面舰艇在洋流和风扰动下的驻留控制

Edoardo I. Sarda, Huajin Qu, Ivan R. Bertaska, Karl D. von Ellenrieder

AI总结本文评估了无人水面舰艇在风和洋流扰动下的驻留控制性能，比较了非线性比例导数、反步法和滑模反馈控制器的效果，发现滑模控制器表现最佳，风前馈控制对部分控制器有显著提升作用。

详情

DOI: 10.1016/j.oceaneng.2016.09.037
Journal ref: Journal of Ocean Engineering, Volume 127, Pages 305-324, 2016

AI中文摘要

为评估无人水面舰艇在户外海洋环境中的驻留控制性能，进行了4米长、180千克的无人水面舰艇（USV）实地试验。该USV采用双体结构和定制推进系统，包含两个可旋转推力器。非线性比例导数、反步法和滑模反馈控制器在4-5节风速下进行了测试，包括有无风前馈控制的情况。测试时USV的纵轴与平均风向一致或垂直。结果显示滑模控制器整体表现最佳，风前馈控制对滑模控制器效果提升不显著，但对比例导数和反步法控制器在纵轴垂直于风向时有显著提升。对风速波动功率谱中存在长度尺度的分析表明，单个风速仪足以表征作用于USV的风速和方向。

英文摘要

Field trials of a 4 meter long, 180 kilogram, unmanned surface vehicle (USV) have been conducted to evaluate the performance of station-keeping heading and position controllers in an outdoor marine environment disturbed by wind and current. The USV has a twin hull configuration and a custom-designed propulsion system, which consists of two azimuthing thrusters, one for each hull. Nonlinear proportional derivative, backstepping and sliding mode feedback controllers were tested in winds of about 4-5 knots, with and without wind feedforward control. The controllers were tested when the longitudinal axis of the USV was aligned with the mean wind direction and when the longitudinal axis was perpendicular to the mean wind direction. It was found that the sliding mode controller performed best overall and that the addition of wind feedforward control did not significantly improve its effectiveness. However, wind feedforward control did substantially improve the performance of the proportional derivative and backstepping controllers when the mean wind direction was perpendicular to the longitudinal axis of the USV. An analysis of the length scales present in the power spectrum of the turbulent speed fluctuations in the wind suggests that a single anemometer is sufficient to characterize the speed and direction of the wind acting on the USV.

URL PDF HTML ☆

赞 0 踩 0

1702.04940 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Supervisory Switching Control of an Unmanned Surface Vehicle

无人水面舰艇的监督切换控制

Ivan R. Bertaska, Karl D. von Ellenrieder

AI总结本文提出了一种新的切换控制器方法以提升系统性能，通过三个控制器捕捉无人水面舰艇的行为，包括欠驱动非线性控制器、全驱动非线性控制器和线性反风速控制器，并通过实验验证监督切换控制系统的有效性。

1610.02797 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Guidance algorithm for smooth trajectory tracking of a fixed wing UAV flying in wind flows

固定翼无人机在风流中平滑轨迹跟踪的引导算法

Hector Garcia de Marina, Yuri A. Kapitanyuk, Murat Bronz, Gautier Hattenberger, Ming Cao

AI总结本文提出一种算法，用于解决固定翼无人机在恒定空速和恒定风扰下跟踪平滑曲线的问题，通过构造隐函数描述的引导向量场实现轨迹跟踪，并能通过离线调整避免无人机物理约束。

Comments 6 pages

1609.05264 2026-06-04 cs.MA cs.RO cs.SY eess.SY 版本更新

Asynchronous and Dynamic Coverage Control Scheme for Persistent Surveillance Missions

异步和动态覆盖控制方案用于持续监视任务

Jeffrey R. Peters, Sean J. Wang, Amit Surana, Francesco Bullo

AI总结本文提出了一种基于分解的多智能体持续监视任务覆盖控制方案，通过模块化框架解耦高层任务分配与低层运动规划，利用中央基站管理覆盖分配和监视参数，并通过非计划和异步交换传输至移动代理，实现负载平衡和有效路径规划。

详情

AI中文摘要

本文提出了一种基于分解的覆盖控制方案，用于在通信受限和动态环境中运行的多智能体持续监视任务。所提出的方法在模块化框架中解耦高层任务分配与低层运动规划。覆盖分配和监视参数由中央基站管理，并通过非计划和异步交换传输至移动代理。覆盖更新促进负载平衡，同时保持几何和时间特征，允许与通用路径规划器有效配对。具体而言，所提出的方案保证了（i）覆盖区域连通且集体覆盖环境，（ii）子区域可能仅在有界时间内未被覆盖，（iii）碰撞（或传感重叠）被内在避免，（iv）在静态事件可能性下，集体覆盖区域收敛到帕累托最优配置。该管理方案随后与满足宽松假设的通用路径规划器配对。通过模拟监视任务进行了说明。

英文摘要

A decomposition-based coverage control scheme is proposed for multi-agent, persistent surveillance missions operating in a communication-constrained, dynamic environment. The proposed approach decouples high-level task assignment from low-level motion planning in a modular framework. Coverage assignments and surveillance parameters are managed by a central base station, and transmitted to mobile agents via unplanned and asynchronous exchanges. Coverage updates promote load balancing, while maintaining geometric and temporal characteristics that allow effective pairing with generic path planners. Namely, the proposed scheme guarantees that (i) coverage regions are connected and collectively cover the environment, (ii) subregions may only go uncovered for bounded periods of time, (iii) collisions (or sensing overlaps) are inherently avoided, and (iv) under static event likelihoods, the collective coverage regions converge to a Pareto-optimal configuration. This management scheme is then paired with a generic path planner satisfying loose assumptions. The scheme is illustrated through simulated surveillance missions.

URL PDF HTML ☆

赞 0 踩 0

1612.07850 2026-06-04 cs.RO cs.CV cs.SY eess.SY 版本更新

Automatic Interpretation of Unordered Point Cloud Data for UAV Navigation in Construction

无人机在建筑施工中无序点云数据的自动解释

M. D. Phung, C. H. Quach, D. T. Chu, N. Q. Nguyen, T. H. Dinh, Q. P. Ha

AI总结本文提出了一种数据处理系统，用于自动为无人机生成航路点，以检查建筑和桥梁等结构表面。系统通过两个正交安装的2D激光扫描仪和惯性测量单元的数据，利用数据注册、表面检测和航路生成算法，实现结构点云重建和航路规划。

Comments In The 14th International Conference on Control, Automation, Robotics and Vision, ICARCV 2016

详情

DOI: 10.1109/ICARCV.2016.7838683

AI中文摘要

本工作旨在开发一种数据处理系统，能够自动为无人驾驶航空器（UAV）生成航路点，以检查建筑物和桥梁等结构的表面。输入包括由两个正交安装在UAV上的2D激光扫描仪和惯性测量单元（IMU）记录的数据。为实现目标，开发了处理所收集数据的算法，分为三类：（i）数据注册和滤波以生成结构的3D模型并控制点云密度以提高数据完整性；（ii）表面和障碍物检测以协助UAV的监控任务；（iii）航路点生成以设置飞行路径。不同数据集的实验表明，所开发的系统能够重建结构的3D点云，提取其表面和物体，并为UAV生成航路点以完成检查任务。

英文摘要

The objective of this work is to develop a data processing system that can automatically generate waypoints for navigation of an unmanned aerial vehicle (UAV) to inspect surfaces of structures like buildings and bridges. The input includes data recorded by two 2D laser scanners, orthogonally mounted on the UAV, and an inertial measurement unit (IMU). To achieve the goal, algorithms are developed to process the data collected. They are separated into three major groups: (i) the data registration and filtering to generate a 3D model of the structure and control the density of point clouds for data completeness enhancement; (ii) the surface and obstacle detection to assist the UAV in monitoring tasks; and (iii) the waypoint generation to set the flight path. Experiments on different data sets show that the developed system is able to reconstruct a 3D point cloud of the structure, extract its surfaces and objects, and generate waypoints for the UAV to accomplish inspection tasks.

URL PDF HTML ☆

赞 0 踩 0

1503.04894 2026-06-04 cs.RO cs.MA cs.SY eess.SY 版本更新

Biomimetic Algorithms for Coordinated Motion: Theory and Implementation

仿生算法协调运动：理论与实现

Udit Halder, Biswadip Dey

AI总结本文基于生物飞行行为，提出两种覆盖与聚群策略，通过轮式机器人与Vicon系统验证了运动伪装理论，并展示了基于局部信息的拓扑速度对齐方法，证明了生物启发在多智能体机器人系统中的应用价值。

1503.03388 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Station Keeping through Beacon-referenced Cyclic Pursuit

通过信标参考的循环追捕实现轨道维持

Kevin S. Galloway, Biswadip Dey

AI总结本文研究了多智能体系统中循环恒定方位追捕的改进方法，通过信标和邻居的交互实现集体围绕信标旋转，分析了旋转半径和角度分离的参数影响，并验证了双智能体系统的稳定性。

1702.00325 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Hybrid Fuel Cells Power for Long Duration Robot Missions in Field Environments

混合燃料电池为野外环境中的长时间机器人任务提供动力

Jekan Thangavelautham, Danielle Gallardo, Daniel Strawser, Steven Dubowsky

AI总结本文提出混合燃料电池系统用于提升机器人在野外长时间任务中的续航能力，通过燃料电池与电池的结合解决传统电源的局限性。

Comments 8 pages, 5 figures in Field Robotics - 14th International Conference on Climbing and Walking Robots and the Support Technologies for Mobile Machines

详情

AI中文摘要

移动机器人常需执行长时间任务，包括搜索救援、哨兵、维修、监视和娱乐。当前电源技术限制了机器人在许多任务中的行走和攀爬能力。内燃机噪音大且排放有毒废气，而可充电电池能量密度低且自放电率高。理论上，燃料电池没有这些限制。特别是质子交换膜（PEM）可提供极高的能量密度，清洁且安静。然而，PEM燃料电池因性能退化而不可靠。这可通过在燃料电池电池混合配置中保护燃料电池来缓解，使用过滤电子设备确保燃料电池远离电气噪声，并通过电池隔离它免受电压尖峰影响。针对HOAP 2仿人机器人展示了模拟结果，表明燃料电池混合电源优于传统电池。

英文摘要

Mobile robots are often needed for long duration missions. These include search and rescue, sentry, repair, surveillance and entertainment. Current power supply technology limit walking and climbing robots from many such missions. Internal combustion engines have high noise and emit toxic exhaust while rechargeable batteries have low energy densities and high rates of self-discharge. In theory, fuel cells do not have such limitations. In particular Proton Exchange Membrane (PEMs) can provide very high energy densities, are clean and quiet. However, PEM fuel cells are found to be unreliable due to performance degradation. This can be mitigated by protecting the fuel cell in a fuel-cell battery hybrid configuration using filtering electronics that ensure the fuel cell is isolated from electrical noise and a battery to isolate it from power surges. Simulation results are presented for a HOAP 2 humanoid robot that suggests a fuel cell powered hybrid power supply superior to conventional batteries.

URL PDF HTML ☆

赞 0 踩 0

1701.08915 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Accelerated Evaluation of Automated Vehicles Using Piecewise Mixture Models

利用分段混合模型加速自动驾驶车辆评估

Zhiyuan Huang, Ding Zhao, Henry Lam, David J. LeBlanc

AI总结本文提出利用分段混合分布模型改进自动驾驶车辆评估方法，提升效率和准确性，实现评估过程加速四到五个数量级。

Comments 11 pages, 13 figures

详情

AI中文摘要

高度自动驾驶车辆的认证过程尚未由任何国家定义。目前，公司通过公共道路测试自动驾驶车辆，这耗时且低效。我们提出了加速评估概念，利用周围车辆的修改统计和重要抽样理论，将评估时间减少几个数量级，同时确保统计准确性。本文进一步改进该概念，使用分段混合分布模型代替单参数分布模型。我们开发并应用于向前碰撞控制系统，以应对进行变道的车辆。变道车辆的行为基于密歇根大学安全试点模型部署计划收集的超过403,581次变道数据建模。模拟结果证实，分段混合分布方法在准确性和效率上优于单参数分布方法，并将评估过程加速了几乎四个数量级。

英文摘要

The process to certify highly Automated Vehicles has not yet been defined by any country in the world. Currently, companies test Automated Vehicles on public roads, which is time-consuming and inefficient. We proposed the Accelerated Evaluation concept, which uses a modified statistics of the surrounding vehicles and the Importance Sampling theory to reduce the evaluation time by several orders of magnitude, while ensuring the evaluation results are statistically accurate. In this paper, we further improve the accelerated evaluation concept by using Piecewise Mixture Distribution models, instead of Single Parametric Distribution models. We developed and applied this idea to forward collision control system reacting to vehicles making cut-in lane changes. The behavior of the cut-in vehicles was modeled based on more than 403,581 lane changes collected by the University of Michigan Safety Pilot Model Deployment Program. Simulation results confirm that the accuracy and efficiency of the Piecewise Mixture Distribution method outperformed single parametric distribution methods in accuracy and efficiency, and accelerated the evaluation process by almost four orders of magnitude.

URL PDF HTML ☆

赞 0 踩 0

1609.02174 2026-06-04 eess.SY cs.MA cs.RO cs.SY 版本更新

Distributed sampled-data control of nonholonomic multi-robot systems with proximity networks

非holonomic多机器人系统的分布式采样数据控制与临近网络

Zhixin Liu, Lin Wang, Jinhuan Wang, Daoyi Dong, Xiaoming Hu

AI总结本文研究了通过距离诱导临近网络连接的多机器人系统的分布式采样数据控制问题，设计了基于最近邻规则的分布式采样数据控制器，通过混合闭环系统实现同步，并定量确定领导者数量以跟踪常量或时变信号。

Comments 15 pages, 3 figures

详情

DOI: 10.1016/j.automatica.2016.11.027
Journal ref: Automatica 77 (2017) 170-179

AI中文摘要

本文考虑了通过距离诱导临近网络连接的一组移动机器人系统的分布式采样数据控制问题。假设存在驻留时间以避免因位置突变导致的邻居关系振荡。基于最近邻规则设计了分布式采样数据控制律，结合连续时间动力学产生混合闭环系统。对于均匀独立初始状态，提供了保证无领导者的系统同步的充分条件。为了使所有机器人以期望的方位和速度移动，引入了领导者，并定量确定所需的领导者比例以跟踪常量或时变信号。所有条件仅依赖于邻居半径、最大初始移动速度和驻留时间，不假设邻居图的先验性质，这与现有文献不同。

英文摘要

This paper considers the distributed sampled-data control problem of a group of mobile robots connected via distance-induced proximity networks. A dwell time is assumed in order to avoid chattering in the neighbor relations that may be caused by abrupt changes of positions when updating information from neighbors. Distributed sampled-data control laws are designed based on nearest neighbour rules, which in conjunction with continuous-time dynamics results in hybrid closed-loop systems. For uniformly and independently initial states, a sufficient condition is provided to guarantee synchronization for the system without leaders. In order to steer all robots to move with the desired orientation and speed, we then introduce a number of leaders into the system, and quantitatively establish the proportion of leaders needed to track either constant or time-varying signals. All these conditions depend only on the neighborhood radius, the maximum initial moving speed and the dwell time, without assuming a prior properties of the neighbor graphs as are used in most of the existing literature.

URL PDF HTML ☆

赞 0 踩 0

1610.02849 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Automatic Gain Tuning of a Momentum Based Balancing Controller for Humanoid Robots

基于动量的平衡控制器自动增益调节技术

Daniele Pucci, Gabriele Nava, Francesco Nori

AI总结本文提出了一种用于双足机器人基于动量的平衡控制器自动增益调节技术，通过线性化闭环约束关节空间动力学并设计增益矩阵以实现系统稳定性和正定性。

Comments Accepted at IEEE-RAS International Conference on Humanoid Robots (HUMANOIDS). 2016

1511.01166 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A bi-criteria path planning algorithm for robotics applications

用于机器人应用的双标准路径规划算法

Zachary Clawson, Xuchu Ding, Brendan Englot, Thomas A. Frewen, William M. Sisson, Alexander Vladimirsky

AI总结本文提出一种高效的双标准路径规划算法，通过扩展状态空间来跟踪剩余预算，以满足二次成本约束。该算法在概率道路图上测试，用于最小化旅行距离同时控制总体威胁暴露。

Comments 19 pages, 12 figures; submitted for publication to IEEE Transactions on Automation Science and Engineering

1611.07114 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Multi-sensor perceptual system for mobile robot and sensor fusion-based localization

多传感器感知系统用于移动机器人及基于传感器融合的定位

T. T. Hoang, P. M. Duong, N. T. T. Van, D. A. Viet, T. Q. Vinh

AI总结本文提出一种扩展卡尔曼滤波方法，用于利用双四象限编码器、指南针、激光雷达和全方位摄像头对移动机器人进行定位，通过融合多种传感器数据实现高效定位。

Comments In 2012 International Conference on Control, Automation and Information Sciences (ICCAIS). arXiv admin note: substantial text overlap with arXiv:1611.07112

1611.07112 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Development of an EKF-based localization algorithm using compass sensor and LRF

基于指南针传感器和LRF的EKF定位算法开发

T. T. Hoang, P. M. Duong, N. T. T. Van, D. A. Viet, T. Q. Vinh

AI总结本文提出一种移动机器人感知系统，结合指南针和激光雷达数据，通过扩展卡尔曼滤波实现高精度定位，提升导航性能。

Comments In 12th International Conference on Control Automation Robotics & Vision (ICARCV), 2012. arXiv admin note: substantial text overlap with arXiv:1611.07114

1612.06008 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Optimal Control-Based UAV Path Planning with Dynamically-Constrained TSP with Neighborhoods

基于最优控制的无人机路径规划与动态约束TSP带邻居问题

Dae-Sung Jang, Hyeok-Joo Chae, Han-Lim Choi

AI总结本文提出一种基于采样 roadmap 算法的无人机路径规划方法，通过最优控制生成路径以减少计算时间并提升解的质量，解决动态约束 TSP 带邻居问题。

Comments 17 pages, 7 figures

1612.05594 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Importance sampling-based approximate optimal planning and control

基于重要性采样的近似最优规划与控制

Jie Fu

AI总结本文提出了一种非线性系统在非可微约束下的采样规划与最优控制方法，通过重要性采样算法迭代计算最优权重参数，确保策略的最优性和鲁棒性。

Comments submitted to IEEE ACC 2017

详情

AI中文摘要

本文提出了一种基于采样的非线性系统规划与最优控制方法。受可扩展规划算法的启发，将最优运动计划视为反馈控制器，并通过加权基函数的线性组合进行近似。本文的主要贡献是引入重要性采样，特别是模型参考自适应搜索算法，以迭代计算最优权重参数。关键思想是通过迭代估计参数化分布，使其收敛于狄拉克δ函数，无限峰值于全局最优权重。然后，利用此直接策略搜索，结合轨迹验证，确保对于一类非线性系统，所获得的策略不仅最优，而且对有界扰动具有鲁棒性。通过数值实验，包括具有非线性成本函数的线性系统和双极车运动规划，验证了方法的正确性和效率。

英文摘要

In this paper, we propose a sampling-based planning and optimal control method of nonlinear systems under non-differentiable constraints. Motivated by developing scalable planning algorithms, we consider the optimal motion plan to be a feedback controller that can be approximated by a weighted sum of given bases. Given this approximate optimal control formulation, our main contribution is to introduce importance sampling, specifically, model-reference adaptive search algorithm, to iteratively compute the optimal weight parameters, i.e., the weights corresponding to the optimal policy function approximation given chosen bases. The key idea is to perform the search by iteratively estimating a parametrized distribution which converges to a Dirac's Delta that infinitely peaks on the global optimal weights. Then, using this direct policy search, we incorporated trajectory-based verification to ensure that, for a class of nonlinear systems, the obtained policy is not only optimal but robust to bounded disturbances. The correctness and efficiency of the methods are demonstrated through numerical experiments including linear systems with a nonlinear cost function and motion planning for a Dubins car.

URL PDF HTML ☆

赞 0 踩 0

1612.04915 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Collaborative Object Transportation Using MAVs via Passive Force Control

利用MAVs的被动力控进行协同物体运输

Andrea Tagliabue, Mina Kamel, Sebastian Verling, Roland Siegwart, Juan Nieto

AI总结本文提出一种基于被动力控的MAVs协同运输策略，通过双六旋翼运输大型物体，无需通信链路、负载形状或抓取点位置信息。

Comments under review for the IEEE International Conference on Robotics and Automation (ICRA) 2017

1607.07797 2026-06-04 cs.RO cs.FL cs.SY eess.SY 版本更新

Combined Top-Down and Bottom-Up Approaches to Performance-guaranteed Integrated Task and Motion Planning of Cooperative Multi-agent Systems

结合自上而下与自下而上方法的性能保证协同多智能体系统集成任务与运动规划

Rafael Rodrigues da Silva, Bo Wu, Jin Dai, Hai Lin

AI总结本文提出一种分层框架，通过结合自下而上的反应式运动控制器与自上而下的任务计划，实现协同多智能体系统的性能保证任务与运动规划，通过假设-保证范式验证局部任务与全局任务的一致性。

Comments Submitted to Automatica

详情

AI中文摘要

我们提出了一种分层设计框架，用于自动合成协同多智能体系统的协调方案和控制策略，以满足正式的性能要求。通过将自下而上的反应式运动控制器与自上而下的任务计划相结合。一方面，从一个全局任务开始，该任务以所有智能体任务能力的正则语言指定，任务规划层位于所提出框架的顶部，将全局任务分解为与每个智能体个体能力一致的局部任务，并通过假设-保证范式论证局部任务的完成是否意味着全局任务的满足。另一方面，每个智能体关联的自下而上运动计划通过组合基本运动原语生成，这些原语通过差分动态逻辑（d$\mathcal{L}$）验证安全，通过可满足性模理论（SMT）求解器在面对局部任务要求和环境描述所施加的约束时搜索可行解。研究表明，所提出的框架能够处理动态环境，因为运动原语具有反应特性，使运动计划能够适应局部环境变化。此外，当SMT求解器无法找到可行解时，任务重新配置可以由运动规划层触发。整体设计框架的有效性通过自动化仓库案例研究得到验证。

英文摘要

We propose a hierarchical design framework to automatically synthesize coordination schemes and control policies for cooperative multi-agent systems to fulfill formal performance requirements, by associating a bottom-up reactive motion controller with a top-down mission plan. On one hand, starting from a global mission that is specified as a regular language over all the agents' mission capabilities, a mission planning layer sits on the top of the proposed framework, decomposing the global mission into local tasks that are in consistency with each agent's individual capabilities, and compositionally justifying whether the achievement of local tasks implies the satisfaction of the global mission via an assume-guarantee paradigm. On the other hand, bottom-up motion plans associated with each agent are synthesized corresponding to the obtained local missions by composing basic motion primitives, which are verified safe by differential dynamic logic (d$\mathcal{L}$), through a Satisfiability Modulo Theories (SMT) solver that searches feasible solutions in face of constraints imposed by local task requirements and the environment description. It is shown that the proposed framework can handle dynamical environments as the motion primitives possess reactive features, making the motion plans adaptive to local environmental changes. Furthermore, on-line mission reconfiguration can be triggered by the motion planning layer once no feasible solutions can be found through the SMT solver. The effectiveness of the overall design framework is validated by an automated warehouse case study.

URL PDF HTML ☆

赞 0 踩 0

1609.05258 2026-06-04 cs.RO cs.AI cs.CV cs.SY eess.SY 版本更新

The ACRV Picking Benchmark (APB): A Robotic Shelf Picking Benchmark to Foster Reproducible Research

ACRV 摘取基准 (APB)：一个促进可重复研究的机器人货架摘取基准

Jürgen Leitner, Adam W. Tow, Jake E. Dean, Niko Suenderhauf, Joseph W. Durham, Matthew Cooper, Markus Eich, Christopher Lehnert, Ruben Mangels, Christopher McCool, Peter Kujala, Lachlan Nicholson, Trung Pham, James Sergeant, Liao Wu, Fangyi Zhang, Ben Upcroft, Peter Corke

AI总结本文提出ACRV摘取基准(APB)，通过42个常见物品、广泛可用的货架和精确的物品排列指南，提供可重复的机器人摘取基准，支持完整机器人系统的比较。

Comments 8 pages, submitted to RA:Letters

1510.06496 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Synthesizing least-limiting guidelines for safety of semi-autonomous systems

为半自主系统安全合成最少限制的指南

Jana Tumova, Dimos V. Dimarogonas

AI总结本文提出系统方法生成最少限制指南，以在最坏情况下最小限制人类操作员。

Comments Extended version of CDC 2016 paper

1612.04324 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Stabilization and Trajectory Control of a Quadrotor with Uncertain Suspended Load

四旋翼载具带不确定悬挂负载的稳定与轨迹控制

Xu Zhou, Xiaoli Zhang, Jiucai Zhang, Rui Liu

AI总结本文研究四旋翼搭载不确定悬挂负载时的稳定与轨迹控制问题，通过比较三种控制器性能，发现负载质量不确定性主要影响稳定而非轨迹跟踪，提出关键运动质量概念并验证鲁棒控制器的有效性。

Comments 56 pages, 12 figures, article submitted to ASME Journal of Dynamic Systems Measurement and Control, 2016 April

详情

AI中文摘要

近年来，四旋翼搭载固定已知质量悬挂负载的稳定与轨迹控制已被广泛研究。然而，负载质量不总是事先已知，或在实际运输中可能变化。这种质量不确定性会给四旋翼系统带来不确定扰动，导致现有控制器的稳定性和轨迹跟踪性能变差。为提高这种情况下四旋翼的稳定性与轨迹跟踪能力，本文全面研究了不确定负载质量对四旋翼的影响。通过比较三种不同控制器——比例导数（PD）控制器、滑模控制器（SMC）和模型预测控制器（MPC）的性能，证明稳定而非轨迹跟踪误差是负载质量不确定性的主要影响因素。存在一个关键运动质量，使四旋翼能够维持期望的运输性能。此外，仿真结果验证了具有强抗扰能力的控制器在实际应用中的有效性。

英文摘要

Stabilization and trajectory control of a quadrotor carrying a suspended load with a fixed known mass has been extensively studied in recent years. However, the load mass is not always known beforehand or may vary during the practical transportations. This mass uncertainty brings uncertain disturbances to the quadrotor system, causing existing controllers to have worse stability and trajectory tracking performance. To improve the quadrotor stability and trajectory tracking capability in this situation, we fully investigate the impacts of the uncertain load mass on the quadrotor. By comparing the performances of three different controllers -- the proportional-derivative (PD) controller, the sliding mode controller (SMC), and the model predictive controller (MPC) -- stabilization rather than trajectory tracking error is proved to be the main influence in the load mass uncertainty. A critical motion mass exists for the quadrotor to maintain a desired transportation performance. Moreover, simulation results verify that a controller with strong robustness against disturbances is a good choice for practical applications.

URL PDF HTML ☆

赞 0 踩 0

1612.04023 2026-06-04 eess.SY cs.AI cs.RO cs.SY 版本更新

Proceedings of the The First Workshop on Verification and Validation of Cyber-Physical Systems

第一届验证与验证网络物理系统研讨会会议记录

Mehdi Kargahi, Ashutosh Trivedi

发表机构 * Reykjavík, Iceland（冰岛雷克雅未克）； MITL Specification Debugging for Monitoring of Cyber-Physical Systems（网络物理系统监控的MITL规格调试）； Automatic Synthesis of Controllers from Specifications using Control Certificates（使用控制证书从规范自动合成控制器）； A Compositional Framework for Preference-Aware Agents（偏好感知代理的组合框架）； Output Feedback Controller Design with Symbolic Observers for Cyber-physical Systems（网络物理系统符号观测器输出反馈控制器设计）； Towards an Approximate Conformance Relation for Hybrid I/O Automata（混合I/O自动机近似一致性关系）； On Nonlinear Prices in Timed Automata（时序自动机中的非线性价格）； Towards the Verification of Safety-critical Autonomous Systems in Dynamic Environments（动态环境中安全关键自主系统的验证）

AI总结本文介绍了首届网络物理系统验证与验证研讨会，探讨了验证与验证方法，包括控制、模拟和形式化方法等，旨在解决复杂软件和算法的验证问题。

详情

DOI: 10.4204/EPTCS.232
Journal ref: EPTCS 232, 2016

AI中文摘要

第一届国际网络物理系统验证与验证研讨会（V2CPS-16）于冰岛雷克雅未克举行的第十二届国际形式化方法整合会议（iFM 2016）期间召开。该研讨会旨在汇集形式化验证和网络物理系统（CPS）领域的研究人员和专家，讨论涵盖广泛验证与验证方法的主题，包括但不限于控制、模拟、形式化方法等。网络物理系统（CPS）是网络化计算和物理过程的整合，具有有意义的相互作用；前者监控、控制并影响后者，而后者也影响前者。CPS在机器人、交通、通信、基础设施、能源和制造系统中有广泛应用。许多安全关键系统，如化学过程、医疗设备、飞机飞行控制系统和汽车系统，确实属于CPS。CPS的先进能力需要复杂的软件和合成算法，这些算法难以验证。事实上，该领域中的许多问题都是不可判定的。因此，一个重要的步骤是找到特定的抽象，这些抽象可能在特定属性上算法上可验证，描述CPS的部分或整体行为。

英文摘要

The first International Workshop on Verification and Validation of Cyber-Physical Systems (V2CPS-16) was held in conjunction with the 12th International Conference on integration of Formal Methods (iFM 2016) in Reykjavik, Iceland. The purpose of V2CPS-16 was to bring together researchers and experts of the fields of formal verification and cyber-physical systems (CPS) to cover the theme of this workshop, namely a wide spectrum of verification and validation methods including (but not limited to) control, simulation, formal methods, etc. A CPS is an integration of networked computational and physical processes with meaningful inter-effects; the former monitors, controls, and affects the latter, while the latter also impacts the former. CPSs have applications in a wide-range of systems spanning robotics, transportation, communication, infrastructure, energy, and manufacturing. Many safety-critical systems such as chemical processes, medical devices, aircraft flight control, and automotive systems, are indeed CPS. The advanced capabilities of CPS require complex software and synthesis algorithms, which are hard to verify. In fact, many problems in this area are undecidable. Thus, a major step is to find particular abstractions of such systems which might be algorithmically verifiable regarding specific properties of such systems, describing the partial/overall behaviors of CPSs.

URL PDF HTML ☆

赞 0 踩 0

1612.02739 2026-06-04 cs.RO cs.AI cs.LG cs.SY eess.SY 版本更新

Controlling Robot Morphology from Incomplete Measurements

从不完整测量中控制机器人形态

Martin Pecka, Karel Zimmermann, Michal Reinštein, Tomáš Svoboda

AI总结针对复杂形态机器人在城市搜索与救援任务中的地形穿越需求，提出通过自主控制处理不完整数据并确保安全性的方法。

Comments Accepted into IEEE Transactions to Industrial Electronics, Special Section on Motion Control for Novel Emerging Robotic Devices and Systems

1612.01476 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Modeling and Control of an Autonomous Three Wheeled Mobile Robot with Front Steer

自主三轮移动机器人前轮转向的建模与控制

Ayush Pandey, Siddharth Jha, Debashish Chakravarty

AI总结本文提出了一种自主三轮移动机器人的前轮转向建模与控制策略，设计了速度控制PID控制器并实现了数字控制框架，同时展示了轨迹控制框架，通过实验验证了系统模型并讨论了控制器性能与鲁棒性。

Comments IEEE International Conference on Robotic Computing 2017. (under review)

1612.01034 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Localization of networked robot systems subject to random delay and packet loss

网络机器人系统在随机延迟和丢包下的定位

Manh Duong Phung, Thi Thanh Van Nguyen, Thuan Hoang Tran, Quang Vinh Tran

AI总结本文针对通信延迟和丢包影响下的移动机器人定位问题，提出统一的状态空间表示法和最优线性估计器，通过相关因子整合延迟测量，验证了方法在仿真和真实机器人系统中的有效性。

Comments In 2013 IEEE/ASME International Conference on Advanced Intelligent Mechatronics (AIM

1611.10007 2026-06-04 eess.SY cs.MA cs.RO cs.SY math.CO math.DS 版本更新

Structural Controllability of Multi-Agent Networks: Robustness against Simultaneous Failures

多智能体网络的结构可控性：同时故障下的鲁棒性

M. Amin Rahimian, Amir G. Aghdam

AI总结本文研究了多领导者多智能体系统的结构可控性，探讨了在通信链路和智能体同时失效时的保持性，提出联合(r,s)-可控性和联合t-可控性作为多智能体系统可靠性的量化指标。

详情

DOI: 10.1016/j.automatica.2013.06.023
Journal ref: Automatica, Volume 49, Issue 11, 2013, Pages 3149-3157

AI中文摘要

本文从图论角度研究了具有多个领导者的领导者-跟随者多智能体系统的结构可控性。研究了在通信链路和智能体同时失效时结构可控性的保持问题。先前研究探讨了智能体和通信链路丢失对信息流图可控性的影响。本文利用相应结果，引入了一些有用的指数和重要性度量，帮助表征和量化单个链路和智能体在整体网络可控性中的作用。现有结果通过考虑链路和智能体同时丢失的影响进行了扩展。为此，引入了联合(r,s)-可控性和联合t-可控性作为多智能体系统可靠性的量化指标，并研究了其重要性质。最后，引入了联合临界有向图的概念，并指出如果一个有向图是联合临界，则联合t-可控性是任何一组链路和智能体失效（数量小于t）后保持可控的必要且充分条件。文中通过各种例子阐述了分析结果。

英文摘要

In this paper, structural controllability of a leader-follower multi-agent system with multiple leaders is studied from a graph-theoretic point of view. The problem of preservation of structural controllability under simultaneous failures in both the communication links and the agents is investigated. The effects of the loss of agents and communication links on the controllability of an information flow graph are previously studied. In this work, the corresponding results are exploited to introduce some useful indices and importance measures that help characterize and quantify the role of individual links and agents in the controllability of the overall network. Existing results are then extended by considering the effects of losses in both links and agents at the same time. To this end, the concepts of joint (r,s)-controllability and joint t-controllability are introduced as quantitative measures of reliability for a multi-agent system, and their important properties are investigated. Lastly, the class of jointly critical digraphs is introduced and it is stated that if a digraph is jointly critical, then joint t-controllability is a necessary and sufficient condition for remaining controllable following the failure of any set of links and agents, with cardinality less than t. Various examples are exploited throughout the paper to elaborate on the analytical findings.

URL PDF HTML ☆

赞 0 踩 0

1611.09987 2026-06-04 eess.SY cs.MA cs.RO cs.SY math.DS 版本更新

Digraphs with Distinguishable Dynamics under the Multi-Agent Agreement Protocol

具有可区分动态的有向图在多智能体协议下的研究

M. Amin Rahimian, Amir Ajorlou, Amir G. Aghdam

AI总结研究在多智能体网络中，通过观察代理的输出响应区分有向图的能力，提出基于图论的矩阵树定理扩展，揭示了多智能体网络健康与故障状态间响应区分能力与信息流图中节点路径的关系。

详情

DOI: 10.1002/asjc.868
Journal ref: Asian Journal of Control, Volume 16, Issue 5, 2014, Pages 1300-1311

AI中文摘要

本文研究了在多智能体网络中，通过观察代理的输出响应区分有向图的能力。给定一个固定观察点，旨在找到足够的图形条件，使得网络信息流有向图中一组边的失效可区分于另一组边。当后者为空时，这对应于给定观察代理响应下前者边集的可检测性。在结果开发过程中，证明了代数图论中所有主子式矩阵树定理的一个强大扩展，该定理将有向图转换后的拉普拉斯矩阵的主子式与图中顶点间路径的数量和最短路径长度相关联。结果揭示了区分健康和故障多智能体网络响应能力与信息流有向图中节点路径之间复杂的关系。结果对受多重链路失效影响的多智能体系统的操作和设计有直接影响。通过仿真和示例来说明分析发现。

英文摘要

In this work, the ability to distinguish digraphs from the output response of some observing agents in a multi-agent network under the agreement protocol has been studied. Given a fixed observation point, it is desired to find sufficient graphical conditions under which the failure of a set of edges in the network information flow digraph is distinguishable from another set. When the latter is empty, this corresponds to the detectability of the former link set given the response of the observing agent. In developing the results, a powerful extension of the all-minors matrix tree theorem in algebraic graph theory is proved which relates the minors of the transformed Laplacian of a directed graph to the number and length of the shortest paths between its vertices. The results reveal an intricate relationship between the ability to distinguish the responses of a healthy and a faulty multi-agent network and the inter-nodal paths in their information flow digraphs. The results have direct implications for the operation and design of multi-agent systems subject to multiple link losses. Simulations and examples are presented to illustrate the analytic findings.

URL PDF HTML ☆

赞 0 踩 0

1611.09436 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Proposal of algorithms for navigation and obstacles avoidance of autonomous mobile robot

自主移动机器人导航与障碍物避障算法的提出

T. T. Hoang, D. T. Hiep, P. M. Duong, N. T. T. Van, B. G. Duong, T. Q. Vinh

AI总结本文提出算法用于室内自主移动机器人导航与避障，利用激光雷达获取3D环境图像，通过改进的向量场直方图算法生成2D地图并控制轨迹跟踪，实验结果良好。

Comments In 2013 8th IEEE Conference on Industrial Electronics and Applications (ICIEA)

1611.09433 2026-06-04 cs.RO cs.HC cs.SY eess.SY 版本更新

A novel platform for internet-based mobile robot systems

一种基于互联网的移动机器人系统新平台

P. M. Duong, T. T. Hoang, N. T. T. Van, D. A. Viet, T. Q. Vinh

AI总结本文提出一种基于互联网的移动机器人系统软硬件架构，通过3G网络连接多传感器智能机器人，采用客户端-服务器架构实现数据传输，并通过避障和安全点达成等自主机制确保安全，为远程控制算法等研究提供实验平台。

Comments In 2012 7th IEEE Conference on Industrial Electronics and Applications (ICIEA)

1611.09431 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Localization of a unicycle-like mobile robot using LRF and omni-directional camera

使用LRF和 omnidirectional相机对类双轮车移动机器人进行定位

Tran Hiep Dinh, Manh Duong Phung, Thuan Hoang Tran, Quang Vinh Tran

AI总结本文提出利用扩展卡尔曼滤波器对配备LRF和 omnidirectional相机的类双轮车移动机器人进行定位，通过改进的最小二乘二次方法提取环境线段并利用线匹配算法提高定位精度。

Comments In 2012 IEEE International Conference on Control System, Computing and Engineering (ICCSCE)

详情

DOI: 10.1109/ICCSCE.2012.6487193

AI中文摘要

本文针对定位问题，采用扩展卡尔曼滤波器（EKF）对配备激光雷达（LRF）传感器和 omnidirectional相机的类双轮车移动机器人进行定位。LRF用于扫描环境并用线段描述，线段通过改进的最小二乘二次方法提取，其中引入动态阈值。相机用于确定机器人方位。EKF的预测步骤通过提取机器人运动学模型和输入信号参数进行，修正步骤通过实现线匹配算法和比较局部与全局地图中线段参数进行。线匹配算法中引入转换矩阵以降低计算成本。在真实移动机器人系统中进行了实验，结果证明了该方法在定位中的适用性。

英文摘要

This paper addresses the localization problem. The extended Kalman filter (EKF) is employed to localize a unicycle-like mobile robot equipped with a laser range finder (LRF) sensor and an omni-directional camera. The LRF is used to scan the environment which is described with line segments. The segments are extracted by a modified least square quadratic method in which a dynamic threshold is injected. The camera is employed to determine the robot's orientation. The prediction step of the EKF is performed by extracting parameters from the kinematic model and input signal of the robot. The correction step is conducted with the implementation of a line matching algorithm and the comparison between line's parameters of the local and global maps. In the line matching algorithm, a conversion matrix is introduced to reduce the computation cost. Experiments have been carried out in a real mobile robot system and the results prove the applicability of the method for the purpose of localization.

URL PDF HTML ☆

赞 0 踩 0

1611.09424 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Development of a multi-sensor perceptual system for mobile robot and EKF-based localization

移动机器人多传感器感知系统的发展及基于EKF的定位方法

T. T. Hoang, P. M. Duong, N. T. T. Van, D. A. Viet, T. Q. Vinh

AI总结本文提出一种基于现代传感器和多点通信通道的移动机器人感知系统，采用传感器融合模型处理数据以提高机器人定位与控制的准确性，通过扩展卡尔曼滤波器优化系统状态。

Comments In 2012 International Conference on Systems and Informatics (ICSAI). arXiv admin note: substantial text overlap with arXiv:1611.07112, arXiv:1611.07114

1311.3979 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Precision improvement of MEMS gyros for indoor mobile robots with horizontal motion inspired by methods of TRIZ

为室内移动机器人改进MEMS陀螺仪精度的水平运动启发的TRIZ方法

Dongmyoung Shin, Sung Gil Park, Byung Soo Song, Eung Su Kim, Oleg Kupervasser, Denis Pivovartchuk, Ilya Gartseev, Oleg Antipov, Evgeniy Kruchenkov, Alexey Milovanov, Andrey Kochetov, Igor Sazonov, Igor Nogtev, Sun Woo Hyun

AI总结本文利用TRIZ方法解决室内移动机器人中水平运动下MEMS陀螺仪精度提升问题，通过创新方法提高传感器性能。

Comments 6 pages, the paper is accepted to 9th IEEE International Conference on Nano/Micro Engineered and Molecular Systems, Hawaii, USA (IEEE-NEMS 2014) as an oral presentation

1308.0037 2026-06-04 eess.SY cs.MA cs.NI cs.RO cs.SY math.OC 版本更新

Route Swarm: Wireless Network Optimization through Mobility

路由蜂群：通过移动性实现无线网络优化

Ryan K. Williams, Andrea Gasparri, Bhaskar Krishnamachari

AI总结本文提出一种混合架构，用于协调网络化机器人在传感和信息路由中的应用。通过动态重构机器人网络，确保静态无线节点间的高质量路由，实现信息流与物理控制的解耦。

Comments 9 pages, 4 figures, submitted to the IEEE International Conference on Intelligent Robots and Systems (IROS) 2014

详情

DOI: 10.1109/IROS.2014.6943092

AI中文摘要

在本文中，我们展示了一种新颖的混合架构，用于协调网络化机器人在传感和信息路由应用中的协同工作。所提出的信息和传感驱动的物理可重构机器人网络（INSPIRE）由物理控制平面（PCP）和信息控制平面（ICP）组成。PCP负责控制代理位置，ICP负责调节信息流向通信/传感目标。我们描述了一个实例，其中移动机器人网络动态重构以确保静态无线节点之间的高质量路由，这些节点作为信息流的源/目的地对。ICP指挥机器人向均匀分布的流分配配置，而流内的配置则最大化路由质量。PCP则通过基于势能的控制引导机器人，以根据ICP命令进行重构。这种称为路由蜂群的公式解耦了信息流和物理控制，从而在路由和传感需求以及机器人配置之间产生反馈。我们通过在现实无线网络环境下进行模拟来验证我们的提案。

英文摘要

In this paper, we demonstrate a novel hybrid architecture for coordinating networked robots in sensing and information routing applications. The proposed INformation and Sensing driven PhysIcally REconfigurable robotic network (INSPIRE), consists of a Physical Control Plane (PCP) which commands agent position, and an Information Control Plane (ICP) which regulates information flow towards communication/sensing objectives. We describe an instantiation where a mobile robotic network is dynamically reconfigured to ensure high quality routes between static wireless nodes, which act as source/destination pairs for information flow. The ICP commands the robots towards evenly distributed inter-flow allocations, with intra-flow configurations that maximize route quality. The PCP then guides the robots via potential-based control to reconfigure according to ICP commands. This formulation, deemed Route Swarm, decouples information flow and physical control, generating a feedback between routing and sensing needs and robotic configuration. We demonstrate our propositions through simulation under a realistic wireless network regime.

URL PDF HTML ☆

赞 0 踩 0

1511.02547 2026-06-04 eess.SY cs.MA cs.RO cs.SY 版本更新

Decentralized Algorithms for 3D Symmetric Formations in Robotic Networks: a Contraction Theory Approach

分布式算法用于机器人网络中的3D对称编队：一种收缩理论方法

Sumeet Singh, Edward Schmerling, Marco Pavone

AI总结本文提出分布式算法用于多机器人三维对称编队控制，结合循环追捕数学性质和收缩理论，确保全局收敛至对称编队，并通过仿真验证。

Comments Submitted to IEEE Transactions in Robotics

详情

DOI: 10.1109/ICRA.2015.7139355

AI中文摘要

本文提出用于多机器人三维编队控制的分布式算法。我们利用循环追捕的数学性质以及收缩和部分收缩理论的结果，设计出确保全局收敛到对称编队的分布式控制算法。首先考虑正多边形编队作为基础案例，然后扩展到约翰逊固体和其他多边形网格编队。算法进一步增强以控制编队大小并避免编队内其他机器人碰撞。在存在有界加性扰动的情况下评估算法的鲁棒性，并量化其对编队质量的影响。最后，我们提出一种将控制律嵌入复杂动态系统（如四旋翼无人机）的一般方法，并通过仿真和四旋翼无人机车队实验验证该方法。

英文摘要

This paper presents decentralized algorithms for formation control of multiple robots in three dimensions. Specifically, we leverage the mathematical properties of cyclic pursuit along with results from contraction and partial contraction theory to design decentralized control algorithms that ensure global convergence to symmetric formations. We first consider regular polygon formations as a base case, and then extend the results to Johnson solid and other polygonal mesh formations. The algorithms are further augmented to allow control over formation size and avoid collisions with other robots in the formation. The robustness properties of the algorithms are assessed in the presence of bounded additive disturbances and their effect on the quality of the formation is quantified. Finally, we present a general methodology for embedding the control laws on complex dynamical systems, in this case, quadcopters, and validate this approach via simulations and experiments on a fleet of quadcopters.

URL PDF HTML ☆

赞 0 踩 0

1510.06469 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Optimal Temporal Logic Planning in Probabilistic Semantic Maps

在概率语义地图中进行最优时间逻辑规划

Jie Fu, Nikolay Atanasov, Ufuk Topcu, George J. Pappas

AI总结本文研究了在概率语义地图中基于时间逻辑约束的机器人运动规划问题，提出通过引入置信度参数delta将随机控制问题转化为确定性最短路径问题，并设计启发函数以提高A*算法的效率和正确性。

Comments 8 pages, 6 figures. submitted to IEEE International Conference on Robotics and Automation 2016

详情

DOI: 10.1109/ICRA.2016.7487554

AI中文摘要

本文考虑了在通过语义同时定位与建图（SLAM）获得的概率地图中，机器人运动规划受时间逻辑约束的问题。地图分布的不确定性对保证线性时间逻辑（LTL）规范的正确性构成重大挑战。我们展示该问题可转化为一个最优控制问题，其中语义地图和逻辑公式评估都是随机的。我们的第一项贡献是通过引入置信度参数delta，将LTL子类的随机控制问题转化为确定性最短路径问题。从确定性问题获得的机器人轨迹在真实环境中具有最小成本并以概率delta满足逻辑规范。我们的第二项贡献是设计了一个可接受的启发函数，引导确定性问题的规划朝着满足时间逻辑规范的方向进行。这使我们能够使用A*算法获得最优且高效的解决方案。在模拟语义环境中使用差分驱动机器人验证了我们方法的性能和正确性。

英文摘要

This paper considers robot motion planning under temporal logic constraints in probabilistic maps obtained by semantic simultaneous localization and mapping (SLAM). The uncertainty in a map distribution presents a great challenge for obtaining correctness guarantees with respect to the linear temporal logic (LTL) specification. We show that the problem can be formulated as an optimal control problem in which both the semantic map and the logic formula evaluation are stochastic. Our first contribution is to reduce the stochastic control problem for a subclass of LTL to a deterministic shortest path problem by introducing a confidence parameter $delta$. A robot trajectory obtained from the deterministic problem is guaranteed to have minimum cost and to satisfy the logic specification in the true environment with probability $delta$. Our second contribution is to design an admissible heuristic function that guides the planning in the deterministic problem towards satisfying the temporal logic specification. This allows us to obtain an optimal and very efficient solution using the A* algorithm. The performance and correctness of our approach are demonstrated in a simulated semantic environment using a differential-drive robot.

URL PDF HTML ☆

赞 0 踩 0

1506.08762 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Passivity-Based Adaptive Control for Visually Servoed Robotic Systems

基于被动性的自适应控制用于视觉伺服机器人系统

Hanlei Wang

AI总结本文研究了具有不确定运动学、动力学和相机参数的机器人系统视觉伺服问题，提出两种基于被动性的自适应控制方案，通过Lyapunov分析证明在无需假设估计深度可逆的情况下，图像空间跟踪误差收敛至零。

Comments 18 pages, 7 figures, revised for improving the presentation and correcting several typos based on the reviewers' and AE's comments from IEEE Transactions on Automatic Control

1611.05095 2026-06-04 cs.LG cs.RO cs.SY eess.SY 版本更新

Learning Dexterous Manipulation Policies from Experience and Imitation

从经验与模仿中学习灵巧操作策略

Vikash Kumar, Abhishek Gupta, Emanuel Todorov, Sergey Levine

AI总结本文研究了通过经验与模仿学习反馈控制灵巧五指手非抓取操作的任务，提出基于轨迹优化的局部控制器，并通过深度学习和最近邻方法进行泛化，展示了小数据训练下的有效性和盲操作优势。

Comments Initial draft for a journal submission

详情

AI中文摘要

我们探索了基于学习的反馈控制方法，用于控制执行非抓取操作的灵巧五指手。首先，我们学习了能够从预定义初始状态开始执行任务的局部控制器。这些控制器是通过轨迹优化构建的，基于从传感器数据直接学习到的局部线性时变模型。在某些情况下，我们使用通过虚拟环境中的遥控收集的人类示范来初始化优化器。我们证明，这些控制器在模拟和物理平台上都能在初始条件的有限范围内稳健地执行任务。然后，我们考虑了两种泛化方法：深度学习和最近邻。我们发现最近邻方法性能更高。然而，神经网络也有其优势：它仅使用触觉和本体感觉反馈，而没有关于物体的视觉反馈（即盲操作），并且学习了一个时间不变的策略。相比之下，最近邻方法根据运动捕捉感知的初始物体状态切换时间变化的局部控制器。尽管两种泛化方法仍有改进空间，我们的工作表明（i）复杂的非抓取操作任务的局部轨迹控制器可以从惊人的少量训练数据中构建，（ii）此类控制器的集合可以插值形成更全局的控制器。结果总结在补充视频中：https://youtu.be/E0wmO6deqjo

英文摘要

We explore learning-based approaches for feedback control of a dexterous five-finger hand performing non-prehensile manipulation. First, we learn local controllers that are able to perform the task starting at a predefined initial state. These controllers are constructed using trajectory optimization with respect to locally-linear time-varying models learned directly from sensor data. In some cases, we initialize the optimizer with human demonstrations collected via teleoperation in a virtual environment. We demonstrate that such controllers can perform the task robustly, both in simulation and on the physical platform, for a limited range of initial conditions around the trained starting state. We then consider two interpolation methods for generalizing to a wider range of initial conditions: deep learning, and nearest neighbors. We find that nearest neighbors achieve higher performance. Nevertheless, the neural network has its advantages: it uses only tactile and proprioceptive feedback but no visual feedback about the object (i.e. it performs the task blind) and learns a time-invariant policy. In contrast, the nearest neighbors method switches between time-varying local controllers based on the proximity of initial object states sensed via motion capture. While both generalization methods leave room for improvement, our work shows that (i) local trajectory-based controllers for complex non-prehensile manipulation tasks can be constructed from surprisingly small amounts of training data, and (ii) collections of such controllers can be interpolated to form more global controllers. Results are summarized in the supplementary video: https://youtu.be/E0wmO6deqjo

URL PDF HTML ☆

赞 0 踩 0

1308.5133 2026-06-04 cs.RO cs.NE cs.SY eess.SY 版本更新

Performance Measurement Under Increasing Environmental Uncertainty In The Context of Interval Type-2 Fuzzy Logic Based Robotic Sailing

在区间型2模糊逻辑基础上的机器人航海中环境不确定性增加时的性能测量

Naisan Benatar, Uwe Aickelin, Jonathan M. Garibald

AI总结本文探讨了在环境不确定性显著变化时，传统性能指标如RMSE的不足，提出更复杂的比较方法，应用于机器人控制问题，证明其比简单方法更稳健。

Comments International Conference on Fuzzy Systems 2013 (Fuzz-IEEE 2013)

1611.03372 2026-06-04 cs.RO cs.AI cs.SE cs.SY eess.SY 版本更新

A stochastically verifiable autonomous control architecture with reasoning

一种具有推理能力的随机可验证自主控制架构

Paolo Izzo, Hongyang Qu, Sandor M. Veres

AI总结本文提出一种具有推理能力的随机可验证自主控制架构LISA，通过将系统抽象为DTMC和MDP模型，实现代理与环境的概率验证，提升设计与运行时的验证效率。

Comments Accepted at IEEE Conf. Decision and Control, 2016

1609.07015 2026-06-04 cs.MA cs.RO cs.SY eess.SY 版本更新

Distributed Consistent Data Association

分布式一致数据关联

Spyridon Leonardos, Xiaowei Zhou, Kostas Daniilidis

AI总结本文提出两种去中心化方法，通过一致的全局数据关联提升多传感器系统准确性，结合理论分析和实验验证方法有效性。

1610.05863 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Learning Quadrotor Dynamics Using Neural Network for Flight Control

使用神经网络学习四旋翼动力学用于飞行控制

Somil Bansal, Anayo K. Akametalu, Frank J. Jiang, Forrest Laine, Claire J. Tomlin

AI总结本文研究通过神经网络学习四旋翼动力学以合成不同轨迹的控制器，验证了仅使用平移和旋转轨迹训练模型的有效性。

详情

AI中文摘要

传统四旋翼或直升机控制学习方法聚焦于通过迭代改进名义控制器提升特定轨迹性能，如学习示范、迭代学习和强化学习。然而，这些方案中训练轨迹获取的信息如何用于合成更一般轨迹的控制器尚不明确。最近研究表明深度学习在推断直升机动力学方面的有效性。受深度学习泛化能力的启发，本文探讨是否可以利用基于神经网络的动力学模型来合成不同于训练轨迹的控制器。为此，我们仅使用平移和仅使用旋转训练轨迹分别学习四旋翼动力学模型，每种轨迹均可独立控制，然后用该模型同时控制四旋翼的偏航和位置，这因两种运动间的非线性耦合而具有挑战性。我们通过四旋翼测试平台实验验证了本方法。

英文摘要

Traditional learning approaches proposed for controlling quadrotors or helicopters have focused on improving performance for specific trajectories by iteratively improving upon a nominal controller, for example learning from demonstrations, iterative learning, and reinforcement learning. In these schemes, however, it is not clear how the information gathered from the training trajectories can be used to synthesize controllers for more general trajectories. Recently, the efficacy of deep learning in inferring helicopter dynamics has been shown. Motivated by the generalization capability of deep learning, this paper investigates whether a neural network based dynamics model can be employed to synthesize control for trajectories different than those used for training. To test this, we learn a quadrotor dynamics model using only translational and only rotational training trajectories, each of which can be controlled independently, and then use it to simultaneously control the yaw and position of a quadrotor, which is non-trivial because of nonlinear couplings between the two motions. We validate our approach in experiments on a quadrotor testbed.

URL PDF HTML ☆

赞 0 踩 0

1610.03518 2026-06-04 cs.RO cs.AI cs.LG cs.SY eess.SY 版本更新

Transfer from Simulation to Real World through Learning Deep Inverse Dynamics Model

通过学习深度逆动力学模型实现仿真到现实世界的迁移

Paul Christiano, Zain Shah, Igor Mordatch, Jonas Schneider, Trevor Blackwell, Joshua Tobin, Pieter Abbeel, Wojciech Zaremba

AI总结本文提出通过学习深度逆动力学模型，在仿真与现实世界之间实现控制策略的迁移，解决仿真与现实差异导致的性能下降问题。

详情

AI中文摘要

在仿真中开发控制策略通常比直接在现实世界中运行实验更实用和安全。这适用于通过规划和优化获得的策略，甚至更适用于通过强化学习获得的策略，后者通常非常数据密集。然而，仿真中成功的策略在部署到现实机器人时往往无法工作。然而，策略在仿真中执行的整体思路在现实世界中通常仍然有效。本文研究了此类场景，其中仿真中遍历的状态序列在现实世界中仍然合理，即使控制细节不同，例如摩擦、接触、质量和几何属性的差异。在执行过程中，我们的方法在每个时间步计算仿真基于的控制策略会做什么，但不执行这些控制在现实机器人上，而是计算仿真期望的下一个状态，并依赖于学习的深度逆动力学模型来决定最合适的现实世界动作以达到这些状态。深度模型只有在训练数据足够的情况下才有效，我们还提出了一种数据收集方法来（逐步）学习深度逆动力学模型。我们的实验表明，我们的方法在处理仿真到现实世界模型差异的各种基线方法中表现良好，包括输出误差控制和高斯动态适应。

英文摘要

Developing control policies in simulation is often more practical and safer than directly running experiments in the real world. This applies to policies obtained from planning and optimization, and even more so to policies obtained from reinforcement learning, which is often very data demanding. However, a policy that succeeds in simulation often doesn't work when deployed on a real robot. Nevertheless, often the overall gist of what the policy does in simulation remains valid in the real world. In this paper we investigate such settings, where the sequence of states traversed in simulation remains reasonable for the real world, even if the details of the controls are not, as could be the case when the key differences lie in detailed friction, contact, mass and geometry properties. During execution, at each time step our approach computes what the simulation-based control policy would do, but then, rather than executing these controls on the real robot, our approach computes what the simulation expects the resulting next state(s) will be, and then relies on a learned deep inverse dynamics model to decide which real-world action is most suitable to achieve those next states. Deep models are only as good as their training data, and we also propose an approach for data collection to (incrementally) learn the deep inverse dynamics model. Our experiments shows our approach compares favorably with various baselines that have been developed for dealing with simulation to real world model discrepancy, including output error control and Gaussian dynamics adaptation.

URL PDF HTML ☆

赞 0 踩 0

1610.03028 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Notes on geometry of locomotion of 3-dimensional version of the Purcell's swimmer

关于三维Purcell游泳者运动几何的笔记

Sudin Kadam, Ravi Banavar

AI总结本文提出三维Purcell游泳者模型，结合Cox理论和阻力力理论，推导出纯运动学方程，用于低雷诺数下的运动分析。

Comments These are notes, and have not been submitted for any kind of publication

1610.01243 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

On the Construction of Safe Controllable Regions for Affine Systems with Applications to Robotics

关于仿射系统安全可控区域构造及其在机器人中的应用

Mohamed K. Helwa, Angela P. Schoellig

AI总结本文研究了构造仿射系统在块内可控区域的问题，提出了一种计算高效的算法，并通过实验验证了其在机器人系统中的有效性。

Comments 17 pages, 18 figures, under review for publication in Automatica

详情

基于可达集的无人机避撞方法

Yuchen Zhou, John S. Baras

AI总结本文提出基于可达集的无人机避撞算法，通过凸优化求解控制约束集，解决多无人机避撞问题。

Comments CDC 2015 fixed-wing nonlinear dynamics extension. CDC 2015 DOI: 10.1109/CDC.2015.7403154

1609.07437 2026-06-04 eess.SY cs.MA cs.RO cs.SY 版本更新

Distributed scaling control of rigid formations

分布式刚体编队的尺度控制

Hector Garcia de Marina, Bayu Jayawardhana, Ming Cao

AI总结本文通过引入分布式参数操控刚体编队的稳态运动，结合图刚性理论和承载刚性理论实现编队的可控缩放。

Comments 6 pages In proceedings 55th Conference on Decision and Control, year 2016

1609.07436 2026-06-04 cs.RO cs.SY eess.SY 版本更新

UAV attitude estimation using Unscented Kalman Filter and TRIAD

使用无迹卡尔曼滤波和TRIAD估计无人机姿态

Hector Garcia de Marina, Fernando J. Pereda, Jose Marina Giron-Sierra, Felipe Espinosa

AI总结本文提出一种基于UKF和TRIAD算法的AHRS，通过仿真和实验证明其在微控制器上具有低计算成本和实时性能。

Comments 10 pages

1609.07038 2026-06-04 cs.MA cs.RO cs.SY eess.SY 版本更新

Simultaneous Intermittent Communication Control and Path Optimization in Networks of Mobile Robots

移动机器人网络中的间歇通信控制与路径优化同时进行

Yiannis Kantaros, Michael M. Zavlanos

AI总结本文提出了一种间歇通信框架，通过移动机器人在连接移动图的边缘移动并仅在节点相遇时通信，实现网络的间歇连通性和路径优化，同时通过局部LTL公式分解和分布式冲突解决方案保证全局LTL公式的满足。

详情

AI中文摘要

本文提出了一种间歇通信框架用于移动机器人网络。具体而言，我们考虑机器人沿连接移动图的边移动，并仅在该图的节点相遇时通信，从而形成动态通信网络。我们提出的分布式控制器同时确保网络的间歇连通性和路径优化。我们展示间歇连通性需求可以被一个全局线性时间逻辑（LTL）公式封装。然后将其近似分解为局部LTL表达式并分配给机器人。为了避免由于这种近似分解可能产生的冲突行为，我们开发了一种分布式冲突解决方案，该方案基于分配的局部LTL表达式为每个机器人生成非冲突的离散运动计划，其组合满足全局LTL公式。通过适当引入生成运动计划执行中的延迟，我们还证明所提出的控制器可以异步执行。

英文摘要

In this paper, we propose an intermittent communication framework for mobile robot networks. Specifically, we consider robots that move along the edges of a connected mobility graph and communicate only when they meet at the nodes of that graph giving rise to a dynamic communication network. Our proposed distributed controllers ensure intermittent connectivity of the network and path optimization, simultaneously. We show that the intermittent connectivity requirement can be encapsulated by a global Linear Temporal Logic (LTL) formula. Then we approximately decompose it into local LTL expressions which are then assigned to the robots. To avoid conflicting robot behaviors that can occur due to this approximate decomposition, we develop a distributed conflict resolution scheme that generates non-conflicting discrete motion plans for every robot, based on the assigned local LTL expressions, whose composition satisfies the global LTL formula. By appropriately introducing delays in the execution of the generated motion plans we also show that the proposed controllers can be executed asynchronously.

URL PDF HTML ☆

赞 0 踩 0

1609.07006 2026-06-04 cs.RO cs.MA cs.SY eess.SY 版本更新

SafeGuardPF: Safety Guaranteed Reactive Potential Fields for Mobile Robots in Unknown and Dynamic Environments

SafeGuardPF: 保障安全的反应势场用于未知和动态环境中的移动机器人

Rafael Rodrigues da Silva, Samuel Silva, Grigoriy Dubrovskiy, Hai Lin

AI总结本文提出SafeGuardPF方法，通过将反应势场运动控制器建模为混合自动机，并利用微分动态逻辑形式化验证其安全性，确保机器人在动态环境中避免碰撞。

Comments 8 pages, 9 figures, Submitted for publication in 2017 American Control Conference (ACC2017)

详情

AI中文摘要

自主导航在未知和动态环境中实现可靠避障仍具挑战性，尤其当存在移动障碍物时。为应对这一问题，本文采用反应势场方法，该方法在每个周期内仅需机器人相对于最近障碍物点的当前状态即可计算势场，从而更高效且适用于多智能体场景。本文的主要贡献是将反应势场运动控制器建模为混合自动机，并通过微分动态逻辑形式化验证其安全性，确保机器人在碰撞时不会发生故障，即只有在机器人静止时才可能发生碰撞。所提出控制器及验证结果通过仿真实验和Pioneer P3-AT机器人实现进行验证。

英文摘要

An autonomous navigation with proven collision avoidance in unknown and dynamic environments is still a challenge, particularly when there are moving obstacles. A popular approach to collision avoidance in the face of moving obstacles is based on model predictive algorithms, which, however, may be computationally expensive. Hence, we adopt a reactive potential field approach here. At every cycle, the proposed approach requires only current robot states relative to the closest obstacle point to find the potential field in the current position; thus, it is more computationally efficient and more suitable to scale up for multiple agent scenarios. Our main contribution here is to write the reactive potential field based motion controller as a hybrid automaton, and then formally verify its safety using differential dynamic logic. In particular, we can guarantee a passive safety property, which means that collisions cannot occur if the robot is to blame, namely a collision can occur only if the robot is at rest. The proposed controller and verification results are demonstrated via simulations and implementation on a Pioneer P3-AT robot.

URL PDF HTML ☆

赞 0 踩 0

1609.06662 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

On Efficient Computation of Shortest Dubins Paths Through Three Consecutive Points

在三个连续点上高效计算最短Dubins路径

Armin Sadeghi, Stephen L. Smith

AI总结本文研究了在三个连续点上计算曲率受限移动Dubins车辆最优路径的问题，提出新的几何分析方法，改进了Dubins TSP路线的计算效率和解的质量。

Comments Extended version of IEEE CDC 2016 paper

1609.06435 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Controlling rigid formations of mobile agents under inconsistent measurements

在不一致测量下控制移动代理的刚体编队

Hector Garcia de Marina, Ming Cao, Bayu Jayawardhana

AI总结本文提出基于局部估计器的梯度控制器，以增强对不一致测量的鲁棒性，通过机器人实验和仿真验证了其在消除集体运动不一致轨道方面的有效性。

Comments 10 pages

详情

DOI: 10.1109/TRO.2014.2373145
Journal ref: IEEE Transactions on Robotics, Volume 31, Issue 1, Feb. 2015

AI中文摘要

尽管近年来使用梯度控制器稳定自主代理的刚体编队取得了巨大成功，但最近在代理局部控制器中使用不一致测量时却报告了令人惊讶且引人入胜的不希望的集体运动。为了使现有的梯度控制对这种测量不一致具有鲁棒性，我们利用局部估计器，遵循著名的内部模型原理进行鲁棒输出调节控制。新的基于估计器的梯度控制仍然具有分布性质，并且即使在刚体编队中的代理数量增长时，也可以系统地构建。我们严格证明所提出控制能够保证指数收敛，然后通过机器人实验和计算机仿真证明所报告的由不一致引起的集体运动轨道得到有效消除。

英文摘要

Despite the great success of using gradient-based controllers to stabilize rigid formations of autonomous agents in the past years, surprising yet intriguing undesirable collective motions have been reported recently when inconsistent measurements are used in the agents' local controllers. To make the existing gradient control robust against such measurement inconsistency, we exploit local estimators following the well known internal model principle for robust output regulation control. The new estimator-based gradient control is still distributed in nature and can be constructed systematically even when the number of agents in a rigid formation grows. We prove rigorously that the proposed control is able to guarantee exponential convergence and then demonstrate through robotic experiments and computer simulations that the reported inconsistency-induced orbits of collective movements are effectively eliminated.

URL PDF HTML ☆

赞 0 踩 0

1609.06283 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Robotic Swarm Control from Spatio-Temporal Specifications

二维机器人群的时空规范控制

Iman Haghighi, Sadra Sadraddini, Calin Belta

AI总结本文研究如何通过丰富的时空逻辑控制二维机器人群以实现复杂时空模式，将规范编码为混合整数线性约束，并通过优化机器人移动总和或相似度指标来规划轨迹。

Comments A shorter version of this paper is going to be published at the proceedings of the 55th international conference on decision and control (CDC 2016)

1609.05960 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

Incremental Sampling-based Motion Planners Using Policy Iteration Methods

基于增量采样的运动规划器使用策略迭代方法

Oktay Arslan, Panagiotis Tsiotras

AI总结本文提出了一种基于策略迭代的运动规划算法，利用动态规划思想在随机图中求解最短路径问题，通过改进策略加速计算过程，适用于大规模并行化。

详情

AI中文摘要

最近随机运动规划的进步导致了一类新的基于采样的算法发展，这些算法提供了渐近最优性保证，例如RRT*和PRM*算法。仔细分析发现，这些算法中的所谓'重 wiring'步骤可以被解释为局部策略迭代（PI）步骤（即局部策略评估步骤后跟局部策略改进步骤），因此随着样本数趋于无穷大，这两种算法几乎肯定收敛到最优路径（概率1）。策略迭代，与价值迭代（VI）一样，是解决动态规划（DP）问题的常用方法。基于这一观察，最近提出了RRT#算法，该算法在每次迭代中对那些可能成为最优路径部分的顶点（即'有希望'的顶点）执行Bellman更新（即'备份'）。RRT#算法因此利用了动态规划思想，并在随机生成的图上逐步实现以获得高质量的解决方案。在本文中，基于这一关键洞察，我们探索了一类不同的动态规划算法来解决由迭代采样方法生成的随机图中的最短路径问题。这些算法利用策略迭代而不是价值迭代，因此更适合大规模并行化。与RRT*算法不同，策略改进在重 wiring步骤中不是仅在局部进行，而是在当前迭代中被分类为'有希望'的顶点集合上进行。这倾向于加快整个过程。所得到的算法，恰当地命名为策略迭代-RRT#（PI-RRT#），是第一种基于动态规划思想的随机运动规划新类算法，利用PI方法。

英文摘要

Recent progress in randomized motion planners has led to the development of a new class of sampling-based algorithms that provide asymptotic optimality guarantees, notably the RRT* and the PRM* algorithms. Careful analysis reveals that the so-called "rewiring" step in these algorithms can be interpreted as a local policy iteration (PI) step (i.e., a local policy evaluation step followed by a local policy improvement step) so that asymptotically, as the number of samples tend to infinity, both algorithms converge to the optimal path almost surely (with probability 1). Policy iteration, along with value iteration (VI) are common methods for solving dynamic programming (DP) problems. Based on this observation, recently, the RRT$^{\#}$ algorithm has been proposed, which performs, during each iteration, Bellman updates (aka "backups") on those vertices of the graph that have the potential of being part of the optimal path (i.e., the "promising" vertices). The RRT$^{\#}$ algorithm thus utilizes dynamic programming ideas and implements them incrementally on randomly generated graphs to obtain high quality solutions. In this work, and based on this key insight, we explore a different class of dynamic programming algorithms for solving shortest-path problems on random graphs generated by iterative sampling methods. These class of algorithms utilize policy iteration instead of value iteration, and thus are better suited for massive parallelization. Contrary to the RRT* algorithm, the policy improvement during the rewiring step is not performed only locally but rather on a set of vertices that are classified as "promising" during the current iteration. This tends to speed-up the whole process. The resulting algorithm, aptly named Policy Iteration-RRT$^{\#}$ (PI-RRT$^{\#}$) is the first of a new class of DP-inspired algorithms for randomized motion planning that utilize PI methods.

URL PDF HTML ☆

赞 0 踩 0

1510.07573 2026-06-04 cs.RO cs.CV cs.MA cs.SY eess.SY 版本更新

Generalized Regressive Motion: a Visual Cue to Collision

广义回归运动：碰撞的视觉线索

Krzysztof Chalupka, Michael Dickinson, Pietro Perona

AI总结研究提出广义回归运动作为碰撞检测的视觉线索，通过几何分析证明其在同类碰撞中的可靠性，并通过基于代理的建模显示其比 looming 更有效。

详情

DOI: 10.1088/1748-3190/11/4/046008

AI中文摘要

大脑和感觉系统进化以指导运动。关键任务是控制对静止障碍物的接近并检测移动生物。Looming 被提出为主要的单目视觉线索，用于检测其他动物的接近并避免与静止障碍物碰撞。在昆虫和脊椎动物大脑中发现了优雅的神经机制用于 looming 检测。然而，looming 未在两个移动动物碰撞的背景下进行分析。我们提出了一种替代策略，即广义回归运动（GRM），这与最近观察到的果蝇行为一致。几何分析证明 GRM 是同类碰撞的可靠线索，而基于代理的建模表明 GRM 比 looming 更有效用于检测接近、防止碰撞和维持移动性。

英文摘要

Brains and sensory systems evolved to guide motion. Central to this task is controlling the approach to stationary obstacles and detecting moving organisms. Looming has been proposed as the main monocular visual cue for detecting the approach of other animals and avoiding collisions with stationary obstacles. Elegant neural mechanisms for looming detection have been found in the brain of insects and vertebrates. However, looming has not been analyzed in the context of collisions between two moving animals. We propose an alternative strategy, Generalized Regressive Motion (GRM), which is consistent with recently observed behavior in fruit flies. Geometric analysis proves that GRM is a reliable cue to collision among conspecifics, whereas agent-based modeling suggests that GRM is a better cue than looming as a means to detect approach, prevent collisions and maintain mobility.

URL PDF HTML ☆

赞 0 踩 0

1609.05483 2026-06-04 eess.SY cs.CV cs.RO cs.SY 版本更新

Set-Point Regulation of Linear Continuous-Time Systems using Neuromorphic Vision Sensors

利用神经形态视觉传感器进行线性连续时间系统的设定点调节

Prince Singh, Sze Zheng Yong, Emilio Frazzoli

AI总结本文提出基于神经形态视觉传感器的H∞控制器，用于调节线性时不变系统的设定点，并在不稳定系统上验证了方法的有效性。

Comments Submitted to IEEE Transactions on Automatic Control

1609.05235 2026-06-04 cs.RO cs.SY eess.SY 版本更新

RFM-SLAM: Exploiting Relative Feature Measurements to Separate Orientation and Position Estimation in SLAM

RFM-SLAM：利用相对特征测量分离SLAM中的姿态和位置估计

Saurav Agarwal, Vikram Shree, Suman Chakravorty

AI总结本文提出RFM-SLAM框架，通过相对特征测量将SLAM问题建模为线性最小二乘问题，减少非线性优化计算量，避免现有方法因初始猜测导致的灾难性失败，同时在噪声增加时保持精度。

Comments 9 pages, submitted to IEEE ICRA 2017

详情

AI中文摘要

SLAM问题具有一个特殊性质，当机器人姿态已知时，估计机器人姿态历史和特征位置可以转化为标准线性最小二乘问题。本文开发了一种SLAM框架，利用相对特征到特征测量来利用SLAM的结构特性。相对特征测量用于对姿态到姿态的方位约束提出线性估计问题，随后通过迭代非线性在流形上的优化问题计算在相对旋转约束下的机器人方位最大似然估计。一旦计算出机器人方位，就解决一个线性问题来计算机器人位置和地图估计。我们的方法通过将优化问题缩小到比标准基于特征的图方法更小的规模，减少了非线性优化的计算负担。进一步，实验证明我们的方法避免了现有方法因使用里程计作为非线性优化初始猜测而导致的灾难性失败，同时在传感器噪声增加时保持精度。我们通过广泛的模拟和与现有最先进求解器的比较展示了我们的方法。

英文摘要

The SLAM problem is known to have a special property that when robot orientation is known, estimating the history of robot poses and feature locations can be posed as a standard linear least squares problem. In this work, we develop a SLAM framework that uses relative feature-to-feature measurements to exploit this structural property of SLAM. Relative feature measurements are used to pose a linear estimation problem for pose-to-pose orientation constraints. This is followed by solving an iterative non-linear on-manifold optimization problem to compute the maximum likelihood estimate for robot orientation given relative rotation constraints. Once the robot orientation is computed, we solve a linear problem for robot position and map estimation. Our approach reduces the computational burden of non-linear optimization by posing a smaller optimization problem as compared to standard graph-based methods for feature-based SLAM. Further, empirical results show our method avoids catastrophic failures that arise in existing methods due to using odometery as an initial guess for non-linear optimization, while its accuracy degrades gracefully as sensor noise is increased. We demonstrate our method through extensive simulations and comparisons with an existing state-of-the-art solver.

URL PDF HTML ☆

赞 0 踩 0

1609.03628 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Co-active Learning to Adapt Humanoid Movement for Manipulation

协同学习以适应人形机器人的运动用于操作

Ren Mao, John S. Baras, Yezhou Yang, Cornelia Fermuller

AI总结本文提出协同学习框架，通过人机交互适应机器人末端执行器的运动以应对不同约束环境，实验验证了方法的有效性。

1608.02683 2026-06-04 math.OC cs.DS cs.RO cs.SY eess.SY 版本更新

System Identification and Control of Valkyrie through SVA--Based Regressor Computation

基于SVA的瓦尔基里人形机器人系统辨识与控制

Shishir Kolathaya, Benjamin J. Morris, Ryan W. Sinnet, Aaron D. Ames

AI总结本文利用空间向量代数(SVA)实现瓦尔基里人形机器人的同时辨识与控制，通过空间惯性张量计算动力学参数，提出高效算法实现回归器评估，用于在线系统辨识与基于模型的控制。

Comments 8 pages, 15 figures

详情

AI中文摘要

本文演示了利用空间向量代数(SVA)对人形机器人Valkyrie进行的同时辨识与控制。特别是，通过空间惯性张量计算机器人动力学中的惯性、科里奥利-离心力和重力项。在假设链接长度或关节轴间距离准确已知的情况下，将证明机器人惯性属性可直接从惯量张量评估。提出了一种算法来评估回归器，其运行时间为O(n²)。该算法的效率为通过基于SVA的回归器实现在线系统辨识提供了手段，并作为副产品，提供了一种精确的基于模型的控制方法。通过在三个案例研究中的实现，提供了所提方法的实验验证：离线识别双摆和4自由度机械腿，以及在线识别和控制4自由度机械臂。

英文摘要

This paper demonstrates simultaneous identification and control of the humanoid robot, Valkyrie, utilizing Spatial Vector Algebra (SVA). In particular, the inertia, Coriolis-centrifugal and gravity terms for the dynamics of a robot are computed using spatial inertia tensors. With the assumption that the link lengths or the distance between the joint axes are accurately known, it will be shown that inertial properties of a robot can be directly evaluated from the inertia tensor. An algorithm is proposed to evaluate the regressor, yielding a run time of $O(n^2)$. The efficiency of this algorithm yields a means for online system identification via the SVA--based regressor and, as a byproduct, a method for accurate model-based control. Experimental validation of the proposed method is provided through its implementation in three case studies: offline identification of a double pendulum and a $4$-DOF robotic leg, and online identification and control of a $4$-DOF robotic arm.

URL PDF HTML ☆

赞 0 踩 0

1510.06263 2026-06-04 cs.RO cs.SY eess.SY 版本更新

An EKF-SLAM algorithm with consistency properties

具有一致性的EKF-SLAM算法

Axel Barrau, Silvere Bonnabel

AI总结本文针对EKF-SLAM中由于全局参考系原点和方位不可观测导致的不一致性问题，提出基于不变EKF的方法以解决状态空间对称性问题，并通过蒙特卡洛实验验证了理论结果。

Comments Submitted

1606.03736 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Enhancement of Low-cost GNSS Localization in Connected Vehicle Networks Using Rao-Blackwellized Particle Filters

利用 Rao-Blackwellized 粒子滤波器增强连接车辆网络中低成本 GNSS 定位

Macheng Shen, Ding Zhao, Jing Sun

AI总结本文提出利用 Rao-Blackwellized 粒子滤波器融合连接车辆组的 GNSS 信息并匹配数字地图，以提高定位精度，实验显示算法将误差降低50%，方差减少两个数量级。

Comments 7 pages, 7 figures, IEEE ITSC

详情

AI中文摘要

自动驾驶技术中准确定位是关键功能。然而，由于伪距测量存在偏差噪声，低成本 GNSS 接收器难以实现车道级精度。差分 GNSS 可提高精度，但需要大量基站投资。新兴的连接车辆技术提供了替代方案。本文显示，通过融合连接车辆组的 GNSS 信息并匹配数字地图以消除定位的共同偏差，可提高定位精度。Rao-Blackwellized 粒子滤波器（RBPF）用于联合估计伪距的共同偏差和车辆位置。多径偏差通过多假设检测-拒绝方法缓解。通过预测-更新过程利用时间相关性。所提方法在交叉口场景中与现有静态和平滑静态方法进行比较。仿真结果表明，所提算法将估计误差减少50%，估计方差减少两个数量级。

英文摘要

An essential function for automated vehicle technologies is accurate localization. It is difficult, however, to achieve lane-level accuracy with low-cost Global Navigation Satellite System (GNSS) receivers due to the biased noisy pseudo-range measurements. Approaches such as Differential GNSS can improve the accuracy, but usually require an enormous amount of investment in base stations. The emerging connected vehicle technologies provide an alternative approach to improving the localization accuracy. It has been shown in this paper that localization accuracy can be enhanced by fusing GNSS information within a group of connected vehicles and matching the configuration of the group to a digital map to eliminate the common bias in localization. A Rao-Blackwellized particle filter (RBPF) was used to jointly estimate the common biases of the pseudo-ranges and the vehicles positions. Multipath biases, which are non-common to vehicles, were mitigated by a multi-hypothesis detection-rejection approach. The temporal correlation was exploited through the prediction-update process. The proposed approach was compared to the existing static and smoothed static methods in the intersection scenario. Simulation results show that the proposed algorithm reduced the estimation error by fifty percent and reduced the estimation variance by two orders of magnitude.

URL PDF HTML ☆

赞 0 踩 0

1601.04862 2026-06-04 cs.RO cs.DC cs.NE cs.SY eess.SY 版本更新

Scalability in Neural Control of Musculoskeletal Robots

神经控制的肌骨机器人可扩展性

Christoph Richter, Sören Jentzsch, Rafael Hostettler, Jesús A. Garrido, Eduardo Ros, Alois C. Knoll, Florian Röhrbein, Patrick van der Smagt, Jörg Conradt

AI总结本文提出一种结合Myorobotics框架和SpiNNaker平台的系统，实现数十个神经控制的物理顺应性关节的可扩展性，通过闭环小脑模型实现低功耗高效神经控制。

Comments Accepted at IEEE Robotics and Automation Magazine on 2015-12-31

详情

DOI: 10.1109/MRA.2016.2535081

AI中文摘要

类人机器人是能够感知、行为、交互和感觉像人类的机器人。根据这一定义，类人机器人需要人形的机械硬件和驱动，以及类脑的控制和感知。最明显的实现方式是具有类脑神经控制器的人形肌骨机器人。尽管肌骨机器人硬件和神经控制软件已存在数十年，但尚未有可扩展的方法用于构建和控制类人人类尺度机器人。通过结合Myorobotics框架和SpiNNaker神经形态计算平台，我们展示了能够扩展到数十个神经控制、物理顺应性关节的系统。其核心实现了一个闭环小脑模型，提供最低功耗和最大扩展性的实时低层神经控制：更高阶（如皮层）神经网络和神经形态传感器如硅视网膜或耳蜗可以自然地被整合进来。

英文摘要

Anthropomimetic robots are robots that sense, behave, interact and feel like humans. By this definition, anthropomimetic robots require human-like physical hardware and actuation, but also brain-like control and sensing. The most self-evident realization to meet those requirements would be a human-like musculoskeletal robot with a brain-like neural controller. While both musculoskeletal robotic hardware and neural control software have existed for decades, a scalable approach that could be used to build and control an anthropomimetic human-scale robot has not been demonstrated yet. Combining Myorobotics, a framework for musculoskeletal robot development, with SpiNNaker, a neuromorphic computing platform, we present the proof-of-principle of a system that can scale to dozens of neurally-controlled, physically compliant joints. At its core, it implements a closed-loop cerebellar model which provides real-time low-level neural control at minimal power consumption and maximal extensibility: higher-order (e.g., cortical) neural networks and neuromorphic sensors like silicon-retinae or -cochleae can naturally be incorporated.

URL PDF HTML ☆

赞 0 踩 0

1604.07849 2026-06-04 cs.RO cs.SY eess.SY math.OC 版本更新

Distributed rotational and translational maneuvering of rigid formations and their applications

分布式刚体编队的旋转和翻译 maneuvering 及其应用

Hector Garcia de Marina, Bayu Jayawardhana, Ming Cao

AI总结本文提出一种分布式控制器，通过引入距离约束参数实现编队与运动控制，允许恒定集体翻译、旋转或组合，同时保证编队形状无畸变，应用于编队对齐和目标追踪。

Comments 14 pages

详情

Journal ref: Robotics, IEEE Transactions on, Volume 32, Issue 3, Pages 684 - 696, Year 2016

AI中文摘要

最近有报告指出，范围测量不一致或等效的预定 agent 间距离不匹配可能阻止流行的梯度控制器引导移动 agent 的刚体编队收敛到期望形状，甚至使编队停滞在任何位置。本文不将不匹配视为性能不佳的原因，而是将其作为设计参数，通过在每个距离约束中引入这样的参数对，设计出同时实现编队和运动控制的分布式控制器，不仅涵盖流行的梯度控制，更重要的是允许实现恒定集体翻译、旋转或其组合，同时保证渐近无编队形状畸变。此类运动控制结果随后应用于（a）编队方向对齐和（b）包围和跟踪移动目标。除了严谨的数学证明外，还通过移动机器人实验展示了所提出编队-运动分布式控制器的满意性能。

英文摘要

Recently it has been reported that range-measurement inconsistency, or equivalently mismatches in prescribed inter-agent distances, may prevent the popular gradient controllers from guiding rigid formations of mobile agents to converge to their desired shape, and even worse from standing still at any location. In this paper, instead of treating mismatches as the source of ill performance, we take them as design parameters and show that by introducing such a pair of parameters per distance constraint, distributed controller achieving simultaneously both formation and motion control can be designed that not only encompasses the popular gradient control, but more importantly allows us to achieve constant collective translation, rotation or their combination while guaranteeing asymptotically no distortion in the formation shape occurs. Such motion control results are then applied to (a) the alignment of formations orientations and (b) enclosing and tracking a moving target. Besides rigorous mathematical proof, experiments using mobile robots are demonstrated to show the satisfying performances of the proposed formation-motion distributed controller.

URL PDF HTML ☆

赞 0 踩 0

1608.06440 2026-06-04 cs.RO cs.CV cs.SY eess.SY 版本更新

A Delay-Tolerant Potential-Field-Based Network Implementation of an Integrated Navigation System

基于延迟容忍的势场网络的集成导航系统实现

Rachana Ashok Gupta, Ahmad A. Masoud, Mo-Yuen Chow

AI总结本文提出一种基于网络控制器的集成导航系统，通过互联网实现摄像头、无人地面车和远程服务器的实时网络化，旨在简化无人车导航同时保持系统鲁棒性。

详情

DOI: 10.1109/TIE.2009.2026764
Journal ref: The IEEE Transactions On Industrial Electronics, Vol. 57, No.2, February 2010, PP. 769-783

AI中文摘要

网络控制器（NCs）是能够将动态、空间扩展且功能专门化的模块转化为可执行目标导向组的设备，称为网络控制系统。本文探讨了设计和构建使用互联网作为通信介质的NC的实践方面。重点在于寻找兼容的控制器组件，这些组件可通过主机结构集成，使摄像头、无人地面车（UGV）、远程计算机服务器及必要的操作软件界面能够实时联网。目标是简化UGV的导航过程，同时保持系统性能的鲁棒性。本文描述了所提控制器的结构、其组件及其接口方式。提供了详尽的实验结果，包括性能评估和与之前实现的NC的比较。

英文摘要

Network controllers (NCs) are devices that are capable of converting dynamic, spatially extended, and functionally specialized modules into a taskable goal-oriented group called networked control system. This paper examines the practical aspects of designing and building an NC that uses the Internet as a communication medium. It focuses on finding compatible controller components that can be integrated via a host structure in a manner that makes it possible to network, in real-time, a webcam, an unmanned ground vehicle (UGV), and a remote computer server along with the necessary operator software interface. The aim is to deskill the UGV navigation process and yet maintain a robust performance. The structure of the suggested controller, its components, and the manner in which they are interfaced are described. Thorough experimental results along with performance assessment and comparisons to a previously implemented NC are provided.

URL PDF HTML ☆

赞 0 踩 0

1608.06420 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A Harmonic Potential Field Approach for Joint Planning & Control of a Rigid, Separable Nonholonomic, Mobile Robot

一种基于谐振势场的方法用于刚性、可分离非完整移动机器人的联合规划与控制

Ahmad A. Masoud

AI总结本文提出一种基于谐振势场的方法，用于在移动机器人伺服层面实现路径规划，通过同步控制信号使机器人速度与势场梯度一致，解决复杂环境下的规划问题。

详情

DOI: 10.1016/j.robot.2013.02.007
Journal ref: Robotics And Autonomous Systems, Vol. 61, No. 6, June 2013 Page 593,615

AI中文摘要

本文的主要目标是提供一种工具，用于在移动机器人伺服层面进行路径规划。通过在伺服层面以可证明正确的方式执行复杂任务，可以显著提高操作速度、降低能耗并提高响应质量。传统规划局限于机器人的高层控制器，该阶段的指导速度信号通常通过电子速度控制器（ESC）转换为控制信号。本文展示了谐振势场（HPF）方法生成可证明正确、受约束且行为良好的轨迹和控制信号的能力，用于刚性、非完整机器人在静止、杂乱环境中。证明了基于HPF的伺服层面规划器能够解决现实情况下的大量挑战。所提出的方法将HPF规划器解决方案轨迹的丰富且可证明正确性质迁移至机器人本身。这通过同步控制信号实现，其目的是使机器人在局部坐标系中的速度与HPF梯度一致。通过将机器人表示为论文中所谓的分离形式，使两者之间的联系成为可能。所使用的上下文敏感且目标导向的控制信号在存在执行器噪声、饱和和参数不确定性时表现出良好的行为和鲁棒性。该方法通过仿真结果进行了开发、证明了正确性，并展示了方案的能力。

英文摘要

The main objective of this paper is to provide a tool for performing path planning at the servo level of a mobile robot. The ability to perform, in a provably correct manner, such a complex task at the servo level can lead to a large increase in the speed of operation, low energy consumption and high quality of response. Planning has been traditionally limited to the high level controller of a robot. The guidance velocity signal from this stage is usually converted to a control signal using what is known as an electronic speed controller (ESC). This paper demonstrates the ability of the harmonic potential field (HPF) approach to generate a provably correct, constrained, well behaved trajectory and control signal for a rigid, nonholonomic robot in a stationary, cluttered environment. It is shown that the HPF based, servo level planner can address a large number of challenges facing planning in a realistic situation. The suggested approach migrates the rich and provably correct properties of the solution trajectories from an HPF planner to those of the robot. This is achieved using a synchronizing control signal whose aim is to align the velocity of the robot in its local coordinates, with that of the gradient of the HPF. The link between the two is made possible by representing the robot using what the paper terms separable form. The context-sensitive and goal-oriented control signal used to steer the robot is demonstrated to be well behaved and robust in the presence of actuator noise, saturation and uncertainty in the parameters. The approach is developed, proofs of correctness are provided and the capabilities of the scheme are demonstrated using simulation results.

URL PDF HTML ☆

赞 0 踩 0

1608.05786 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Design of a Trajectory Tracking Controller for a Nanoquadcopter

小型四旋翼飞行器轨迹跟踪控制器设计

Carlos Luis, Jérôme Le Ny

AI总结本文设计了基于数学模型的四旋翼飞行器位置与轨迹控制算法，并通过仿真与实机测试验证了LQT控制器在轨迹跟踪中的优越性。

Comments Complete Technical Report

详情

AI中文摘要

本文旨在研究小型四旋翼飞行器的系统建模及设计位置与轨迹控制算法，最终目标是在仿真和真实平台上测试系统。所选平台为开源的Crazyflie 2.0。首先开发了描述四旋翼动力学的数学模型，其次创建了仿真环境以设计两种不同的控制架构：级联PID位置跟踪器和LQT轨迹跟踪器。最后阶段是在所选平台上测试控制器并比较其轨迹跟踪性能。仿真结果与实验结果一致，进一步的模型改进通过闭环模型识别技术提出。结果表明，LQT控制器在轨迹跟踪中表现更优，其位置RMS误差比PID控制器小四倍。尽管LQT控制努力更大，但消除了诱导电机饱和的高控制峰值。LQT控制器还通过超宽频双程测距系统进行了测试，与更精确的VICON系统比较表明，控制器在两种系统之间不同的噪声水平下均能跟踪轨迹。

英文摘要

The primary purpose of this study is to investigate the system modeling of a nanoquadcopter as well as designing position and trajectory control algorithms, with the ultimate goal of testing the system both in simulation and on a real platform. The open source nanoquadcopter platform named Crazyflie 2.0 was chosen for the project. The first phase consisted in the development of a mathematical model that describes the dynamics of the quadcopter. Secondly, a simulation environment was created to design two different control architectures: cascaded PID position tracker and LQT trajectory tracker. Finally, the implementation phase consisted in testing the controllers on the chosen platform and comparing their performance in trajectory tracking. Our simulations agreed with the experimental results, and further refinement of the model is proposed as future work through closed-loop model identification techniques. The results show that the LQT controller performed better at tracking trajectories, with RMS errors in position up to four times smaller than those obtained with the PID. LQT control effort was greater, but eliminated the high control peaks that induced motor saturation in the PID controller. The LQT controller was also tested using an ultra-wide band two-way ranging system, and comparisons with the more precise VICON system indicate that the controller could track a trajectory in both cases despise the difference in noise levels between the two systems.

URL PDF HTML ☆

赞 0 踩 0

1603.06443 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

A Scalable and Distributed Solution to the Inertial Motion Capture Problem

可扩展且分布式的惯性运动捕捉问题解决方案

Manon Kok, Sina Khoshfetrat Pakazad, Thomas B. Schön, Anders Hansson, Jeroen D. Hol

AI总结本文提出一种可扩展且分布式的惯性运动捕捉解决方案，通过定制信息传递方法解决计算复杂性问题，并应用于下肢配置数据进行验证。

Comments 14 pages, 5 figures. In proceedings of the 19th International Conference on Information Fusion, pp. 1348-1355, Heidelberg, Germany, July 2016

1311.5796 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Unscented Orientation Estimation Based on the Bingham Distribution

基于Bingham分布的无偏姿态估计

Igor Gilitschenski, Gerhard Kurz, Simon J. Julier, Uwe D. Hanebeck

AI总结本文提出基于Bingham分布的滤波器，考虑姿态估计的周期性，利用超球面上的分布特性提升非线性姿态估计的精度。

1608.02286 2026-06-04 cs.RO cs.MA cs.SY eess.SY math.DS 版本更新

Enforcing Biconnectivity in Multi-robot Systems

在多机器人系统中强制双连通性

Mehran Zareh, Lorenzo Sabattini, Cristian Secchi

AI总结本文提出一种去中心化梯度协议，通过增加拉普拉斯矩阵第三小特征值来增强多机器人系统的双连通性，同时引入算法估计拉普拉斯矩阵特征向量以定义梯度，验证了理论成果的有效性。

Comments arXiv admin note: text overlap with arXiv:1608.02276

详情

AI中文摘要

在多机器人系统中，保持连通性是一项关键任务，近年来受到广泛关注。如果单个机器人失效，连接系统可能被分割成两个或更多子集。如果在单个机器人失效的情况下保证网络连通性，则可以实现更稳健的通信，这种网络被称为双连通的。在Zareh2016biconnectivitycheck中，我们提出了一个双连通性检查的标准，基本上确定了拉普拉斯矩阵第三小特征值的下界。在本文中，我们介绍了一种去中心化的基于梯度的协议，用于在双连通性检查失败时增加拉普拉斯矩阵第三小特征值的值。我们还介绍了一种去中心化的算法，用于估计拉普拉斯矩阵的特征向量，这些特征向量用于定义梯度。模拟显示了理论发现的有效性。

英文摘要

Connectivity maintenance is an essential task in multi-robot systems and it has received a considerable attention during the last years. A connected system can be broken into two or more subsets simply if a single robot fails. A more robust communication can be achieved if the network connectivity is guaranteed in the case of one-robot failures. The resulting network is called biconnected. In \cite{Zareh2016biconnectivitycheck}, we presented a criterion for biconnectivity check, which basically determines a lower bound on the third-smallest eigenvalue of the Laplacian matrix. In this paper, we introduce a decentralized gradient-based protocol to increase the value of the third-smallest eigenvalue of the Laplacian matrix, when the biconnectivity check fails. We also introduce a decentralized algorithm to estimate the eigenvectors of the Laplacian matrix, which are used for defining the gradient. Simulations show the effectiveness of the theoretical findings.

URL PDF HTML ☆

赞 0 踩 0

1603.09672 2026-06-04 eess.SY cs.RO cs.SY math.DS 版本更新

Lyapunov stability of a rigid body with two frictional contacts

具有两个摩擦接触的刚体的李雅普诺夫稳定性

Péter L. Várkonyi, Yizhar Or

AI总结本文研究了具有两个摩擦性单向接触的平面刚体在非弹性碰撞下的李雅普诺夫稳定性分析，通过引入Poincare映射和不变关系，将系统动态行为编码为两个半解析标量函数，用于判断平衡状态的稳定性。

详情

AI中文摘要

机械系统李雅普诺夫稳定性意味着在位置和速度的小扰动下，动态响应保持在静态平衡配置的任意小邻域内有界。这种稳定性在涉及多个单向接触的机器人应用中非常受欢迎。然而，此类系统的李雅普诺夫稳定性分析极其困难，因为即使小扰动也可能导致混合动力学，其中解涉及许多非光滑转换的不同接触状态。本文研究了具有两个摩擦性单向接触的平面刚体在非弹性碰撞下的李雅普诺夫稳定性分析，针对一般类别的平衡配置和恒定外部载荷。系统在接触转换和碰撞下的混合动力学被公式化，并引入了两个接触状态的Poincare映射。利用不变关系，该Poincare映射被减少为两个半解析标量函数，完全编码了任何小初始扰动下的解的动态行为。这两个函数使几乎任何平衡状态的李雅普诺夫稳定性或不稳定性得以确定。结果通过模拟示例和在描述接触几何和外部载荷的二维参数空间中绘制稳定性与不稳定性区域来展示。

英文摘要

Lyapunov stability of a mechanical system means that the dynamic response stays bounded in an arbitrarily small neighborhood of a static equilibrium configuration under small perturbations in positions and velocities. This type of stability is highly desired in robotic applications that involve multiple unilateral contacts. Nevertheless, Lyapunov stability analysis of such systems is extremely difficult, because even small perturbations may result in hybrid dynamics where the solution involves many nonsmooth transitions between different contact states. This paper concerns with Lyapunov stability analysis of a planar rigid body with two frictional unilateral contacts under inelastic impacts, for a general class of equilibrium configurations under a constant external load. The hybrid dynamics of the system under contact transitions and impacts is formulated, and a \Poincare map at two-contact states is introduced. Using invariance relations, this \Poincare map is reduced into two semi-analytic scalar functions that entirely encode the dynamic behavior of solutions under any small initial perturbation. These two functions enable determination of Lyapunov stability or instability for almost any equilibrium state. The results are demonstrated via simulation examples and by plotting stability and instability regions in two-dimensional parameter spaces that describe the contact geometry and external load.

URL PDF HTML ☆

赞 0 踩 0

1608.02276 2026-06-04 cs.RO cs.MA cs.SY eess.SY math.DS 版本更新

Decentralized Biconnectivity Conditions in Multi-robot Systems

多机器人系统中的去中心化双连通性条件

Mehran Zareh, Lorenzo Sabattini, Cristian Secchi

AI总结本文提出去中心化方法，通过拉普拉斯矩阵的第三小特征值提供双连通性条件，确保网络在单个机器人失效时仍保持连通。

1608.00337 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Higher-Degree Stochastic Integration Filtering

高次随机积分滤波器

Syed Safwan Khalid, Naveed Ur Rehman, Shafayat Abrar

AI总结本文提出一种高次随机积分滤波器，用于非线性滤波，通过比较与现有滤波器的性能，证明其优越性。

Comments Submitted to IEEE Signal Processing Letters on 17th July 2016

1607.07848 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Towards Controllability of Wireless Network Quality using Mobile Robotic Routers

无线网络质量可控性研究：基于移动机器人中继节点

Pradipta Ghosh, Raktim Pal, Bhaskar Krishnamachari

AI总结本文研究通过移动机器人中继节点的部署与移动控制，优化无线网络通信质量，提出集中式和分布式两种优化算法，通过仿真实验验证其性能。

1607.04439 2026-06-04 cs.RO cs.NI cs.SY eess.SY 版本更新

A Networked Swarm Model for UAV Deployment in the Assessment of Forest Environments

用于森林环境评估的UAV部署网络化群体模型

Matthias R. Brust, Bogdan M. Strimbu

AI总结本文提出一种新型UAV群体编队飞行方法，通过自适应网络结构保持群体连通性，实现高精度森林环境数据采集与融合。

详情

DOI: 10.1109/ISSNIP.2015.7106967

AI中文摘要

自主无人机（UAV）因其广泛的应用前景而受到青睐。伴随复杂传感器的出现，UAV可配备通信适配器实现机间通信。在UAV间形成群体通信时，如何管理其通信结构和移动性成为问题。本文考虑建立高效的群体运动模型和UAV之间的网络拓扑结构，专门用于高质量的森林制图场景。森林环境具有高度异质性的树和障碍物分布，对UAV群体构成极大挑战。群体需不断避免与树碰撞，自主改变轨迹，这可能导致与群体断开连接，需在通过障碍后重新连接，同时继续收集需高效融合和评估的环境数据。本文提出了一种新型解决方案，提供自适应且可靠的网络结构，维持群体连通性和可通信性。这些特性对于从UAV群体采集的数据实现详细准确的环境描述至关重要。本文方法的主要特点是群体中UAV数量的高可扩展性和群体内的自适应网络拓扑。

英文摘要

Autonomous Unmanned Aerial Vehicles (UAVs) have gained popularity due to their many potential application fields. Alongside sophisticated sensors, UAVs can be equipped with communication adaptors aimed for inter-UAV communication. Inter-communication of UAVs to form a UAV swarm raises questions on how to manage its communication structure and mobility. In this paper, we consider therefore the problem of establishing an efficient swarm movement model and a network topology between a collection of UAVs, which are specifically deployed for the scenario of high-quality forest-mapping. The forest environment with its highly heterogeneous distribution of trees and obstacles represents an extreme challenge for a UAV swarm. It requires the swarm to constantly avoid possible collisions with trees, to change autonomously the trajectory, which can lead to disconnection to the swarm, and to reconnect to the swarm after passing the obstacle, while continue collecting environmental data that needs to be fused and assessed efficiently. In this paper, we propose a novel solution to the formation flight problem for UAV swarms. The proposed method provides an adaptive and reliable network structure, which maintains swarm connectivity and communicability. These characteristics are needed to achieve a detailed and accurate description of the environment from the data acquired by the UAV swarm. The main characteristics of our approach are high scalability regarding the number of UAVs in the swarm and the adaptive network topology within the swarm.

URL PDF HTML ☆

赞 0 踩 0

1601.05257 2026-06-04 eess.SY cs.RO cs.SY stat.AP 版本更新

Magnetometer calibration using inertial sensors

使用惯性传感器的磁力计校准

Manon Kok, Thomas B. Schön

AI总结本文提出一种实用算法，用于校准磁力计以应对磁干扰和传感器误差，并结合惯性测量进行姿态估计。

Comments 19 pages, 8 figures

1606.09278 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Motion Planning With Gamma-Harmonic Potential Fields

基于伽马谐波势场的运动规划

Ahmad A. Masoud

AI总结本文扩展了谐波势场方法，用于处理无法分割为独立区域的机器人工作空间。通过任务导向的概率描述符和目标点生成导航策略，能够适应存在矢量漂移场的复杂环境。

详情

DOI: 10.1109/TAES.2012.6324661
Journal ref: IEEE Transactions on Aerospace and Electronic Systems, 48 (4), 2012, pp. 2786 - 2801

AI中文摘要

本文扩展了谐波势场（HPF）方法，以实现更广泛的规划能力。该方法适用于机器人工作空间无法分割为独立几何子区域的情况。所提出的方法将任务导向的概率工作空间描述符作为输入，与目标点结合，生成导航策略，以将智能体从工作空间中的任意点引导至目标。该方法易于适应包含矢量漂移场的复杂环境。HPF方法的扩展基于非均匀导电介质中电流的物理类比。所得到的势场称为伽马谐波势场（GHPF）。提供了修改方法能够避免零概率（确定威胁）区域并收敛到目标的证明。通过仿真展示了规划器的能力。

英文摘要

This paper extends the capabilities of the harmonic potential field (HPF) approach to planning. The extension covers the situation where the workspace of a robot cannot be segmented into geometrical subregions where each region has an attribute of its own. The suggested approach uses a task-centered, probabilistic descriptor of the workspace as an input to the planner. This descriptor is processed, along with a goal point, to yield the navigation policy needed to steer the agent from any point in its workspace to the target. The approach is easily adaptable to planning in a cluttered environment containing a vector drift field. The extension of the HPF approach is based on the physical analogy with an electric current flowing in a nonhomogeneous conducting medium. The resulting potential field is known as the gamma-harmonic potential (GHPF). Proofs of the ability of the modified approach to avoid zero-probability (definite threat) regions and to converge to the goal are provided. The capabilities of the planer are demonstrated using simulation.

URL PDF HTML ☆

赞 0 踩 0

1607.02632 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Decentralized, Self-organizing, Potential field-based Control for Individuallymotivated, Mobile Agents in a Cluttered Environment: A Vector-Harmonic Potential Field Approach

去中心化、自组织、基于势场的控制方法用于在复杂环境中具有个体动机的移动智能体：一种向量谐波势场方法

Ahmad A. Masoud

AI总结本文提出了一种去中心化、自组织的势场控制方法，用于复杂环境中多个智能体的协同运动控制，通过向量谐波势场方法实现低计算成本和自适应性。

详情

DOI: 10.1109/TSMCA.2007.893483
Journal ref: IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans, May 2007, Volume:37, Issue: 3, pp. 372-390

AI中文摘要

空间多代理系统正受到越来越多研究者的关注，旨在为涉及多个代理资源共享的应用开发理论基础。交通管理系统是其中之一。本文探讨了构建一个去中心化交通控制器，用于大量共享工作空间的智能体，其中包含静止禁区域。所提出的多代理运动控制器在满足工作空间几何条件时是完整的。其计算成本线性增长于智能体数量。控制器具有自组织特性，能够自主处理不完整信息和意外情况。此外，控制器具有开放结构，允许任何智能体加入或退出群体而不影响其他智能体的功能方式。为此，提出了一种去中心化的定义，将去中心化等同于在人工生命模式下智能体群体的自组织。该定义用于指导多代理控制器的构建。控制器采用势场方法实现。理论发展和仿真结果均被提供。

英文摘要

Spatial multi-agency has been receiving growing attention from researchers exploring many of the aspects and modalities of this phenomenon. The aim is to develop the theoretical background needed for a multitude of applications involving the sharing of resources by more than one agent. A traffic management system is one of these applications. Here, a large group of mobile robots that are operating in communication-limited, and sensory-limited modes are required to cope with each others presence as well as the contents of their environment while preserving their ability to reach their preset, independent goals. This work explores the construction of a decentralized traffic controller for a large group of agents sharing a workspace with stationary forbidden regions. The suggested multi-agent motion controller is complete provided that a lenient condition on the geometry of the workspace is upheld. It has a low computational effort that linearly increases with the number of agents. The controller is also self-organizing; therefore, it is able to deal, on its own, with incomplete information and unexpected situations. In addition to the above, the controller has an open structure to enable any agent to join or leave the group without the remaining agents having to adjust the manner in which they function. To meet these requirements, a definition of decentralization is suggested. This definition equates decentralization to self-organization in a group of agents operating in an artificial life mode. The definition is used to provide guidelines for the construction of the multi-agent controller. The controller is realized using the potential field approach. Theoretical developments, as well as simulation results, are provided.

URL PDF HTML ☆

赞 0 踩 0

1606.09270 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Kinodynamic Motion Planning: A Novel Type Of Nonlinear, Passive Damping Forces And Advantages

运动动力学路径规划：一种新型非线性、各向异性阻尼力及优势

Ahmad A. Masoud

AI总结本文提出一种结合非线性各向异性阻尼力的新型运动动力学路径规划方法，有效管理机器人动态并抑制惯性瞬态，无需完整系统动力学知识。

详情

DOI: 10.1109/MRA.2010.935794
Journal ref: IEEE Robotics And Automation Magazine, March 2010, Pp. 85-99

AI中文摘要

本文扩展了谐振势场方法在路径规划中的能力，涵盖机器人运动的运动学和动力学方面。所提出的方法通过将梯度引导场从谐振势场转换为控制信号，加入了新型非线性、各向异性阻尼力。两种方法的结合提供了一个既能引导机器人又能有效管理其动态的信号。运动动力学规划信号继承了谐振梯度场的引导能力，同时可轻松配置以高效抑制机器人轨迹中的惯性瞬态，而不影响操作速度。该方法适用于耗散系统以及受外力作用的系统，无需完整系统动力学知识。本文提供了理论发展和仿真结果。

英文摘要

This article extends the capabilities of the harmonic potential field approach to planning to cover both the kinematic and dynamic aspects of a robot motion. The suggested approach converts the gradient guidance field from a harmonic potential to a control signal by augmenting it with a novel type of damping forces called nonlinear, anisotropic, damping forces. The combination of the two provides a signal that can both guide a robot and effectively manage its dynamics. The kinodynamic planning signal inherits the guidance capabilities of the harmonic gradient field. It can also be easily configured to efficiently suppress the inertia-induced transients in the robot trajectory without compromising the speed of operation. The approach works with dissipative systems as well as systems acted on by external forces without needing the full knowledge of the system dynamics. Theoretical developments and simulation results are provided in this article.

URL PDF HTML ☆

赞 0 踩 0

1607.01478 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

Mixed Strategy for Constrained Stochastic Optimal Control

混合策略用于受约束的随机最优控制

Masahiro Ono, Mahmoud El Chamie, Marco Pavone, Behcet Acikmese

AI总结本文提出混合策略用于受约束的随机最优控制，证明随机化控制输入在非凸优化问题中可降低成本，等于对偶间隙，并提出基于对偶优化的高效求解方法。

Comments 11 pages. 9 figures.Preliminary version of a working journal paper

详情

AI中文摘要

在具有随机约束的最优控制问题中，随机选择控制输入可以降低预期成本，例如随机模型预测控制（SMPC）。我们考虑具有初始随机化的控制器，即在开始时随机选择K+1个控制序列（称为K-随机化）。已知对于具有K个约束的有限状态、有限动作马尔可夫决策过程（MDP），K-随机化足以达到最小成本。我们发现，对于具有连续状态和动作空间的随机最优控制问题，相同结果也成立。进一步，我们证明当优化问题非凸时，控制输入的随机化可以导致成本降低，且该降低量等于对偶间隙。然后，我们提供随机解最优性的必要和充分条件，并开发基于对偶优化的高效求解方法。此外，在K=1的特殊情况（如联合概率约束问题）中，对偶优化可通过根查找更高效地解决。最后，我们在路径规划到未来火星任务的着陆、下降和着陆（EDL）规划等多个实际问题上测试理论并演示求解方法。

英文摘要

Choosing control inputs randomly can result in a reduced expected cost in optimal control problems with stochastic constraints, such as stochastic model predictive control (SMPC). We consider a controller with initial randomization, meaning that the controller randomly chooses from K+1 control sequences at the beginning (called K-randimization).It is known that, for a finite-state, finite-action Markov Decision Process (MDP) with K constraints, K-randimization is sufficient to achieve the minimum cost. We found that the same result holds for stochastic optimal control problems with continuous state and action spaces.Furthermore, we show the randomization of control input can result in reduced cost when the optimization problem is nonconvex, and the cost reduction is equal to the duality gap. We then provide the necessary and sufficient conditions for the optimality of a randomized solution, and develop an efficient solution method based on dual optimization. Furthermore, in a special case with K=1 such as a joint chance-constrained problem, the dual optimization can be solved even more efficiently by root finding. Finally, we test the theories and demonstrate the solution method on multiple practical problems ranging from path planning to the planning of entry, descent, and landing (EDL) for future Mars missions.

URL PDF HTML ☆

赞 0 踩 0

1607.00765 2026-06-04 cs.NE cs.RO cs.SY eess.SY 版本更新

Multi-Objective Design of State Feedback Controllers Using Reinforced Quantum-Behaved Particle Swarm Optimization

基于强化量子行为粒子群优化的多目标状态反馈控制器设计

Kaveh Hassani, Won-Sook Lee

AI总结本文提出一种新颖的多目标设计方法，利用量子行为粒子群优化算法（QPSO）优化LQR控制器配置，结合模拟退火和高斯邻域选择机制改进初始化方案，引入动态加权准则生成Pareto最优解，通过实验验证其在控制性能上的优越性。

详情

DOI: 10.1016/j.asoc.2015.12.024
Journal ref: Applied Soft Computing, 41, pp. 66-76, 2016

AI中文摘要

本文提出了一种新颖且通用的多目标设计范式，利用量子行为粒子群优化（QPSO）算法确定给定问题中LQR控制器的最佳配置，考虑一组竞争性目标。本文的主要贡献包括：（1）基于模拟退火算法和高斯邻域选择机制的改进初始化方案；（2）结合膜计算算法优势的局部搜索策略；（3）引入动态加权准则，将软约束和硬约束与控制目标动态结合，为设计者提供一组Pareto最优解，并根据实际偏好选择目标解。所提方法在两个控制基准上通过敏感性分析和全因子参数选择与梯度法、七种元启发式算法和试错法进行比较，实验结果表明，所提方法在控制器努力、暂态响应指标和稳态相关指标上均优于对手方法。

英文摘要

In this paper, a novel and generic multi-objective design paradigm is proposed which utilizes quantum-behaved PSO(QPSO) for deciding the optimal configuration of the LQR controller for a given problem considering a set of competing objectives. There are three main contributions introduced in this paper as follows. (1) The standard QPSO algorithm is reinforced with an informed initialization scheme based on the simulated annealing algorithm and Gaussian neighborhood selection mechanism. (2) It is also augmented with a local search strategy which integrates the advantages of memetic algorithm into conventional QPSO. (3) An aggregated dynamic weighting criterion is introduced that dynamically combines the soft and hard constraints with control objectives to provide the designer with a set of Pareto optimal solutions and lets her to decide the target solution based on practical preferences. The proposed method is compared against a gradient-based method, seven meta-heuristics, and the trial-and-error method on two control benchmarks using sensitivity analysis and full factorial parameter selection and the results are validated using one-tailed T-test. The experimental results suggest that the proposed method outperforms opponent methods in terms of controller effort, measures associated with transient response and criteria related to steady-state.

URL PDF HTML ☆

赞 0 踩 0

1607.00644 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Nearest Neighbor-based Rendezvous for Sparsely Connected Mobile Agents

基于最近邻的稀疏连接移动体 rendezvous

Ahmad A. Masoud

AI总结本文提出一种收敛的最近邻控制协议，用于非平凡动力学的移动体。协议保证即使每个体仅与单个最近邻通信，也能收敛到共同点。最近邻需位于任意小的优先区外，协议由两层结构组成，第一层为一阶动力学提供 rendezvous 信号，第二层将信号转换为适合现实体的控制信号。

1606.09275 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A Harmonic Potential Approach For Simultaneous Planning And Control Of A Generic UAV Platform

一种谐波势场方法用于通用无人机平台的同时规划与控制

Ahmad A. Masoud

AI总结本文提出利用谐波势场方法实现多种无人机的同时规划与控制，通过生成密集参考速度场调节无人机速度，确保其向目标点移动并满足行为约束。

详情

DOI: 10.1007/s10846-011-9570-8
Journal ref: Journal Of Intelligent & Robotic Systems: Volume 65, Issue 1 (2012), Page 153-173

AI中文摘要

本文采用谐波势场方法解决多种无人飞行器的同时规划与控制问题。通过生成参考速度场的梯度来调节无人机速度，使其向目标点移动并满足事先编码在参考场中的约束。调节过程使用了称为虚拟速度吸引器（VVA）的新概念。谐波势场梯度与VVA的综合作用能够产生高效、易于实现且具有证明正确性的上下文敏感控制动作，适用于多种无人机。该方法已开发并提供了正确性证明及仿真结果。

英文摘要

Simultaneous planning and control of a large variety of unmanned aerial vehicles (UAVs) is tackled using the harmonic potential field (HPF) approach. A dense reference velocity field generated from the gradient of an HPF is used to regulate the velocity of the UAV concerned in a manner that would propel the UAV to a target point while enforcing the constraints on behavior that were a priori encoded in the reference field. The regulation process is carried-out using a novel and simple concept called the: virtual velocity attractor (VVA). The combined effect of the HPF gradient and the VVA is found able to yield an efficient, easy to implement, well-behaved and provably-correct context-sensitive control action that suits a wide variety of UAVs. The approach is developed and basic proofs of correctness are provided along with simulation results.

URL PDF HTML ☆

赞 0 踩 0

1606.08323 2026-06-04 math.OC cs.RO cs.SY eess.SY math.DS 版本更新

Simultaneous Mode, Input and State Estimation for Switched Linear Stochastic Systems

切换线性随机系统的模式、输入和状态联合估计

Sze Zheng Yong, Minghui Zhu, Emilio Frazzoli

AI总结本文提出了一种滤波算法，用于同时估计隐藏模式切换线性随机系统的模式、输入和状态。通过多模型方法，利用每个模式的线性输入和状态滤波器银行，通过识别关键性质实现最可能模型的估计。

Comments Submitted to SIAM Journal on Control and Optimization

1606.05124 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

Robust Active Perception via Data-association aware Belief Space planning

通过数据关联意识的信念空间规划实现鲁棒的主动感知

Shashank Pathak, Antony Thomas, Asaf Feniger, Vadim Indelman

AI总结本文提出一种结合数据关联推理的信念空间规划方法，以应对定位不确定性和感知模糊环境中的挑战，通过设计新的成本函数提升主动解歧能力。

详情

AI中文摘要

我们开发了一种信念空间规划（BSP）方法，通过在规划中整合数据关联（DA）推理，同时考虑额外的不确定性来源，从而推动了该领域的前沿。现有BSP方法通常假设数据关联已知且完美，但在存在定位不确定性、模糊和感知混叠环境时，这一假设更难成立。相反，我们的数据关联意识信念空间规划（DA-BSP）方法在信念演化中显式推理数据关联，因此能更好地应对这些具有挑战性的现实场景。特别是，我们展示了由于感知混叠，后验信念成为概率分布函数的混合，设计了衡量预期模糊程度和后验不确定性的成本函数。使用这些以及标准成本（如控制惩罚、距离目标）在目标函数中，得到一个能够可靠表示动作影响且特别擅长主动解歧的通用框架。我们的方法因此适用于感知混叠环境中的鲁棒主动感知和自主导航。我们通过基本和现实的模拟展示了关键方面。

英文摘要

We develop a belief space planning (BSP) approach that advances the state of the art by incorporating reasoning about data association (DA) within planning, while considering additional sources of uncertainty. Existing BSP approaches typically assume data association is given and perfect, an assumption that can be harder to justify while operating, in the presence of localization uncertainty, in ambiguous and perceptually aliased environments. In contrast, our data association aware belief space planning (DA-BSP) approach explicitly reasons about DA within belief evolution, and as such can better accommodate these challenging real world scenarios. In particular, we show that due to perceptual aliasing, the posterior belief becomes a mixture of probability distribution functions, and design cost functions that measure the expected level of ambiguity and posterior uncertainty. Using these and standard costs (e.g.~control penalty, distance to goal) within the objective function, yields a general framework that reliably represents action impact, and in particular, capable of active disambiguation. Our approach is thus applicable to robust active perception and autonomous navigation in perceptually aliased environments. We demonstrate key aspects in basic and realistic simulations.

URL PDF HTML ☆

赞 0 踩 0

1602.02990 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Self-organized control for musculoskeletal robots

肌骨机器人中的自组织控制

Ralf Der, Georg Martius

AI总结本文提出了一种自组织控制方法，通过无功能控制器实现机器人与环境的动态交互，展示了其在肌肉驱动臂肩系统中的自组织行为及与物体动态的共振效应。

Comments 11 pages, 4 figures, 1 table

详情

AI中文摘要

随着机器人技术的快速发展，最优控制成为研究核心。传统方法中，控制器基于传感器历史数据和预设目标进行动作决策。然而，弹性驱动机器人面临严重挑战。本文提出自组织控制新范式，采用无自身功能的固定函数控制器，基于传感器历史数据。在Myorobotics工具包的肌肉驱动臂肩系统中，观察到多样化的自组织行为：当系统独处时，臂部产生伪随机姿态序列，也可被操控为确定性运动模式。最有趣的是，当附加物体后，控制器与物体内部动态产生共振：给半满瓶时，系统自发摇晃瓶身以产生最大水动态响应；附加摆锤时，控制器使其进入圆周模式。本文还讨论了该控制器范式在意图驱动行为生成中的应用前景。

英文摘要

With the accelerated development of robot technologies, optimal control becomes one of the central themes of research. In traditional approaches, the controller, by its internal functionality, finds appropriate actions on the basis of the history of sensor values, guided by the goals, intentions, objectives, learning schemes, and so on planted into it. The idea is that the controller controls the world---the body plus its environment---as reliably as possible. However, in elastically actuated robots this approach faces severe difficulties. This paper advocates for a new paradigm of self-organized control. The paper presents a solution with a controller that is devoid of any functionalities of its own, given by a fixed, explicit and context-free function of the recent history of the sensor values. When applying this controller to a muscle-tendon driven arm-shoulder system from the Myorobotics toolkit, we observe a vast variety of self-organized behavior patterns: when left alone, the arm realizes pseudo-random sequences of different poses but one can also manipulate the system into definite motion patterns. But most interestingly, after attaching an object, the controller gets in a functional resonance with the object's internal dynamics: when given a half-filled bottle, the system spontaneously starts shaking the bottle so that maximum response from the dynamics of the water is being generated. After attaching a pendulum to the arm, the controller drives the pendulum into a circular mode. In this way, the robot discovers dynamical affordances of objects its body is interacting with. We also discuss perspectives for using this controller paradigm for intention driven behavior generation.

URL PDF HTML ☆

赞 0 踩 0

1606.03754 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Towards Self-Calibrating Inertial Body Motion Capture

迈向自校准的惯性人体运动捕捉

Bertram Taetz, Gabriele Bleser, Markus Miezal

AI总结本文提出了一种新的在线方法，用于同时估计人体运动的肢体方位和位置及传感器到肢体的校准参数，结合了先进的运动、测量和生物力学模型及新的随机方程和先验知识。

Comments 9 pages, 6 figures, accepted to the 19th International Conference on Information Fusion

1411.0181 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Restricted Discrete Invariance and Self-Synchronization For Stable Walking of Bipedal Robots

受限离散不变性与自同步用于双足机器人稳定行走

Hamed Razavi, Anthony M. Bloch, Christine Chevallereau, J. W. Grizzle

AI总结本文研究了双足机器人稳定行走的低维子流形不变性，提出自同步概念，通过3D线性倒立摆模型分析，扩展至9自由度双足机器人，验证渐近稳定行走的可行性。

Comments Conference

详情

AI中文摘要

双足运动模型是混合系统，包含连续部分（由拉格朗日方程和执行器生成）和离散部分（腿部转移）。离散部分通常由连续状态空间中的局部嵌入共维一子流形（切换面）和重置映射组成。本文旨在识别切换面的低维子流形，使其在闭环动力学下不变，从而实现渐近稳定的周期性步态。本文首先研究经典的3D线性倒立摆（LIP）模型，通过分析结果更易获得。关键贡献是自同步概念，即摆动平面的周期趋于共同周期。通过3D LIP模型的不变性研究，将该概念扩展到9自由度3D双足机器人，并通过数值研究验证渐近稳定行走的可行性。

英文摘要

Models of bipedal locomotion are hybrid, with a continuous component often generated by a Lagrangian plus actuators, and a discrete component where leg transfer takes place. The discrete component typically consists of a locally embedded co-dimension one submanifold in the continuous state space of the robot, called the switching surface, and a reset map that provides a new initial condition when a solution of the continuous component intersects the switching surface. The aim of this paper is to identify a low-dimensional submanifold of the switching surface, which, when it can be rendered invariant by the closed-loop dynamics, leads to asymptotically stable periodic gaits. The paper begins this process by studying the well-known 3D Linear Inverted Pendulum (LIP) model, where analytical results are much easier to obtain. A key contribution here is the notion of \textit{self-synchronization}, which refers to the periods of the pendular motions in the sagittal and frontal planes tending to a common period. The notion of invariance resulting from the study of the 3D LIP model is then extended to a 9-DOF 3D biped. A numerical study is performed to illustrate that asymptotically stable walking may be obtained.

URL PDF HTML ☆

赞 0 踩 0

1606.02205 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Applying Gaussian distributed constraints to Gaussian distributed variables

将高斯分布约束应用于高斯分布变量

Andrew W. Palmer, Andrew J. Hill, Steven J. Scheding

AI总结本文提出一种分析方法，用于截断由高斯分布描述的不等式约束高斯分布变量，通过基于矩的高斯近似改进了截断分布，应用于不确定约束的卡尔曼滤波问题，仿真显示效果优于无约束和硬约束卡尔曼滤波。

Comments 34 pages, 12 figures, accepted to Information Fusion. Updated to fix minor errors in equations (33), (34), and (B.12)

1605.08542 2026-06-04 eess.SY cs.MA cs.RO cs.SY 版本更新

Distributed controller-estimator for target tracking of networked robotic systems under sampled interaction

分布式控制器-估计器用于网络化机器人系统在采样交互下的目标跟踪

Ming-Feng Ge, Zhi-Hong Guan, Bin Hu, Ding-Xin He, Rui-Quan Liao

AI总结本文提出两种新型分布式控制器-估计器算法，用于网络化机器人系统在采样交互下的目标跟踪问题，通过小值范数和李雅普诺夫稳定性理论分析，确定交互拓扑、采样周期等参数以实现跟踪误差的实际稳定性。

Comments 8 pages, 4 figures, Published in Automatica

详情

DOI: 10.1016/j.automatica.2016.03.008
Journal ref: Automatica, 2016, 69: 410-417

AI中文摘要

本文研究了在采样交互下网络化机器人系统（NRSs）的目标跟踪问题。目标被假设为时间变化的，并由二阶振荡器描述。提出了两种新颖的分布式控制器-估计器算法（DCEA），其包含连续和不连续信号。基于小值范数和李雅普诺夫稳定性理论的性质，给出了交互拓扑、采样周期和其他控制参数的条件，以实现跟踪误差的实际稳定性和稳定性区域的定量调节。通过与其他算法的比较和仿真示例，展示了所提DCEA的优势。

英文摘要

This paper investigates the target tracking problem for networked robotic systems (NRSs) under sampled interaction. The target is assumed to be time-varying and described by a second-order oscillator. Two novel distributed controller-estimator algorithms (DCEA), which consist of both continuous and discontinuous signals, are presented. Based on the properties of small-value norms and Lyapunov stability theory, the conditions on the interaction topology, the sampling period, and the other control parameters are given such that the practical stability of the tracking error is achieved and the stability region is regulated quantitatively. The advantages of the presented DCEA are illustrated by comparisons with each other and the existing coordination algorithms. Simulation examples are given to demonstrate the theoretical results.

URL PDF HTML ☆

赞 0 踩 0

1406.4047 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Robot Impedance Control and Passivity Analysis with Inner Torque and Velocity Feedback Loops

机器人阻抗控制与内力和速度反馈环的被动性分析

Michele Focchi, Gustavo A. Medrano-Cerda, Thiago Boaventura, Marco Frigerio, Jonas Buchli, Darwin G. Caldwell, Claudio Semini

AI总结本文研究了机器人阻抗控制中内力和速度反馈环对稳定性区域和系统被动性的影响，通过仿真和实验数据验证，并提出基于内力环和正速度反馈环的关节阻抗控制器设计方法。

Comments 14 pages in Control Theory and Technology (2016)

详情

DOI: 10.1007/s11768-016-5015-z

AI中文摘要

带有时间时序逻辑约束的最优任务规划器

Yuchen Zhou, Dipankar Maity, John S. Baras

AI总结本文提出一种基于优化的方法，用于在动态环境中规划移动机器人路径，满足时间受限的时序约束。通过度量时序逻辑（MTL）编码任务规范，并将其转化为混合整数线性约束，利用混合整数线性规划求解器求解优化问题。

Comments European Control Conference 2015

详情

DOI: 10.1109/ECC.2015.7330634

AI中文摘要

在本文中，我们提出了一种基于优化的方法，用于在动态环境中规划移动机器人的路径，以满足时间受限的时序约束。时序逻辑（TL）可以处理非常复杂的任务规范，如安全、覆盖、运动序列等。我们使用度量时序逻辑（MTL）来编码具有时间约束的任务规范。然后，我们将MTL公式转换为混合整数线性约束，并使用混合整数线性规划求解器求解相关优化问题。这种方法不同于基于自动机的方法，后者生成环境和动态的有限抽象，并使用自动机理论方法正式生成满足TL的路径。我们已在多个复杂的动态环境中应用了我们的方法，这些环境受到时间时序规范的约束。

英文摘要

In this paper, we present an optimization based method for path planning of a mobile robot subject to time bounded temporal constraints, in a dynamic environment. Temporal logic (TL) can address very complex task specification such as safety, coverage, motion sequencing etc. We use metric temporal logic (MTL) to encode the task specifications with timing constraints. We then translate the MTL formulae into mixed integer linear constraints and solve the associated optimization problem using a mixed integer linear program solver. This approach is different from the automata based methods which generate a finite abstraction of the environment and dynamics, and use an automata theoretic approach to formally generate a path that satisfies the TL. We have applied our approach on several case studies in complex dynamical environments subjected to timed temporal specifications.

URL PDF HTML ☆

赞 0 踩 0

1604.06558 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Folding Assembly by Means of Dual-Arm Robotic Manipulation

通过双臂机械手实现折叠组装

Diogo Almeida, Yiannis Karayiannidis

AI总结本文提出一种适用于双臂机械手的折叠组装基本操作，用于更高层次的组装策略。通过实验验证了该方法在两个部件接触时的可行性。

Comments 7 pages, accepted for ICRA 2016

1604.05064 2026-06-04 cs.RO cs.SY eess.SY 版本更新

An Approximation Algorithm for a Shortest Dubins Path Problem

一种短路径问题的近似算法

Sivakumar Rathinam, Pramod Khargonekar

AI总结本文提出了一种改进的近似算法，将短路径问题的解的质量保证从3.04降低到2.04，通过实验证明了该方法的有效性。

1604.02930 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Implementation of haptic communication in comanipulative tasks: a statistical state machine model

在协同操作任务中实现触觉通信：一种统计状态机模型

Lucas Roche, Ludovic Saint-Bauzel

AI总结本文通过轻量化条件下的机械臂实验，探讨物理人机交互中的时间基通信机制，提出统计状态机模型并验证其与人类交互性能的接近性。

1604.00975 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Combining Vision, Machine Learning and Automatic Control to Play the Labyrinth Game

将视觉、机器学习和自动控制结合以玩迷宫游戏

Kristoffer Öfjäll, Michael Felsberg

AI总结本文提出结合确定性控制与学习控制的方法，利用视觉系统和PID控制器实现迷宫游戏的自动导航，通过学习提升控制性能。

Comments Presented at the SSBA Symposium 2012, Stockholm, Sweden

1604.00602 2026-06-04 eess.SY cs.RO cs.SY math.DS math.OC 版本更新

Convex Computation of the Basin of Stability to Measure the Likelihood of Falling: A Case Study on the Sit-to-Stand Task

用凸优化计算稳定性盆地以衡量跌倒的可能性：站立-坐下的任务案例研究

Victor Shia, Talia Moore, Ruzena Bajcsy, Ram Vasudevan

AI总结本文利用动力系统理论，开发出自动框架计算稳定性盆地，用于评估站立-坐下的稳定性策略。实验验证了该方法能区分不同稳定性策略。

Comments 11 pages, 9 figures

1603.08246 2026-06-04 eess.SY cs.LO cs.RO cs.SY 版本更新

Timed Automata Approach for Motion Planning Using Metric Interval Temporal Logic

基于度量区间时序逻辑的运动规划方法

Yuchen Zhou, Dipankar Maity, John S. Baras

AI总结本文研究在给定时间约束的高层规范下机器人运动规划问题，采用度量区间时序逻辑表示任务规范，并构造时序自动机以生成可行的运动路径序列。

Comments Full Version for ECC 2016

1505.01874 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Adaptive importance sampling for control and inference

自适应重要性采样用于控制与推断

Hilbert Johan Kappen, Hans Christian Ruiz

AI总结本文提出了一种基于交叉熵方法的自适应重要性采样算法，用于解决非线性随机控制问题，展示了其在潜在状态模型后验分布估计中的应用。

Comments 23 pages, 4 figures

详情

DOI: 10.1007/s10955-016-1446-7

AI中文摘要

路径积分（PI）控制问题是一类受限的非线性控制问题，可以形式化为费曼-卡茨路径积分并利用蒙特卡洛采样进行估计。本文回顾了有限时间 horizon 情况下的路径积分控制理论。随后，我们关注如何计算和表示控制解。在 PI 理论中，计算问题转化为重要性采样问题。高效的采样器是状态反馈控制器，其使用需要高效的表示。学习和表示有效的状态反馈控制器对于非线性随机控制问题是一个极具挑战性且大多未解决的问题。我们展示了如何利用交叉熵方法的思想来学习和表示此类控制器。我们推导出一种梯度下降方法，允许使用任意参数化来学习反馈控制器。我们将此方法称为路径积分交叉熵方法或 PICE。我们通过一些简单示例来说明此方法。路径积分控制方法可用于估计潜在状态模型的后验分布。在神经科学中，这些问题出现在使用 EM 方法从神经记录数据中估计连接性时。我们展示了路径积分控制方法作为粒子过滤的准确替代方案。

英文摘要

Path integral (PI) control problems are a restricted class of non-linear control problems that can be solved formally as a Feyman-Kac path integral and can be estimated using Monte Carlo sampling. In this contribution we review path integral control theory in the finite horizon case. We subsequently focus on the problem how to compute and represent control solutions. Within the PI theory, the question of how to compute becomes the question of importance sampling. Efficient importance samplers are state feedback controllers and the use of these requires an efficient representation. Learning and representing effective state-feedback controllers for non-linear stochastic control problems is a very challenging, and largely unsolved, problem. We show how to learn and represent such controllers using ideas from the cross entropy method. We derive a gradient descent method that allows to learn feed-back controllers using an arbitrary parametrisation. We refer to this method as the Path Integral Cross Entropy method or PICE. We illustrate this method for some simple examples. The path integral control methods can be used to estimate the posterior distribution in latent state models. In neuroscience these problems arise when estimating connectivity from neural recording data using EM. We demonstrate the path integral control method as an accurate alternative to particle filtering.

URL PDF HTML ☆

赞 0 踩 0

1410.7632 2026-06-04 cs.CV cs.RO cs.SY eess.SY 版本更新

On the Covariance of ICP-based Scan-matching Techniques

关于基于ICP的扫描匹配技术的协方差

Silvère Bonnabel, Martin Barczyk, François Goulette

AI总结本文研究了ICP算法计算旋转变换协方差的问题，指出点到点版本的ICP应用会导致错误协方差，通过数学证明验证点到平面版本的正确性。

Comments Accepted at 2016 American Control Conference

1505.07158 2026-06-04 eess.SY cs.DC cs.RO cs.SY 版本更新

Resilient and Decentralized Control of Multi-level Cooperative Mobile Networks to Maintain Connectivity under Adversarial Environment

多级协作移动网络在对抗环境下的鲁棒与去中心化控制以维持连通性

Juntao Chen, Quanyan Zhu

AI总结本文提出基于博弈论的框架，设计去中心化算法以最大化全局网络的代数连通性，并通过案例研究验证算法的有效性和网络鲁棒性。

Comments 9 pages, 6 figures

1603.04586 2026-06-04 cs.AI cs.RO cs.SY eess.SY 版本更新

Optimal Sensing via Multi-armed Bandit Relaxations in Mixed Observability Domains

通过混合可观测域中的多臂老虎机放松实现最优感知

Mikko Lauri, Risto Ritala

AI总结研究在混合可观测域中不确定决策问题，通过放松约束推导最优价值函数上界，并利用多臂老虎机的可计算最优策略提升搜索空间剪枝效率，实验显示在目标监控领域有效。

Comments 6 pages, 2 figures

1510.05344 2026-06-04 eess.SY cs.RO cs.SY 版本更新

A Topology-Guided Path Integral Approach for Stochastic Optimal Control

一种基于拓扑的路径积分方法用于随机最优控制

Jung-Su Ha, Han-Lim Choi

AI总结本文提出一种高效的连续时间连续空间随机最优控制方法，通过拓扑运动规划算法缓解局部极小问题，生成动态可行且无碰撞轨迹。

Comments 8 pages, 4 figures, accepted to IEEE International Conference on Robotics and Automation (ICRA) 2016

1603.02650 2026-06-04 eess.SY cs.RO cs.SY 版本更新

An MILP Approach for Real-time Optimal Controller Synthesis with Metric Temporal Logic Specifications

基于混合整数线性规划的实时最优控制器综合方法：带有度量时序逻辑规范

Sayan Saha, A. Agung Julius

AI总结本文提出一种混合整数线性规划方法，用于在动态环境中实时合成满足度量时序逻辑规范的控制器，通过优化性能指标确保系统行为正确。

Comments American Control Conference 2016 (extended version)

详情

AI中文摘要

本文的核心思想是合成反应控制器，使得系统闭环执行轨迹满足期望规范，以确保正确系统行为，同时优化期望性能标准。在我们的方法中，系统行为的正确性可以根据系统与环境的关系来定义，例如系统的输出轨迹终止在目标集而不进入不安全集。使用度量时序逻辑（MTL）规范可以进一步捕捉复杂系统行为和时间要求，例如输出轨迹必须在一定时间内经过若干路标后终止在目标集。给定一个混合逻辑动态（MLD）系统和以MTL公式或更简单的到达-避免规范表示的系统规范，我们的目标是在非确定性环境中找到满足规范的闭环轨迹。通过使用混合整数线性规划（MILP）框架，我们搜索输入信号空间以获得有效的轨迹，通过在必要时添加约束以满足MTL公式，以避免求解MILP问题的指数复杂性。我们还展示了在动态变化的环境中为移动机器人规划路径的实验结果，具有期望的任务规范。

英文摘要

The fundamental idea of this work is to synthesize reactive controllers such that closed-loop execution trajectories of the system satisfy desired specifications that ensure correct system behaviors, while optimizing a desired performance criteria. In our approach, the correctness of a system's behavior can be defined according to the system's relation to the environment, for example, the output trajectories of the system terminate in a goal set without entering an unsafe set. Using Metric Temporal Logic (MTL) specifications we can further capture complex system behaviors and timing requirements, such as the output trajectories must pass through a number of way-points within a certain time frame before terminating in the goal set. Given a Mixed Logical Dynamical (MLD) system and system specifications in terms of MTL formula or simpler reach-avoid specifications, our goal is to find a closed-loop trajectory that satisfies the specifications, in non-deterministic environments. Using an MILP framework we search over the space of input signals to obtain such valid trajectories of the system, by adding constraints to satisfy the MTL formula only when necessary, to avoid the exponential complexity of solving MILP problems. We also present experimental results for planning a path for a mobile robot through a dynamically changing environment with a desired task specification.

URL PDF HTML ☆

赞 0 踩 0

1603.02381 2026-06-04 cs.RO cs.SY eess.SY 版本更新

The Effect of Communication Topology on Scalar Field Estimation by Networked Robotic Swarms

网络化机器人群中通信拓扑对标量场估计的影响

Ragesh K Ramachandran, Spring Berman

AI总结本文研究了利用网络化机器人群重构二维标量场的问题，通过链式或网格拓扑结构进行通信，采用优化方法结合梯度计算，推导了可观测性格拉姆矩阵的迹界，验证了链式和网格拓扑的估计能力与鲁棒性。

1603.02038 2026-06-04 cs.RO cs.AI cs.LG cs.SY eess.SY 版本更新

Unscented Bayesian Optimization for Safe Robot Grasping

无迹贝叶斯优化用于安全机器人抓取

José Nogueira, Ruben Martinez-Cantin, Alexandre Bernardino, Lorenzo Jamone

AI总结本文提出无迹贝叶斯优化算法，通过考虑输入噪声在安全区域寻找最优抓取策略，提升机器人抓取的安全性和效率。

Comments conference paper

详情

AI中文摘要

我们解决了在输入空间存在不确定性时的机器人抓取优化问题。通过试错探索策略实现抓取未知物体。贝叶斯优化是一种样本高效的优化算法，特别适合此设置，因为它能主动减少试验次数以学习待优化函数。事实上，这种主动对象探索策略与婴儿学习最佳抓取方式的策略相同。在学习抓取策略时，一些抓取参数配置可能对物体与机器人末端执行器之间相对姿态的误差非常敏感。我们称这些配置为不安全，因为抓取执行中的小误差可能将好的抓取变为坏的抓取。因此，为了降低抓取失败的风险，抓取应规划在安全区域。我们提出了一种新的算法，即无迹贝叶斯优化，能够在考虑输入噪声的情况下进行样本高效的优化以找到安全的极值。无迹贝叶斯优化的贡献是双方面的：一方面提供了一个新的决策过程，驱动探索到安全区域；另一方面提供了一个新的选择过程，选择在不进行额外分析或计算成本的情况下最优的抓取策略。这两个贡献都根植于无迹变换背后的强大理论，这是一种流行的非线性近似方法。我们在合成问题和现实的机器人抓取模拟中展示了其相对于经典贝叶斯优化的优势。结果表明，我们的方法在几次试验后就能获得最优且鲁棒的抓取策略，同时所选的抓取保持在安全区域。

英文摘要

We address the robot grasp optimization problem of unknown objects considering uncertainty in the input space. Grasping unknown objects can be achieved by using a trial and error exploration strategy. Bayesian optimization is a sample efficient optimization algorithm that is especially suitable for this setups as it actively reduces the number of trials for learning about the function to optimize. In fact, this active object exploration is the same strategy that infants do to learn optimal grasps. One problem that arises while learning grasping policies is that some configurations of grasp parameters may be very sensitive to error in the relative pose between the object and robot end-effector. We call these configurations unsafe because small errors during grasp execution may turn good grasps into bad grasps. Therefore, to reduce the risk of grasp failure, grasps should be planned in safe areas. We propose a new algorithm, Unscented Bayesian optimization that is able to perform sample efficient optimization while taking into consideration input noise to find safe optima. The contribution of Unscented Bayesian optimization is twofold as if provides a new decision process that drives exploration to safe regions and a new selection procedure that chooses the optimal in terms of its safety without extra analysis or computational cost. Both contributions are rooted on the strong theory behind the unscented transformation, a popular nonlinear approximation method. We show its advantages with respect to the classical Bayesian optimization both in synthetic problems and in realistic robot grasp simulations. The results highlights that our method achieves optimal and robust grasping policies after few trials while the selected grasps remain in safe regions.

URL PDF HTML ☆

赞 0 踩 0

1603.00955 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Decentralized State Estimation via a Hybrid of Consensus and Covariance intersection

通过一致性与协方差交叠的混合方法实现去中心化状态估计

Amirhossein Tamjidi, Suman Chakravorty, Dylan Shell

AI总结本文提出了一种新的递归信息一致性滤波器，用于去中心化动态状态估计，通过协方差交叠处理先验一致性，利用Metropolis Hastings马尔可夫链处理新信息一致性，并在大气散射问题中验证了方法的性能。

1603.00748 2026-06-04 cs.LG cs.AI cs.RO cs.SY eess.SY 版本更新

Continuous Deep Q-Learning with Model-based Acceleration

基于模型的连续深度Q学习加速

Shixiang Gu, Timothy Lillicrap, Ilya Sutskever, Sergey Levine

AI总结本文提出连续深度Q学习算法NAF及基于模型的加速方法，用于提升连续控制任务的样本效率和学习速度。

详情

AI中文摘要

模型无关强化学习已成功应用于多种挑战性问题，并扩展到处理大规模神经网络策略和价值函数。然而，模型无关算法的样本复杂性，特别是使用高维函数近似器时，限制了其在物理系统中的应用。本文探索了减少深度强化学习样本复杂性的算法和表示方法。我们提出两种互补技术来提高此类算法的效率。首先，我们推导出Q学习的连续变种，称为归一化优势函数（NAF），作为替代更常用的策略梯度和actor-critic方法。NAF表示允许我们应用带有经验回放的Q学习来处理连续任务，并在一组模拟机器人控制任务上显著提高性能。为进一步提高我们的方法效率，我们探索了使用学习模型来加速模型无关强化学习。我们展示迭代重新拟合的局部线性模型在这一点上特别有效，并在适用此类模型的领域中展示了显著更快的学习速度。

英文摘要

Model-free reinforcement learning has been successfully applied to a range of challenging problems, and has recently been extended to handle large neural network policies and value functions. However, the sample complexity of model-free algorithms, particularly when using high-dimensional function approximators, tends to limit their applicability to physical systems. In this paper, we explore algorithms and representations to reduce the sample complexity of deep reinforcement learning for continuous control tasks. We propose two complementary techniques for improving the efficiency of such algorithms. First, we derive a continuous variant of the Q-learning algorithm, which we call normalized adantage functions (NAF), as an alternative to the more commonly used policy gradient and actor-critic methods. NAF representation allows us to apply Q-learning with experience replay to continuous tasks, and substantially improves performance on a set of simulated robotic control tasks. To further improve the efficiency of our approach, we explore the use of learned models for accelerating model-free reinforcement learning. We show that iteratively refitted local linear models are especially effective for this, and demonstrate substantially faster learning on domains where such models are applicable.

URL PDF HTML ☆

赞 0 踩 0

1511.04634 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Motion Planning for Global Localization in Non-Gaussian Belief Spaces

非高斯信念空间中的全局定位运动规划

Saurav Agarwal, Amirhossein Tamjidi, Suman Chakravorty

AI总结本文提出了一种在不确定环境下进行运动规划的方法，用于处理模糊数据关联导致的多模态假设。通过递推方法逐步消除多模态信念，实现有限时间内精确定位。

Comments extends previous submission with updated figures, analysis and justifications. arXiv admin note: text overlap with arXiv:1506.01780

1602.00646 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Autonomous Agent Behaviour Modelled in PRISM -- A Case Study

基于PRISM的自主代理行为建模——一个案例研究

Ruth Hoffmann, Murray Ireland, Alice Miller, Gethin Norman, Sandor Veres

AI总结本文提出了一种抽象的自主性定义，用于建模自主场景，并通过小规模仿真模型推断定量数据，以验证无人飞行器在自主场景中的行为。

1403.5986 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Controllability Analysis for Multirotor Helicopter Rotor Degradation and Failure

多旋翼直升机旋翼退化与失效的可控性分析

Guang-Xun Du, Quan Quan, Binxian Yang, Kai-Yuan Cai

AI总结本文针对多旋翼系统在旋翼失效或磨损情况下的可控性分析问题，提出了一种易用的测量指标来评估控制权限，并推导出新的必要充分条件及可控性测试方法，验证了不同旋翼配置的多旋翼在故障容错能力上的差异。

Comments 21 pages, 4 figures

详情

DOI: 10.2514/1.G000731
Journal ref: AIAA Journal of Guidance, Control, and Dynamics, 2015, 38(5): 978-984

AI中文摘要

本文考虑了一类受旋翼失效或磨损影响的多旋翼系统的可控性分析问题。研究表明，传统的线性系统可控性理论不足以测试所考虑的多旋翼的可控性。为此，引入了一个易于使用的测量指标来评估可用的控制权限。基于此，推导出多旋翼的可控性的新必要充分条件。此外，提出了一种可控性测试程序。所提出的可控性测试方法应用于不同旋翼配置和不同旋翼效率参数的六旋翼类别，以展示其有效性。分析结果表明，不同旋翼配置的六旋翼具有不同的故障容错能力。因此，在采用任何故障容错控制策略之前，必须测试多旋翼的可控性。

英文摘要

This paper considers the controllability analysis problem for a class of multirotor systems subject to rotor failure/wear. It is shown that classical controllability theories of linear systems are not sufficient to test the controllability of the considered multirotors. Owing to this, an easy-to-use measurement index is introduced to assess the available control authority. Based on it, a new necessary and sufficient condition for the controllability of multirotors is derived. Furthermore, a controllability test procedure is approached. The proposed controllability test method is applied to a class of hexacopters with different rotor configurations and different rotor efficiency parameters to show its effectiveness. The analysis results show that hexacopters with different rotor configurations have different fault-tolerant capabilities. It is therefore necessary to test the controllability of the multirotors before any fault-tolerant control strategies are employed.

URL PDF HTML ☆

赞 0 踩 0

1310.2539 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Intrinsic filtering on Lie groups with applications to attitude estimation

在李群上的内在滤波及其在姿态估计中的应用

Axel Barrau, Silvere Bonnabel

AI总结本文提出了一种基于概率的李群系统内在滤波方法，通过构建连续时间不变观测器理论，证明误差方程为马尔可夫链，且在噪声下分布收敛，同时提出离散时间不变扩展卡尔曼滤波及数值方法用于姿态估计。

Comments Submitted

详情

AI中文摘要

本文提出了一种概率方法，用于具有不变性质的矩阵李群系统内在滤波问题。将具有连续时间模型和离散时间测量的不变问题纳入严谨的随机和几何框架。基于连续时间不变观测器理论，证明误差方程为马尔可夫链，且不依赖于状态估计。当滤波器增益固定且在无噪声情况下具有近全球收敛性时，噪声误差的分布被证明收敛于平稳分布，为李群上滤波的数学理论提供见解。从工程角度，我们引入了离散时间不变扩展卡尔曼滤波，证明信任协方差矩阵渐近收敛，并提出一些更复杂的基于样本的计算方法来计算卡尔曼增益。这些方法应用于姿态估计，推导出该领域的新型理论结果，并通过合成数据的仿真进行说明。

英文摘要

This paper proposes a probabilistic approach to the problem of intrinsic filtering of a system on a matrix Lie group with invariance properties. The problem of an invariant continuous-time model with discrete-time measurements is cast into a rigorous stochastic and geometric framework. Building upon the theory of continuous-time invariant observers, we show that, as in the linear case, the error equation is a Markov chain that does not depend on the state estimate. Thus, when the filter's gains are held fixed, and the filter admits almost-global convergence properties with noise turned off, the noisy error's distribution is proved to converge to a stationary distribution, providing insight into the mathematical theory of filtering on Lie groups. For engineering purposes we also introduce the discrete-time Invariant Extended Kalman Filter, for which the trusted covariance matrix is shown to asymptotically converge, and some numerically more involved sample-based methods as well to compute the Kalman gains. The methods are applied to attitude estimation, allowing to derive novel theoretical results in this field, and illustrated through simulations on synthetic data.

URL PDF HTML ☆

赞 0 踩 0

1407.6836 2026-06-04 eess.SY cs.RO cs.SY math.PR 版本更新

A Theory of Cheap Control in Embodied Systems

具身系统中的廉价控制理论

Guido Montufar, Keyan Ghazi-Zahedi, Nihat Ay

AI总结本文提出了一种设计具身代理廉价控制架构的框架，通过利用具身性实现更高效的通用逼近，对比传统非具身方法，展示通过条件受限玻尔兹曼机政策模型的定量案例研究，实验验证了理论的实用性。

Comments 27 pages, 10 figures

详情

DOI: 10.1371/journal.pcbi.1004427

AI中文摘要

我们提出了一种设计具身代理廉价控制架构的框架。我们的推导受经典通用逼近问题的启发，探索利用代理的具身性实现行为生成的新且更高效的通用逼近。该具身通用逼近与传统非具身通用逼近进行比较。为了说明我们的方法，我们展示了基于条件受限玻尔兹曼机政策模型的详细定量案例研究。与需要指数级参数的传统非具身通用逼近不同，在具身设置中，我们能够用大幅减小的模型生成所有可能的行为，从而获得廉价通用逼近。我们通过六足步行机实验测试并验证了理论。实验显示，理论预测的足够控制器复杂度是紧致的，这意味着该理论具有直接的实用意义。关键词：廉价设计，具身性，感觉运动环路，通用逼近，条件受限玻尔兹曼机

英文摘要

We present a framework for designing cheap control architectures for embodied agents. Our derivation is guided by the classical problem of universal approximation, whereby we explore the possibility of exploiting the agent's embodiment for a new and more efficient universal approximation of behaviors generated by sensorimotor control. This embodied universal approximation is compared with the classical non-embodied universal approximation. To exemplify our approach, we present a detailed quantitative case study for policy models defined in terms of conditional restricted Boltzmann machines. In contrast to non-embodied universal approximation, which requires an exponential number of parameters, in the embodied setting we are able to generate all possible behaviors with a drastically smaller model, thus obtaining cheap universal approximation. We test and corroborate the theory experimentally with a six-legged walking machine. The experiments show that the sufficient controller complexity predicted by our theory is tight, which means that the theory has direct practical implications. Keywords: cheap design, embodiment, sensorimotor loop, universal approximation, conditional restricted Boltzmann machine

URL PDF HTML ☆

赞 0 踩 0

1510.00771 2026-06-04 cs.CV cs.RO cs.SY eess.SY 版本更新

Design and Analysis of a Single-Camera Omnistereo Sensor for Quadrotor Micro Aerial Vehicles (MAVs)

单相机 omnistereo 传感器的设计与分析用于四旋翼微型飞行器（MAVs）

Carlos Jaramillo

AI总结本文提出一种适用于低负载四旋翼微型飞行器的单相机 omnistereo 传感器设计，通过共轴超曲面镜实现立体视觉，分析其几何特性与3D感知性能。

Comments 49 pages, 22 figures, journal article draft

详情

DOI: 10.3390/s16020217
Journal ref: Sensors 16 (2016) 217

AI中文摘要

我们描述了一种应用于微型飞行器（MAVs）的 omnistereo 系统的设计和3D感知性能。所提出的 omnistereo 模型采用一个单目相机，与一对双曲面镜（折叠catadioptric配置）共轴对齐。我们证明这种配置在安装在具有低负载的四旋翼MAV上进行立体视觉是可行的。理论上的单视角（SVP）约束帮助我们推导出传感器投影几何的解析解，并生成SVP兼容的全景图像，以从立体对应关系中计算3D信息（真正同步地）。我们对各种系统特性进行了广泛分析，如大小、catadioptric空间分辨率、视场。此外，我们提出了一种概率模型，用于估计从三角化中深度的不确定性，用于斜向后投影射线。我们期望通过我们的解决方案的可重复性来激励，因为它可以被适应（最优地）到其他基于catadioptric的omnistereo视觉应用。

英文摘要

We describe the design and 3D sensing performance of an omnidirectional stereo-vision system (omnistereo) as applied to Micro Aerial Vehicles (MAVs). The proposed omnistereo model employs a monocular camera that is co-axially aligned with a pair of hyperboloidal mirrors (folded catadioptric configuration). We show that this arrangement is practical for performing stereo-vision when mounted on top of propeller-based MAVs characterized by low payloads. The theoretical single viewpoint (SVP) constraint helps us derive analytical solutions for the sensor's projective geometry and generate SVP-compliant panoramic images to compute 3D information from stereo correspondences (in a truly synchronous fashion). We perform an extensive analysis on various system characteristics such as its size, catadioptric spatial resolution, field-of-view. In addition, we pose a probabilistic model for uncertainty estimation of the depth from triangulation for skew back-projection rays. We expect to motivate the reproducibility of our solution since it can be adapted (optimally) to other catadioptric-based omnistereo vision applications.

URL PDF HTML ☆

赞 0 踩 0

1505.06379 2026-06-04 eess.SY cs.GT cs.MA cs.RO cs.SY math.OC 版本更新

Communication-Free Distributed Coverage for Networked Systems

无通信分布式覆盖网络系统

A. Yasin Yazicioglu, Magnus Egerstedt, Jeff S. Shamma

AI总结本文提出一种无通信算法，用于由多个具有本地感知能力的移动代理对任意网络进行分布式覆盖。通过游戏理论框架，代理利用感官输入优化位置，以最大化覆盖范围。

1512.03351 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Adaptive Neural Control for Mobile Robots Autonomous Navigation

自适应神经控制用于自主导航的移动机器人

Monica Dragoicea, Ioan Dumitrache, Nicolae Constantin

AI总结本文提出了一种自适应神经控制策略，用于非holonomic移动机器人的自主导航，通过同时学习运动学转向和速度动力学，实现稳定跟踪。

Comments in Proceedings of the 7th Int. Symposium on Automatic Control and Computer Science SACCS 2001, Iasi, Romania, CD ISBN 973-8292-11-5, 2001

1512.03345 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Mobile Robots Adaptive Control Using Neural Networks

移动机器人自适应控制的神经网络方法

Ioan Dumitrache, Monica Dragoicea

AI总结本文提出了一种前馈控制策略，用于考虑移动机器人非线性模型及其输入输出交互，通过神经网络控制器补偿建模不确定性，实现智能控制策略。

1504.06917 2026-06-04 cs.RO cs.SY eess.SY math.OC 版本更新

Spline Path Following for Redundant Mechanical Systems

样条路径跟随用于冗余机械系统

Rajan Gill, Dana Kulić, Christopher Nielsen

AI总结本文提出了一种适用于由样条生成的框架曲线的路径跟随控制方法，通过求解约束二次优化问题解决冗余性，并通过实验验证了其在具有显著模型不确定性的4自由度机械臂上的有效性。

Comments Submitted to IEEE TRO (under review)

详情

DOI: 10.1109/TRO.2015.2489502
Journal ref: Robotics, IEEE Transactions on (Volume:31 , Issue: 6 ) 02 December 2015

AI中文摘要

路径跟随控制器使控制系统的输出能够接近并沿预指定路径移动，无需事先时间参数化。本文提出了一种适用于冗余机械系统工作空间中由样条生成的框架曲线的路径跟随控制设计方法。可接受的路径类别包括自相交曲线。通过设计解决合适定义的约束二次优化问题来解决运动学冗余性。通过部分反馈线性化，所提出的路径跟随控制器具有明确的物理意义。该方法在具有旋转和线性执行器链接的4自由度机械臂上进行了实验验证，该机械臂具有显著的模型不确定性。

英文摘要

Path following controllers make the output of a control system approach and traverse a pre-specified path with no apriori time parametrization. In this paper we present a method for path following control design applicable to framed curves generated by splines in the workspace of kinematically redundant mechanical systems. The class of admissible paths includes self-intersecting curves. Kinematic redundancies are resolved by designing controllers that solve a suitably defined constrained quadratic optimization problem. By employing partial feedback linearization, the proposed path following controllers have a clear physical meaning. The approach is experimentally verified on a 4-degree-of-freedom (4-DOF) manipulator with a combination of revolute and linear actuated links and significant model uncertainty.

URL PDF HTML ☆

赞 0 踩 0

1511.05996 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Uncertainty-based Arbitration of Human-Machine Shared Control

基于不确定性的机人类共享控制仲裁

Parker Owan, Joseph Garbini, Santosh Devasia

AI总结本文提出基于自动化不确定性的机人类共享控制仲裁方法，通过调整自主级别提升任务完成的容错能力，实验显示共享控制方法在任务完成误差容忍度上提升超5倍。

Comments 8 pages, 11 figures. Submitted to the 2016 American Control Conference

1511.04628 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A Framework for Planning and Controlling Non-Periodic Bipedal Locomotion

一种用于非周期性双足运动规划与控制的框架

Ye Zhao, Benito R. Fernandez, Luis Sentis

AI总结本文提出了一种基于非周期性顶点状态鲁棒跟踪的双足运动规划与控制理论框架，通过混合相空间规划与控制方法实现非周期性步态生成及抗干扰能力。

Comments 33 pages, 18 figures, journal

详情

AI中文摘要

本文提出了一种基于鲁棒跟踪非周期性顶点状态的双足运动规划与控制理论框架。基于平面倒置摆模型，我们提出了一个混合相空间规划与控制框架，包含四个关键组件：(1) 一种步态转换求解器，能够在各种地形上动态跟踪非周期性顶点或关键帧状态；(2) 一种鲁棒混合自动机，有效制定规划与控制算法；(3) 一种相空间度量，用于测量到规划运动流形的距离；(4) 一种基于前度量的混合控制方法，以产生在外部干扰下的稳健动态运动。与其他运动框架相比，我们更关注非周期性步态生成和鲁棒性度量以处理干扰。这种关注使所提出的控制框架能够稳健地跟踪各种具有挑战性的地形上的非周期性顶点状态，并通过多个模拟示例进行了说明。此外，它还允许双足机器人在不连续地形上执行非周期性跳跃动作。

英文摘要

This study presents a theoretical framework for planning and controlling agile bipedal locomotion based on robustly tracking a set of non-periodic apex states. Based on the prismatic inverted pendulum model, we formulate a hybrid phase-space planning and control framework which includes the following key components: (1) a step transition solver that enables dynamically tracking non-periodic apex or keyframe states over various types of terrains, (2) a robust hybrid automaton to effectively formulate planning and control algorithms, (3) a phase-space metric to measure distance to the planned locomotion manifolds, and (4) a hybrid control method based on the previous distance metric to produce robust dynamic locomotion under external disturbances. Compared to other locomotion frameworks, we have a larger focus on non-periodic gait generation and robustness metrics to deal with disturbances. Such focus enables the proposed control framework to robustly track non-periodic apex states over various challenging terrains and under external disturbances as illustrated through several simulations. Additionally, it allows a bipedal robot to perform non-periodic bouncing maneuvers over disjointed terrains.

URL PDF HTML ☆

赞 0 踩 0

1510.08474 2026-06-04 eess.SY cs.LO cs.RO cs.SY 版本更新

Control with Probabilistic Signal Temporal Logic

基于概率信号时间逻辑的控制

Chanyeol Yoo, Calin Belta

AI总结本文提出概率信号时间逻辑用于连续信念空间中的复杂任务控制，设计高效合成算法以最大化任务满足概率，并通过无人机仿真验证。

Comments 7 pages, submitted to the 2016 American Control Conference (ACC 2016) on September, 30, 2015 (under review)

1509.01149 2026-06-04 eess.SY cs.DC cs.RO cs.SY 版本更新

Model Predictive Path Integral Control using Covariance Variable Importance Sampling

基于协方差变量重要性采样的模型预测路径积分控制

Grady Williams, Andrew Aldrich, Evangelos Theodorou

AI总结本文提出基于广义重要性采样方案的模型预测路径积分控制算法，利用GPU进行并行优化，改进了随机扩散过程的漂移和扩散项，通过仿真与差分动态规划的模型预测控制进行比较。

Comments 8 pages

1411.2276 2026-06-04 cs.RO cs.NE cs.SY eess.SY 版本更新

Trade-Offs in Exploiting Body Morphology for Control: from Simple Bodies and Model-Based Control to Complex Bodies with Model-Free Distributed Control Schemes

在利用身体形态进行控制中权衡：从简单身体和基于模型的控制到复杂身体与基于模型的分布式控制方案

Matej Hoffmann, Vincent C. Müller

AI总结本文探讨了在复杂身体设计中基于模型与无模型控制的权衡，分析了软体机器人自动接管控制的优缺点及模型构建的可行性。

详情

Journal ref: Helmut Hauser; Rudolf M. Füchslin & Rolf Pfeifer, ed., 'E-book on Opinions and Outlooks on Morphological Computation', 2014, pp. 185--194

AI中文摘要

为控制目的设计机器人身体的优化隐含在工程师的实践中，但缺乏系统的方法或工具，导致形态优化滞后于控制器的发展。随着柔软、变形或“软”身体的出现，其在控制中的潜力显著，有时被称为“形态计算”，即通过身体卸载控制所需计算。本文主张采用动态系统而非计算视角来审视该问题，并分析简单与复杂身体的优劣，批判性地回顾“软”身体自动接管控制任务的吸引力。同时，本文还探讨了设计空间中的关键维度——是否应使用基于模型的控制，以及在不同形态下开发忠实模型的可行性。

英文摘要

Tailoring the design of robot bodies for control purposes is implicitly performed by engineers, however, a methodology or set of tools is largely absent and optimization of morphology (shape, material properties of robot bodies, etc.) is lagging behind the development of controllers. This has become even more prominent with the advent of compliant, deformable or "soft" bodies. These carry substantial potential regarding their exploitation for control---sometimes referred to as "morphological computation" in the sense of offloading computation needed for control to the body. Here, we will argue in favor of a dynamical systems rather than computational perspective on the problem. Then, we will look at the pros and cons of simple vs. complex bodies, critically reviewing the attractive notion of "soft" bodies automatically taking over control tasks. We will address another key dimension of the design space---whether model-based control should be used and to what extent it is feasible to develop faithful models for different morphologies.

URL PDF HTML ☆

赞 0 踩 0

1510.07313 2026-06-04 eess.SY cs.AI cs.LO cs.RO cs.SY 版本更新

Safe Control under Uncertainty

在不确定性下的安全控制

Dorsa Sadigh, Ashish Kapoor

AI总结本文提出Probabilistic Signal Temporal Logic（PrSTL）用于定义随机属性并确保概率保证，通过该逻辑合成安全控制器，应用于四旋翼和自动驾驶车辆等案例。

Comments 10 pages, 6 figures, Submitted to HSCC 2016

1510.06460 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Robust Satisfaction of Temporal Logic Specifications via Reinforcement Learning

通过强化学习实现时序逻辑规范的鲁棒满足

Austin Jones, Derya Aksaray, Zhaodan Kong, Mac Schwager, Calin Belta

AI总结本文提出通过强化学习最大化满足信号时序逻辑规范的概率和鲁棒性，通过机器人导航仿真验证其在概率和鲁棒性上的优越性。

Comments 8 pages, 4 figures

1405.7178 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Artificial Wrestling: A Dynamical Formulation of Autonomous Agents Fighting in a Coupled Inverted Pendula Framework

人工摔跤：一种自主代理在耦合倒立摆框架中的动态建模

Katsutoshi Yoshida, Shigeki Matsumoto, Yoichi Matsue

AI总结本文提出基于耦合倒立摆框架的自主代理对抗模型，通过动态控制器存储状态对应关系并生成控制力，实验表明延迟元素和量化分辨率影响性能。

Comments The 12th International Conference on Motion and Vibration Control (MOVIC 2014), August 3-7, 2014, Sapporo, Japan. This article was selected as an article of Mechanical Engineering Journal after minor revisions; the final version is available at http://dx.doi.org/10.1299/mej.14-00518

详情

AI中文摘要

我们开发了自主代理相互对抗的系统，灵感来自人类摔跤。为此，我们提出耦合倒立摆（CIP）框架：1）两个倒立摆的顶端通过连接杆相连；2）每个摆主要通过PD控制器稳定；3）并额外配备智能控制器。基于此框架，我们动态建模智能控制器，用于存储CIP模型的初始状态到最终状态的动态对应关系，接收模型状态向量并输出脉冲控制力以产生期望的最终状态。开发了该控制器的量化和降阶设计，基于离线学习方法获得实用控制流程。随后进行数值模拟以研究智能控制器的个体性能，显示通过在智能控制器中添加延迟元件可提高性能。结果表明，性能不仅取决于学习数据的量化分辨率，还取决于延迟元件的延迟时间。最后，我们将智能控制器安装到所提框架中的两个摆上，以演示倒立摆之间的自主竞争行为。

英文摘要

We develop autonomous agents fighting with each other, inspired by human wrestling. For this purpose, we propose a coupled inverted pendula (CIP) framework in which: 1) tips of two inverted pendulums are linked by a connection rod, 2) each pendulum is primarily stabilized by a PD-controller, 3) and is additionally equipped with an intelligent controller. Based on this framework, we dynamically formulate an intelligent controller designed to store dynamical correspondence from initial states to final states of the CIP model, to receive state vectors of the model, and to output impulsive control forces to produce desired final states of the model. Developing a quantized and reduced order design of this controller, we have a practical control procedure based on an off-line learning method. We then conduct numerical simulations to investigate individual performance of the intelligent controller, showing that the performance can be improved by adding a delay element into the intelligent controller. The result shows that the performance depends not only on quantization resolutions of learning data but also on delay time of the delay element. Finally, we install the intelligent controllers into both pendulums in the proposed framework to demonstrate autonomous competitive behavior between inverted pendulums.

URL PDF HTML ☆

赞 0 踩 0

1505.05908 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Cooperative localization for mobile agents: a recursive decentralized algorithm based on Kalman filter decoupling

移动代理的协同定位：基于卡尔曼滤波解耦的递归分布式算法

Solmaz S. Kia, Stephen Rounds, Sonia Martinez

AI总结本文提出一种递归分布式协同定位算法，通过解耦卡尔曼滤波实现分布式状态估计，提升移动代理的定位精度与效率。

详情

AI中文摘要

我们考虑具有通信和计算能力的移动代理的协同定位技术。首先概述了文献中不同的去中心化策略，特别关注这些算法如何维护团队成员状态估计之间的内在相关性。然后，我们提出了一种新的去中心化协同定位算法，它是集中式扩展卡尔曼滤波的分布式实现。在该算法中，每个代理传播新的中间局部变量，这些变量可用于更新阶段生成所需的传播交叉协方差项。每当网络中存在相对测量时，算法将进行该测量的代理声明为临时主代理。通过从临时地标获取信息，该代理可以计算并广播一组中间变量，每个机器人可以使用这些变量来更新其估计值，以匹配集中式扩展卡尔曼滤波器的估计结果。一旦完成更新，直到下一次相对测量之前不需要进一步通信。

英文摘要

We consider cooperative localization technique for mobile agents with communication and computation capabilities. We start by provide and overview of different decentralization strategies in the literature, with special focus on how these algorithms maintain an account of intrinsic correlations between state estimate of team members. Then, we present a novel decentralized cooperative localization algorithm that is a decentralized implementation of a centralized Extended Kalman Filter for cooperative localization. In this algorithm, instead of propagating cross-covariance terms, each agent propagates new intermediate local variables that can be used in an update stage to create the required propagated cross-covariance terms. Whenever there is a relative measurement in the network, the algorithm declares the agent making this measurement as the interim master. By acquiring information from the interim landmark, the agent the relative measurement is taken from, the interim master can calculate and broadcast a set of intermediate variables which each robot can then use to update its estimates to match that of a centralized Extended Kalman Filter for cooperative localization. Once an update is done, no further communication is needed until the next relative measurement.

URL PDF HTML ☆

赞 0 踩 0

1509.04826 2026-06-04 cs.MA cs.RO cs.SY eess.SY 版本更新

Inter-Robot Interactions in Multi-Robot Systems Using Braids

多机器人系统中使用编织群的机器人交互

Yancy Diaz-Mercado, Magnus Egerstedt

AI总结本文提出利用编织群元素描述机器人交互模式，以实现安全的协作运动和传感器信息交换。

1509.02054 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Underwater Doppler Navigation with Self-calibration

水下多普勒导航与自校准

Xianfei Pan, Yuanxin Wu

AI总结本文提出一种结合惯性测量单元和多普勒速度计的水下导航系统，通过分析证明在中等条件下系统可观测，且能无需外部GPS或声纳信标实现多普勒速度计参数的现场校准。

Comments To appear in Journal of Navigation

1508.07723 2026-06-04 eess.SY cs.RO cs.SY 版本更新

A survey on unmanned aerial vehicle collision avoidance systems

关于无人机避障系统的综述

Hung Pham, Scott A. Smolka, Scott D. Stoller, Dung Phan, Junxing Yang

AI总结本文综述了最新无人机避障系统，分析了感知与检测、避障两部分的分类、优缺点及比较。

Comments This is only a draft

1508.05514 2026-06-04 stat.ML cs.CV cs.LG cs.RO cs.SY eess.SY 版本更新

Gaussian Mixture Reduction Using Reverse Kullback-Leibler Divergence

基于反向Kullback-Leibler散度的高斯混合减少

Tohid Ardeshiri, Umut Orguner, Emre Özkan

AI总结本文提出一种贪心混合减少算法，基于Kullback-Leibler散度进行混合成分的剪枝与合并，通过分析近似方法提高计算效率，并在模拟和实际数据中验证其性能优于现有方法。

1312.7602 2026-06-04 eess.SY cs.RO cs.SY math.DS math.PR 版本更新

A Martingale Approach and Time-Consistent Sampling-based Algorithms for Risk Management in Stochastic Optimal Control

一个基于鞅和时间一致采样算法的风险管理在随机优化控制中的方法

Vu Anh Huynh, Leonid Kogan, Emilio Frazzoli

AI总结本文提出基于鞅和时间一致采样算法的方法，用于解决具有风险约束的随机优化控制问题，通过将风险约束转化为鞅来构建时间一致的控制策略，并在扩展的状态空间中采样以近似最优反馈策略。

详情

AI中文摘要

本文考虑了一类具有风险约束的随机优化控制问题，其中风险约束以特定初始状态的失败概率有界形式表达。我们提出了一种鞅方法，将风险约束扩散为鞅以构建时间一致的控制策略。鞅代表了时间上的风险容忍水平。通过将系统动态与受控的鞅相结合，原始的风险约束问题被转化为随机目标问题。我们扩展了增量马尔可夫决策过程（iMDP）算法，通过在扩展的状态空间中采样并计算适当的边界条件来近似原始问题的最优反馈策略。我们证明该算法在概率上是稳健的且渐近最优。所提出算法的性能在具有受限碰撞概率的不确定杂乱环境中的运动规划和控制问题中得到了验证。

英文摘要

In this paper, we consider a class of stochastic optimal control problems with risk constraints that are expressed as bounded probabilities of failure for particular initial states. We present here a martingale approach that diffuses a risk constraint into a martingale to construct time-consistent control policies. The martingale stands for the level of risk tolerance over time. By augmenting the system dynamics with the controlled martingale, the original risk-constrained problem is transformed into a stochastic target problem. We extend the incremental Markov Decision Process (iMDP) algorithm to approximate arbitrarily well an optimal feedback policy of the original problem by sampling in the augmented state space and computing proper boundary conditions for the reformulated problem. We show that the algorithm is both probabilistically sound and asymptotically optimal. The performance of the proposed algorithm is demonstrated on motion planning and control problems subject to bounded probability of collision in uncertain cluttered environments.

URL PDF HTML ☆

赞 0 踩 0

1507.00525 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A Reactive Robotized Interface for Lower Limb Rehabilitation: Clinical Results

一种用于下肢康复的反应式机器人接口：临床结果

Ludovic Saint-Bauzel, Viviane Pasqui, Isabelle Monteil

AI总结本文展示了MONIMAD在治疗小脑疾病患者下肢康复中的临床效果，通过分析坐姿到站立的姿势，利用模糊控制调整机器人接口运动以提高稳定性。

详情

DOI: 10.1109/TRO.2009.2019886
Journal ref: IEEE Transactions on Robotics, Institute of Electrical and Electronics Engineers (IEEE), 2009, pp.583 - 592

AI中文摘要

本文介绍了MONIMAD在下肢康复中的临床结果，该接口用于帮助小脑疾病患者康复。首先解决坐姿到站立动作的姿势分析，通过健康受试者实验确定了坐起动作分为预加速、加速、起始上升和上升几个阶段。通过分析压力中心、地面力和手柄的水平力分量，得出识别患者稳定性并调整机器人接口运动以适应人类自愿运动的规则。这些规则用于模糊控制器的实现，并在贝尔安医院对患者进行实验验证。

英文摘要

-This article presents clinical results from the use of MONIMAD, a reactive robotized interface for lower limb Rehabilitation of patients suffering from cerebellar disease. The first problem to be addressed is the postural analysis of sit-to-stand motion. Experiments with healthy subjects were performed for this purpose. Analysis of external forces shows that sit-to-stand transfer can be subdivided into several phases: preaccel-eration, acceleration, start rising, rising. Observation of Center of Pressure, ground forces and horizontal components force on handles yields rules to identify the stability of the patient and to adjust the robotic interface motion to the human voluntary movement. These rules are used in a fuzzy-based controller implementation. The controller is validated on experiments with diseased patients in Bellan Hospital.

URL PDF HTML ☆

赞 0 踩 0

1502.02251 2026-06-04 stat.ML cs.LG cs.RO cs.SY eess.SY 版本更新

From Pixels to Torques: Policy Learning with Deep Dynamical Models

从像素到扭矩：基于深度动态模型的策略学习

Niklas Wahlström, Thomas B. Schön, Marc Peter Deisenroth

AI总结本文提出一种高效的数据驱动强化学习算法，通过深度动态模型直接从像素信息学习闭环控制策略，解决高维观测下的连续状态-动作空间数据高效学习问题。

Comments 9 pages

详情

AI中文摘要

在开发完全自主系统中，利用非常高的维数观测进行数据高效学习连续状态-动作空间仍是一个关键挑战。本文考虑这一挑战的一个实例，即像素到扭矩问题，其中智能体必须仅从像素信息学习闭环控制策略。我们引入了一种数据高效、基于模型的强化学习算法，该算法直接从像素信息学习此类闭环策略。关键成分是深度动态模型，该模型使用深度自编码器学习图像的低维嵌入，并在该低维特征空间中学习预测模型。联合学习确保不仅静态属性，而且动态属性都被考虑在内。这对于长期预测至关重要，而长期预测是适应性模型预测控制策略的核心。与最先进的连续状态和动作强化学习方法相比，我们的方法学习速度快，可扩展到高维状态空间，并是向完全自主学习从像素到扭矩的重要一步。

英文摘要

Data-efficient learning in continuous state-action spaces using very high-dimensional observations remains a key challenge in developing fully autonomous systems. In this paper, we consider one instance of this challenge, the pixels to torques problem, where an agent must learn a closed-loop control policy from pixel information only. We introduce a data-efficient, model-based reinforcement learning algorithm that learns such a closed-loop policy directly from pixel information. The key ingredient is a deep dynamical model that uses deep auto-encoders to learn a low-dimensional embedding of images jointly with a predictive model in this low-dimensional feature space. Joint learning ensures that not only static but also dynamic properties of the data are accounted for. This is crucial for long-term predictions, which lie at the core of the adaptive model predictive control strategy that we use for closed-loop control. Compared to state-of-the-art reinforcement learning methods for continuous states and actions, our approach learns quickly, scales to high-dimensional state spaces and is an important step toward fully autonomous learning from pixels to torques.

URL PDF HTML ☆

赞 0 踩 0

1401.6904 2026-06-04 cs.RO cs.SY eess.SY math.OC 版本更新

Adaptive Visual Tracking for Robotic Systems Without Image-Space Velocity Measurement

无图像空间速度测量的机器人系统自适应视觉跟踪

Hanlei Wang

AI总结本文研究了无图像空间速度测量的机器人视觉跟踪问题，提出了一种新的图像空间观测器，通过未知动力学中的图像空间速度信息设计自适应控制器，解决了多不确定性分离问题，避免了过参数化。

Comments 21 pages, 3 figures, revised for making improvements based on the reviewers' and AE's comments from Automatica and for adding the journal reference

详情

DOI: 10.1016/j.automatica.2015.02.029
Journal ref: Automatica, 55: 294-301, May 2015

AI中文摘要

本文研究了机器人系统在无图像空间速度测量情况下的视觉跟踪问题，同时考虑了相机模型和机械臂动力学的不确定性。我们提出了一种新的图像空间观测器，利用未知动力学中包含的图像空间速度信息，设计了不使用图像空间速度信号的自适应控制器，其中深度速率无关的动力学参数和深度参数的适应由图像空间跟踪误差和观测误差驱动。所提观测器-自适应控制器的主要优势在于其简单性和在视觉伺服机器人系统中处理多个不确定性的分离，从而避免了现有工作的过参数化问题。通过李雅普诺夫分析，我们证明了图像空间跟踪误差渐近收敛到零。所提自适应控制方案的性能通过数值模拟进行了验证。

英文摘要

In this paper, we investigate the visual tracking problem for robotic systems without image-space velocity measurement, simultaneously taking into account the uncertainties of the camera model and the manipulator kinematics and dynamics. We propose a new image-space observer that exploits the image-space velocity information contained in the unknown kinematics, upon which, we design an adaptive controller without using the image-space velocity signal where the adaptations of the depth-rate-independent kinematic parameter and depth parameter are driven by both the image-space tracking errors and observation errors. The major superiority of the proposed observer-based adaptive controller lies in its simplicity and the separation of the handling of multiple uncertainties in visually servoed robotic systems, thus avoiding the overparametrization problem of the existing work. Using Lyapunov analysis, we demonstrate that the image-space tracking errors converge to zero asymptotically. The performance of the proposed adaptive control scheme is illustrated by a numerical simulation.

URL PDF HTML ☆

赞 0 踩 0

1506.03771 2026-06-04 math.NA cs.NA cs.RO 版本更新

Fast Methods for Eikonal Equations: an Experimental Survey

快速求解eikonal方程的方法：实验调查

Javier V. Gomez, David Alvarez, Santiago Garrido, Luis Moreno

AI总结本文通过实验对比了9种改进标准快速推进法的算法，重点研究单线程方法和各向同性环境下的计算效率提升。

1506.02312 2026-06-04 cs.AI cs.LG cs.RO cs.SY eess.SY 版本更新

A Framework for Constrained and Adaptive Behavior-Based Agents

一种用于约束和自适应行为基 agent 的框架

Renato de Pontes Pereira, Paulo Martins Engel

AI总结本文提出一种框架，通过强化学习节点整合到行为树中，解决约束 agent 的学习能力问题，并展示其与分层强化学习选项的关系，确保嵌套学习节点的收敛性。

Comments 2015; 15 pages

1309.0535 2026-06-04 eess.SY cs.MA cs.RO cs.SY math.OC 版本更新

Decentralized Rigidity Maintenance Control with Range Measurements for Multi-Robot Systems

多机器人系统中基于距离测量的去中心化刚性维持控制

Daniel Zelazo, Antonio Franchi, Heinrich H. Bülthoff, Paolo Robuffo Giordano

AI总结本文提出了一种基于距离测量的去中心化策略，用于维持多机器人系统的形成刚性，同时允许图拓扑自由变化。通过扩展刚性理论到加权框架并引入刚性特征值，设计了分布式算法以估计相对位置参考框架，并用于生成维持刚性并满足碰撞避免等约束的局部控制作用。

Comments Preprint submitted to The International Journal of Robotics Research

详情

DOI: 10.1177/0278364914546173

AI中文摘要

本文提出了一种完全去中心化的策略，用于利用仅有的距离测量来维持多机器人系统的形成刚性，同时允许图拓扑随时间自由变化。首先，本文扩展了刚性理论到加权框架并引入了刚性特征值，当该值为正时确保框架的无穷小刚性。随后，提出了一种分布式算法，用于估计一组机器人之间的共同相对位置参考框架，该算法结合了仅有的距离测量以及一个具备测量两个其他机器人方位能力的代理。这一估计步骤被嵌入到随后的分布式算法中，用于估计加权框架的刚性特征值。最终，利用刚性特征值的估计值为每个代理生成局部控制作用，以维持刚性属性并施加额外约束，如碰撞避免和感知/通信范围限制及遮挡。此外，本文方法还允许机器人之间的通信和感知链接随时间自由变化，同时保持整个框架的刚性。所提出的方案通过由6个四旋翼无人机组成的机器人测试平台在复杂环境中进行了实验验证。

英文摘要

This work proposes a fully decentralized strategy for maintaining the formation rigidity of a multi-robot system using only range measurements, while still allowing the graph topology to change freely over time. In this direction, a first contribution of this work is an extension of rigidity theory to weighted frameworks and the rigidity eigenvalue, which when positive ensures the infinitesimal rigidity of the framework. We then propose a distributed algorithm for estimating a common relative position reference frame amongst a team of robots with only range measurements in addition to one agent endowed with the capability of measuring the bearing to two other agents. This first estimation step is embedded into a subsequent distributed algorithm for estimating the rigidity eigenvalue associated with the weighted framework. The estimate of the rigidity eigenvalue is finally used to generate a local control action for each agent that both maintains the rigidity property and enforces additional con- straints such as collision avoidance and sensing/communication range limits and occlusions. As an additional feature of our approach, the communication and sensing links among the robots are also left free to change over time while preserving rigidity of the whole framework. The proposed scheme is then experimentally validated with a robotic testbed consisting of 6 quadrotor UAVs operating in a cluttered environment.

URL PDF HTML ☆

赞 0 踩 0

1506.00547 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Differential Geometric SLAM

微分几何SLAM

David Evan Zlotnik, James Richard Forbes

AI总结本文提出基于微分几何的SLAM算法，通过数学方法实现状态和地图估计，证明在无噪声情况下渐进稳定，仿真验证其鲁棒性。

1412.6164 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Formation of Multiple Groups of Mobile Robots Using Sliding Mode Control

利用滑模控制形成多个移动机器人类群

Soumic Sarkar, Indra Narayan Kar

AI总结本文提出基于质心变换的滑模控制方法，用于多组移动机器人形成不同几何形状，通过设计子系统控制器并采用奇异摄动理论实现有限时间滑动表面控制，同时利用势函数梯度确保碰撞避免。

Comments 8 pages, 5 figures

1504.06002 2026-06-04 math.OC cs.RO cs.SY eess.SY math.DS 版本更新

Some Applications of Polynomial Optimization in Operations Research and Real-Time Decision Making

多项式优化在运筹学和实时决策中的某些应用

Amir Ali Ahmadi, Anirudha Majumdar

AI总结本文探讨了多项式优化在无线覆盖、无人机避障和四旋翼飞行控制中的应用，采用SOS和SDSOS优化方法解决相关问题。

1504.05723 2026-06-04 stat.CO cs.RO cs.SY eess.SY q-fin.CP 版本更新

Noise Robust Online Inference for Linear Dynamic Systems

针对线性动态系统的噪声鲁棒在线推断

Saikat Saha

AI总结本文提出一种新的噪声自适应 Rao-Blackwellized 粒子滤波器，通过分层高斯模型近似非高斯噪声密度，以提高鲁棒性和适应性，同时保持可扩展性和易实现性。

详情

AI中文摘要

我们重新审视了在非高斯环境下线性动态系统（LDS）的贝叶斯在线推断问题。噪声可以自然非高斯（偏斜和/或重尾）或为了处理虚假观测，噪声可以建模为重尾分布。然而，这种噪声鲁棒性可能在没有虚假观测时导致性能下降。因此，任何推断引擎不仅要鲁棒于噪声异常，还应适应潜在未知且随时间变化的噪声参数；同时应具有可扩展性和易实现性。为了解决这些问题，本文提出了一种新的噪声自适应 Rao-Blackwellized 粒子滤波器（RBPF），通过分层高斯模型作为任何非高斯（过程或测量）噪声密度的代理。这导致了可处理的条件线性高斯模型（CLGM）。然而，该框架需要一个有效的转移核作为粒子滤波（PF）的目标状态。这通常是未知的。我们概述了如何通过辅助潜在变量方法构建这样的核，至少对于包含许多常见非高斯噪声的某些类别。通过数值研究验证了该 RBPF 算法的有效性。

英文摘要

We revisit the Bayesian online inference problems for the linear dynamic systems (LDS) under non- Gaussian environment. The noises can naturally be non-Gaussian (skewed and/or heavy tailed) or to accommodate spurious observations, noises can be modeled as heavy tailed. However, at the cost of such noise robustness, the performance may degrade when such spurious observations are absent. Therefore, any inference engine should not only be robust to noise outlier, but also be adaptive to potentially unknown and time varying noise parameters; yet it should be scalable and easy to implement. To address them, we envisage here a new noise adaptive Rao-Blackwellized particle filter (RBPF), by leveraging a hierarchically Gaussian model as a proxy for any non-Gaussian (process or measurement) noise density. This leads to a conditionally linear Gaussian model (CLGM), that is tractable. However, this framework requires a valid transition kernel for the intractable state, targeted by the particle filter (PF). This is typically unknown. We outline how such kernel can be constructed provably, at least for certain classes encompassing many commonly occurring non-Gaussian noises, using auxiliary latent variable approach. The efficacy of this RBPF algorithm is demonstrated through numerical studies.

URL PDF HTML ☆

赞 0 踩 0

1503.07889 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Design and Implementation of an Inertial Navigation System for Pedestrians Based on a Low-Cost MEMS IMU

基于低成本MEMS IMU的行人惯性导航系统设计与实现

Francesco Montorsi, Fabrizio Pancaldi, Giorgio M. Vitetta

AI总结本文提出一种低成本MEMS IMU的行人惯性导航系统，通过改进的卡尔曼滤波和步态检测算法，有效抑制位置漂移，提升导航精度。

1503.01407 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Invariant EKF Design for Scan Matching-aided Localization

用于扫描匹配辅助定位的不变扩展卡尔曼滤波设计

Martin Barczyk, Silvère Bonnabel, Jean-Emmanuel Deschaud, François Goulette

AI总结本文提出基于IEKF和MEKF的室内定位方法，通过融合运动传感器数据与扫描匹配点云实现机器人位姿估计，实验验证了IEKF的优越性。

1502.07424 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Mechanical Design, Modelling and Control of a Novel Aerial Manipulator

新型空中机械臂的机械设计、建模与控制

Alexandros Nikou, Georgios C. Gavridis, Kostas J. Kyriakopoulos

AI总结本文提出了一种新型空中机械臂系统，通过技术优化问题确定机械结构、推力器数量及几何布局，设计了自适应反步控制器以实现末端执行器在笛卡尔空间中的精确位置和姿态控制，并通过仿真验证了系统性能。

Comments Comments: 8 Pages, 2015 IEEE International Conference on Robotics and Automation (ICRA '15), Seattle, WA, USA

1403.1202 2026-06-04 cond-mat.stat-mech cs.RO cs.SY eess.SY physics.bio-ph q-bio.PE 版本更新

Flocking and turning: a new model for self-organized collective motion

编队与转向：一种新的自组织集体运动模型

Andrea Cavagna, Lorenzo Del Castello, Irene Giardina, Tomas Grigera, Asja Jelic, Stefania Melillo, Thierry Mora, Leonardo Parisi, Edmondo Silvestri, Massimiliano Viale, Aleksandra M. Walczak

AI总结本文提出新的动态方程，用于描述极化动物群体的集体运动，考虑了相关转向，仅通过社会力。该模型在适当极限下恢复Vicsek模型，但能描述更复杂的转向行为。

Comments Accepted for the Special Issue of the Journal of Statistical Physics: Collective Behavior in Biological Systems, 17 pages, 4 figures, 3 videos

详情

DOI: 10.1007/s10955-014-1119-3
Journal ref: J.Stat.Phys. 158 (2015) 601-627

AI中文摘要

鸟群中的个体以相关方式移动，导致速度极化。对于线性运动，已有良好理解。然而，实际观察中，群体质心常转向，导致更复杂的动态，但仍保持强极化。本文提出新的动态方程，用于极化动物群体的集体运动，考虑相关转向，仅通过社会力。利用旋转对称性和守恒定律，用类似于旋转哈密顿形式的广义坐标来描述速度方向。显式推导该形式与个体速度动态的对应关系，从而获得新的集体运动模型。在适当的阻尼极限下恢复已知的Vicsek模型，该模型耗散旋转信息，不允许可极化的转向。尽管新模型在描述转向群体时最为成功，其动态在广泛动态范围内本质上不同于之前的模型，而在非常大的长度尺度上减少到Toner和Tu的流体动力学描述。因此，推导出的框架是通用的，可能描述任何强极化活性物质系统的集体运动。

英文摘要

Birds in a flock move in a correlated way, resulting in large polarization of velocities. A good understanding of this collective behavior exists for linear motion of the flock. Yet observing actual birds, the center of mass of the group often turns giving rise to more complicated dynamics, still keeping strong polarization of the flock. Here we propose novel dynamical equations for the collective motion of polarized animal groups that account for correlated turning including solely social forces. We exploit rotational symmetries and conservation laws of the problem to formulate a theory in terms of generalized coordinates of motion for the velocity directions akin to a Hamiltonian formulation for rotations. We explicitly derive the correspondence between this formulation and the dynamics of the individual velocities, thus obtaining a new model of collective motion. In the appropriate overdamped limit we recover the well-known Vicsek model, which dissipates rotational information and does not allow for polarized turns. Although the new model has its most vivid success in describing turning groups, its dynamics is intrinsically different from previous ones in a wide dynamical regime, while reducing to the hydrodynamic description of Toner and Tu at very large length-scales. The derived framework is therefore general and it may describe the collective motion of any strongly polarized active matter system.

URL PDF HTML ☆

赞 0 踩 0

1501.05628 2026-06-04 cs.RO cs.SY eess.SY math.DS 版本更新

Identification of a Hybrid Spring Mass Damper via Harmonic Transfer Functions as a Step Towards Data-Driven Models for Legged Locomotion

通过谐波传递函数识别混合弹簧质量阻尼器：迈向基于数据的腿部运动模型

İsmail Uyanık, Mustafa Mert Ankaralı, Noah J. Cowan, Ömer Morgül, Uluç Saranlı

AI总结本文通过数据驱动的频域系统识别方法，利用谐波传递函数构建混合结构动力学模型，展示其在腿部运动建模中的应用潜力。

Comments Draft submitted to the 17th International Conference on Advanced Robotics

详情

AI中文摘要

在手动构建的数学模型中，难以充分捕捉腿部运动的相关方面。即使简单的跑步模型也涉及非可积动力学，需要在模型控制器设计中使用可能不准确的近似方法。本文展示如何利用数据驱动的频域系统识别方法，获取围绕极限环的动态系统输入-输出特性，其混合结构特性类似于腿部运动系统。在某些假设下，可以将此类系统的极限环混合动力学近似为分段光滑的线性时间周期系统（LTP），进一步近似为时间周期性、分段LTI系统以减少识别过程中的参数自由度。本文使用一个简单的单维混合模型，通过线性执行器的动作诱导极限环，以展示方法细节。我们首先推导了示例模型的理论谐波传递函数。然后用小的啁啾信号激励模型，引入围绕其极限环的扰动，并展示系统识别结果以估计该模型的谐波传递函数。数据驱动的HTF模型与理论预测的比较展示了此类经验识别方法在腿部运动中的潜在有效性。

英文摘要

There are limitations on the extent to which manually constructed mathematical models can capture relevant aspects of legged locomotion. Even simple models for basic behaviors such as running involve non-integrable dynamics, requiring the use of possibly inaccurate approximations in the design of model-based controllers. In this study, we show how data-driven frequency domain system identification methods can be used to obtain input--output characteristics for a class of dynamical systems around their limit cycles, with hybrid structural properties similar to those observed in legged locomotion systems. Under certain assumptions, we can approximate hybrid dynamics of such systems around their limit cycle as a piecewise smooth linear time periodic system (LTP), further approximated as a time-periodic, piecewise LTI system to reduce parametric degrees of freedom in the identification process. In this paper, we use a simple one-dimensional hybrid model in which a limit-cycle is induced through the actions of a linear actuator to illustrate the details of our method. We first derive theoretical harmonic transfer functions of our example model. We then excite the model with small chirp signals to introduce perturbations around its limit-cycle and present systematic identification results to estimate the harmonic transfer functions for this model. Comparison between the data-driven HTF model and its theoretical prediction illustrates the potential effectiveness of such empirical identification methods in legged locomotion.

URL PDF HTML ☆

赞 0 踩 0

1501.02855 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Assessing Whole-Body Operational Space Control in a Point-Foot Series Elastic Biped: Balance on Split Terrain and Undirected Walking

评估点足系列弹性双足机器人的全身操作空间控制：在高坡分地形平衡与无定向行走

Donghyun Kim, Ye Zhao, Gray Thomas, Luis Sentis

AI总结本文提出双足机器人敏捷行为的控制与轨迹生成改进，展示全身操作空间控制在高坡分地形平衡和无定向行走中的应用，首次实现该控制在系列弹性双足机器人上的应用。

Comments 17 pages, 9 figures, 4 tables

详情

AI中文摘要

在本文中，我们提出了双足机器人敏捷行为的控制和轨迹生成的改进。我们证明了几年前开发的全身操作空间控制（WBOSC）适用于实现两种敏捷行为：在高坡分地形上平衡和在平坦地形上实现无定向行走。本文展示的是首次在双足机器人上实现WBOSC，特别是具有系列弹性执行器的双足机器人。我们提出并分析了一种新的算法，该算法通过选择足部放置位置动态平衡点足机器人。处理这些系统自然不稳定的动态是一个困难的问题，需要控制器和轨迹生成算法快速且高效地运行。我们提出了全面的开发和整合努力：双足系统的設計與建造以及实验基础设施，WBOSC的定制化以适应敏捷行为，以及新的轨迹生成算法。使用这种定制的控制器，我们首次进行了实验，使双足机器人在高坡分地形上平衡，展示了我们通过力反馈技术精确调节内部力的能力。最后，我们通过基于物理的仿真器和通过平面化运动设置的物理实验，展示了我们的在线轨迹生成算法的稳定能力。

英文摘要

In this paper we present advancements in control and trajectory generation for agile behavior in bipedal robots. We demonstrate that Whole-Body Operational Space Control (WBOSC), developed a few years ago, is well suited for achieving two types of agile behaviors, namely, balancing on a high pitch split terrain and achieving undirected walking on flat terrain. The work presented here is the first implementation of WBOSC on a biped robot, and more specifically a biped robot with series elastic actuators. We present and analyze a new algorithm that dynamically balances point foot robots by choosing footstep placements. Dealing with the naturally unstable dynamics of these type of systems is a difficult problem that requires both the controller and the trajectory generation algorithm to operate quickly and efficiently. We put forth a comprehensive development and integration effort: the design and construction of the biped system and experimental infrastructure, a customization of WBOSC for the agile behaviors, and new trajectory generation algorithms. Using this custom built controller, we conduct, for first time, an experiment in which a biped robot balances in a high pitch split terrain, demonstrating our ability to precisely regulate internal forces using force sensing feedback techniques. Finally, we demonstrate the stabilizing capabilities of our online trajectory generation algorithm in the physics-based simulator and through physical experiments with a planarized locomotion setup.

URL PDF HTML ☆

赞 0 踩 0

1501.02854 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Stability and Performance Limits of Latency-Prone Distributed Feedback Controllers

具有延迟的分布式反馈控制器的稳定性与性能极限

Ye Zhao, Nicholas Paine, Kwan Suk Kim, Luis Sentis

AI总结本文研究了具有延迟的分布式反馈控制器的稳定性与性能极限，探讨了在高阻尼反馈和低阻尼反馈之间平衡的重要性，并通过实验验证了分布式阻抗控制器的稳定性影响因素。

Comments 13 pages, 10 figures, 2 tables, 31 reference

详情

AI中文摘要

机器人控制系统日益依赖分布式反馈控制器来解决复杂传感和决策问题，如在高灵活度的人为中心机器人中遇到的问题。这些需求带来了计算负担的增加，从而导致更大的控制器延迟。为了通过最大化控制反馈增益来提高对机械扰动的鲁棒性，本文强调了在分布式控制器中在高和低级反馈控制努力之间进行权衡的必要性。具体而言，研究了分布式阻抗控制器的影响，其中阻尼反馈努力在靠近控制装置的位置执行，而刚度反馈努力在具有延迟的集中控制过程中执行。一个中心观察是，高阻抗分布式控制器的稳定性对阻尼反馈延迟非常敏感，但对刚度反馈延迟影响较小。本研究深入分析了这一观察，从而获得了对差异的物理理解。然后推导出一个实用的控制器崩溃增益规则，以帮助控制设计师考虑以分布式方式实现其控制应用的好处。这些考虑通过在高性能执行器和 omnidirectional 移动基座上的分析、模拟和实验测试进一步验证。

英文摘要

Robotic control systems are increasingly relying on distributed feedback controllers to tackle complex sensing and decision problems such as those found in highly articulated human-centered robots. These demands come at the cost of a growing computational burden and, as a result, larger controller latencies. To maximize robustness to mechanical disturbances by maximizing control feedback gains, this paper emphasizes the necessity for compromise between high- and low-level feedback control effort in distributed controllers. Specifically, the effect of distributed impedance controllers is studied where damping feedback effort is executed in close proximity to the control plant and stiffness feedback effort is executed in a latency-prone centralized control process. A central observation is that the stability of high impedance distributed controllers is very sensitive to damping feedback delay but much less to stiffness feedback delay. This study pursues a detailed analysis of this observation that leads to a physical understanding of the disparity. Then a practical controller breakdown gain rule is derived to aim at enabling control designers to consider the benefits of implementing their control applications in a distributed fashion. These considerations are further validated through the analysis, simulation and experimental testing on high performance actuators and on an omnidirectional mobile base.

URL PDF HTML ☆

赞 0 踩 0

1501.00505 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Adaptive Control of 4-DoF Robot manipulator

四自由度机器人操作臂的自适应控制

P. Mironchyk

AI总结本文提出了一种针对四自由度机器人操作臂动态参数不确定性的自适应控制策略，通过实验验证其在机器人仿真器中的有效性。

Comments 7 pages, 4(5) figures

1412.7824 2026-06-04 eess.SY cs.MA cs.RO cs.SY 版本更新

Multi Time Scale Behaviour of The Formation of Multiple Groups of Nonholonomic Wheeled Mobile Robots

多时间尺度下非完整轮式移动机器人多组形成行为

Soumic Sarkar, Indra Narayan Kar

AI总结本文提出基于质心的变换方法，将非完整轮式移动机器人的联合动力学分解为三个子系统，分别设计控制器以实现多组形变和质心动态的多时间尺度收敛，同时通过势函数梯度确保避障。

Comments arXiv admin note: text overlap with arXiv:1412.6164

详情

AI中文摘要

不同几何模式和形状通过群体代理生成，需要形成控制。本文应用基于质心的变换（CBT），将非完整轮式移动机器人（WMRs）的联合动力学分解为三个子系统：组内和组间形状动力学以及质心动力学。组内形状动力学可进一步分为每个组的形状动力学，从而形成多组概念。因此，为每个子系统设计了独立控制器。控制器的增益被选择使得整体系统成为奇异扰动系统，不同子系统在不同时间收敛到其期望值。本文进行了多时间尺度收敛分析。基于势函数梯度的负值被添加到控制器中，以确保机器人之间的避障。提供了仿真结果以证明所提控制器的有效性。

英文摘要

Different geometric patterns and shapes are generated using groups of agents, and this needs formation control. In this paper, Centroid Based Transformation (CBT), has been applied to decompose the combined dynamics of nonholonomic Wheeled Mobile Robots (WMRs) into three subsystems: intra and inter group shape dynamics, and the dynamics of the centroid. The intra group shape dynamics can further be partitioned into the shape dynamics of each group, giving the notion of multiple group. Thus separate controllers have been designed for each subsystem. The gains of the controllers are such chosen that the overall system becomes singularly perturbed system, and different subsystems converge to their desired values at different times. Then multi-time scale convergence analysis has been carried out in this paper. Negative gradient of a potential based function has been added to the controller to ensure collision avoidance among the robots. Simulation results have been provided to demonstrate the effectiveness of the proposed controller.

URL PDF HTML ☆

赞 0 踩 0

1412.6704 2026-06-04 eess.SY cs.RO cs.SY 版本更新

First Passage Value

首次通过值

Cenk Oguz Saglam, Katie Byl

AI总结本文扩展了首次通过时间的概念，提出首次通过值，用于描述更广泛的利益相关值，如能量消耗、距离或时间，并为给定置信水平提供首次通过值的界限。

详情

AI中文摘要

对于许多随机动态系统，首次通过时间（MFPT）是一个有用的概念，它给出了感兴趣状态之前的预期时间。本文从多个方面扩展了MFPT。首先，我们证明对于某些系统，仅使用第二大特征值计算的系统级MFPT可以捕捉大部分基本动态，即使对于相当复杂、高维系统也是如此。其次，我们将MFPT推广为首次通过值（MFPV），它提供了更广泛的利益相关值，例如能量消耗、距离或时间。第三，我们为给定置信水平提供了首次通过值（FPV）的界限。本文的核心是强调，为了我们的目标，许多混合系统可以近似为马尔可夫决策过程。因此，许多系统可以使用此框架有效控制。然而，我们的框架特别适用于亚稳系统。此类系统表现出有趣的长寿命行为，它们注定会不可避免地逃逸（例如，最终到达一个不同的失败或成功状态）。我们的目标是根据应用，最小化或最大化该值，直到逃逸。

英文摘要

For many stochastic dynamic systems, the Mean First Passage Time (MFPT) is a useful concept, which gives expected time before a state of interest. This work is an extension of MFPT in several ways. (1) We show that for some systems the system-wide MFPT, calculated using the second largest eigenvalue only, captures most of the fundamental dynamics, even for quite complex, high-dimensional systems. (2) We generalize MFPT to Mean First Passage Value (MFPV), which gives a more general value of interest, e.g., energy expenditure, distance, or time. (3) We provide bounds on First Passage Value (FPV) for a given confidence level. At the heart of this work, we emphasize that for our goals, many hybrid systems can be approximated as Markov Decision Processes. So, many systems can be controlled effectively using this framework. However, our framework is particularly useful for metastable systems. Such systems exhibit interesting long-living behaviors from which they are guaranteed to inevitably escape (e.g., eventually arriving at a distinct failure or success state). Our goal is then either minimizing or maximizing the value until escape, depending on the application.

URL PDF HTML ☆

赞 0 踩 0

1412.1251 2026-06-04 cs.RO cs.HC cs.SY eess.SY 版本更新

From Human-Computer Interaction to Human-Robot Social Interaction

从人机交互到人机社会交互

Tarek Toumi, Abdelmadjid Zidani

AI总结本文探讨了人机社会交互领域，结合人机交互与人机交互的研究，整合机器人的情感与能力概念，以提升人机交互效果，并讨论相关挑战。

1412.6029 2026-06-04 cs.RO cs.HC cs.SY eess.SY 版本更新

Pareto efficiency in synthesizing shared autonomy policies with temporal logic constraints

在具有时序逻辑约束下的共享自主策略合成中的帕累托效率

Jie Fu, Ufuk Topcu

AI总结本文提出一种两阶段策略合成算法，用于生成帕累托高效的协调与控制策略，通过整合Tchebychev标量化方法以优化人类努力与系统性能之间的平衡。

Comments 8 pages, 5 figures, submitted to ICRA 2015 conference

1404.2289 2026-06-04 eess.SY cs.RO cs.SY 版本更新

On the Minimal Revision Problem of Specification Automata

关于规范自动机的最小修正问题

Kangjin Kim, Georgios E. Fainekos, Sriram Sankaranarayanan

AI总结本文研究了机器人任务规划中规范自动机的最小修正问题，证明其为NP完全问题，并提出一种多项式时间的启发式算法以近似求解。

Comments 23 pages, 16 figures, 2 tables, International Joural of Robotics Research 2014 Major Revision (submitted)

详情

AI中文摘要

随着机器人逐步融入日常生活，提供安全且可证明正确操作的保障变得必要。此类保障可通过自动机理论的任务和使命规划实现，其中要求以时序逻辑规范表达。然而，在现实场景中，机器人可能无法满足所有用户任务需求。在这种情况下，机器人必须向用户提供无法完成任务的原因，并指示尽可能接近初始用户意图的任务。本文证明后者问题，即最小规范修正问题，是NP完全的。提出了一种启发式算法，可在多项式时间内计算最小修正问题（MRP）的良好近似解。算法的实验研究显示，在大多数问题实例中，该启发式算法实际上返回最优解。最后，一些算法未返回最优解的情况也被呈现。

英文摘要

As robots are being integrated into our daily lives, it becomes necessary to provide guarantees on the safe and provably correct operation. Such guarantees can be provided using automata theoretic task and mission planning where the requirements are expressed as temporal logic specifications. However, in real-life scenarios, it is to be expected that not all user task requirements can be realized by the robot. In such cases, the robot must provide feedback to the user on why it cannot accomplish a given task. Moreover, the robot should indicate what tasks it can accomplish which are as "close" as possible to the initial user intent. This paper establishes that the latter problem, which is referred to as the minimal specification revision problem, is NP complete. A heuristic algorithm is presented that can compute good approximations to the Minimal Revision Problem (MRP) in polynomial time. The experimental study of the algorithm demonstrates that in most problem instances the heuristic algorithm actually returns the optimal solution. Finally, some cases where the algorithm does not return the optimal solution are presented.

URL PDF HTML ☆

赞 0 踩 0

1406.4905 2026-06-04 cs.LG cs.RO cs.SY eess.SY stat.ML 版本更新

Variational Gaussian Process State-Space Models

变分高斯过程状态空间模型

Roger Frigola, Yutian Chen, Carl E. Rasmussen

AI总结本文提出基于稀疏高斯过程的变分贝叶斯学习方法，用于高效学习非线性状态空间模型，实现对非线性动力系统后验的可计算性，相比传统参数模型，能灵活平衡模型容量与计算成本，避免过拟合。

1410.4622 2026-06-04 cs.RO cs.SY eess.SY math.AT stat.ML 版本更新

Robust Topological Feature Extraction for Mapping of Environments using Bio-Inspired Sensor Networks

鲁棒拓扑特征提取用于生物启发式传感器网络环境映射

Alireza Dirafzoon, Edgar Lobaton

AI总结本文利用生物启发式传感器网络收集的最小感知信息，通过概率运动模型提取弱接触信息，构建环境拓扑表示。采用拓扑数据分析提取主导特征，结合密度基子采样算法提高鲁棒性，并提出鲁棒尺度不变分类算法以量化特征。

Comments 14 pages, 7 figures

详情

AI中文摘要

本文提出了一种利用生物启发式传感器网络进行未知环境探索和映射的方法。通过受蟑螂运动特性启发的概率运动模型，提取弱接触信息以构建环境拓扑表示。利用节点间交互生成点云，代表环境流形的空间特征。通过拓扑数据分析生成持续区间用于拓扑映射。为提高采样数据对异常值的鲁棒性，采用密度基子采样算法。此外，提出一种鲁棒尺度不变分类算法用于持久图，以提供数据中所需特征的定量表示。同时，提出多种定义接触度量的策略，以提高拓扑方法的估计和分类性能。

英文摘要

In this paper, we exploit minimal sensing information gathered from biologically inspired sensor networks to perform exploration and mapping in an unknown environment. A probabilistic motion model of mobile sensing nodes, inspired by motion characteristics of cockroaches, is utilized to extract weak encounter information in order to build a topological representation of the environment. Neighbor to neighbor interactions among the nodes are exploited to build point clouds representing spatial features of the manifold characterizing the environment based on the sampled data. To extract dominant features from sampled data, topological data analysis is used to produce persistence intervals for features, to be used for topological mapping. In order to improve robustness characteristics of the sampled data with respect to outliers, density based subsampling algorithms are employed. Moreover, a robust scale-invariant classification algorithm for persistence diagrams is proposed to provide a quantitative representation of desired features in the data. Furthermore, various strategies for defining encounter metrics with different degrees of information regarding agents' motion are suggested to enhance the precision of the estimation and classification performance of the topological method.

URL PDF HTML ☆

赞 0 踩 0

1403.5195 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Experimental Implementation of an Invariant Extended Kalman Filter-based Scan Matching SLAM

基于不变扩展卡尔曼滤波的扫描匹配SLAM实验实现

Martin Barczyk, Silvère Bonnabel, Jean-Emmanuel Deschaud, François Goulette

AI总结本文提出将不变扩展卡尔曼滤波方法应用于扫描匹配SLAM问题，通过实验验证了其在机器人平台上的鲁棒性。

1410.2792 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Convex Model Predictive Control for Vehicular Systems

凸优化预测控制用于车辆系统

Tiffany A. Huang, Matanya B. Horowitz, Joel W. Burdick

AI总结本文提出一种基于旋转矩阵轨道ope的凸优化预测控制方法，适用于SO(n)状态空间的系统，无需局部线性化，避免传统方法的缺陷，适用于航空和车辆系统。

1410.0879 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Priority-based coordination of mobile robots

基于优先级的移动机器人协调

Jean Gregoire

AI总结本文研究了在交叉口协调多个移动机器人的问题，提出通过规划优先级而非具体轨迹来实现更鲁棒的协调系统，适用于自动驾驶车辆等共享道路的场景。

Comments PhD Thesis, 182 pages

详情

AI中文摘要

自20世纪80年代末以来，自动驾驶车辆的发展已成为主要工业国家的高强度研究领域。积极的社会经济效益包括减少事故、减少旅行时间、提高能效和减少对昂贵基础设施的需求。某些形式的车对车和/或车对基础设施协作是确保安全高效的全球交通系统所必需的。本文研究了在交叉口协调多个移动机器人的特定协作形式。大多数先前提出的协调系统涉及规划轨迹并沿规划轨迹控制机器人：即计划-编程范式，其中规划被视为动作生成机制。本文的方法是规划优先级——机器人通过交叉口的相对顺序，这比许多轨迹尊重相同的优先级要弱得多。更具体地说，优先级编码协调问题的解的同调类。优先级分配相当于选择某些同调类来解决协调问题，而不是特定轨迹。一旦分配了优先级，机器人将通过保持分配优先级的控制定律进行控制，即确保描述的轨迹属于所选同调类。这导致了一个更鲁棒的协调系统——能够以反应式方式处理大量意外事件——特别适用于自动驾驶车辆在交叉口的协调应用，其中汽车、公共交通和行人共享道路。

英文摘要

Since the end of the 1980's, the development of self-driven autonomous vehicles is an intensive research area in most major industrial countries. Positive socio-economic potential impacts include a decrease of crashes, a reduction of travel times, energy efficiency improvements, and a reduced need of costly physical infrastructure. Some form of vehicle-to-vehicle and/or vehicle-to-infrastructure cooperation is required to ensure a safe and efficient global transportation system. This thesis deals with a particular form of cooperation by studying the problem of coordinating multiple mobile robots at an intersection area. Most of coordination systems proposed in previous work consist in planning a trajectory and to control the robots along the planned trajectory: that is the plan-as-program paradigm where planning is considered as a generative mechanism of action. The approach of the thesis is to plan priorities -- the relative order of robots to go through the intersection -- which is much weaker as many trajectories respect the same priorities. More precisely, priorities encode the homotopy classes of solutions to the coordination problem. Priority assignment is equivalent to the choice of some homotopy class to solve the coordination problem instead of a particular trajectory. Once priorities are assigned, robots are controlled through a control law preserving the assigned priorities, i.e., ensuring the described trajectory belongs to the chosen homotopy class. It results in a more robust coordination system -- able to handle a large class of unexpected events in a reactive manner -- particularly well adapted for an application to the coordination of autonomous vehicles at intersections where cars, public transport and pedestrians share the road.

URL PDF HTML ☆

赞 0 踩 0

1410.0083 2026-06-04 eess.SY cs.AI cs.RO cs.SY 版本更新

Integrating active sensing into reactive synthesis with temporal logic constraints under partial observations

将主动感知整合到带有时序逻辑约束的反应合成中，在部分观察下

Jie Fu, Ufuk Topcu

AI总结本文提出在部分可观测和动态环境中，利用感知动作进行在线反应规划，通过主动感知策略减少不确定性，确保时序逻辑规范以概率1满足。

Comments 7 pages, 2 figures, submitted to American Control Conference 2015

详情

AI中文摘要

我们引入了在部分可观测和动态环境中，具有时序逻辑约束的系统中利用感知动作进行在线反应规划的概念。在动态环境信息不完整的情况下，反应控制器合成相当于解决一个具有部分观察的双人游戏，计算复杂度 impractically 高。为减轻高计算负担，通过感知动作进行在线重规划，避免在部分观察下解决反应系统的策略。相反，我们只解决一个策略，确保给定的时序逻辑规范在系统拥有完整环境观察时可以满足。此类策略随后被转换为基于观察到的状态序列（交互系统及其环境）做出控制决策的策略。当系统遇到一个信念——包含所有可能的当前状态假设的集合——对于观察策略未定义时，触发一系列感知动作，由主动感知策略选择，以减少系统信念中的不确定性。我们证明，在满足系统传感器集合的 mild 技术假设下，通过交替使用基于观察的策略和主动感知策略，可以以概率1满足给定的时序逻辑规范。

英文摘要

We introduce the notion of online reactive planning with sensing actions for systems with temporal logic constraints in partially observable and dynamic environments. With incomplete information on the dynamic environment, reactive controller synthesis amounts to solving a two-player game with partial observations, which has impractically computational complexity. To alleviate the high computational burden, online replanning via sensing actions avoids solving the strategy in the reactive system under partial observations. Instead, we only solve for a strategy that ensures a given temporal logic specification can be satisfied had the system have complete observations of its environment. Such a strategy is then transformed into one which makes control decisions based on the observed sequence of states (of the interacting system and its environment). When the system encounters a belief---a set including all possible hypotheses the system has for the current state---for which the observation-based strategy is undefined, a sequence of sensing actions are triggered, chosen by an active sensing strategy, to reduce the uncertainty in the system's belief. We show that by alternating between the observation-based strategy and the active sensing strategy, under a mild technical assumption of the set of sensors in the system, the given temporal logic specification can be satisfied with probability 1.

URL PDF HTML ☆

赞 0 踩 0

1401.7612 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Mathematical Modelling of Turning Delays in Swarm Robotics

群体机器人转向延迟的数学建模

Jake P. Taylor-King, Benjamin Franz, Christian A. Yates, Radek Erban

AI总结研究转向延迟对群体机器人行为的影响，通过传输方程建模并验证实验结果，证明延迟方程能更准确预测机器人寻找目标区域的时间。

Comments Submitted to the IMA Journal of Applied Mathematics

1407.5813 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Priority-based coordination of autonomous and legacy vehicles at intersection

基于优先级的自主车辆与传统车辆在交叉口的协调

Xiangjun Qian, Jean Gregoire, Fabien Moutarde, Arnaud De La Fortelle

AI总结本文提出一种基于优先级的协调方法，支持传统车辆安全通过交叉口，同时保持无碰撞和死锁特性。

Comments put in other preprint server

1406.0993 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Latent Kullback Leibler Control for Continuous-State Systems using Probabilistic Graphical Models

基于概率图模型的潜在Kullback-Leibler控制用于连续状态系统

Takamitsu Matsubara, Vicenç Gómez, Hilbert J. Kappen

AI总结本文提出将KL控制问题嵌入概率图模型中，通过连续状态和离散表示进行控制，展示两种方法在机器人运动控制中的应用。

Comments 9 pages, 5 figures, accepted in Uncertainty in Artificial Intelligence (UAI '14)

1406.1619 2026-06-04 cs.RO cs.SY eess.SY 版本更新

An Invariant Linear Quadratic Gaussian controller for a simplified car

一种不变的线性二次高斯控制器用于简化汽车

Sébastien Diemer, Silvère Bonnabel

AI总结本文提出一种不变的线性二次高斯控制器，用于跟踪简化汽车模型的参考轨迹，通过模拟验证其在高噪声或初始不确定性下的鲁棒性。

详情

AI中文摘要

本文考虑了基于单轮动力学模型的简化汽车跟踪参考轨迹的问题，其中仅测量位置，并且控制输入和测量值受到独立高斯噪声干扰。为解决此问题，我们设计了一种新型观测器-控制器：不变的线性二次高斯控制器（ILQG）。它基于线性二次高斯控制器，但方程稍作修改以利用问题的对称性。增益调节对估计轨迹的依赖性降低，因此对估计误差更不敏感。除了不变方法本身合理（无论参考轨迹朝西还是朝南，控制器性能不应依赖于此）外，我们通过仿真证明，ILQG在大噪声或大初始不确定性情况下优于传统LQG控制器。我们还表明，这些鲁棒性特性可能对运动规划应用也有用。

英文摘要

In this paper, we consider the problem of tracking a reference trajectory for a simplified car model based on unicycle kinematics, whose position only is measured, and where the control input and the measurements are corrupted by independent Gaussian noises. To tackle this problem we devise a novel observer-controller: the invariant Linear Quadratic Gaussian controller (ILQG). It is based on the Linear Quadratic Gaussian controller, but the equations are slightly modified to account for, and to exploit, the symmetries of the problem. The gain tuning exhibits a reduced dependency on the estimated trajectory, and is thus less sensitive to misestimates. Beyond the fact the invariant approach is sensible (there is no reason why the controller performance should depend on whether the reference trajectory is heading west or south), we show through simulations that the ILQG outperforms the conventional LQG controller in case of large noises or large initial uncertainties. We show that those robustness properties may also prove useful for motion planning applications.

URL PDF HTML ☆

赞 0 踩 0

1405.7392 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Information-Theoretic Stochastic Optimal Control via Incremental Sampling-based Algorithms

基于增量采样的信息论随机最优控制

Oktay Arslan, Evangelos Theodorou, Panagiotis Tsiotras

AI总结本文提出基于增量采样的算法，将RRT与信息论随机最优控制相结合，解决大系统中非线性随机微分方程的最优控制问题。

Comments 18 pages

详情

AI中文摘要

本文考虑由非线性随机微分方程表示的动力系统最优控制问题。已知最优控制策略可通过满足非线性偏微分方程（即哈密顿-雅可比-贝尔曼方程）的价值函数获得。该非线性PDE必须逆向求解，但对于大规模系统来说计算不可行。在某些假设下，应用对数变换后，最优策略可表示为路径积分。基于路径积分的控制方法最近被证明能优雅地解决广泛类别的随机最优控制问题。其中一个实现挑战是计算成本函数在未受力动力学轨迹上的期望值。在均匀采样轨迹上计算此类期望可能导致数值不稳定性，因为成本的指数化。因此，采样低成本轨迹对PI方法的实用实现至关重要。本文使用基于增量采样的算法从未受力系统动力学中采样有用的轨迹，并建立了RRT与信息论随机最优控制之间的新联系。我们展示了所提出方法在多个示例中的数值实现结果。

英文摘要

This paper considers optimal control of dynamical systems which are represented by nonlinear stochastic differential equations. It is well-known that the optimal control policy for this problem can be obtained as a function of a value function that satisfies a nonlinear partial differential equation, namely, the Hamilton-Jacobi-Bellman equation. This nonlinear PDE must be solved backwards in time, and this computation is intractable for large scale systems. Under certain assumptions, and after applying a logarithmic transformation, an alternative characterization of the optimal policy can be given in terms of a path integral. Path Integral (PI) based control methods have recently been shown to provide elegant solutions to a broad class of stochastic optimal control problems. One of the implementation challenges with this formalism is the computation of the expectation of a cost functional over the trajectories of the unforced dynamics. Computing such expectation over trajectories that are sampled uniformly may induce numerical instabilities due to the exponentiation of the cost. Therefore, sampling of low-cost trajectories is essential for the practical implementation of PI-based methods. In this paper, we use incremental sampling-based algorithms to sample useful trajectories from the unforced system dynamics, and make a novel connection between Rapidly-exploring Random Trees (RRTs) and information-theoretic stochastic optimal control. We show the results from the numerical implementation of the proposed approach to several examples.

URL PDF HTML ☆

赞 0 踩 0

1405.3094 2026-06-04 cs.RO cs.SY eess.SY 版本更新

The inverted Pendulum: A fundamental Benchmark in Control Theory and Robotics

倒立摆：控制理论与机器人学中的基本基准

Olfa Boubaker

AI总结本文探讨倒立摆作为控制理论与机器人学中重要基准的应用动机，分析其在实际设计中的实现，并通过150篇文献综述展示其在历史、现状及挑战中的发展。

Comments IEEE International Conference on Education and e-Learning Innovations (ICEELI), 1-3 July 2012, Sousse, Tunisia

1405.2363 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

A sampling-based approach to scalable constraint satisfaction in linear sampled-data systems---Part I: Computation

基于采样的可扩展约束满足方法在线性采样数据系统中的应用---第一部分：计算

Shahab Kaynama, Jeremy H. Gillula, Claire J. Tomlin

AI总结本文提出一种新的基于采样的算法，用于高维线性采样数据系统中精确逼近可行性核，解决约束满足问题，并在十二维飞行包线保护问题中验证了方法的有效性。

详情

AI中文摘要

采样数据（SD）系统由离散和连续时间组件组成，是实际中最常见的混合物理系统之一；大多数现代控制器是在数字平台上实现的，而被控制的植物动力学在时间上是连续变化的。与所有混合物理系统一样，确保硬约束满足是SD系统安全运行的关键。一种强大的分析工具用于保证此类约束满足是可行性核：所有初始条件的集合，对于存在安全保持控制律（即满足所有输入和状态约束的控制律）而言。在本文中，我们提出了一种新的基于采样的算法，该算法紧密逼近高维线性采样数据时间不变（LTI）系统的可行性核。与该领域先前的工作不同，我们的算法正式处理了SD系统的离散和连续特性。我们证明了我们近似技术的正确性和收敛性，提供了关于如何通过启发式方法优化采样过程的讨论，并在十二维飞行包线保护问题中展示了结果。

英文摘要

Sampled-data (SD) systems, which are composed of both discrete- and continuous-time components, are arguably one of the most common classes of cyberphysical systems in practice; most modern controllers are implemented on digital platforms while the plant dynamics that are being controlled evolve continuously in time. As with all cyberphysical systems, ensuring hard constraint satisfaction is key in the safe operation of SD systems. A powerful analytical tool for guaranteeing such constraint satisfaction is the viability kernel: the set of all initial conditions for which a safety-preserving control law (that is, a control law that satisfies all input and state constraints) exists. In this paper we present a novel sampling-based algorithm that tightly approximates the viability kernel for high-dimensional sampled-data linear time-invariant (LTI) systems. Unlike prior work in this area, our algorithm formally handles both the discrete and continuous characteristics of SD systems. We prove the correctness and convergence of our approximation technique, provide discussions on heuristic methods to optimally bias the sampling process, and demonstrate the results on a twelve-dimensional flight envelope protection problem.

URL PDF HTML ☆

赞 0 踩 0

1404.7073 2026-06-04 eess.SY cs.LG cs.LO cs.RO cs.SY 版本更新

Probably Approximately Correct MDP Learning and Control With Temporal Logic Constraints

在时序逻辑约束下进行近似正确马尔可夫决策过程的学习与控制

Jie Fu, Ufuk Topcu

AI总结本文提出在未知随机环境中，基于PAC-MDP方法学习满足时序逻辑规范的近优控制策略，通过多项式时间与空间复杂度实现高概率的近优策略生成。

Comments 9 pages, 5 figures, Accepted by 2014 Robotics: Science and Systems (RSS)

详情

AI中文摘要

我们考虑在未知随机环境中合成控制策略，以最大化满足给定时序逻辑规范的概率。我们将系统与环境的交互建模为具有初始未知转移概率的马尔可夫决策过程（MDP）。所开发的解决方案基于所谓的基于模型的近似正确马尔可夫决策过程（PAC-MDP）方法。该算法通过样本（即观测）、时间和空间，以多项式复杂度与MDP大小、自动机构造时序逻辑规范的大小、1/ε、1/δ和有限时间 horizon 相关，生成一个ε-近优策略，概率为1-δ。在此方法中，系统维护初始未知MDP的模型，并基于其学习模型和规范自动机构造产品MDP。在执行过程中，策略通过观察系统所采取的转移进行迭代更新。迭代在有限步骤内终止。以高概率，所生成的策略使得任何状态下，该策略满足规范的概率与最优策略之间的差异在预定义范围内。

英文摘要

We consider synthesis of control policies that maximize the probability of satisfying given temporal logic specifications in unknown, stochastic environments. We model the interaction between the system and its environment as a Markov decision process (MDP) with initially unknown transition probabilities. The solution we develop builds on the so-called model-based probably approximately correct Markov decision process (PAC-MDP) methodology. The algorithm attains an $\varepsilon$-approximately optimal policy with probability $1-δ$ using samples (i.e. observations), time and space that grow polynomially with the size of the MDP, the size of the automaton expressing the temporal logic specification, $\frac{1}{\varepsilon}$, $\frac{1}δ$ and a finite time horizon. In this approach, the system maintains a model of the initially unknown MDP, and constructs a product MDP based on its learned model and the specification automaton that expresses the temporal logic constraints. During execution, the policy is iteratively updated using observation of the transitions taken by the system. The iteration terminates in finitely many steps. With high probability, the resulting policy is such that, for any state, the difference between the probability of satisfying the specification under this policy and the optimal one is within a predefined bound.

URL PDF HTML ☆

赞 0 踩 0

1403.2174 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A New Technique for INS/GNSS Attitude and Parameter Estimation Using Online Optimization

一种新的INS/GNSS姿态和参数估计技术：使用在线优化

Yuanxin Wu, Jinling Wang, Dewen Hu

AI总结本文提出一种在线约束优化方法，用于同时估计INS/GNSS的姿态和其他相关参数，无需先验姿态或传感器噪声信息，通过数值结果验证其在高精度应用中的有效性。

Comments IEEE Trans. on Signal Processing, to appear

1404.3580 2026-06-04 cs.MA cs.NI cs.RO cs.SY eess.SY 版本更新

Joint Estimation and Localization in Sensor Networks

传感器网络中的联合估计与定位

Nikolay A. Atanasov, Roberto Tron, Victor M. Preciado, George J. Pappas

AI总结本文提出了一种分布式线性估计器用于无线传感器网络中动态目标的协同跟踪，证明了其在静态目标估计中的均方一致性，并通过分布式雅可比算法实现传感器定位，确保了联合定位与目标估计方法的强收敛性。

Comments 9 pages (two-column); 5 figures; Manuscript submitted to the 2014 IEEE Conference on Decision and Control (CDC)

详情

AI中文摘要

自主水下机器人在线近似最优保持策略

Patrick Walters, Warren E. Dixon

AI总结研究提出基于强化学习的 actor-critic 框架，用于在线近似自主水下机器人六自由度最优保持策略，实现状态和策略的统一有界收敛。

Comments 6 pages

1403.5374 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Transverse Contraction Criteria for Stability of Nonlinear Hybrid Limit Cycles

横向收缩准则用于非线性混合极限环的稳定性

Justin Z. Tang, Ian R. Manchester

AI总结本文提出基于线性矩阵不等式序列的微分条件，用于证明非线性混合极限环的轨道稳定性，通过凸优化工具如sum-of-squares编程寻找稳定性证书，无需事先知道稳定极限环在状态空间中的精确位置。

Comments Conference submission

1402.5639 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Decentralized Rendezvous of Nonholonomic Robots with Sensing and Connectivity Constraints

非holonomic机器人群体的去中心化会合

Zhen Kan, Justin Klotz, Eduardo L. Pasiliao, John M. Shea, Warren E. Dixon

AI总结研究针对具有非holonomic约束的轮式机器人，在保持网络连通性和碰撞避免的前提下，实现群体在指定点集的会合。采用去中心化时变控制器基于导航函数框架，利用局部传感反馈实现导航。

Comments 9 pages, 5 figures, submitted to Automatica

详情

AI中文摘要

本文考虑一组具有非holonomic约束的轮式机器人在指定点集实现会合，同时保持网络连通性和碰撞避免。每个机器人存在通信和传感约束，只有部分机器人知晓全局目标，其余机器人需在连通性约束下移动，使知晓目标的机器人引导群体到达目标。机器人还需在会合点附近外避免碰撞。为实现会合控制目标，基于导航函数框架开发了去中心化时变控制器，利用局部传感反馈（包括邻居位置反馈和绝对方位测量）导航机器人，实现导航期间的无线电静默。仿真结果展示了所提出方法的性能。

英文摘要

A group of wheeled robots with nonholonomic constraints is considered to rendezvous at a common specified setpoint with a desired orientation while maintaining network connectivity and ensuring collision avoidance within the robots. Given communication and sensing constraints for each robot, only a subset of the robots are aware or informed of the global destination, and the remaining robots must move within the network connectivity constraint so that the informed robots can guide the group to the goal. The mobile robots are also required to avoid collisions with each other outside a neighborhood of the common rendezvous point. To achieve the rendezvous control objective, decentralized time-varying controllers are developed based on a navigation function framework to steer the robots to perform rendezvous while preserving network connectivity and ensuring collision avoidance. Only local sensing feedback, which includes position feedback from immediate neighbors and absolute orientation measurement, is used to navigate the robots and enables radio silence during navigation. Simulation results demonstrate the performance of the developed approach.

URL PDF HTML ☆

赞 0 踩 0

1402.4004 2026-06-04 cs.RO cs.ET cs.SY eess.SY 版本更新

Design of a Hybrid Robot Control System using Memristor-Model and Ant-Inspired Based Information Transfer Protocols

基于忆阻器模型和蚂蚁启发信息传输协议的混合机器人控制系统设计

Ella Gale, Ben de Lacy Costello, Andrew Adamatzky

AI总结本文提出一种混合机器人控制方法，利用蚂蚁行为启发算法和忆阻器非线性特性，在传感器信息处理受限情况下实现快速有效的情报近似。

Comments Conference

1402.0289 2026-06-04 cs.CV cs.RO cs.SY eess.SY 版本更新

A Robust Framework for Moving-Object Detection and Vehicular Traffic Density Estimation

一种用于移动物体检测和车辆交通密度估计的稳健框架

Pranam Janney, Glenn Geers

AI总结本文提出了一种基于纹理度量的移动物体检测方法，具有计算成本低、参数调整少和抗噪声能力强的特点，实验表明其性能优于现有方法，并提出车辆交通密度估计的框架及对比分析。

1312.6573 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Trackability with Imprecise Localization

不精确定位下的跟踪性

Kyle Klein, Subhash Suri

AI总结研究在存在定位误差的情况下，跟踪器如何在同等速度下追击目标，分析最坏情况下的距离增长速率及保持目标距离所需的速度提升，以及障碍物对跟踪性能的影响。

Comments 17 pages, 9 figures

1311.4769 2026-06-04 cs.RO cs.SY eess.SY 版本更新

On 'A Kalman Filter-Based Algorithm for IMU-Camera Calibration: Observability Analysis and Performance Evaluation'

关于基于卡尔曼滤波算法的IMU-相机标定：可观测性分析与性能评估

Yuanxin Wu

AI总结本文分析了IMU-相机标定中可观测性问题，指出原始工作中的可观测性结论存在错误，影响了SLAM系统的发展。

Comments 3 pages. This work was done in 2009. Abstract revised and More refs added in this new version

1304.3663 2026-06-04 cs.RO cs.MA cs.SY eess.SY 版本更新

Cooperative localization by dual foot-mounted inertial sensors and inter-agent ranging

通过双足安装惯性传感器和机间测距实现协同定位

John-Olof Nilsson, Dave Zachariah, Isaac Skog, Peter Händel

AI总结本文提出一种部分去中心化的系统架构，利用步进惯性导航和步进死 reckoning 降低计算和通信成本，实现大规模群体的协同定位。

Comments 14 pages

详情

DOI: 10.1186/1687-6180-2013-164
Journal ref: EURASIP Journal on Advances in Signal Processing 2013, 2013:164

AI中文摘要

本文讨论了通过双足安装惯性传感器和机间测距实现协同定位的实施挑战，并回顾了相关工作。系统架构和传感器融合被确定为关键挑战。提出了一种基于步进惯性导航和步进死 reckoning的部分去中心化系统架构，该架构可将计算成本和所需通信带宽降低两个数量级，同时与传统集中实现相比仅产生可忽略的信息损失。这使得联合全局状态估计对于多达编队规模的群体成为可能。此外，基于状态空间变换和边缘化的鲁棒且低成本的传感器融合方法被提出。变换和边缘化用于提供必要的灵活性，以支持所提出的基于采样更新的机间测距和无测距融合。最后，通过仿真和实时系统实现展示了所提实现的特点。

英文摘要

The implementation challenges of cooperative localization by dual foot-mounted inertial sensors and inter-agent ranging are discussed and work on the subject is reviewed. System architecture and sensor fusion are identified as key challenges. A partially decentralized system architecture based on step-wise inertial navigation and step-wise dead reckoning is presented. This architecture is argued to reduce the computational cost and required communication bandwidth by around two orders of magnitude while only giving negligible information loss in comparison with a naive centralized implementation. This makes a joint global state estimation feasible for up to a platoon-sized group of agents. Furthermore, robust and low-cost sensor fusion for the considered setup, based on state space transformation and marginalization, is presented. The transformation and marginalization are used to give the necessary flexibility for presented sampling based updates for the inter-agent ranging and ranging free fusion of the two feet of an individual agent. Finally, characteristics of the suggested implementation are demonstrated with simulations and a real-time system implementation.

URL PDF HTML ☆

赞 0 踩 0

1311.4625 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Control Contraction Metrics and Universal Stabilizability

控制收缩度量与通用可控性

Ian R. Manchester, Jean-Jacques E. Slotine

AI总结本文提出通用可控性概念，通过求解点wise线性矩阵不等式得到控制收缩度量，为非线性系统全局稳定提供了充分条件，并与控制Lyapunov函数理论有联系。

Comments Conference submission

1311.4527 2026-06-04 cs.AI cs.DC cs.MA cs.RO cs.SY eess.SY 版本更新

A message-passing algorithm for multi-agent trajectory planning

多智能体轨迹规划的消息传递算法

Jose Bento, Nate Derbinsky, Javier Alonso-Mora, Jonathan Yedidia

AI总结本文提出基于改进交替方向乘子法的新型算法，用于计算多智能体的无碰撞全局轨迹，具有自然并行化和易整合不同成本函数的优点。

Comments In Advances in Neural Information Processing Systems (NIPS), 2013. Demo video available at http://www.youtube.com/watch?v=yuGCkVT8Bew

1311.4419 2026-06-04 eess.SY cs.RO cs.SY physics.bio-ph 版本更新

Perception and Steering Control in Paired Bat Flight

群体飞行中的感知与转向控制

Zhaodan Kong, Kayhan Ozcimder, Nathan W. Fuller, John Baillieul

AI总结本文研究群体飞行中领导者与追随者的行为协调，提出基于虚拟目镜的转向法则，结合现有视觉转向法则生成轨迹，符合蝙蝠飞行数据统计特征。

Comments Submitted to the 19th World Congress of the International Federation of Automatic Control (IFAC)

详情

AI中文摘要

群体中的动物需要协调对环境特征和彼此的反应以安全移动。本文扩展了之前关于德克萨斯州约翰逊城附近Myotis velifer飞行模式的研究。这些蝙蝠每天傍晚以小群体形式从洞穴中出没，通常不超过三到四只，为研究领导者-追随者行为提供了理想对象。通过分析M. velifer群体的飞行路径，数据表明追随者蝙蝠的飞行行为受领导者蝙蝠影响，这种影响无法用现有追逐定律如经典追逐、恒定方位和运动伪装法充分解释。因此，我们提出了一种基于虚拟目镜的转向法则，该法则用于捕捉领导者-追随者对的几何配置。结果表明，该法则可与之前提出的视觉赋能转向法则结合，生成轨迹，其统计特征与数据集中蝙蝠的轨迹相符。结果表明，蝙蝠在导航时使用了环境和邻居的感知信息。

英文摘要

Animals within groups need to coordinate their reactions to perceived environmental features and to each other in order to safely move from one point to another. This paper extends our previously published work on the flight patterns of Myotis velifer that have been observed in a habitat near Johnson City, Texas. Each evening, these bats emerge from a cave in sequences of small groups that typically contain no more than three or four individuals, and they thus provide ideal subjects for studying leader-follower behaviors. By analyzing the flight paths of a group of M. velifer, the data show that the flight behavior of a follower bat is influenced by the flight behavior of a leader bat in a way that is not well explained by existing pursuit laws, such as classical pursuit, constant bearing and motion camouflage. Thus we propose an alternative steering law based on virtual loom, a concept we introduce to capture the geometrical configuration of the leader-follower pair. It is shown that this law may be integrated with our previously proposed vision-enabled steering laws to synthesize trajectories, the statistics of which fit with those of the bats in our data set. The results suggest that bats use perceived information of both the environment and their neighbors for navigation.

URL PDF HTML ☆

赞 0 踩 0

1311.4296 2026-06-04 cs.LG cs.NA cs.RO math.NA math.OC 版本更新

Reflection methods for user-friendly submodular optimization

用于用户友好的子模优化的反射方法

Stefanie Jegelka, Francis Bach, Suvrit Sra

AI总结本文提出一种高效的子模函数优化方法，通过连续最佳逼近问题的反射序列求解，实现连续和离散问题的双重解决，应用于图像分割任务。

Comments Neural Information Processing Systems (NIPS), États-Unis (2013)

详情

AI中文摘要

最近研究表明，子模性自然捕捉了机器学习、信号处理和计算机视觉中广泛出现的概念。因此，需要高效的子模函数优化方法，尤其是最小化问题。尽管一般子模最小化具有挑战性，我们提出了一种新方法，利用子模函数的现有分解性。与以往方法不同，该方法既非近似也非不切实际，也不需要复杂的参数调优。此外，它易于实现和并行化。我们方法的关键组成部分是将离散子模最小化问题转化为连续最佳逼近问题，通过一系列反射求解，并可通过阈值处理获得最优离散解。该方法解决了连续和离散问题，因此在学习、推断和重建中有应用。在实验中，我们展示了该方法在两个图像分割任务中的优势。

英文摘要

Recently, it has become evident that submodularity naturally captures widely occurring concepts in machine learning, signal processing and computer vision. Consequently, there is need for efficient optimization procedures for submodular functions, especially for minimization problems. While general submodular minimization is challenging, we propose a new method that exploits existing decomposability of submodular functions. In contrast to previous approaches, our method is neither approximate, nor impractical, nor does it need any cumbersome parameter tuning. Moreover, it is easy to implement and parallelize. A key component of our method is a formulation of the discrete submodular minimization problem as a continuous best approximation problem that is solved through a sequence of reflections, and its solution can be easily thresholded to obtain an optimal discrete solution. This method solves both the continuous and discrete formulations of the problem, and therefore has applications in learning, inference, and reconstruction. In our experiments, we illustrate the benefits of our method on two image segmentation tasks.

URL PDF HTML ☆

赞 0 踩 0

1311.2796 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Mixed Human-Robot Team Surveillance

混合人机团队监视

Vaibhav Srivastava, Amit Surana, Miguel P. Eckstein, Francesco Bullo

AI总结本文研究了混合人机团队在监视任务中的系统理论设计，提出人机决策模型及高效注意力分配策略，通过异常检测算法和随机车辆路由策略实现高效监视。

Comments 16 pages

详情

AI中文摘要

我们从系统理论的角度研究混合人机团队设计，以监视任务为背景。混合团队设计的三个关键耦合组件是（i）人操作员的策略，（ii）补偿错误人决策的策略，（iii）控制自动机的策略。本文调研了人决策的要素，包括证据聚合、态势感知、疲劳和记忆效应。我们将这些要素的模型结合起来，开发出一个单一的、连贯的人决策模型，用于双替代选择任务。我们利用该模型设计高效的注意力分配策略给人操作员。我们提出了一种异常检测算法，利用操作员可能的错误决策来确定被监视区域中的异常区域。最后，我们提出了一种随机车辆路由策略，以高概率监视异常区域。我们的混合团队设计依赖于确定性等价滚动时域控制框架。

英文摘要

We study the mixed human-robot team design in a system theoretic setting using the context of a surveillance mission. The three key coupled components of a mixed team design are (i) policies for the human operator, (ii) policies to account for erroneous human decisions, and (iii) policies to control the automaton. In this paper, we survey elements of human decision-making, including evidence aggregation, situational awareness, fatigue, and memory effects. We bring together the models for these elements in human decision-making to develop a single coherent model for human decision-making in a two-alternative choice task. We utilize the developed model to design efficient attention allocation policies for the human operator. We propose an anomaly detection algorithm that utilizes potentially erroneous decision by the operator to ascertain an anomalous region among the set of regions surveilled. Finally, we propose a stochastic vehicle routing policy that surveils an anomalous region with high probability. Our mixed team design relies on the certainty-equivalent receding-horizon control framework.

URL PDF HTML ☆

赞 0 踩 0

1311.1761 2026-06-04 cs.LG cs.AI cs.NE cs.RO cs.SY eess.SY 版本更新

Exploring Deep and Recurrent Architectures for Optimal Control

探索深度和循环架构以实现最优控制

Sergey Levine

AI总结本文探讨了将深度和循环神经网络应用于连续高维运动控制任务，通过强化学习算法训练控制器，比较不同架构的性能，并讨论深度学习在最优控制中的应用前景。

Comments Appears in the Neural Information Processing Systems (NIPS 2013) Workshop on Deep Learning

详情

AI中文摘要

复杂的多层神经网络在多个监督任务中取得了最先进的结果。然而，此类多层网络在控制领域的成功应用迄今为止主要局限于控制流水线的感知部分。本文探讨了将深度和循环神经网络应用于连续、高维运动任务，其中网络用于表示控制策略，将系统状态（由关节角度表示）直接映射到每个关节的扭矩。通过使用最近的强化学习算法guided policy search，可以成功训练具有数千参数的神经网络控制器，从而比较各种架构。我们讨论了运动控制任务与先前监督感知任务的区别，展示了比较各种架构的实验结果，并讨论了将深度学习技术应用于最优控制问题的未来方向。

英文摘要

Sophisticated multilayer neural networks have achieved state of the art results on multiple supervised tasks. However, successful applications of such multilayer networks to control have so far been limited largely to the perception portion of the control pipeline. In this paper, we explore the application of deep and recurrent neural networks to a continuous, high-dimensional locomotion task, where the network is used to represent a control policy that maps the state of the system (represented by joint angles) directly to the torques at each joint. By using a recent reinforcement learning algorithm called guided policy search, we can successfully train neural network controllers with thousands of parameters, allowing us to compare a variety of architectures. We discuss the differences between the locomotion control task and previous supervised perception tasks, present experimental results comparing various architectures, and discuss future directions in the application of techniques from deep learning to the problem of optimal control.

URL PDF HTML ☆

赞 0 踩 0

1310.7568 2026-06-04 q-bio.QM cs.RO cs.SY eess.SY 版本更新

Interlimb neural connection is not required for gait transition in quadruped locomotion

四肢神经连接在四足运动中步态转换中并非必需

Atsushi Tero, Masakazu Akiyama, Dai Owaki, Takeshi Kano, Akio Ishiguro, Ryo Kobayashi

AI总结本文提出一种耦合振荡器模型，通过身体物理相互作用生成步态模式，发现改变肢体运动速度可自发产生类似真实四足动物的步态转换，且改变物理特性可使 pacing 替代 trotting，模型有助于理解生物运动原理及设计节能腿式机器人。

Comments 6 pages, 2figures

1310.7062 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Real-Time Planning with Primitives for Dynamic Walking over Uneven Terrain

动态不平地形下基于原始体的实时规划

Ian R. Manchester, Jack Umenberger

AI总结本文提出一种基于有限运动原语的实时运动规划算法，利用虚拟holonomic约束构建闭式解和二叉搜索树，提升动态步行规划效率。

Comments Conference submission

1309.5401 2026-06-04 cs.RO cs.CV cs.SY eess.SY 版本更新

Nonmyopic View Planning for Active Object Detection

非我的视图规划用于主动物体检测

Nikolay Atanasov, Bharath Sankaran, Jerome Le Ny, George J. Pappas, Kostas Daniilidis

AI总结本文提出通过控制移动深度相机视角进行主动物体检测，通过规划视图序列平衡移动能耗与识别正确假设的概率，实验表明优于贪心方法。

Comments 12 pages (two-column); 7 figures; 2 tables; Manuscript submitted to the IEEE Transactions on Robotics (TRO)

详情

AI中文摘要

计算机视觉中的核心问题之一是语义重要物体的检测以及其姿态的估计。大多数物体检测工作基于单张图像处理，其性能受限于遮挡和外观和几何的模糊性。本文提出了一种主动检测方法，通过控制移动深度相机的视角进行物体检测。当初始静态检测阶段识别出感兴趣的物体时，会针对其类别和方向提出多个假设。传感器随后规划一系列视角，平衡移动所消耗的能量与识别正确假设的概率。我们提出了一个包含传感器移动性的主动假设检验问题，并使用基于点的近似POMDP算法进行求解。通过仿真和实际世界实验验证了我们的方法的有效性，结果表明我们的方法优于广泛使用的贪心视角选择方法，并在静态物体检测上提供了显著改进。

英文摘要

One of the central problems in computer vision is the detection of semantically important objects and the estimation of their pose. Most of the work in object detection has been based on single image processing and its performance is limited by occlusions and ambiguity in appearance and geometry. This paper proposes an active approach to object detection by controlling the point of view of a mobile depth camera. When an initial static detection phase identifies an object of interest, several hypotheses are made about its class and orientation. The sensor then plans a sequence of views, which balances the amount of energy used to move with the chance of identifying the correct hypothesis. We formulate an active hypothesis testing problem, which includes sensor mobility, and solve it using a point-based approximate POMDP algorithm. The validity of our approach is verified through simulation and real-world experiments with the PR2 robot. The results suggest that our approach outperforms the widely-used greedy view point selection and provides a significant improvement over static object detection.

URL PDF HTML ☆

赞 0 踩 0

1309.5390 2026-06-04 eess.SY cs.RO cs.SY math.DS math.OC 版本更新

Information Acquisition with Sensing Robots: Algorithms and Error Bounds

利用传感机器人获取信息：算法与误差界

Nikolay Atanasov, Jerome Le Ny, Kostas Daniilidis, George J. Pappas

AI总结本文提出一种非贪心算法，考虑传感器动态，提供子最优保证，优于传统贪心方法，适用于非线性传感模型。

Comments 9 pages (two-column); 2 figures; Manuscript submitted to the 2014 IEEE International Conference on Robotics and Automation

详情

AI中文摘要

利用可配置传感系统的功能需要解决困难的信息收集问题。对于无内部状态的传感系统，存在近优解方法。然而，当优化移动传感器轨迹时，解决方案通常是贪心的，很少提供性能保证。值得注意的是，在线性高斯假设下，问题变为确定性问题，可离线求解。基于子模性的方法通常忽略传感器动态，贪心选择环境中的信息位置。本文提出一种非贪心算法，不依赖子模性，考虑传感器动态。我们的方法在理论上优于广泛使用的贪心方法。结合线性化和模型预测控制，可用于生成适应性策略。应用包括气体浓度映射和目标跟踪。

英文摘要

Utilizing the capabilities of configurable sensing systems requires addressing difficult information gathering problems. Near-optimal approaches exist for sensing systems without internal states. However, when it comes to optimizing the trajectories of mobile sensors the solutions are often greedy and rarely provide performance guarantees. Notably, under linear Gaussian assumptions, the problem becomes deterministic and can be solved off-line. Approaches based on submodularity have been applied by ignoring the sensor dynamics and greedily selecting informative locations in the environment. This paper presents a non-greedy algorithm with suboptimality guarantees, which does not rely on submodularity and takes the sensor dynamics into account. Our method performs provably better than the widely used greedy one. Coupled with linearization and model predictive control, it can be used to generate adaptive policies for mobile sensors with non-linear sensing models. Applications in gas concentration mapping and target tracking are presented.

URL PDF HTML ☆

赞 0 踩 0

1309.1029 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Sensor Setups for State and Wind Estimation for Airborne Wind Energy Converters

用于空载风能转换器状态和风速估计的传感器配置

Maximilian Ranneberg

AI总结本文提出了一种无迹卡尔曼滤波器，用于空载风能转换器的空气动力学、状态和风况估计，通过不同传感器配置验证，展示了低成本高可靠性的可行性。

1309.1788 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Web Standards as Standard Pieces in Robotics

网络标准作为机器人中的标准组件

Sekou L. Remy

AI总结本文探讨了网络技术在机器人控制中的应用，指出尽管存在性能担忧，但网络技术能有效提升系统集成，并展示了通过Web服务器和浏览器实现开环和闭环控制的实验结果。

Comments 9th Annual IEEE International Conference on Automation Science and Engineering (IEEE CASE 2013)

详情

AI中文摘要

现代机器人技术常常利用网络技术来应对设计和操作的复杂性。许多网络技术已被正式化为标准，但机器人和控制系统领域的人士往往因担心网络速度或不确定性而回避这些标准。本文认为，尽管网络技术不适用于所有控制场景，但不应被完全摒弃，因为它们能为系统集成提供关键帮助。网络技术在过去十年中取得了显著进步。我们展示了使用Web服务器在不同网络拓扑上实现开环和闭环控制（3Hz至1kHz）的应用细节。此外，我们还考虑了使用Web浏览器实现植物控制的影响。实验结果证实，网络技术可用于有意义的控制，并突显了限制其适用性的设计选择。

英文摘要

Modern robotics often involves the use of web technologies as a means to cope with the complexity of design and operation. Many of these technologies have been formalized into standards, which are often avoided by those in robotics and controls because of a sometimes warranted fear that "the web" is too slow, or too uncertain for meaningful control applications. In this work we argue that while web technologies may not be applicable for all control, they should not be dismissed outright because they can provide critical help with system integration. Web technologies have also advanced significantly over the past decade. We present the details of an application of a web server to perform open and close-loop control (between 3Hz and 1kHz) over a variety of different network topologies. In our study we also consider the impact of a web browser to implement the control of the plant. Our results confirm that meaningful control can be performed using web technologies, and also highlight design choices that can limit their applicability.

URL PDF HTML ☆

赞 0 踩 0

1308.6250 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Circumnavigation of an Unknown Target Using UAVs with Range and Range Rate Measurements

利用UAV的测距和测距率测量绕未知目标 circumnavigation

Yongcan Cao, Jonathan Muse, David Casbeer, Derek Kingston

AI总结本文提出两种控制算法，使UAV能绕未知目标飞行。算法通过测距和测距率测量，使UAV在轨道外或在轨道上时趋近预定轨道切线，在轨道内时施零控制。两种算法在平滑性和饱和性上有所区别，通过Lyapunov函数设计证明算法能实现任意初始状态下的预定轨道。

Comments To appear in IEEE Conference on Decision and Control, 2013

1308.3015 2026-06-04 cs.RO cs.SY eess.SY stat.CO stat.ME 版本更新

On Generalized Bayesian Data Fusion with Complex Models in Large Scale Networks

在大规模网络中使用复杂模型的广义贝叶斯数据融合

Nisar Ahmed, Tsung-Lin Yang, Mark Campbell

AI总结本文提出新的广义贝叶斯分布式数据融合算法，用于处理动态网络拓扑和复杂信念模型，通过混合pdf和条件因子提升多机器人目标搜索中的融合效果。

Comments Revised version of paper submitted to 2013 Workshop on Wireless Intelligent Sensor Networks (WISeNET 2013) at Duke University, June 5, 2013

详情

AI中文摘要

近年来，通信、移动计算和人工智能的进步大大扩展了智能分布式传感器网络的应用空间。这反过来推动了开发广义贝叶斯分布式数据融合（DDF）算法，以在自主代理之间实现鲁棒且高效的资源共享，使用概率信念模型。然而，DDF在需要使用动态/即需网络拓扑和复杂信念模型（如高斯混合或混合贝叶斯网络）的一般现实应用中显著难以实施。为了解决这些问题，我们首先讨论了关于精确DDF和保守近似DDF的一些新关键数学见解。这些见解随后被用来开发基于混合pdf和条件因子的新型广义DDF算法。受多机器人目标搜索启发的数值示例显示，我们的方法导致显著更好的融合结果，因此有潜力增强传感器网络中的分布式智能推理。

英文摘要

Recent advances in communications, mobile computing, and artificial intelligence have greatly expanded the application space of intelligent distributed sensor networks. This in turn motivates the development of generalized Bayesian decentralized data fusion (DDF) algorithms for robust and efficient information sharing among autonomous agents using probabilistic belief models. However, DDF is significantly challenging to implement for general real-world applications requiring the use of dynamic/ad hoc network topologies and complex belief models, such as Gaussian mixtures or hybrid Bayesian networks. To tackle these issues, we first discuss some new key mathematical insights about exact DDF and conservative approximations to DDF. These insights are then used to develop novel generalized DDF algorithms for complex beliefs based on mixture pdfs and conditional factors. Numerical examples motivated by multi-robot target search demonstrate that our methods lead to significantly better fusion results, and thus have great potential to enhance distributed intelligent reasoning in sensor networks.

URL PDF HTML ☆

赞 0 踩 0

1308.2923 2026-06-04 cs.NI cs.RO cs.SY eess.SY math.OC 版本更新

Robotic Message Ferrying for Wireless Networks using Coarse-Grained Backpressure Control

用于无线网络的机器人信息中继：基于粗粒度背压控制

Shangxing Wang, Andrea Gasparri, Bhaskar Krishnamachari

AI总结本文研究了机器人在无线网络中中继信息的容量区域，提出粗粒度背压中继算法，通过优化匹配实现稳定传输，证明在可控移动下可实现常数容量并保证有限延迟。

详情

AI中文摘要

基于Bingham分布的方位递归估计

Gerhard Kurz, Igor Gilitschenski, Simon Julier, Uwe D. Hanebeck

AI总结本文提出基于Bingham分布的二维递归滤波器，用于解决具有180度对称性的圆形滤波问题，适用于实时应用，并可扩展至四元数以跟踪三维方位。

1304.3075 2026-06-04 cs.AI cs.RO cs.SY eess.SY 版本更新

Application of Evidential Reasoning to Helicopter Flight Path Control

证据推理在直升机飞行路径控制中的应用

Shoshana Abel

AI总结本文提出了一种专家系统推理和知识表示方法，用于在实时车辆导航系统中处理不确定性。提出了一种创新的证据推理方法，即求和与格点方法，并进行了数学推导、并行环境实现及原型软件开发和测试。

Comments Appears in Proceedings of the Second Conference on Uncertainty in Artificial Intelligence (UAI1986)