arXivDaily arXiv每日学术速递 周一至周五更新
重置
2606.05160 2026-06-04 cs.RO 版本更新

GRAIL: Generating Humanoid Loco-Manipulation from 3D Assets and Video Priors

GRAIL: 从3D资产和视频先验生成人形机器人全身操作

Tianyi Xie, Haotian Zhang, Jinhyung Park, Zi Wang, Bowen Wen, Jiefeng Li, Xueting Li, Qingwei Ben, Haoyang Weng, Yufei Ye, David Minor, Tingwu Wang, Chenfanfu Jiang, Sanja Fidler, Jan Kautz, Linxi Fan, Yuke Zhu, Zhengyi Luo, Umar Iqbal, Ye Yuan

发表机构 * NVIDIA UCLA(加州大学洛杉矶分校)

AI总结 提出GRAIL全虚拟生成管线,利用3D资产和视频基础模型先验合成人机交互演示,无需物理搭建或遥操作,实现人形机器人全身操作策略的模拟到现实迁移。

Comments Project page: https://research.nvidia.com/labs/dair/grail/

详情
AI中文摘要

扩展人形机器人全身操作需要跨多样物体、全身运动和场景几何的机器人兼容演示,但遥操作和动作捕捉难以规模化,因为每次采集都依赖于物理设置、仪器化演员和机器人操作。我们提出GRAIL,一个在部署前完全保持虚拟的数字生成管线:它组合3D资产、模拟器就绪场景和视频基础模型(VFM)的先验,以合成交互,无需重建物理环境或遥操作机器人。GRAIL并非重建无约束的野外视频,而是从完全指定的3D配置开始,其中物体几何、相机参数、度量尺度、环境深度和机器人比例的角色在视频生成前已知,并在重建过程中重复使用。这种特权设置更好地约束了4D恢复,允许基于模型的物体跟踪、人体运动估计和交互感知优化,以重建度量的4D人-物交互(HOI)轨迹,减少了深度模糊和形态不匹配。我们将恢复的运动重定向到人形机器人,并训练互补的任务通用跟踪器:用于操作的对象感知潜在适配器和用于地形穿越的场景感知跟踪器。GRAIL生成了超过20,000个序列,涵盖拾取、物体操作、坐姿和地形穿越。仅使用GRAIL生成的数据,我们通过模拟到现实管线训练自我中心视觉策略,并将其部署在Unitree G1人形机器人上,在多样物体拾取上实现了84%的真实世界成功率,在爬楼梯上实现了90%的成功率。

英文摘要

Scaling humanoid loco-manipulation requires robot-compatible demonstrations across diverse objects, whole-body motions, and scene geometries, but teleoperation and motion capture are difficult to scale because each collection depends on physical setups, instrumented actors, and robot operation. We present GRAIL, a digital generation pipeline that remains fully virtual until deployment: it composes 3D assets, simulator-ready scenes, and priors from video foundation models (VFMs) to synthesize interactions without rebuilding physical environments or teleoperating the robot. Rather than reconstructing unconstrained in-the-wild videos, GRAIL starts from fully specified 3D configurations in which object geometry, camera parameters, metric scale, environment depth, and a robot-proportioned character are known before video generation and reused during reconstruction. This privileged setup better conditions 4D recovery, allowing model-based object tracking, human motion estimation, and interaction-aware optimization to reconstruct metric 4D human-object interaction (HOI) trajectories with reduced depth ambiguity and morphology mismatch. We retarget the recovered motions to a humanoid robot and train complementary task-general trackers: an object-aware latent adaptor for manipulation and a scene-aware tracker for terrain traversal. GRAIL produces over 20,000 sequences spanning pick-up, object manipulation, sitting, and terrain traversal. Using only GRAIL-generated data, we train egocentric visual policies through a sim-to-real pipeline and deploy them on a Unitree G1 humanoid, achieving 84\% real-world success on diverse object pick-up and 90\% success on stair-climbing.

2606.05159 2026-06-04 cs.RO 版本更新

X4Val: Learning Neural Surrogates for Variance-Reduced Policy Evaluation

X4Val: 学习方差缩减策略评估的神经代理模型

Rachel Luo, Michael Watson, Apoorva Sharma, Heng Yang, Han Qi, Edward Schmerling, Sushant Veer, Boris Ivanovic, Marco Pavone

发表机构 * NVIDIA Research(NVIDIA研究院) Harvard University(哈佛大学) Stanford University(斯坦福大学)

AI总结 提出X4Val框架,通过嵌入多域数据并学习可迁移预测器,结合控制变量估计器实现无配对样本下的方差缩减,在自动驾驶和机器人操作任务中方差降低达38.4%。

详情
AI中文摘要

对基于学习的机器人系统进行严格评估是部署的必要前提。然而,真实世界的测试数据收集成本高昂;此外,在典型的迭代开发环境中,从最新策略收集的数据规模必然有限。这促使我们利用异构数据源(包括仿真、历史策略日志以及从相关平台或环境收集的数据)的评估方法。虽然此类辅助数据丰富且廉价,但它们通常不能直接代表真实世界的结果——例如,仿真中的性能可能与真实世界中的性能存在显著差异——这使得它们在高置信度性能估计中的原则性使用具有挑战性。在本文中,我们介绍了X4Val,一个在存在非配对、多域数据的情况下进行方差缩减的真实世界指标估计的通用框架。X4Val将来自真实域和辅助域的样本嵌入到一个共享表示空间中,并学习一个可迁移的真实世界指标预测器;然后将这个学习到的预测器纳入控制变量估计器,即使在无配对样本的情况下也能实现方差缩减。我们提供了理论分析,并在自动驾驶和真实世界机器人操作任务上进行了实证评估,在这些领域中,X4Val实现了高达38.4%的方差缩减,并表现出相对于强基线的持续改进。这些结果表明,非配对的异构数据可以被利用来显著提高严格机器人系统验证的样本效率。

英文摘要

Rigorous evaluation of learning-based robotic systems is an essential prerequisite for deployment. However, real-world test data is expensive to gather; moreover, in a typical iterative development context, data gathered from the latest policy is necessarily limited in scale. This motivates evaluation methodologies that make use of heterogeneous data sources, including simulation, historical policy logs, and data collected from related platforms or environments. While such auxiliary data are abundant and inexpensive, they are generally not directly representative of real-world outcomes -- for example, performance in simulation may differ substantially from performance in the real world -- making their principled use for high-confidence performance estimation challenging. In this paper, we introduce X4Val, a general framework for variance-reduced real-world metric estimation in the presence of non-paired, multi-domain data. X4Val embeds samples from real and auxiliary domains into a shared representation space and learns a transferable predictor of real-world metrics; this learned predictor is then incorporated into a control-variates estimator, enabling variance reduction even when paired samples are unavailable. We provide theoretical analysis and empirical evaluations on autonomous driving and real-world robot manipulation tasks, domains across which X4Val achieves up to 38.4% variance reduction and demonstrates consistent improvements over strong baselines. These results show that non-paired, heterogeneous data can be leveraged to substantially improve the sample efficiency of rigorous robotic system validation.

2606.05143 2026-06-04 cs.RO 版本更新

HORIZON: Recoverability-Governed Curriculum for Physical-Domain Scaling

HORIZON: 基于可恢复性的物理域缩放课程

Chenhao Bai, Liqin Lu, Kaijun Wang, Hui Chen, Jin-Chuan Shi, Yuyang Liu, Hao Chen, Chunhua Shen

发表机构 * Zhejiang University, State Key Lab of CAD & CG(浙江大学,计算机辅助设计与图形学国家重点实验室) Zhejiang University of Technology(浙江工业大学)

AI总结 针对机器人策略在物理域缩放中的可学习性问题,提出基于可恢复性的前沿课程HORIZON,通过回滚和边界细化逐步扩展物理域,实验揭示了物理域扩展的三个规律。

Comments 16 pages, 9 figures

详情
AI中文摘要

扩展鲁棒的机器人策略需要的不仅仅是更广泛的随机化,因为物理域经验必须在整个训练过程中保持有序和可学习。我们研究了策略何时能从更难的物理中受益,并确定可恢复性是在策略物理域缩放中的核心约束。在在策略训练中,新的动态仅当它们足够接近当前策略以生成纠正性的在策略数据时才有用,而不是将轨迹崩溃为不可恢复的失败。使用四足运动作为具身泛化的物理要求高的基准,我们引入了HORIZON,一种检查点前沿课程,仅在当前策略的可恢复边界内扩展物理域。HORIZON使用回滚和边界细化来管理每个扩展步骤,将固定随机化转变为物理域增长的持续过程。实验揭示了物理域扩展的三个规律。首先,直接域扩展在物理轴上是非均匀的,并且通常在没有阶段排序的情况下不可学习。其次,域组合是非单调的,在紧凑核心之外添加更多域可能会稀释可恢复的联合样本并降低整体鲁棒性。第三,孤立专家的离线蒸馏不能替代在策略课程生成的联合交互。这些结果共同将物理域泛化框架为具身控制的持续增长问题,以可恢复性作为在策略扩展的组织原则。

英文摘要

Scaling robust robot policies requires more than broader randomization, because physical-domain experience must remain organized and learnable throughout training. We study when a policy can benefit from harder physics and identify recoverability as a central constraint in on-policy physical-domain scaling. In on-policy training, new dynamics are useful only insofar as they remain close enough to the current policy to generate corrective on-policy data, rather than collapsing rollouts into unrecoverable failures. Using quadruped locomotion as a physically demanding benchmark for embodied generalization, we introduce HORIZON, a checkpointed frontier curriculum that expands physical domains only within the current policy's recoverable boundary. HORIZON uses rollback and boundary refinement to govern each expansion step, turning fixed randomization into a continual process of physical-domain growth. Experiments reveal three regularities of physical-domain expansion. First, direct domain widening is uneven across physical axes and often unlearnable without staged ordering. Second, domain composition is non-monotonic, and adding more domains beyond a compact core can dilute recoverable joint samples and reduce overall robustness. Third, offline distillation of isolated experts cannot substitute for the joint interaction generated by on-policy curriculum. Together, these results frame physical-domain generalization as a continual growth problem for embodied control, with recoverability as the organizing principle for on-policy expansion.

2606.05015 2026-06-04 cs.RO 版本更新

Generalization of World Models under Environmental Variability for Vision-based Quadrotor Navigation

环境变异性下基于视觉的四旋翼导航的世界模型泛化

Luca Zanatta, Grzegorz Malczyk, Kostas Alexis

发表机构 * Norwegian University of Science and Technology(挪威科学与技术大学)

AI总结 通过基于视觉的四旋翼导航测试,研究世界模型在不同环境随机性下的鲁棒性,发现自监督预训练阶段的泛化能力是模拟到现实迁移的强预测因子,并识别出离散潜在大小和训练序列长度是关键因素。

详情
AI中文摘要

世界模型,即学习预测环境演化的生成模型,已成为样本高效机器人学习的有前景工具。然而,它们对环境变异性的鲁棒性仍知之甚少。为解决这一问题,我们以基于视觉的四旋翼导航为测试平台进行系统研究,在不同环境随机性水平下训练基于DreamerV3的世界模型,并通过跨环境验证(涵盖自监督学习预训练和强化学习微调)在所有水平上评估它们。然后,我们将所有世界模型及相关导航策略部署到真实四旋翼上,在未见环境中进行测试,包括一次开环运行,其中模型仅接收2.5秒的真实感官输入,之后所有传感器被切断,系统完全依靠想象导航穿越12米距离。结果表明,自监督预训练阶段的世界模型鲁棒性是模拟到现实迁移的强预测因子:在跨环境自监督验证中泛化良好的每个模型都成功部署到真实世界,通过窄至0.67米的间隙,而在模拟策略评估中占主导地位的模型却在真实平台上失败。我们进一步识别出(a)离散潜在大小和(b)训练序列长度是控制世界模型质量的主要因素。

英文摘要

World models, learned generative models that predict how an environment evolves, have become a promising tool for sample-efficient robot learning. Yet how robust they are to environmental variability remains poorly understood. To address this, we conduct a systematic study using vision-based quadrotor navigation as a testbed problem, training DreamerV3-based world models under varying levels of environmental randomness and evaluating them across all levels through cross-environment validation, spanning both Self-Supervised Learning (SSL) pretraining and Reinforcement Learning (RL) fine-tuning. We then deploy all world models and associated navigation policies on a real quadrotor in unseen environments, including an open-loop run where the model receives just 2.5s of real sensory input before all sensors are cut off, leaving the system to navigate entirely in imagination over a 12m traverse. Our results show that world model robustness during SSL pretraining is a strong predictor of sim-to-real transfer: every model that generalized well in cross-environment SSL validation deployed successfully in the real world, passing through gaps as narrow as 0.67m, whereas the model that dominated simulation policy evaluation failed on the real platform. We further identify (a) the discrete latent size and (b) the training-sequence length as the dominant factors governing world model quality.

2606.05011 2026-06-04 cs.CV cs.RO 版本更新

CIPER: A Unified Framework for Cross-view Image-retrieval and Pose-estimation

CIPER: 跨视图图像检索与姿态估计的统一框架

Yurim Jeon, Dongseong Seo, Seung-Woo Seo

发表机构 * Seoul National University(首尔国立大学)

AI总结 提出CIPER框架,通过共享Transformer编码器和任务特定令牌联合进行城市级跨视图检索与精确3自由度姿态估计,实现互惠特征学习。

Comments 16 pages, 5 figures

详情
AI中文摘要

跨视图地理定位通过将地面图像与航拍图像数据库匹配来估计其地理位置。现有方法要么通过大规模检索,要么通过精确姿态估计来处理,但无法兼顾:基于检索的方法能够进行广域搜索,但牺牲了定位精度;而姿态估计方法仅在狭窄的搜索空间内实现高精度。简单级联这些流程会导致误差传播和特征表示不一致。我们将跨视图地理定位形式化为一个统一问题,要求同时进行城市级检索和精确的3自由度姿态估计。我们提出CIPER(跨视图图像检索与姿态估计变换器),这是一种单一架构,通过互惠特征学习联合执行两项任务。CIPER使用共享的Transformer编码器和任务特定令牌,将全局检索特征与空间定位线索分离。为了弥合地面和航拍视图之间的大领域差距,我们引入了一个双向Transformer姿态解码器,该解码器使用地面特征作为空间查询进行双向交叉注意力。一种集合预测策略进一步在统一的多任务目标下实现稳定的3自由度回归。在VIGOR、KITTI和Ford Multi-AV上的实验表明,特别是在有限的视野和任意方向条件下,性能具有竞争力。代码可在https://github.com/yurimjeon1892/CIPER获取。

英文摘要

Cross-view geo-localization estimates the geographic location of a ground image by matching it against an aerial image database. Existing methods tackle this through either large-scale retrieval or precise pose estimation, but not both: retrieval-based methods enable wide-area search at the cost of localization accuracy, while pose estimation methods achieve high precision within only a narrow search space. Naively cascading these pipelines introduces error propagation and inconsistent feature representations. We formulate cross-view geo-localization as a unified problem requiring simultaneous city-scale retrieval and precise 3-DoF pose estimation. We propose CIPER (Cross-view Image-retrieval and Pose-estimation transformER), a single architecture that jointly performs both tasks through mutually beneficial feature learning. CIPER uses a shared transformer encoder with task-specific tokens to disentangle global retrieval features from spatial localization cues. To bridge the large domain gap between ground and aerial views, we introduce a two-way transformer pose decoder that uses ground features as spatial queries for bidirectional cross-attention. A set prediction strategy further enables stable 3-DoF regression under a unified multi-task objective. Experiments on VIGOR, KITTI, and Ford Multi-AV demonstrate competitive performance, especially under limited field-of-view and arbitrary orientation conditions. Code is available at https://github.com/yurimjeon1892/CIPER.

2606.04989 2026-06-04 cs.HC cs.RO 版本更新

What Can Eye Gaze Teach Us About Real-World Cycling? Insights From the Oxford RobotCycle Project

眼动能教会我们关于真实世界骑行的什么?来自牛津RobotCycle项目的见解

Benjamin Hardin, Efimia Panagiotaki, Daniele De Martini, Lars Kunze

发表机构 * University of Oxford(牛津大学) University of the West of England(西英格兰大学)

AI总结 本研究利用可穿戴眼动追踪眼镜,通过分析不同环境(如自行车道、汽车道和共享公交车道)和事件(如超车和行人)下的眼动模式,揭示了骑行中感知危险的潜意识差异,并评估了眼动追踪在估计骑行压力和认知负荷方面的潜力。

详情
AI中文摘要

尽管对骑行情境的身体危险已有较多了解,但对骑行的感知危险知之甚少。此外,危险感知可能在潜意识层面被过滤,因此难以自我报告。为此,这些潜意识感知可以通过眼动等生理指标揭示。本文探讨了英国牛津骑行的感知安全性,并研究了可穿戴眼动追踪眼镜在不同环境和事件下产生关于感知差异见解的能力。本文发现,在自行车道、汽车道和共享公交车道之间,眼动模式发生变化,代表了每种车道类型的不同认知挑战。本文表明,不同交叉路口的眼动模式显著不同,这可能对骑行者的压力有影响。最后,与无事件骑行相比,在超车和道路行人等事件发生时,眼动模式存在差异。本文总结了使用可穿戴眼动追踪器估计压力和骑行者工作量的优点和局限性。

英文摘要

Although much is known about the physical danger of cycling situations, less is understood about the perceived danger of cycling. Furthermore, perception of danger may be filtered at a subconscious level and therefore difficult for one to self-report. To this end, these subconscious perceptions can be revealed through physiological metrics such as eye gaze. This paper explores the perceived safety of cycling in Oxford, United Kingdom and explores the ability of wearable eye tracking glasses to produce insights about the differences in perception under different environments and events. This paper finds that eye gaze patterns change between using bike lanes, car lanes and shared bus lanes, representing different cognitive challenges of each lane type. This paper presents that different intersections have significantly different eye gaze patterns which may have implications for cyclist stress. Finally, eye gaze patterns differ in the presence of events such as passes and pedestrians in the road compared to when cycling with no events. This paper draws conclusions on the benefits and limitations of using wearable eye trackers to estimate stress and cyclist workload.

2606.04968 2026-06-04 cs.RO 版本更新

Potential-Guided Flow Matching for Vision-Language-Action Policy Improvement

势引导的流匹配用于视觉-语言-动作策略改进

Yunpeng Mei, Jiakai He, Hongjie Cao, Chenyu Wang, Xiaowen Zhu, Yihan Zhou, Jiamin Wang, Chenbo Xin, Peng Cheng, Yuxuan Yang, Yijie Wang, Xinhu Zheng, Gao Huang, Jie Chen, Gang Wang

发表机构 * Nanyang Technological University(南洋理工大学) Tsinghua University(清华大学) University of Science and Technology of China(中国科学技术大学)

AI总结 提出ForesightFlow,一种自引导流匹配策略,通过解耦优势加权流匹配和一步边界估计器,无需外部评论家即可改进视觉-语言-动作策略。

详情
AI中文摘要

大型视觉-语言-动作(VLA)策略越来越多地被训练为动作块上的条件生成模型。然而,部署会产生混合质量的体验——成功的演示、部分完成、可恢复的错误和失败——这些难以与标准模仿一起使用。完整的行为克隆(BC)模仿失败,过滤后的BC丢弃有用的子轨迹,而离线强化学习增加了大型评论家。我们引入了ForesightFlow,一种自引导流匹配策略,它为每个生成的动作块增加一个学习到的成功势轨迹。同一个流提出并评分候选动作,实现了无需外部评论家的最佳K选择推理。关键问题是策略改进和价值校准需要不同的监督:优势加权应强调高质量动作,但将相同的权重应用于势坐标会抑制失败梯度并产生过度自信的分数。我们通过解耦优势加权流匹配来解决这个问题,将指数化优势权重仅应用于动作速度,同时均匀训练势速度。我们进一步推导了条件流匹配的一步边界估计器,允许通过单次停止梯度前向传递计算优势。在五个BEHAVIOR-1K模拟任务和五个真实世界双臂任务中,ForesightFlow优于模仿基线,在模拟成功率上与最强的分离评论家基线持平,提高了真实世界成功率,并将训练计算量减少了38%。消融实验表明,解耦防止了价值幻觉,一步估计器保持了候选排名保真度,自引导采样改善了长时程执行。

英文摘要

Large vision-language-action (VLA) policies are increasingly trained as conditional generative models over action chunks. Yet deployment produces mixed-quality experience-successful demonstrations, partial completions, recoverable mistakes, and failures-that is difficult to use with standard imitation. Full behavior cloning (BC) imitates failures, filtered BC discards useful sub-trajectories, and offline reinforcement learning adds a large critic. We introduce ForesightFlow, a self-guided flow-matching policy that augments each generated action chunk with a learned success-potential trajectory. The same flow proposes and scores candidate actions, enabling best-of-$K$ inference without an external critic. The key issue is that policy improvement and value calibration require different supervision: advantage weighting should emphasize high-quality actions, but applying the same weights to potential coordinates suppresses failure gradients and creates overconfident scores. We address this with decoupled advantage-weighted flow matching, applying exponentiated advantage weights only to action velocities while training potential velocities uniformly. We further derive a one-step boundary estimator for conditional flow matching, allowing advantage computation with a single stop-gradient forward pass. Across five BEHAVIOR-1K simulation tasks and five real-world bimanual tasks, ForesightFlow improves over imitation baselines, matches the strongest separate-critic baseline in simulation success, improves real-world success, and reduces training compute by $38\%$. Ablations show that decoupling prevents value hallucination, the one-step estimator preserves candidate-ranking fidelity, and self-guided sampling improves long-horizon execution.

2606.04884 2026-06-04 cs.RO 版本更新

D$^3$-MoE:Dual Disentangled Diffusion Mixture-of-Experts for Style-Controllable End-to-End Autonomous Driving

D$^3$-MoE:面向风格可控的端到端自动驾驶的双解耦扩散混合专家模型

Renju Feng, Rukang Wang, Ning Xi, Jianguo Yu, Liping Lu, Pan Zhou, Duanfeng Chu

发表机构 * Intelligent Transportation Systems Research Center, Wuhan University of Technology(武汉理工大学智能交通系统研究中心) School of Mechanical and Electronic Engineering, Wuhan University of Technology(武汉理工大学机械电子工程学院) School of Computer Science and Artificial Intelligence, Wuhan University of Technology(武汉理工大学计算机科学与人工智能学院) Hubei Key Laboratory of Distributed System Security, School of Cyber Science and Engineering, Huazhong University of Science and Technology(湖北省分布式系统安全重点实验室,华中科技大学网络空间安全学院)

AI总结 提出D$^3$-MoE框架,通过行为轴(扩散生成与选择解耦)和物理轴(纵向与横向专家解耦)的双重解耦,实现风格可控的端到端自动驾驶,在NAVSIM基准上达到SOTA规划性能。

Comments 8 pages, 6 figures

详情
AI中文摘要

传统的端到端自动驾驶框架在训练于高方差的人类演示时经常遭受“风格平均化”困境,产生同质化、风格不可控甚至运动学不安全的策略。为了克服这一限制,我们提出了D$^3$-MoE(双解耦扩散混合专家模型),该模型沿两个互补轴解耦轨迹建模。在行为轴上,生成与选择解耦:一个风格条件扩散过程在单个场景中并行合成多风格候选轨迹,允许下游模块根据用户偏好或评估分数选择最优轨迹。在物理轴上,解耦的纵向和横向路由器在推理时激活各自的专家,这些专家使用来自正交地面真值运动学的自监督目标进行训练,无需人工标签。这些激活的专家采用扩散变换器(DiT)架构,并配备风格条件自适应层归一化(AdaLN)和非对称横向融合交叉注意力,独立预测其对应的物理状态,然后重新组装成统一的、运动学一致的轨迹。在具有挑战性的NAVSIM基准上的广泛评估表明,D$^3$-MoE实现了最先进的规划性能,默认达到88.2 PDMS和84.3 EPDMS。此外,我们的“三选最佳”集成策略有效拓宽了多模态解空间,将性能提升至91.3 PDMS和87.5 EPDMS。定量和定性分析共同证实了该框架在规划质量和风格可控性方面的优势。

英文摘要

Traditional end-to-end autonomous driving frameworks frequently suffer from the "style-averaging" dilemma when trained on high-variance human demonstrations, yielding homogenized, style-uncontrollable, and even kinematically unsafe policies. To overcome this limitation, we present D$^3$-MoE (Dual Disentangled Diffusion Mixture-of-Experts), which disentangles trajectory modeling along two complementary axes. On the behavioral axis, generation is decoupled from selection: a style-conditioned diffusion process synthesizes multi-style candidate trajectories in parallel within a single scene, allowing a downstream module to select the optimal trajectory based on user preference or an evaluation score. On the physical axis, decoupled longitudinal and lateral routers activate their respective experts during inference time, trained without manual labels using self-supervised targets from orthogonal ground-truth kinematics. These activated experts, architected as Diffusion Transformers (DiT) and equipped with style-conditioned AdaLN and asymmetric lateral-fusion cross-attention, independently predict their corresponding physical state before being reassembled into a unified, kinematically coherent trajectory. Extensive evaluations on the challenging NAVSIM benchmark demonstrate that D$^3$-MoE achieves state-of-the-art planning performance, reaching 88.2 PDMS and 84.3 EPDMS by default. Moreover, our Best-of-Three ensemble strategy effectively broadens the multi-modal solution space, raising performance to 91.3 PDMS and 87.5 EPDMS. Both quantitative and qualitative analyses jointly confirm the framework's advantages in planning quality and style controllability.

2606.04853 2026-06-04 cs.RO 版本更新

Teaching Robots to Say 'I Don't Know' : SENTINEL for Uncertainty-Aware SLAM

教机器人说‘我不知道’:用于不确定性感知SLAM的SENTINEL

Abhishek S, Badrikanath Praharaj, Sreeram MV

发表机构 * University of California, Berkeley(加州大学伯克利分校) Stanford University(斯坦福大学)

AI总结 提出SENTINEL框架,通过几何扫描统计和跨模态深度一致性为低成本2D LiDAR提供无训练、无标签的可靠性评分,拒绝损坏扫描并回退到轮式里程计,防止SLAM无声损坏。

Comments 6 pages, 10 figures, 3 tables, This paper was accepted at Uncertainty in Open-World Robotics Workshop in conjunction with Internation conference of robotics and automation (ICRA 2026)

详情
AI中文摘要

低成本2D LiDAR缺乏高端传感器用于诊断测量故障的强度通道,但它们广泛用于教育和预算机器人平台。我们提出SENTINEL,一个无需训练、无需标签的可靠性估计框架,为仅测距的LiDAR提供有效的诊断信号。SENTINEL结合基于几何的扫描统计与LiDAR和RGB-D相机之间的跨模态深度一致性,计算每个扫描的可靠性分数(0到1)。当分数低于阈值时,损坏的扫描被拒绝,机器人回退到校准的轮式里程计,防止无声的SLAM损坏。我们在配备RPLidar A2M12和Intel RealSense D435i的GEFIER R1四轮差速转向机器人上评估SENTINEL,在包含中央障碍物上受控透明和反射故障元素的185 cm × 245 cm场地中。跨五种表面条件(包括玻璃、镜子、光面纸以及混合镜子和光面纸条件)的空间可靠性图显示了干净情况和故障情况之间的清晰分离,允许受影响区域被识别为拒绝或噪声。由于这些故障模式在仿真中不存在,验证完全在真实硬件上进行。

英文摘要

Low-cost 2D LiDARs lack the intensity channel that higher-end sensors use to diagnose measurement failures, yet they are widely used on educational and budget robotics platforms. We present SENTINEL, a training - free, label - free reliability estimation framework that gives range - only LiDAR an effective diagnostic signal. SENTINEL combines geometry-based scan statistics with cross - modal depth consistency between LiDAR and an RGB - D camera to compute a per - scan reliability score between 0 and 1. When the score falls below a threshold, corrupted scans are rejected and the robot falls back to calibrated wheel odometry, preventing silent SLAM corruption. We evaluate SENTINEL on a GEFIER R1 four - wheel skid-steer robot equipped with an RPLidar A2M12 and an Intel RealSense D435i in a 185 cm by 245 cm arena containing controlled transparent and reflective failure elements on a central obstacle. Spatial reliability maps across five surface conditions, including glass, mirror, shiny paper, and a mixed mirror and shiny-paper condition, show clear separation between clean and failure cases, allowing affected regions to be identified as reject or noise. Because these failure modes are absent in simulation, validation is performed entirely on real hardware.

2606.04829 2026-06-04 cs.RO 版本更新

M3imic: Learning a Versatile Whole-Body Controller for Multimodal Motion Mimicking

M3imic: 学习用于多模态运动模仿的通用全身控制器

Zuxing Lu, Ziang Zheng, Yao Lyu, Jingyu Liu, Feihong Zhang, Song Lu, Xin Yuan, Changyin Sun, Xingxing Zuo, Shengbo Eben Li

发表机构 * School of Automation, Southeast University(东南大学自动化学院) School of Vehicle and Mobility, Tsinghua University(清华大学车辆与移动性学院) Department of Robotics, Mohamed Bin Zayed University of Artificial Intelligence(马尔代夫比兹亚德大学人工智能学院机器人系)

AI总结 提出M3imic框架,通过模态特定编码器将异构运动参考模态(机器人关节角度、人体姿态轨迹、末端执行器位姿)映射到共享潜在空间,并利用大规模强化学习训练单一策略,实现无需模态特定重训练的sim-to-real迁移。

详情
AI中文摘要

构建通用全身控制器对于使人形机器人在广泛的下游任务(包括 locomotion 和 loco-manipulation)中具备多样化的运动能力至关重要。不同任务依赖于不同的运动参考模态:locomotion 主要依赖于协调的机器人关节轨迹,而 manipulation 则需要精确的末端执行器轨迹跟踪。现有方法常常忽视密集的机器人关节角度与稀疏的末端执行器位姿之间的表示不匹配问题。为解决这一问题,我们提出了 Multi-Modal Mimic (M3imic),一个通用的多模态全身控制框架,它使用模态特定编码器将异构运动参考模态(包括机器人关节角度、人体姿态轨迹和末端执行器位姿)映射到共享潜在空间,从而统一这些模态。利用模拟器中的大规模强化学习,我们训练了一个单一策略,该策略能够在无需模态特定重训练的情况下实现跨多种运动参考模态的 sim-to-real 迁移。在 Unitree G1 机器人上进行了广泛的仿真和真实世界实验以评估所提出的框架。在仿真中,该策略在未见过的测试数据集上达到了 98.42% 的峰值成功率,展示了其卓越的泛化能力。代码可在 https://github.com/Renforce-Dynamics/MultiModalWBC 获取。

英文摘要

Building a general-purpose whole-body controller is essential for enabling diverse motion capabilities in humanoid robots across a wide range of downstream tasks, including locomotion and loco-manipulation. Different tasks rely on distinct motion reference modalities: locomotion primarily depends on coordinated robot joint trajectories, whereas manipulation requires precise end-effector trajectory tracking. Existing methods often overlook the representational mismatch between dense robot joint angles and sparse end-effector poses. To address this, we propose Multi-Modal Mimic (M3imic), a versatile multi-modal whole-body control framework that unifies heterogeneous motion reference modalities, including robot joint angles, human pose trajectories, and end-effector poses, using modality-specific encoders to map them into a shared latent space. Leveraging large-scale reinforcement learning in the simulator, we train a single policy that achieves sim-to-real transfer across multiple motion reference modalities without modality-specific retraining. Extensive simulation and real-world experiments on the Unitree G1 robot are conducted to evaluate the proposed framework. In simulation, the policy achieves a peak success rate of 98.42\% on an unseen test dataset, demonstrating its exceptional generalization capability. The code is available at https://github.com/Renforce-Dynamics/MultiModalWBC

2606.04825 2026-06-04 cs.RO 版本更新

HapTile: A Haptic-Informed Vision-Tactile-Language-Action Dataset for Contact-Rich Imitation Learning

HapTile: 用于接触丰富模仿学习的触觉感知视觉-触觉-语言-动作数据集

Amirhosein Alian, Yongqiang Zhao, Shiyi Gu, Xuyang Zhang, Zhuo Chen, Christopher E. Mower, Haitham Bou-Ammar, Shan Luo

发表机构 * King’s College London, UK(伦敦国王学院) Huawei, Noah’s Ark Lab, UK(华为、诺亚实验室) University College London, UK(伦敦大学学院)

AI总结 提出HapTile数据集,通过集成指尖触觉反馈和操作员触觉感知,为接触丰富的机器人操作任务提供视觉-触觉-语言-动作联合数据,并验证其在策略学习中的有效性。

详情
AI中文摘要

尽管触觉感知对于可靠操作至关重要,但大多数现有的视觉-语言-动作(VLA)数据集仍然仅基于视觉,而那些确实包含触觉信息的数据集通常缺乏任务多样性、语言条件和动作轨迹的联合组合。此外,现有的遥操作流程很少为操作员提供触觉反馈,尽管触觉反馈在演示质量和操作稳定性中具有公认的作用。在这项工作中,我们提出了HapTile,一个接触基础的视觉触觉操作数据集,它通过嵌入两个层次的物理交互感知超越了仅视觉轨迹数据集:机器人末端执行器上的指尖触觉反馈,以及遥操作侧的触觉感知演示。数据收集平台将触觉反馈直接集成到遥操作控制器中,使操作员能够实时感知接触交互。它基于一个标准且可复现的机器人系统构建,该系统配备了定制设计的指尖触觉传感器。该数据集涵盖了日常操作任务,包括拾取与放置、折叠、按压、堆叠以及其他常规活动,这些任务涉及广泛的接触丰富技能。每个任务都配有语言指令,用于根据操作目标对策略进行条件化,同时还有同步的视觉触觉观察和动作轨迹。此外,我们使用两个基线模型对接触丰富的策略学习进行了基准研究,以评估所提出的接触基础数据集的有效性。数据集和更多详细信息可在我们的网站上获取:haptile-dataset.github.io。

英文摘要

Despite the importance of tactile sensing for reliable manipulation, most existing Vision-Language-Action (VLA) datasets remain vision-only, and those that do incorporate tactile information typically lack the joint combination of task diversity, language conditioning, and action trajectories. Furthermore, existing teleoperation pipelines rarely provide haptic feedback to the operator, despite its established role in demonstration quality and manipulation stability. In this work, we present HapTile, a contact-grounded visuotactile manipulation dataset that advances beyond vision-only trajectory datasets by embedding physical interaction sensing at two levels: fingertip tactile feedback at the robot end-effector, and haptic-informed demonstrations at the teleoperator side. The data collection platform integrates haptic feedback directly into the teleoperation controller, enabling the operator to perceive contact interactions in real time. It is built around a standard and reproducible robotic system equipped with custom-designed fingertip tactile sensors. The dataset comprises everyday manipulation tasks spanning a broad range of contact-rich skills, including pick-and-place, folding, pressing, stacking, and other routine activities. Each task is paired with language instructions that condition the policy on the manipulation objective, together with synchronized visuotactile observations and action trajectories. In addition, we provide a benchmarking study on contact-rich policy learning using two baseline models to evaluate the effectiveness of the proposed contact-grounded dataset. The dataset and additional details are available on our website: haptile-dataset.github.io.

2606.04818 2026-06-04 cs.RO 版本更新

Real-World Deployment of a 5G-Connected Edge-Controlled Aerial Robot in Industrial Subterranean Mines

工业地下矿井中5G连接边缘控制空中机器人的实际部署

Achilleas Santi Seisa, Emanuele Pagliari, Gerasimos Damigos, Elias Small, George Nikolakopoulos

发表机构 * Robotics and AI Group, Department of Computer, Electrical and Space Engineering, Luleå University of Technology(机器人与人工智能小组,计算机、电气与空间工程系,吕勒奥技术大学)

AI总结 本文首次在实际工业地下矿井中部署了由边缘卸载控制器控制的5G连接自主飞行空中机器人,采用模型预测控制器(MPC)生成平滑无碰撞路径,展示了边缘控制机器人系统在时间关键、安全高效未来部署中的潜力。

Comments 6 pages, 8 figures, MED 2026

详情
AI中文摘要

本文介绍了首次由边缘卸载控制器控制的5G连接空中机器人的实际自主飞行,旨在弥合受控设置与实际设置之间的差距。该机器人在一个活跃的工业地下矿井中运行,而高层控制器部署在附近的基于Kubernetes的边缘集群中。机器人与边缘之间的通信通过5G新无线电(NR)独立组网(SA)网络实现。所选的控制器是模型预测控制器(MPC),它生成控制动作,使机器人能够在采矿环境中无缝导航。人类操作员为空中机器人选择航点,MPC生成平滑、无碰撞的路径以自主执行。所提出的基于5G边缘的闭环系统在实际工业环境中进行了评估,展示了边缘控制机器人系统在时间关键、安全高效的未来部署中的潜力。

英文摘要

This article presents the first real-world autonomous flight of a 5G-connected aerial robot controlled by an edge-offloaded controller, and aims to bridge the gap between controlled and factual setups. The robot operates within an active industrial subterranean mine, while the high-level controller is deployed in a nearby Kubernetes-based edge cluster. Communication between the robot and the edge is enabled via a 5G New Radio (NR) Standalone (SA) network. The chosen controller is a Model Predictive Controller (MPC), which generates control actions to allow the robot to navigate seamlessly through the mining environment. A human operator selects waypoints for the aerial robot, and the MPC generates smooth, collision-free paths for autonomous executions. The proposed 5G edge-based closed-loop system is evaluated in a real industrial setting and demonstrates the potential of edge-controlled robotic systems toward time-critical, safe and efficient future deployments.

2606.04788 2026-06-04 cs.CV cs.RO 版本更新

Z-FLoc: Zero-Shot Floorplan Localization via Geometric Primitives

Z-FLoc: 基于几何基元的零样本楼层平面定位

Ayumi Umemura, Toshinori Kuwahara, Marc Pollefeys, Daniel Barath

发表机构 * ETH Zurich(苏黎世联邦理工学院) Tohoku University(东北大学)

AI总结 提出一种零样本楼层平面定位方法,通过从单目3D重建的鸟瞰图中提取直线和圆等几何基元,并与楼层平面进行鲁棒匹配,无需重新训练即可泛化到新环境。

详情
AI中文摘要

视觉定位——在预先存在的地图中估计相机姿态——是计算机视觉中的一个基本问题。楼层平面是一种有吸引力的地图表示:它们对于大多数建筑来说易于获取、紧凑,并且固有地不受视觉外观变化的影响。然而,弥合相机观测与楼层平面几何之间的严重领域差距仍然具有挑战性。现有方法通过数据驱动学习来解决这一差距,但它们需要大规模训练数据和特定环境的重新训练,限制了实际部署。我们提出了一种零样本楼层平面定位方法,无需任何重新训练即可泛化到新环境。我们的关键见解是,主导几何基元——直线和圆——在人造环境中无处不在,并提供外观不变的结构约束。我们从单目3D重建的鸟瞰图投影中提取这些基元,并通过鲁棒估计框架内的专用最小求解器将它们与楼层平面进行匹配。在模拟和真实数据集上的实验表明,我们的方法在未见过的环境上优于最先进的基于学习的方法,同时在所有实验中使用单一固定的超参数集。源代码将公开提供。

英文摘要

Visual localization -- estimating a camera pose within a pre-existing map -- is a fundamental problem in computer vision. Floorplans are an attractive map representation: they are readily available for most buildings, compact, and inherently invariant to visual appearance changes. However, bridging the severe domain gap between camera observations and floorplan geometry remains challenging. Existing methods address this gap through data-driven learning, yet they require large-scale training data and environment-specific retraining, limiting their practical deployment. We propose a zero-shot floorplan localization method that generalizes to novel environments without any retraining. Our key insight is that dominant geometric primitives -- lines and circles -- are ubiquitous in human-made environments and provide appearance-invariant structural constraints. We extract these primitives from a bird's-eye-view (BEV) projection of monocular 3D reconstructions and match them to the floorplan via dedicated minimal solvers within a robust estimation framework. Experiments on both simulated and real-world datasets show that our approach outperforms state-of-the-art learning-based methods on unseen environments, while using a single fixed set of hyperparameters across all experiments. The source code will be made publicly available.

2606.04776 2026-06-04 cs.RO 版本更新

SoftPINCH: EMG-Driven Soft Exoskeleton Assistance for Finger Flexion and Grasping

SoftPINCH: 用于手指屈曲和抓取的EMG驱动软体外骨骼辅助

Nicklas Nikolaj Grønvall, Magnus Malthe Sigsgaard Nielsen, Xiaofeng Xiong, Saravana Prashanth Murali Babu

发表机构 * SDU Soft Robotics(SDU软机器人实验室) The Maersk Mc-Kinney Moller Institute(马士基麦金尼摩勒研究所) University of Southern Denmark(丹麦南部大学) Odense, Denmark(丹麦奥丁斯)

AI总结 提出一种结合肌腱驱动软体外骨骼、指尖磁接触传感和神经EMG解码的EMG驱动软体可穿戴外骨骼系统SoftPINCH,用于拇指-食指屈曲和捏取辅助,实验表明CNN+LSTM模型在解码中达到99.4%准确率,且主动辅助可显著降低肌肉用力。

Comments Submitted to 18th International Conference on the Simulation of Adaptive Behavior (SAB 2026)

详情
AI中文摘要

表面肌电图(sEMG)提供了一种非侵入式接口,用于检测手部运动意图并控制可穿戴辅助设备。然而,由于EMG信号受噪声、运动伪影、电极放置、肌肉疲劳和受试者间差异的影响,可靠的EMG驱动手部辅助仍然具有挑战性。同时,许多手部外骨骼在机械上仍具有限制性或笨重,限制了舒适性和自然手部运动。本工作提出了SoftPINCH,一种用于拇指-食指屈曲和捏取辅助的EMG驱动软体可穿戴外骨骼。该系统结合了肌腱驱动的软体外骨骼、指尖磁接触传感和用于基于意图辅助的神经EMG解码。在食指和拇指运动期间记录前臂肌肉的表面EMG,并评估了三种独立于受试者的解码架构:LSTM、CNN+LSTM和带注意力的CNN+LSTM。CNN+LSTM和CNN+LSTM-attention模型均达到99.4%的LOSO测试准确率,优于达到97.8%的独立LSTM。然而,注意力机制相比CNN+LSTM并未提供显著改进,表明基于CNN的特征提取足以实现鲁棒的EMG表示。因此,由于高准确率和较低的架构复杂度,选择了CNN+LSTM模型进行实时部署。功能评估表明,主动外骨骼辅助在孤立手指屈曲和物体抓取期间减少了肌肉用力。在负重抓取期间,辅助在所有测试负载下均减少了肌肉用力,在最高负载下减少了92.6%。这些结果证明了SoftPINCH通过实时EMG驱动软体机器人控制实现直观、低用力捏取辅助的潜力。

英文摘要

Surface electromyography (sEMG) provides a non-invasive interface for detecting hand-movement intention and controlling wearable assistive devices. However, reliable EMG-driven hand assistance remains challenging because EMG signals are affected by noise, motion artifacts, electrode placement, muscle fatigue, and inter-subject variability. At the same time, many hand exoskeletons remain mechanically restrictive or bulky, limiting comfort and natural hand motion. This work presents SoftPINCH, an EMG-driven soft wearable exoskeleton for thumb-index finger flexion and pinch grasp assistance. The system combines a tendon-driven soft exoskeleton, fingertip magnetic contact sensing, and neural EMG decoding for intention-based assistance. Surface EMG was recorded from forearm muscles during index and thumb movements, and three subject-independent decoding architectures were evaluated: LSTM, CNN+LSTM, and CNN+LSTM with attention. The CNN+LSTM and CNN+LSTM-attention models both achieved 99.4% LOSO test accuracy, outperforming the standalone LSTM, which reached 97.8%. However, the attention mechanism did not provide a significant improvement over CNN+LSTM, indicating that CNN-based feature extraction was sufficient for robust EMG representation. The CNN+LSTM model was therefore selected for real-time deployment due to its high accuracy and lower architectural complexity. Functional evaluation showed that active exoskeleton assistance reduced muscular effort during isolated finger flexion and object grasping. During weighted grasping, assistance reduced muscular effort across all tested loads, with a 92.6% reduction at the highest load. These results demonstrate the potential of SoftPINCH for intuitive, low-effort pinch assistance using real-time EMG-driven soft robotic control.

2606.04749 2026-06-04 cs.RO cs.LG 版本更新

COP-Q: Safety-First Reinforcement Learning for Robot Control via Cholesky-Ordered Projection

COP-Q:基于Cholesky有序投影的安全优先强化学习机器人控制

Guopeng Li, Moritz A. Zanger, Matthijs T. J. Spaan, Julian F. P. Kooij

发表机构 * Department of Cognitive Robotics, Delft University of Technology(代尔夫特理工大学认知机器人系) Department of Intelligent Systems, Delft University of Technology(代尔夫特理工大学智能系统系) School of Transportation, Southeast University(东南大学交通学院)

AI总结 提出COP-Q方法,通过Cholesky分解编码目标优先级并利用联合Q值空间的广义置信界,在安全优先的离线策略强化学习中平衡安全与奖励目标,减少过度保守性,提升样本效率。

Comments 7 pages, 6 figures, 2 tables

详情
AI中文摘要

安全机器人控制需要在满足安全约束的同时最大化回报。在离线策略安全强化学习中,奖励和安全Q值通常由独立的评论家集成学习,每个目标的不确定性独立处理。这种按目标处理的方式忽略了目标间的相关性,可能导致过于保守的价值估计,从而降低样本效率。为解决此问题,我们提出Cholesky有序投影Q学习(COP-Q),一种安全优先的方法,将目标间协方差纳入向量值Q值估计中。COP-Q在联合Q值空间中构建广义置信界,并使用Cholesky分解以顺序形式编码目标优先级。这在对安全目标保持保守性的同时,自适应地减少对奖励目标的过度保守性。得到的估计同时用于时序差分目标计算和演员优化。COP-Q引入最小的计算开销,并且与大多数现有深度Q学习框架兼容。在Brax中的机器人运动和安全健身房中的安全导航实验(涵盖硬安全和软安全设置)表明,与代表性基线相比,COP-Q实现了强大的安全性能以及有竞争力或更高的样本效率。

英文摘要

Safe robot control requires maximizing return while satisfying safety constraints. In off-policy safe reinforcement learning, reward and safety Q-values are commonly learned by separate critic ensembles, with uncertainty handled independently for each objective. This objective-wise treatment neglects inter-objective correlation and can lead to overly conservative value estimates, thereby reducing sample efficiency. To address this issue, we propose Cholesky-Ordered Projection Q-learning (COP-Q), a safety-first method that incorporates inter-objective covariance into vector-valued Q-value estimation. COP-Q constructs a generalized confidence bound in the joint Q-value space and uses Cholesky factorization to encode objective priority in a sequential form. This preserves conservatism on safety while adaptively reducing excessive conservatism on the reward objective. The resulting estimate is used in both temporal-difference target computation and actor optimization. COP-Q incurs minimal computational overhead and is readily compatible with most existing deep Q-learning frameworks. Experiments on robot locomotion in Brax and safe navigation in Safety-Gymnasium, covering both hard- and soft-safety settings, demonstrate that COP-Q achieves strong safety performance together with competitive or improved sample efficiency relative to representative baselines.

2606.04618 2026-06-04 cs.RO 版本更新

BPDA-GMM: Bayesian Probabilistic Data Association via Gaussian Mixture Models for Semantic SLAM

BPDA-GMM:基于高斯混合模型的贝叶斯概率数据关联用于语义SLAM

Thanh Nguyen Canh, Haolan Zhang, Xiem HoangVan, Antonio Sgorbissa, Nak Young Chong

发表机构 * School of Information Science, Japan Advanced Institute of Science and Technology(信息科学学系,日本科学技术大学) University of Engineering and Technology, Vietnam National University(工程技术大学,越南国家大学)

AI总结 提出BPDA-GMM在线贝叶斯概率数据关联框架,通过狄利克雷过程先验和中文餐馆过程模型实现语义SLAM中增长式地标关联,并利用α散度退火处理模糊关联,提升轨迹精度和语义建图鲁棒性。

详情
AI中文摘要

概率数据关联(PDA)在感知混淆场景中改进了语义SLAM,但现有方法通常假设固定的地标集、随着地图增长重新计算关联权重,或依赖手动调整的零假设权重。为解决这些限制,我们提出了 extbf{BPDA-GMM},一个用于具有增长式对象级地图的语义SLAM的在线贝叶斯PDA框架。BPDA-GMM使用狄利克雷过程先验来诱导中文餐馆过程(CRP)关联模型,其中累积证据倾向于已有地标,而浓度参数将概率质量分配给新地标。对于每个语义检测,通过联合语义-几何门选择合理候选,计算CRP加权的关联概率,并以闭合形式将对象地标更新为语义高斯。所得地标集形成高斯混合模型,其主导分量作为最大混合语义因子传递给后端。当关联权重不确定时,一个由模糊触发的$α$-散度退火步骤提高了区分度。最后,解耦的后端将语义因子的位姿雅可比置零,使得噪声检测能够细化地标而不直接扰动轨迹。在仿真和真实室内数据集上的实验表明,与最先进的基线相比,轨迹精度、语义建图质量以及对感知混淆和分类器错误的鲁棒性均有所提升。代码和视频公开于https://github.com/thanhnguyencanh/BPDA-SLAM。

英文摘要

Probabilistic data association (PDA) improves semantic SLAM in perceptually aliased scenes, but existing methods often assume a fixed landmark set, recompute association weights as the map grows, or rely on hand-tuned null-hypothesis weights. To address these limitations, we propose \textbf{BPDA-GMM}, an online Bayesian PDA framework for semantic SLAM with a growing object-level map. BPDA-GMM uses a Dirichlet-process prior to induce a Chinese Restaurant Process (CRP) association model, where accumulated evidence favors existing landmarks, and the concentration parameter assigns probability mass to new landmarks. For each semantic detection, plausible candidates are selected by a joint semantic-geometric gate, CRP-weighted association probabilities are computed, and object landmarks are updated as semantic Gaussians in closed form. The resulting landmark set forms a Gaussian mixture model, and its dominant component is passed to the back-end as a max-mixture semantic factor. When association weights are inconclusive, an ambiguity-triggered $α$-divergence tempering step improves discrimination. Finally, a decoupled back-end zeroes the pose Jacobian of semantic factors, allowing noisy detections to refine landmarks without directly perturbing the trajectory. Experiments in simulation and on a real indoor dataset demonstrate improved trajectory accuracy, semantic mapping quality, and robustness to perceptual aliasing and classifier errors over state-of-the-art baselines. Code and video are publicly available at https://github.com/thanhnguyencanh/BPDA-SLAM.

2606.04569 2026-06-04 cs.RO 版本更新

MineXplore: An Open-Source Reinforcement Learning Exploration Benchmark for GNSS-Denied Underground Environment

MineXplore: 面向GNSS拒止地下环境的开源强化学习探索基准

Abhishek S, Badrikanath Praharaj, Sreeram MV

发表机构 * University of California, Berkeley(加州大学伯克利分校) Stanford University(斯坦福大学)

AI总结 提出基于真实矿井数据的开源MuJoCo导航基准MineXplore,通过六阶段管道重建隧道网络,验证了在GNSS拒止、光照退化等极端条件下策略学习的稳定性与可复现性。

Comments 7 pages,11 figures, Submitted to the workshop Xplore:Cross-Disciplinary aspects of Exploration in Robotics, Reinforcement Learning and Search Held at International Conference on Robotics and Automation (ICRA)

详情
AI中文摘要

地下矿井为自主机器人导航带来了极端条件:GPS被拒止,光照退化,隧道拓扑具有丰富的环路且非凸。目前开源生态中尚不存在基于真实生产矿井几何结构且兼容GPU加速学习管道的仿真基准。我们提出了MineXplore,一个基于Leung等人2017年智利地下铜矿数据集的开源MuJoCo导航基准。该环境通过六阶段轮廓到MJCF管道重建了一个104,423平方米的隧道网络,包含八边形墙壁横截面、LiDAR源锯齿状墙壁几何、三个地形摩擦区域、全局5度倾斜和周期性点光源。几何保真度通过交并比(IoU)为0.9538(与源测量图对比)得到验证,表面纹理相似度在六个结构维度上达到79.4%。通过RLlib在五个独立随机种子上训练的单智能体PPO基线实现了88.89%的最佳滚动覆盖率(5个种子中有3个达到90%覆盖目标),证实MineXplore在真实地下感知和拓扑条件下支持稳定且可复现的策略学习。

英文摘要

Underground mines present extreme conditions for autonomous robot navigation: GPS is denied, lighting is degraded, and tunnel topology is loop-rich and non-convex. Simulation benchmarks grounded in real production-mine geometry and compatible with GPU-accelerated learning pipelines do not yet exist in the open-source ecosystem. We present MineXplore, an open-source MuJoCo-based navigation benchmark derived from the Leung et al. 2017 Chilean underground copper mine dataset. The environment reconstructs a 104,423 sq.m tunnel network through an six-stage contour-to-MJCF pipeline incorporating octagonal wall cross-sections, LiDAR-sourced jagged wall geometry, three terrain friction zones, a global 5 degree incline, and periodic spot lighting. Geometric fidelity is validated at an Intersection over Union (IoU) of 0.9538 against the source survey map, and surface texture similarity scores 79.4% across six structural dimensions. A single-agent PPO baseline trained via RLlib across five independent random seeds achieves a best rolling coverage of 88.89% (3 of 5 seeds reaching the 90% coverage target), confirming that MineXplore supports stable and reproducible policy learning under realistic underground sensing and topology.

2606.04534 2026-06-04 cs.RO 版本更新

MAD: Mapping-Aware World Models for Agile Quadrotor Flight

MAD: 面向敏捷四旋翼飞行的地图感知世界模型

Xinhong Zhang, Runqing Wang, Yunfan Ren, Ding Yu, Boyu Zhou, Jian Sun, Fang Deng, Jie Chen, Gang Wang

发表机构 * State Key Lab of Autonomous Intelligent Unmanned Systems, Beijing Institute of Technology, Beijing 100081, China(自主智能无人系统国家重点实验室,北京理工大学,北京100081,中国) Zhongguancun Academy, Beijing 100094, China(中关村学院,北京100094,中国) School of Computer Science and Technology, Tongji University(同济大学计算机科学与技术学院) Department of Mechanical and Energy Engineering, Southern University of Science and Technology(南方科技大学机械与能源工程系) Harbin Institute of Technology(哈尔滨工业大学)

AI总结 提出地图感知世界模型MAD,通过重构机器人中心占用和可见性网格地图学习几何感知的潜在动力学,在视觉导航和竞速任务中实现更高成功率、更快飞行速度和更好跨任务迁移。

Comments 12 pages, 14 figures

详情
AI中文摘要

在杂乱场景中的敏捷四旋翼飞行需要的不仅仅是从深度图像到控制命令的反应式映射:飞行器必须记住已观测的区域,推断附近的占用空间,并在部分可见性和严格延迟下行动。在本文中,我们提出了地图感知梦想家(MAD),一种用于基于视觉的四旋翼飞行的几何感知世界模型。MAD不是将原始图像重建作为主要的自监督目标,而是学习循环潜在动力学,该动力学重构机器人中心的占用和可见性网格地图以及本体感受状态。这种设计迫使潜在状态以与碰撞避免直接相关的方式编码局部几何、可见性历史和自运动。MAD使用GPU并行地图构建模块在DiffAero中训练,该模块为占用和可见性提供高通量监督。学习到的表示用于三种策略学习模式:基于想象的MAD-Dreamer以及基于PPO和SHAC的特征提取器变体。在视觉导航和竞速任务中,基于MAD的智能体比相应的纯视觉基线实现了更高的成功率、更快的飞行速度和更好的跨任务迁移。该模型还从深度观测中产生可解释的地图预测和准确的自运动估计。我们进一步在配备Intel RealSense D435i的物理四旋翼上部署学习到的策略,并在有限感知下演示了安全的室内和室外飞行,在仿真中达到9.66 m/s,在真实世界森林实验中达到5.05 m/s。这些结果表明,地图感知世界模型在模块化空中导航和端到端学习之间提供了一个实用的中间地带。

英文摘要

Agile quadrotor flight in cluttered scenes requires more than a reactive mapping from a depth image to a control command: the vehicle must remember which regions have been observed, infer nearby occupied space, and act under partial visibility and tight latency. In this paper, we present Mapping-Aware Dreamer (MAD), a geometry-aware world model for vision-based quadrotor flight. Instead of using raw-image reconstruction as the main self-supervised objective, MAD learns recurrent latent dynamics that reconstruct robocentric occupancy and visibility grid maps together with proprioceptive states. This design forces the latent state to encode local geometry, visibility history, and ego-motion in a form that is directly relevant to collision avoidance. MAD is trained in DiffAero using a GPU-parallel map-construction module that provides high-throughput supervision for occupancy and visibility. The learned representation is used in three policy-learning modes: imagination-based MAD-Dreamer and feature-extractor variants based on PPO and SHAC. Across visual navigation and racing tasks, MAD-based agents achieve higher success rates, faster flight, and better cross-task transfer than corresponding vision-only baselines. The model also produces interpretable map predictions and accurate ego-motion estimates from depth observations. We further deploy the learned policy on a physical quadrotor with an Intel RealSense D435i and demonstrate safe indoor and outdoor flight under limited sensing, reaching 9.66 m/s in simulation and 5.05 m/s in real-world forest experiments. These results show that mapping-aware world models provide a practical middle ground between modular aerial navigation and end-to-end learning.

2606.04518 2026-06-04 cs.RO 版本更新

Cooperative Circumnavigation for Multiple Unmanned Surface Vehicles Without External Localization

无外部定位的多无人水面艇协同环绕航行

Xueming Liu, Lin Li, Xiang Zhou, Tianjiang Hu, Qingrui Zhang

发表机构 * School of Aeronautics and Astronautics, Sun Yat-sen University (Shenzhen Campus)(航空工程学院,中山大学(深圳校区))

AI总结 针对无外部定位的多无人水面艇,提出基于异构感知和耦合振荡器的协同环绕框架,利用最大相关熵卡尔曼滤波和伪线性卡尔曼滤波估计相对位置,实现指定半径的均匀圆形编队。

Comments 17 pages, 15 figures

详情
AI中文摘要

本文提出了一种针对多无人水面艇(USV)在无外部定位条件下运行的协同目标环绕框架。目标是仅利用有限的本船传感,围绕目标保持指定半径的均匀圆形编队。该框架采用异构感知策略,区分与目标之间以及USV之间的非对称传感关系。具体而言,USV通过主动感知和艇间通信获取相对距离和位移测量,而通过被动传感器获取对非合作目标的方位测量。为了估计相对位置——包括USV之间以及每个USV与目标之间的相对位置——我们分别采用了最大相关熵卡尔曼滤波和伪线性卡尔曼滤波。设计了一个基于耦合振荡器的编队控制器,以确保系统可观测性同时实现环绕航行。理论分析表明,该控制器确保USV之间的相对运动以及每个USV与目标之间的相对运动满足持续激励条件,从而保证基于卡尔曼滤波器的可观测性。通过数值仿真验证了所提方法的有效性。

英文摘要

This paper proposes a cooperative target circumnavigation framework for multiple unmanned surface vehicles (USVs) operating without external localization. The objective is to maintain a uniform circular formation of a specified radius around a target using only limited onboard sensing. The framework adopts a heterogeneous perception strategy that distinguishes between the asymmetric sensing relationships with the target and among the USVs. Specifically, the USVs obtain relative range and displacement measurements through active perception and inter-vehicle communication, while bearing measurements to a non-cooperative target are acquired via passive sensors. To estimate relative positions--both among USVs and between each USV and the target--we employ a Maximum Correntropy Kalman Filter and a Pseudo-Linear Kalman Filter, respectively. A coupled oscillator-based formation controller is designed to ensure system observability while achieving circumnavigation. Theoretical analysis demonstrates that the controller ensures the relative motions between the USVs, as well as that between each USV and the target, satisfy the persistent excitation condition, thereby guaranteeing observability of the Kalman-based filters. The effectiveness of the proposed approach is validated through numerical simulations.

2606.04477 2026-06-04 cs.RO 版本更新

TransTac: Visuo-Tactile Modality Transition via Ultraviolet-Encoded Transparent Elastomers

TransTac: 通过紫外编码透明弹性体实现视觉-触觉模态转换

Lingyue Yang, Bin Fang

发表机构 * Beijing University of Posts and Telecommunications(北京邮电大学)

AI总结 提出一种透明紫外编码双目视觉触觉传感器TransTac,结合视觉观察与标记触觉重建,通过先验引导的Delaunay立体匹配算法实现鲁棒稀疏三角化,在零样本触觉图像识别上达到83.3%准确率,并显著增强跨模态对齐。

Comments Accepted at IEEE International Conference on Robotics and Automation (ICRA) 2026. 8 pages, 7 figures

详情
AI中文摘要

基于视觉的触觉传感器(VBTS)能够恢复高分辨率接触几何形状,但通常依赖于不透明的弹性体层,这阻碍了视觉透明性;而RGB-D相机提供全局深度感知,但在近距离时性能显著下降。为解决这一局限,我们提出了TransTac,一种透明的紫外(UV)编码双目VBTS,它将视觉观察和基于标记的触觉重建集成在一个紧凑设备中。该系统采用嵌入UV反射标记的透明弹性体,以及一种先验引导的Delaunay立体匹配算法,用于鲁棒的稀疏三角化。为了可靠地检测密集分布的半透明标记,我们开发了一种轻量级检测器,能够在接触和变形下实现稳定定位。所提出的先验引导的Delaunay匹配相比全局分配基线,将对应鲁棒性提高了约21%,同时保持高重建精度。在语义评估中,TransTac在触觉图像上实现了高达83.3%的零样本识别准确率,超过不透明触觉基线约50个百分点。嵌入分析进一步揭示了与自然图像的跨模态对齐显著增强,类中心相似度从约0.2提升至超过0.77。受控的近距实验量化了RGB-D深度可靠性的下降,并展示了通过视觉-触觉集成实现的扩展几何覆盖。最后,实现了一个紧凑原型,硬件成本约为70美元。

英文摘要

Vision-based tactile sensors (VBTS) recover high-resolution contact geometry but typically rely on opaque elastomer layers that prevent visual transparency, while RGB-D cameras provide global depth perception yet degrade significantly at close range. To address this limitation, we present TransTac, a transparent ultraviolet (UV)-encoded binocular VBTS that integrates visual observation and marker-based tactile reconstruction within a single compact device. The system employs a transparent elastomer embedded with UV-reflective markers and a prior-guided Delaunay stereo matching algorithm for robust sparse triangulation. To reliably detect densely distributed semitransparent markers, we develop a lightweight detector that enables stable localization under contact and deformation. The proposed prior-guided Delaunay matching improves correspondence robustness by approximately 21% compared with global assignment baselines while maintaining high reconstruction accuracy. In semantic evaluation, TransTac achieves up to 83.3% zero-shot recognition accuracy on tactile images, exceeding opaque tactile baselines by approximately 50 percentage points. Embedding analysis further reveals substantially stronger cross-modal alignment with natural images, with class-center similarity increasing from around 0.2 to over 0.77. Controlled near-distance experiments quantify the degradation of RGB-D depth reliability and demonstrate extended geometric coverage enabled by visuo-tactile integration. Finally, a compact prototype is implemented with an approximate hardware cost of $70.

2606.04436 2026-06-04 cs.CV cs.RO 版本更新

3DThinkVLA: Endowing Vision-Language-Action Models with Latent 3D Priors via 3D-Thinking-Guided Co-training

3DThinkVLA:通过3D思维引导的协同训练赋予视觉-语言-动作模型潜在3D先验

Jiaxin Shi, Xidong Zhang, Fucai Zhu, Zhe Li, Siyu Zhu, Weihao Yuan

发表机构 * Shanghai Jiao Tong University(上海交通大学) Harbin Institute of Technology(哈尔滨工业大学) Nanyang Technological University(南洋理工大学) Fudan University(复旦大学) Nanjing University(南京大学) Daimon Robotics(达梦机器人) Great Bay University(大亚大学)

AI总结 提出3D思维引导的协同训练框架,通过解耦3D几何感知与空间推理并在不同特征层次注入,使VLA模型在动作预测中隐式进行3D空间推理,无需3D传感器或外部模型,在多个基准上达到最优性能。

详情
AI中文摘要

我们提出了一种3D思维引导的协同训练框架,使视觉-语言-动作(VLA)模型能够在动作预测过程中隐式地进行3D空间推理。我们的核心见解是,3D几何感知和3D空间推理是两种不同的能力,可以在不同的特征层次上解耦并注入。在训练过程中,三个紧密耦合的组件主要在潜在空间中协同工作:(1)为了获得几何先验,一个潜在3D几何感知模块将中间视觉特征与3D基础模型对齐,在不修改VLM骨干架构的情况下获取低级几何线索。(2)作为补充,一个在线3D推理蒸馏模块通过共享推理锚点令牌缓解提示引发的推理差距。在3D VLM协同训练期间,该锚点作为第一个输出令牌发出,以稳健地编码空间先验。在VLA训练期间,它作为插入在任务指令和动作指令之间的输入令牌,将高级空间思维从显式教师推理提示转移到学生动作提示,无需链式思维文本生成。(3)然后,这些解耦的几何和推理特征通过空间增强的动作集成统一起来,该集成将它们作为分层空间条件共同注入到动作查询令牌中,以防止动作捷径。在部署时,我们的方法仅保留其轻量级适配器以执行隐式3D推理,丢弃用于监督的3D基础模型和教师分支。因此,它纯粹在2D图像上运行,无需3D传感器、外部模型或显式文本生成,同时防止预训练VLM的灾难性遗忘,在LIBERO、LIBERO-PLUS、SimplerEnv和真实世界操作任务上实现了最先进的性能。

英文摘要

We propose a 3D-thinking-guided co-training framework that enables vision-language-action (VLA) models to perform 3D spatial reasoning implicitly during action prediction. Our core insight is that 3D geometry perception and 3D spatial reasoning are distinct capabilities that can be disentangled and injected at different feature hierarchies. During training, three tightly coupled components work in concert primarily within the latent space: (1) To gain geometric priors, a latent 3D geometry perception module aligns intermediate visual features with a 3D foundation model, acquiring low-level geometric cues without architectural modifications to the VLM backbone. (2) Complementing this, an online 3D reasoning distillation module mitigates the prompt-induced reasoning gap via a shared reasoning anchor token. During 3D VLM co-training, this anchor is emitted as the first output token to robustly encode spatial priors. During VLA training, it serves as an input token inserted between the task and action instructions, transferring high-level spatial thinking from explicit teacher reasoning prompts to student action prompts without chain-of-thought text generation. (3) These disentangled geometric and reasoning features are then united by a spatially augmented action integration, which jointly injects them into the action-query tokens as hierarchical spatial conditions to prevent action shortcuts. At deployment, our method retains only its lightweight adapters to perform implicit 3D reasoning, discarding the 3D foundation model and the teacher branch used for supervision. Consequently, it operates purely on 2D images without 3D sensors, external models, or explicit text generation while preventing catastrophic forgetting of the pretrained VLM, achieving state-of-the-art performance on LIBERO, LIBERO-PLUS, SimplerEnv, and real-world manipulation tasks.

2606.04361 2026-06-04 eess.SY cs.MA cs.RO cs.SY math.DS math.OC 版本更新

When Freshness Is Not Enough: Distribution-Aware Age of Information for Networked LQR Control

当新鲜度不足时:面向网络化LQR控制的分布感知信息年龄

Abdullah Y. Etcibasi, C. Emre Koksal, Eylem Ekici

发表机构 * Department of Electrical and Computer Engineering, The Ohio State University(电气与计算机工程系,俄亥俄州立大学)

AI总结 本文研究网络化控制系统中,仅最小化平均信息年龄(AoI)不足以优化LQR跟踪性能,需考虑调度间隔的完整分布(包括高阶矩和指数矩)。

详情
AI中文摘要

信息年龄(AoI)已成为无线更新系统设计的核心指标,尤其是在新鲜测量支持跟踪、估计和控制的场景中。尽管其广泛应用,但将平均AoI或峰值AoI作为闭环性能的替代指标通常基于直觉而非控制理论推导。本文探讨了最小化平均AoI是否对网络化控制系统最优。对于具有延迟间歇更新的标量线性时不变系统,我们证明,在状态无关调度策略下,无限时域LQR跟踪问题可简化为对调度间隔分布的优化。所得目标函数依赖于调度过程的高阶统计矩,在不稳定或相关情况下还依赖于指数矩,而非仅依赖于其均值。因此,具有相同平均AoI的策略可能产生显著不同的跟踪成本。我们进一步将分析扩展到具有指数衰减自相关的扰动,并推导出揭示完整间隔分布作用的等效成本公式。最后,使用NGSIM US-101数据集中的真实车辆轨迹验证理论。实证结果与预测的性能趋势一致,表明仅凭平均AoI不足以进行面向控制的网络设计。

英文摘要

Age of Information (AoI) has become a central metric for the design of wireless update systems, especially in applications where fresh measurements support tracking, estimation, and control. Despite its popularity, the use of mean AoI or peak AoI as a surrogate for closed-loop performance is often motivated by intuition rather than by a control-theoretic derivation. This paper examines whether minimizing the mean AoI is in fact optimal for networked control systems. For scalar linear time-invariant systems with delayed intermittent updates, we show that, under state-independent scheduling policies, the infinite-horizon LQR tracking problem reduces to an optimization over the distribution of inter-scheduling intervals. The resulting objective depends on higher-order statistical moments, and in unstable or correlated regimes on exponential moments, of the inter-scheduling process rather than only on its mean. Consequently, policies with identical mean AoI can induce substantially different tracking costs. We further extend the analysis to disturbances with exponentially decaying autocorrelation and derive equivalent cost formulations that expose the role of the full interval distribution. Finally, we validate the theory using real vehicle trajectories from the NGSIM US-101 dataset. The empirical results match the predicted performance trends, demonstrating that mean AoI alone is insufficient for control-oriented network design.

2606.04355 2026-06-04 cs.RO 版本更新

Think Fast and Far: Long-Horizon Online POMDP Planning via Rapid State Sampling

快速思考与远见:通过快速状态采样实现长时域在线POMDP规划

Yuanchu Liang, Edward Kim, J. Arden Knoll, Wil Thomason, Zachary Kingston, Lydia E. Kavraki, Hanna Kurniawati

发表机构 * Australian National University(澳大利亚国立大学) Rice University(里士大学)

AI总结 提出一种基于快速状态采样的在线POMDP求解器ROP-RAS3,通过宏动作生成和信念空间采样,有效解决长时域POMDP问题,在多种高维连续/离散混合空间中显著优于现有方法。

Comments @inproceedings{Liang2026Thinking, title = {Think Fast and Far: Long-Horizon Online POMDP Planning via Rapid State Sampling}, author = {Yuanchu Liang and Edward Kim and J.Arden Knoll and Wil Thomason and Zachary Kingston and Lydia E. Kavraki and Hanna Kurniawati}, year = 2026, booktitle = {International Journal of Robotics Research (to appear)} }

详情
AI中文摘要

部分可观测马尔可夫决策过程(POMDP)是不确定性下运动规划的通用且原则性框架。尽管POMDP求解器的可扩展性有了巨大提升,但长时域POMDP仍然难以求解。为缓解这一困难,本文提出了一种新的近似在线POMDP求解器,称为基于参考的快速状态空间采样在线POMDP规划(ROP-RAS3)。ROP-RAS3利用新颖的极快采样运动规划技术对状态空间进行采样,并在线生成多样化的宏动作,然后用于偏置信念空间采样并推断高质量策略,而无需对动作空间进行穷举枚举——这是现代在线POMDP求解器的一个基本约束。ROP-RAS3以依赖于采样动作数量而非动作空间大小的速率收敛到近最优的基于参考的解。ROP-RAS3在多种长时域POMDP上进行了评估,这些POMDP具有高达3000个前瞻步骤和35维状态空间,其中状态、动作和观测空间可以是连续的、离散的或离散与连续的混合。尽管基于参考的最优解可能与最优POMDP解不同,但经验结果表明,在所有这些问题中,就成功率而言,ROP-RAS3优于其他最先进方法多达数倍。我们还通过物理机器人演示展示了我们方法的能力。这项工作扩展了我们ISRR24论文的理论和实证结果。代码可在 exttt{https://github.com/RDLLab/ROPRAS3} 找到。

英文摘要

Partially Observable Markov Decision Processes (POMDPs) are a general and principled framework for motion planning under uncertainty. Despite tremendous improvement in the scalability of POMDP solvers, long-horizon POMDPs remain difficult to solve. To alleviate the difficulty, this paper proposes a new approximate online POMDP solver, called Reference-Based Online POMDP Planning via Rapid State Space Sampling (ROP-RAS3). ROP-RAS3 uses novel extremely fast sampling-based motion planning techniques to sample the state space and generate a diverse set of macro actions online, which are then used to bias belief-space sampling and infer high-quality policies without requiring exhaustive enumeration of the action space -- a fundamental constraint for modern online POMDP solvers. ROP-RAS3 converges to a near-optimal reference-based solution at a rate that depends on the number of sampled actions, rather than the size of the action space. ROP-RAS3 is evaluated on various long-horizon POMDPs with up to 3000 lookahead steps and 35-dimensional state spaces, where the state, action and observation spaces can be continuous, discrete, or a hybrid of discrete and continuous. Although the reference-based optimal solution may not be the same as the optimal POMDP solution, empirical results indicate that in all of these problems, in terms of success rate, ROP-RAS3 outperforms other state-of-the-art methods by up to multiple folds. We also demonstrate the capability of our approach on a physical robot demonstration. This work extends the theory and empirical results of our ISRR24 paper. Code can be found at \texttt{https://github.com/RDLLab/ROPRAS3}.

2605.04607 2026-06-04 cs.RO 版本更新

Right Model, Right Time: Real-Time Cascaded-Fidelity MPC for Bipedal Walking

正确模型,正确时机:用于双足行走的实时级联保真度MPC

Franek Stark, Felix Wiebe, Shubham Vyas, Dennis Mronga, Frank Kirchner

发表机构 * Robotics Innovation Center at the German Research Center for Artificial Intelligence (DFKI)(德国人工智能研究中心机器人创新中心)

AI总结 提出一种多阶段全身模型预测控制方法,结合近视野详细全身模型与远视野简化单刚体模型,降低计算复杂度并保持预测能力,在通用MPC框架acados中求解,无需预设足迹位置,在18自由度双足机器人HyPer-2上验证。

Comments Presented at IEEE ICRA 2026 Workshop "2cnd Workshop on Frontiers of Optimization for Robotics"

详情
Journal ref
Proceedings of the 2nd ICRA Workshop on Frontiers of Optimization for Robotics, 2026
AI中文摘要

本文提出了一种用于双足行走的多阶段全身模型预测控制(MPC)方法,在近视野中结合详细的全身模型,在后续预测步骤中结合简化的单刚体模型。这降低了计算复杂度,同时保留了预测能力。所得到的非线性最优控制问题完全在通用现成的非线性MPC框架acados中求解,使用序列二次规划(SQP)。给定接触时间表和目标行走速度,控制器优化关节扭矩,而不依赖于预设的足迹位置。该控制器在18自由度双足机器人HyPer-2的MuJoCo仿真中得到验证。

英文摘要

This paper presents a multi-phase whole-body model predictive control (MPC) approach for bipedal walking, combining a detailed whole-body model in the near horizon with a simplified single-rigid-body model in the later prediction steps. This reduces computational complexity while retaining prediction capabilities. The resulting nonlinear optimal control problem is solved entirely within the general-purpose, off-the-shelf nonlinear MPC framework acados, using sequential quadratic programming (SQP). Given a contact schedule and a target walking speed, the controller optimizes joint torques without depending on preselected footstep locations. The controller is validated in MuJoCo simulation on the 18-DoF bipedal robot HyPer-2.

2606.04269 2026-06-04 cs.RO cs.AI cs.CV 版本更新

Instant-Fold: In-Context Imitation Learning for Deformable Object Manipulation

Instant-Fold: 可变形物体操作的情境模仿学习

Yilong Wang, Cheng Qian, Edward Johns

发表机构 * The Robot Learning Lab(机器人学习实验室) Imperial College London(伦敦帝国学院)

AI总结 提出Instant-Fold框架,通过单次人类演示的情境模仿学习,无需梯度更新即可推断并执行多种可变形物体操作模式,在仿真训练后零样本迁移到真实世界。

详情
AI中文摘要

可变形物体操作(DOM)具有挑战性,因为其状态是高维、部分可观测的,并且通过长时间跨度、拓扑变化的交互演变,涉及多种有效的操作模式。我们引入了Instant-Fold,一个用于DOM的情境模仿学习框架。给定单次人类演示,我们的策略直接从演示中推断并执行多种操作模式,包括空间执行和顺序的变化,无需梯度更新。我们的方法首先通过时间对比预训练学习变形感知的视觉表示,然后基于演示的条件流匹配变换器策略预测执行预期操作模式的动作。完全在仿真中训练的Instant-Fold能够泛化到多种折叠模式,并零样本迁移到真实世界环境,无需额外的数据收集或微调。视频可在https://instant-fold.github.io获取。

英文摘要

Deformable object manipulation (DOM) is challenging due to high-dimensional, partially observable states that evolve through long-horizon, topology-changing interactions with multiple valid manipulation modes. We introduce Instant-Fold, an in-context imitation learning framework for DOM. Given a single human demonstration, our policy infers and executes diverse manipulation modes directly from the demonstration, including variations in spatial execution and ordering, without requiring gradient updates. Our approach first learns deformation-aware visual representations via temporal contrastive pretraining, after which a flow-matching transformer policy conditioned on the demonstration predicts actions to execute the intended manipulation mode. Trained entirely in simulation, Instant-Fold generalizes across diverse folding modes and transfers zero-shot to real-world settings without additional data collection or finetuning. Videos are available at https://instant-fold.github.io.

2606.04248 2026-06-04 cs.RO 版本更新

RSC: Decentralized Rigid Formation Flocking for Large-Scale Swarms via Hybrid Predictive Control and Online Reconfiguration

RSC:通过混合预测控制与在线重配置实现大规模集群的分散式刚性编队集群

Ganyu Zou, Linhan Wang, Chen Dai, Siji Chen, Chang-Tien Lu

发表机构 * University of Science and Technology of China(中国科学技术大学)

AI总结 提出一种分散式控制框架RSC,结合有限时域轨迹预测与反应式人工势场安全控制器,并引入在线领航-跟随重配置机制,在25架无人机杂乱环境中实现83%的编队保持、避障与目标跟踪成功率。

Comments 8 pages, 4 figures, two-column format

详情
AI中文摘要

分散式刚性编队集群要求自主智能体集群在移动过程中仅依靠局部感知和通信来维持预定的几何构型。然而,现有的分散式控制方法在杂乱环境中难以保持严格的智能体间距离约束,常常遭遇局部极小死锁、高频控制振荡或避障时灵活性有限等问题,导致成功率低。为解决这些限制,我们提出了刚性集群控制(RSC),一种用于大规模刚性编队集群的分散式控制框架。为了通过鲁棒的长期规划逃离局部极小同时确保短期安全,RSC在混合架构中集成了有限时域轨迹预测与反应式人工势场(APF)安全控制器。此外,为了在穿越障碍后加速编队重组而不中断任务执行,RSC引入了一种基于稳定角色交换的在线领航-跟随重配置机制。在25架无人机的挑战性杂乱环境中的广泛评估表明,RSC可靠地统一了刚性编队保持、避障和目标跟踪。在严格的成功标准——无碰撞运行且最大相对边长度误差低于10%下,RSC实现了83%的成功率,显著优于成功率低于5%的现有启发式和基于学习的基线方法。

英文摘要

Decentralized rigid formation flocking requires a swarm of autonomous agents to maintain a predetermined geometric configuration while moving, relying solely on local sensing and communication. However, existing decentralized control methods struggle to maintain strict inter-agent distance constraints in cluttered environments, often suffering from local minima deadlocks, high frequency control oscillations, or limited flexibility during obstacle navigation, resulting in low success rate. To address these limitations, we propose Rigid Swarm Control (RSC), a decentralized control framework for large-scale rigid formation flocking. To escape local minima via robust long-term planning while ensuring short-term safety, RSC integrates finite-horizon trajectory predictions with a reactive artificial potential field (APF) safety controller within a hybrid architecture. Furthermore, to accelerate formation reassembly after obstacle traversal without interrupting task execution, RSC introduces an online leader-follower reconfiguration mechanism based on stable role exchange. Extensive evaluations in challenging cluttered environments with 25 UAVs demonstrate that RSC reliably unifies rigid formation maintenance, obstacle avoidance, and target tracking. Under strict success criteria - collision-free operation with a maximum relative edge-length error below 10%, RSC achieves an 83% success rate, significantly outperforming existing heuristic and learning-based baselines that fall below 5%.

2606.04233 2026-06-04 cs.RO 版本更新

What Are We Actually Benchmarking in Robot Manipulation?

我们究竟在机器人操作中基准测试什么?

Tianchong Jiang, Xiangshan Tan, Samuel Wheeler, Luzhe Sun, Tewodros W. Ayalew, Matthew Walter

发表机构 * Toyota Technological Institute at Chicago(丰田技术研究所芝加哥分校) University of Chicago(芝加哥大学) Argonne National Laboratory(阿贡国家实验室)

AI总结 本文通过识别基准测试的四种失效模式(捷径可解性、缺乏统计显著性、渐进过拟合和数据源依赖性),并提出相应诊断方法,对LIBERO、CALVIN、SimplerEnv、RoboCasa和RoboTwin 2.0进行审计,发现多数基准测试存在缺陷,并发布了诊断工具。

Comments 31 pages, 6 figures

详情
AI中文摘要

机器人基准测试分数衡量的是在固定评估设置下的成功率,但通常被当作通用操作能力的证据。我们识别出四种失效模式,每种模式都会削弱或否定基准测试作为该能力有效代理的作用:捷径可解性、缺乏统计显著性、渐进过拟合和数据源依赖性。我们为每种失效模式提出一种诊断方法。我们使用这些诊断方法审计了LIBERO、CALVIN、SimplerEnv、RoboCasa和RoboTwin 2.0。LIBERO和CALVIN未通过多项诊断。RoboCasa和RoboTwin 2.0未通过较少,尽管它们在近期进展声明中出现的频率远低于前者。在LIBERO上,一个没有语言编码器的0.09B探针得分达到或接近报告的最优结果,且大多数报告的性能提升无法证明具有统计显著性。在CALVIN上,在训练范围内随机化块的位置会降低所有测试策略的性能。我们发布了四种诊断方法及其参考实现,供作者和审稿人在将基准测试分数视为进展证据之前使用。代码和工件可在https://ripl.github.io/manipulation_benchmark_audit/获取。

英文摘要

A robotics benchmark score measures success under one fixed evaluation setup, yet is routinely treated as evidence of general manipulation capability. We identify four failure modes, each of which weakens or invalidates a benchmark's role as a valid proxy for that capability: shortcut solvability, lack of statistical significance, creeping overfitting, and data-source dependence. We propose one diagnostic per failure mode. We audit LIBERO, CALVIN, SimplerEnv, RoboCasa, and RoboTwin 2.0 under these diagnostics. LIBERO and CALVIN fail multiple diagnostics. RoboCasa and RoboTwin 2.0 fail fewer, despite appearing far less often in recent progress claims. On LIBERO, a 0.09B probe with no language encoder scores at or near reported SOTA, and most reported gains are not provably statistically significant. On CALVIN, randomizing block poses within the training range drops performance for every tested policy. We release the four diagnostics with reference implementations for authors and reviewers to apply before treating a benchmark score as evidence of progress. Code and artifacts are available at https://ripl.github.io/manipulation_benchmark_audit/.

2606.04226 2026-06-04 cs.RO cs.AI 版本更新

PerceptTwin: Semantic Scene Reconstruction for Iterative LLM Planning and Verification

PerceptTwin:面向迭代LLM规划与验证的语义场景重建

Charlie Gauthier, Sacha Morin, Liam Paull

发表机构 * Department of Computer Science and Operations Research, Université de Montréal(蒙特利尔大学计算机科学与运筹学系) Mila - Quebec AI Institute(魁北克人工智能研究所) CIFAR AI Chair(CIFAR人工智能主席)

AI总结 提出PerceptTwin自动管道,从机器人感知的语义场景表示构建交互式仿真,结合LLM法官验证规划正确性与人类偏好,提升规划成功率约39%。

Comments Accepted at ICRA 2026 (Vienna); published on arxiv for archival purposes. See also https://percept-twin.github.io/

详情
AI中文摘要

仿真环境对于机器人策略学习以及规划验证与确认都很有用。传统上,创建仿真的过程是繁重的。为机器人运行的每个单独环境创建定制的仿真环境是不可行的。在这项工作中,我们引入了PerceptTwin,这是一个全自动管道,直接从机器人感知栈产生的语义场景表示构建交互式仿真。PerceptTwin结合了开放词汇对象地图与3D资产生成、 afford预测和常识条件检查。这些交互式仿真可用于在机器人硬件上执行规划之前验证和完善规划。借鉴AI对齐文献,我们还引入了一个LLM法官,用于验证规划的正确性和与人类偏好的一致性。实验表明,PerceptTwin反馈允许LLM规划器完善规划、增强安全性并抵抗有害的黑盒提示攻击。在我们的任务套件中,PerceptTwin使GPT5、GPT5Mini和GPT5Nano规划器的规划成功率平均提高约39%。此外,对于因未满足技能前提条件而失败的规划,PerceptTwin还将人类规划验证平均提高高达18%。我们的结果证明了从机器人感知进行开放词汇场景仿真作为更安全、更可靠的机器人规划基础的潜力。

英文摘要

Simulation environments are useful for both robot policy learning and planning verification and validation. Traditionally, the process of creating a simulation was onerous. Creating a bespoke simulation environment for each individual environment that a robot would operate in was simply infeasible. In this work, we introduce PerceptTwin, a fully automatic pipeline that constructs interactive simulations directly from semantic scene representations produced by a robot's perception stack. PerceptTwin combines open-vocabulary object maps with 3D asset generation, affordance prediction, and commonsense condition checking. These interactive simulations can be used to validate and refine plans before they are executed on the robot hardware. Borrowing from the AI alignment literature, we also introduce an LLM judge that verifies plan correctness and alignment with human preferences. Experiments show that PerceptTwin feedback allows LLM planners to refine plans, enhance safety, and resist harmful black-box prompting attacks. In our suite of tasks, PerceptTwin improves plan success by an average of approximately 39% for GPT5, GPT5Mini, and GPT5Nano planners. Additionally, PerceptTwin also improves human plan verification by up to 18% on average for plans that fail due to unfilled skill preconditions. Our results demonstrate the potential of open-vocabulary scene simulation from robot perception as a foundation for safer, more reliable robot planning.

2606.04222 2026-06-04 cs.RO 版本更新

Towards Estimating Normal and Shear Interface Pressures in Prosthetic Sockets via Least Squares and Mechanics Modeling

通过最小二乘和力学建模估算假肢接受腔中的法向和剪切界面压力

Axel González Cornejo, Tianhao Yu, Chi Hwan Lee, Edgar Bolívar-Nieto

发表机构 * University of California, Berkeley(加州大学伯克利分校) University of Michigan(密歇根大学)

AI总结 针对假肢接受腔界面压力测量中剪切力缺失和传感器串扰问题,提出一种基于稀疏传感和最小二乘的准静态弹簧-质量接触模型,通过全局力/力矩和局部压力数据验证模型性能。

详情
AI中文摘要

假肢接受腔的适配仍然主要依靠手工和迭代,客观适配指标仍然有限。挑战之一在于缺乏残肢-接受腔界面的长期真实压力数据。传统压力传感器随时间漂移,且仅能捕捉接受腔内稀疏位置的法向压力,缺失了生物力学分析的关键分量:剪切力。尽管某些传感器可以同时报告法向和剪切界面应力,但由于测量串扰,这些分量往往难以解耦。一个潜在的解决途径是开发能够增强现有测量的模型。本文引入了一个测试平台,使用两种互补的验证信号评估稀疏压力传感下的模型性能:(i)通过人工残肢传递的全局力螺旋(即正交坐标系中的总力和力矩),以及(ii)由稀疏传感簇(每个簇由四个电容传感通道组成)测量的局部界面载荷(即每个仪器位置处右手正交坐标系中解耦的法向和剪切压力分量)。本文不呈现全场压力估计,而是聚焦于一个分析序列,量化候选力学模型在受控条件下解释全局和局部测量的能力。评估了一个准静态弹簧-质量接触模型,并通过两阶段凸最小二乘问题识别其参数。静态加载下的验证表明,估计恒定偏置项可以减少力螺旋通道的稳态偏移,并改善与局部测量的一致性。帕累托前沿敏感性分析进一步说明了当包含偏置项时,全局和局部目标之间的权衡如何变化。

英文摘要

Prosthetic socket fitting remains largely manual and iterative, and objective fit metrics are still limited. Part of the challenge is the lack of long-term real-life pressure data at the residual limb--socket interface. Traditional pressure sensors are prone to drift over time, and capture only normal pressures at sparse locations within the socket, missing a critical component for biomechanical analysis: shear. Although some sensors can report both normal and shear interface stresses, these components are often difficult to decouple because of measurement crosstalk. One potential path forward is to develop models that can augment available measurements. This work introduces a testbed to evaluate model performance under sparse pressure sensing using two complementary validation signals: (i) the global wrench (\ie, total forces and moments expressed in an orthonormal frame) transmitted through the socket, by an artificial residual-limb, and (ii) local interface loads (\ie, decoupled normal and shear pressure components in a right-hand-rule orthogonal frame that lives in each instrumented location) measured by sparse sensing clusters, each composed of four capacitance-sensing channels. Rather than presenting full-field pressure estimates, the focus is on an analysis sequence that quantifies how well candidate mechanical models explain both global and local measurements under controlled conditions. A quasi-static spring--mass contact model is evaluated, and its parameters are identified via a two-stage convex least-squares problem. Validation under static loading shows that estimating constant bias terms reduces steady offsets in the wrench channels and improves agreement with local measurements. A Pareto-front sensitivity analysis further illustrates how the trade-off between global and local objectives changes when bias terms are included.

2606.04206 2026-06-04 cs.RO 版本更新

DLO-Lab: Benchmarking Deformable Linear Object Manipulations with Differentiable Physics

DLO-Lab: 基于可微物理的可变形线性物体操作基准测试

Junyi Cao, Yian Wang, Ziyan Xiong, Chunru Lin, Zhehuan Chen, Chuang Gan

发表机构 * DLO-Lab(DLO实验室)

AI总结 针对机器人操作绳索、电缆等可变形线性物体(DLO)的挑战,提出一个可微模拟器,支持多种材料属性,并构建基准任务套件,结合专用DLO智能体,评估策略学习算法并验证仿真到现实的迁移。

Comments ICML 2026, the project page: https://dlo-lab-26.github.io/

详情
AI中文摘要

我们解决了使机器人能够操作可变形线性物体(DLO),如绳索、电缆和橡皮筋的挑战。先前的工作主要集中于狭窄的、任务特定的问题,通常依赖于真实世界的演示或手工制作的启发式方法。然而,这些方法难以扩展到实践中遇到的各种材料和任务,并且收集足够多样化的真实世界数据通常是不切实际的。此外,现有的仿真环境对可泛化DLO操作所需的广泛材料行为支持有限。为了克服这些限制,我们引入了一个明确设计用于多功能DLO操作的可微模拟器。我们的模拟器模拟了广泛的材料属性——包括(不可)延伸性、弹性、弯曲塑性以及与其他物体的复杂交互——为学习和评估操作技能提供了坚实的基础。基于此模拟器,我们提出了一个代表性任务的基准套件,突出了DLO操作的独特挑战。这些任务的成功执行通常受到DLO固有的拓扑复杂性和抓取敏感性的阻碍。因此,我们引入了一个专门的DLO智能体,通过提出战略性抓取点并将长视界任务分解以最大化控制权,明确管理这些挑战。最后,我们使用我们的框架评估了各种策略学习算法,并进行了仿真到现实的迁移实验,展示了我们平台在推进DLO操作方面的潜力。

英文摘要

We address the challenge of enabling robots to manipulate deformable linear objects (DLOs), such as ropes, cables, and rubber bands. Prior work has primarily focused on narrow, task-specific problems, often relying on real-world demonstrations or handcrafted heuristics. Such approaches, however, struggle to scale to the wide variety of materials and tasks encountered in practice, and collecting sufficiently diverse real-world data is often impractical. Additionally, existing simulation environments offer limited support for the broad spectrum of material behaviors necessary for generalizable DLO manipulation. To overcome these limitations, we introduce a differentiable simulator explicitly designed for versatile DLO manipulation. Our simulator models a wide range of material properties-including (in)extensibility, elasticity, bending plasticity, and complex interactions with other objects-providing a robust foundation for learning and evaluating manipulation skills. Building on this simulator, we propose a benchmark suite of representative tasks that highlight the unique challenges of DLO manipulation. The successful execution of these tasks is often hindered by the topological complexity and grasp sensitivity inherent to DLOs. Therefore, we introduce a specialized DLO agent that explicitly manages these challenges by proposing strategic grasping points and decomposing long-horizon tasks to maximize control authority. Finally, we evaluate various policy-learning algorithms using our framework, alongside sim-to-real transfer experiments, demonstrating our platform's potential to advance DLO manipulation.

2606.04188 2026-06-04 cs.LG cs.AI cs.RO 版本更新

Dual Advantage Fields

双优势场

Alexey Zemtsov, Maxim Bobrin, Alexander Nikulin, Dmitry V. Dylov, Fakhri Karray, Vladislav Kurenkov, Martin Takáč, Arip Asadulaev

发表机构 * NUST MISIS(努斯大学材料科学与工程学院) MSU(莫斯科大学) Computational Imaging Lab(计算成像实验室) MBZUAI(马斯喀特人工智能研究院) dunnolab(杜诺实验室) Innopolis University(因诺波利斯大学)

AI总结 提出双优势场(DAF)方法,利用双线性对偶值模型生成局部优势信号,通过动作-效应模型预测折扣特征位移并与目标方向对齐来评分动作,实现离线目标条件强化学习中的策略提取。

Comments Accepted by ICML 2026 Workshop on Decision-Making from Offline Datasets to Online Adaptation: Black-Box Optimization to Reinforcement Learning

详情
AI中文摘要

离线目标条件强化学习需要长期可达性估计和局部动作比较。双目标表示提供捕获全局目标可达性的值场,但它们不直接指定在给定状态下应优先选择哪个动作。我们提出双优势场(DAF),一种策略提取方法,将双线性对偶值模型转化为局部优势信号。在双线性对偶参数化下,目标嵌入是值场关于状态表示的梯度。DAF学习一个动作-效应模型,预测由动作引起的折扣特征位移,并通过该位移与目标方向的对齐程度对动作进行评分。在可实现的情况下,该分数等于目标条件Bellman优势,从而提供标准的局部策略改进保证。在OGBench的 locomotion、manipulation 和 puzzle 任务上,DAF改进了聚合RLiable指标,并在局部正确动作与直接朝向最终目标移动不同的设置中表现强劲。

英文摘要

Offline goal-conditioned reinforcement learning requires both long-horizon reachability estimates and local action comparisons. Dual goal representations provide value fields that capture global goal reachability, but they do not directly specify which action should be preferred at a given state. We propose Dual Advantage Fields, a policy-extraction method that turns a bilinear dual value model into a local advantage signal. Under bilinear dual parameterization, the goal embedding is the gradient of the value field with respect to the state representation. DAF learns an action-effect model that predicts the discounted feature displacement induced by an action and scores actions by the alignment between this displacement and the goal direction. In the realizable case, this score equals the goal-conditioned Bellman advantage, yielding a standard local policy-improvement guarantee. On OGBench locomotion, manipulation, and puzzle tasks, DAF improves aggregate RLiable metrics and performs strongly in settings where locally correct actions differ from direct movement toward the final goal.

2606.04185 2026-06-04 cs.RO 版本更新

Distribution-Free Risk-Aware Planning and Control Under Uncertainty Using Conformal Spectral Risk Control

基于共形谱风险控制的免分布风险感知规划与控制

Junsik Eom, Tulga Ersal

发表机构 * Department of Mechanical Engineering, University of Michigan(密歇根大学机械工程系)

AI总结 提出一种免分布的风险感知模型预测控制框架,通过扩展共形风险控制到谱风险度量,生成预测集以在不确定性下保证风险低于用户指定阈值,并在车辆避障仿真中验证了安全性和效率提升。

Comments Submitted to IEEE Robotics and Automation Letters

详情
AI中文摘要

在动态和不确定环境中的安全导航通常依赖于对真实潜在不确定性的准确估计或假设。然而,由于数据有限或信息不完善,准确描述真实不确定性分布往往很困难。即使在高风险规避水平下,对不确定性及其相关风险的错误理解也可能导致危险决策。为了解决这个问题,我们提出了一种风险感知模型预测控制(RA-MPC)框架,该框架结合预测集来保证风险控制在用户指定阈值以下,而无需对潜在不确定性分布做出假设。为了生成预测集,我们开发了一种免分布的风险量化框架,将共形风险控制(CRC)扩展到一般谱风险度量。然后,我们证明将预测集纳入MPC框架即使在不确定性错误指定的情况下也能提供关于谱风险约束满足的统计安全保证。我们在模拟的车辆避障场景中验证了所提出的框架,与基线RA-MPC框架相比,展示了更高的安全性和更短的求解时间。

英文摘要

Safe navigation in dynamic and uncertain environments often relies on accurate estimation of, or assumptions about, the true underlying uncertainty. However, accurately characterizing the true uncertainty distribution is often difficult due to limited data or imperfect information. An incorrect understanding of the uncertainty and its associated risk may lead to dangerous decisions even under high levels of risk aversion. To address this issue, we propose a risk-aware model predictive control (RA-MPC) framework that incorporates prediction sets to guarantee risk control below a user-specified threshold without requiring assumptions about the underlying uncertainty distribution. To generate the prediction sets, we develop a distribution-free risk quantification framework that extends conformal risk control (CRC) to general spectral risk measures. We then show that incorporating the prediction sets into the MPC framework provides statistical safety guarantees in terms of spectral risk constraint satisfaction even under uncertainty misspecification. We validate the proposed framework in simulated vehicle obstacle avoidance scenarios, demonstrating improved safety and reduced solve time compared to a baseline RA-MPC framework.

2606.04172 2026-06-04 cs.RO 版本更新

Affordance2Action: Task-Conditioned Scene-level Affordance Grounding for Real-Time Manipulation

Affordance2Action: 任务条件下的场景级功能区域定位用于实时操作

Litao Liu, Yifan Han, Pengfei Yi, Wenbo Yu, Hanqing Wang, Haoran Du, Enze Yuan, Zilin Yuan, Ruiding Feng, Michael Liu, Qi Zhang, Jingjin Yu

发表机构 * Department of Computer Science, Rutgers University-New Brunswick(罗格斯大学新布朗斯维尔回声分校计算机科学系) The Hong Kong University of Science and Technology (GZ)(香港科学与技术大学(GZ)) Shanghai AI Laboratory(上海人工智能实验室)

AI总结 提出Affordance2Action框架,通过构建A2A-Bench基准和A2A-AffordGen标注流程,解决场景级任务条件功能区域定位中的多区域对应问题,并支持实时操作。

Comments 23 pages

详情
AI中文摘要

任务条件操作需要将指令定位到与任务相关的功能部件,而非物体类别。这种设置依赖于场景,并且在杂乱场景中通常是一对多的:同一物体在不同任务中可能提供不同的交互,而单个任务可能对应一个功能区域或多个有效功能区域,具体取决于场景布局。现有的功能区域数据集和基准与此设置不一致,因为它们通常侧重于抓取或物体级功能区域,依赖合成场景,或假设单一的指令-区域对应关系。我们提出了Affordance2Action (A2A),一个以基准为中心的学习框架,用于场景级、任务条件的功能区域定位。其核心是A2A-Bench,一个面向操作的基准,涵盖了日常场景中的单区域和多区域指令对应关系,其中多区域对应关系突显了现实多物体环境中功能区域定位的模糊性和多样性。为了大规模构建该基准,我们构建了A2A-AffordGen,一个代理辅助的标注流程,结合了语言模型过滤、交互式部件分割、实例级遮罩细化、任务推理指令生成和人工验证。A2A-Bench的监督进一步支持多种下游应用,其中实时功能区域定位和功能区域条件操作策略是两个代表性示例。实验表明,A2A暴露了通用分割、基于VLM的定位和功能区域蒸馏基线中的显著差距,同时改进了任务级定位并为下游操作提供了有用的空间先验。所有数据集和代码将公开发布,以促进开放研究。

英文摘要

Task-conditioned manipulation requires grounding instructions to task-relevant functional parts rather than object categories. This setting is scene-dependent and often one-to-many in cluttered scenes: the same object may afford different interactions across tasks, while a single task may correspond to either one functional region or multiple valid functional regions, depending on the scene layout. Existing affordance datasets and benchmarks remain misaligned with this setting, as they typically focus on grasping or object-level affordances, rely on synthetic scenes, or assume a single instruction-region correspondence. We present Affordance2Action (A2A), a benchmark-centered learning framework for scene-level, task-conditioned part affordance grounding. At its core is A2A-Bench, a manipulation-oriented benchmark that covers both single-region and multi-region instruction correspondences in everyday scenes, with the latter highlighting the ambiguity and diversity of affordance grounding in realistic multi-object environments. To construct it at scale, we build A2A-AffordGen, an agent-assisted annotation pipeline that combines language-model filtering, interactive part segmentation, instance-level mask-out refinement, task-reasoning instruction generation, and human verification. A2A-Bench's supervision further supports diverse downstream applications, with real-time affordance grounding and affordance-conditioned manipulation policies as two representative examples. Experiments show that A2A exposes substantial gaps in generic segmentation, VLM-based grounding, and affordance distillation baselines, while improving task-level localization and providing useful spatial priors for downstream manipulation. All datasets and code will be publicly released to promote open research.

2606.04158 2026-06-04 cs.RO 版本更新

Multi-Agent Next-Best-View Optimization for Risk-Averse Planning

多智能体风险规避规划中的下一最佳视角优化

Amirhossein Mollaei Khass, Vivek Pandey, Guangyi Liu, Athanasios Cosse, Emrah Bayrak, Nader Motee

发表机构 * Department of Mechanical Engineering and Mechanics, Lehigh University(莱文大学机械工程与力学系) Amazon Robotics(亚马逊机器人)

AI总结 提出一种分布式、风险感知的多智能体下一最佳视角框架,通过共识ADMM优化信息增益并建模碰撞风险,在降低通信开销的同时接近集中式方法的映射质量和轨迹安全性。

Comments 8 pages, 5 figures. Submitted to IROS 2026

详情
AI中文摘要

在不确定和未知环境中,多智能体下一最佳视角选择用于安全路径规划需要信息丰富、安全感知且高效的协调。集中式方法依赖于共享原始传感器数据或大量通信开销,导致可扩展性有限。我们提出一种分布式、风险感知的多智能体NBV框架,其中每个机器人维护一个私有的局部3D高斯溅射地图,团队共同最大化沿规划轨迹的掩蔽区域内的期望信息增益。通过通信图上的共识ADMM求解分布式目标,每个机器人仅交换候选视角、规划轨迹描述符和标量EIG贡献。通过局部3DGS地图上的平均风险价值对每条轨迹的碰撞风险进行建模,并用于塑造掩蔽半径和评分规划路径。在多个团队规模的Gibson环境中的实验表明,分布式公式在映射质量和轨迹安全性方面接近集中式基线,同时将通信量降低数个数量级。

英文摘要

Multi-agent Next-Best-View (NBV) selection for safe path planning in uncertain and unknown environments requires informative, safety-aware, and efficient coordination. Centralized approaches rely on sharing raw sensor data or significant communication overhead, resulting in limited scalability. We propose a distributed, risk-aware multi-agent NBV framework in which each robot maintains a private local 3D Gaussian Splatting map and the team jointly maximizes expected information gain (EIG) restricted to masked zones along planned trajectories. The resulting distributed objective is solved by Consensus ADMM (C-ADMM) over a communication graph, with each robot exchanging only candidate viewpoints, planned trajectory descriptors, and scalar EIG contributions. Collision risk along each trajectory is modeled via Average Value-at-Risk (AV@R) over the local 3DGS map and used both to shape the masking radius and to score planned paths. Experiments in Gibson environments at multiple team sizes show that the distributed formulation approaches the centralized baseline in mapping quality and trajectory safety while reducing communication by orders of magnitude.

2606.04157 2026-06-04 cs.RO 版本更新

Selecting haptic guidance models in teleoperation: guidelines from a comparative user study

遥操作中触觉引导模型的选择:来自比较用户研究的指南

Alexis Boulay, Margot Vulliez, David Daney

发表机构 * Farm3, Besançon, France(法国贝桑松Farm3) Auctus Team, Inria, Talence, France(法国塔兰西Inria Auctus团队)

AI总结 通过用户研究比较弹簧-阻尼器、势场和引导管三种触觉引导模型,提出基于环境特征和实时评估指标的模型选择指南。

Comments EUROHAPTICS 2026 - EuroHaptics International Conference, Jul 2026, Sienna, Italy

详情
AI中文摘要

遥操作中的触觉引导通过力反馈增强操作员性能。本文提出了考虑任务、环境和操作员的最合适模型选择指南。我们定义了一个统一公式,将最常见的模型(弹簧-阻尼器、势场和引导管)表示为具有特定模型引导函数的刚度-阻尼系统的变体。我们进行了一项用户研究,在垂直农业任务中比较了三种经典模型在六种不同环境条件下的场景。结果显示没有普遍优越的模型:弹簧-阻尼器在杂乱环境中表现优异,势场在自由空间中表现良好(但在障碍物附近存在风险),而引导管提供了平衡的折衷。我们提出了新颖的客观指标来评估交互,并表明引导力大小与舒适度和信任度评分相关。这些发现通过环境特征和实时评估指标提供了实用的模型选择指南。

英文摘要

Haptic guidance in teleoperation enhances operator performance through force feedback. This paper presents guidelines to select the most appropriate model considering the task, the environment and the operator. We define a unified formulation expressing most common models (spring-damper, potential field, and guiding tube) as variations of a stiffness-damping system with model-specific guiding functions. We conducted a user study comparing the three classical models across six scenarios with varying environmental conditions in a vertical farming task. Results show no universally superior model: spring-damper excels in cluttered environments, potential field in free spaces (but it shows risks near obstacles), and guiding tube offers a balanced compromise. We propose novel objective metrics to evaluate the interaction, and show that guiding force magnitude correlates with comfort and trust scores. These findings provide practical model selection guidelines through environmental characteristics and real-time evaluation metrics.

2606.04149 2026-06-04 cs.RO 版本更新

CoPark: Learning Reactive Parking via Self-Play

CoPark:通过自我对弈学习反应式泊车

Jiarong Wei, Yanxing Chen, Sinuo Song, Yin Wu, Anna Rehr, Abhinav Valada

发表机构 * Department of Computer Science, University of Freiburg(弗赖堡大学计算机科学系) CARIAD SE(CARIAD SE公司) Technical University of Munich(慕尼黑技术大学)

AI总结 提出CoPark,一种基于残差策略的多智能体自我对弈强化学习方法,通过固定先验与残差头结合,在反应式泊车中实现高精度与安全交互的平衡,显著优于基线方法。

详情
AI中文摘要

学习一个能够以高几何精度达到目标同时与附近智能体安全交互的单一策略面临相互冲突的目标。精度有利于固定几何计划的执行,而交互则要求在另一智能体侵入时立即偏离,导致针对一个目标优化的策略往往在另一个目标上失败。我们在反应式自主泊车的背景下研究这一问题,其中多辆车必须达到指定车位,终端精度达到亚米级,同时在整个操作过程中对邻近车辆保持响应。我们提出CoPark,一种基于残差策略架构的多智能体自我对弈RL方法。预计算的离线计划提供固定的动作先验,而残差头学习反应式修正。残差策略在自我对弈下学习行为,弥补数据和脚本的不足,而固定先验保持纯策略难以可靠达到的车位框架几何。关键设计是一种合作伙伴威胁调制的通道非对称先验释放。连续威胁信号将纵向通道的权限转移给残差头以实现让行,而横向通道仍锚定在预计算参考上以保持亚米级车位对齐。闭环细化层修正动作网格离散化带来的残差终端误差。我们在六个停车场训练策略,并在我们的新反应式泊车基准(包括Dragon Lake Parking (DLP)和DeepScenario Open 3D (DSC3D))上进行零样本评估。CoPark实现了约70-85%的成功率,碰撞率仅为3-6%,显著优于经典、模仿学习和大规模RL基线。重要的是,结果展示了涌现的交互行为,如倒车让行、中途让行、狭窄通道通行和排队。

英文摘要

Learning a single policy that reaches a goal with high geometric precision while interacting safely with nearby agents poses conflicting objectives. Precision favors commitment to a fixed geometric plan, whereas interaction requires immediate deviation when another agent intrudes, causing policies optimized for one objective to often fail at the other. We study this problem in the context of reactive autonomous parking, where multiple vehicles must reach assigned slots with sub-meter terminal accuracy while remaining responsive to neighboring vehicles throughout the maneuver. We propose CoPark, a multi-agent self-play RL approach built on a residual-policy architecture. A precomputed offline plan provides a fixed action prior, while a residual head learns the reactive corrections. The residual policy learns behaviors under self-play, where data and scripting fall short, while the fixed prior holds the slot-frame geometry that pure policies struggle to reach reliably. The key design is a partner-threat-modulated, channel-asymmetric release of the prior. A continuous threat signal shifts authority of the longitudinal channel to the residual head to enable yielding, while the lateral channel remains anchored to the precomputed reference to preserve sub-meter slot alignment. A closed-loop refinement layer corrects residual terminal error from action-grid discretization. We train our policy on six parking lots and evaluate zero-shot on our new reactive-parking benchmark spanning Dragon Lake Parking (DLP) and DeepScenario Open 3D (DSC3D). CoPark achieves ~70-85% success with only 3-6% collision rate, substantially outperforming classical, imitation-learning, and large-scale RL baselines. Importantly, the results demonstrate emergent interaction behaviors such as reverse-yielding, mid-maneuver yielding, tight-corridor passing, and queuing.

2606.04130 2026-06-04 cs.RO 版本更新

CLAW: Learning Continuous Latent Action World Models via Adversarial Latent Regularization

CLAW: 通过对抗潜在正则化学习连续潜在动作世界模型

Tewodros Ayalew, Matthew Jeung, Samuel Wheeler, Xiao Zhang, Andre de la Cruz Arce, Kaylene Stocking, Michael Maire, Matthew R. Walter

发表机构 * University of Chicago(芝加哥大学) Toyota Technological Institute at Chicago(芝加哥丰田技术研究所) Argonne National Laboratory(阿贡国家实验室)

AI总结 提出CLAW框架,利用对抗潜在正则化和扩散视频生成,从无动作视频中端到端学习世界模型与连续潜在动作表示,支持观察模仿学习和目标导向规划。

Comments 8 pages, 15 pages of supplementary material

详情
AI中文摘要

我们引入了CLAW,一个完全端到端的自监督框架,用于直接从无动作视频中联合学习世界模型和连续潜在动作表示。我们的方法利用对抗潜在正则化和基于扩散的视频生成来捕获结构化和语义上有意义的动作表示,同时建模丰富的、可预测的环境动态,而不依赖于任何动作标签或注释。通过同时训练潜在动作模型和世界模型,CLAW学会仅从视觉观察中推理推断的动作如何引起环境转变。我们展示了由此产生的潜在动作世界模型支持从观察中模仿学习和目标导向规划。在模仿学习中,从原始视频中提取的潜在动作实现了行为克隆。对于规划,CLAW生成潜在动作序列并将其映射到可执行动作以达到期望目标。跨多种任务和实体的广泛实验表明,CLAW产生了语义上有意义的潜在动作表示,支持有效的动作迁移,并实现了规划和从观察中模仿,优于现有方法。

英文摘要

We introduce CLAW, a fully end-to-end self-supervised framework for learning a world model jointly with continuous latent action representations directly from action-free videos. Our approach leverages adversarial latent regularization and diffusion-based video generation to capture structured and semantically meaningful action representations while modeling rich, predictive environment dynamics, without relying on any action labels or annotations. By simultaneously training the Latent Action Model and world model, CLAW learns to reason about how inferred actions induce environment transitions from visual observations alone. We show that the resulting latent action world model supports both imitation learning from observation and goal-directed planning. In imitation learning, latent actions extracted from raw videos enable behavior cloning. For planning, CLAW generates sequences of latent actions and maps them to executable actions to reach desired goals. Extensive experiments across diverse tasks and embodiments demonstrate that CLAW produces semantically meaningful latent action representations, supports effective action transfer, and enables planning and imitation from observation, outperforming existing methods.

2606.04123 2026-06-04 math.OC cs.AI cs.RO 版本更新

Semantic Constraint Synthesis for Adaptive Trajectory Optimization via Large Language Models

基于大语言模型的语义约束综合用于自适应轨迹优化

Eleanor Brosius, Yuji Takubo, Daniele Gammelli, Simone D'Amico, Marco Pavone

发表机构 * Stanford University(斯坦福大学)

AI总结 提出利用大语言模型将自然语言描述的任务需求转化为可执行的轨迹优化代码和数学公式,在航天器交会场景中实现了从语义需求重构凸轨迹优化问题的高成功率。

Comments 7 pages, 4 figures, Presented as a short paper at IEEE CVPR 2026, AI4Space Workshop

详情
AI中文摘要

轨迹优化是实现太空探索中安全可靠自主操作的关键组成部分。随着太空任务在频率、复杂性和范围上的增加,迫切需要快速制定数学上合理的轨迹优化问题,以准确反映任务目标和操作约束。然而,将任务意图转化为易于处理的轨迹优化分析公式需要大量的领域专业知识。本文提出一个框架,利用大语言模型(LLMs)将任务需求和约束的自然语言描述转化为可执行的轨迹优化代码及相应的数学公式。在航天器交会场景中的实验表明,从语义任务需求重构凸轨迹优化问题具有高成功率。最终,这项工作凸显了LLMs在连接高层意图与形式化优化模型方面的潜力,从而实现更灵活高效的航天器轨迹设计。

英文摘要

Trajectory optimization is a critical component for enabling safe and reliable autonomous operations in space exploration. As space missions increase in frequency, complexity, and scope, there is a growing need to rapidly formulate mathematically sound trajectory optimization problems that accurately reflect mission objectives and operational constraints. However, translating mission intent into tractable analytical formulations for trajectory optimization requires substantial domain expertise. This paper presents a framework that leverages large language models (LLMs) to translate natural language descriptions of mission requirements and constraints into executable trajectory optimization code and corresponding mathematical formulations. Experiments in spacecraft rendezvous scenarios demonstrate a high success rate in reconditioning a convex trajectory optimization problem from semantic mission requirements. Ultimately, this work highlights the potential of LLMs to bridge high-level intent and formal optimization models, enabling more flexible and efficient trajectory design of spacecraft.

2606.04111 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

AgenticDiffusion: Agentic Diffusion-based Path Planning for Vision-Based UAV Navigation

AgenticDiffusion:基于智能体扩散的视觉无人机导航路径规划

Faryal Batool, Muhammad Ahsan Mustafa, Fawad Mehboob, Valerii Serpiva, Dzmitry Tsetserukou

发表机构 * University of Engineering and Technology, Lahore(拉合尔工程与技术大学)

AI总结 提出AgenticDiffusion多视角无人机导航框架,结合语言引导推理、开放词汇目标定位、视觉扩散规划与NMPC,通过协调第一人称和俯视图提升室内导航效率,在40次真实实验中实现80%任务成功率。

详情
AI中文摘要

室内无人机导航需要在有限视场观测下实现高效探索、场景理解和可靠轨迹执行。现有的基于视觉的导航框架通常依赖单视角观测,限制了其对遮挡、目标可见性和全局场景结构的推理能力。在这项工作中,我们提出了AgenticDiffusion,一个多视角无人机导航框架,在统一的空中导航流程中协调语言引导推理、开放词汇目标定位、基于视觉的扩散规划以及NMPC。给定自然语言指令和同步的第一人称视角(FPV)与俯视图观测,该框架在轨迹执行前确定最具信息量的导航视角并生成任务计划。使用开放词汇定位模型定位目标后,特定视角的扩散规划器生成用于无人机执行的导航轨迹。通过互补视角,所提框架减少了重复目标探索,并提高了在杂乱室内环境中的导航效率。该框架在四个真实无人机导航场景中进行了验证,涉及自适应视角选择、多阶段任务执行、长时域导航和安全着陆点选择。实验结果表明,在40次真实试验中,总体任务成功率达到80%,而扩散规划器实现了100%的轨迹生成成功率。

英文摘要

Indoor UAV navigation requires efficient exploration, scene understanding, and reliable trajectory execution under limited field-of-view observations. Existing vision-based navigation frameworks typically rely on single-view observations, limiting their ability to reason about occlusions, target visibility, and global scene structure. In this work, we propose AgenticDiffusion, a multi-view UAV navigation framework that coordinates language-guided reasoning, open-vocabulary target grounding, vision-based diffusion planning, and NMPC within a unified aerial navigation pipeline. Given a natural language instruction and synchronized first-person-view (FPV) and top-view observations, the framework determines the most informative viewpoint for navigation and generates a mission plan prior to trajectory execution. The targets are localized using an open-vocabulary grounding model, after which viewpoint-specific diffusion planners generate navigation trajectories for UAV execution. Using complementary viewpoints, the proposed framework reduces repeated target exploration and improves navigation efficiency in cluttered indoor environments. The framework was validated in four real-world UAV navigation scenarios involving adaptive viewpoint selection, multi-stage mission execution, long-horizon navigation, and safe landing-site selection. The experimental results demonstrated an overall mission success rate of 80% in 40 real-world trials, while the diffusion planners achieved a trajectory generation success rate of 100%.

2606.04072 2026-06-04 cs.RO cs.DC cs.LG cs.SY eess.SY 版本更新

CADET: A Modular Platform for Evaluating Distributed Cooperative Autonomy in Connected Autonomous Vehicles

CADET:用于评估网联自动驾驶车辆中分布式协作自主性的模块化平台

Pragya Sharma, Brian Wang, Mani Srivastava

发表机构 * UCLA Amazon Scholar(亚马逊学者)

AI总结 提出CADET模块化平台,通过解耦自动驾驶堆栈并集成网络与工作负载仿真,系统评估分布式协作自主系统在真实部署条件下的安全性与性能。

详情
Journal ref
ICRA 2026
AI中文摘要

深度学习模型日益成为自动驾驶汽车(AV)管道的核心,然而其集成传统上遵循单一设计,即感知、规划和控制在同一车载计算机上执行。这种设计忽视了协作自主的新兴范式,即车辆通过车联网(V2X)连接与路侧单元(RSU)、边缘服务器和云托管智能进行交互。协作感知和控制提高了安全性和效率,但也引入了系统级挑战:网络延迟、计算异构性和多租户争用,所有这些都严重影响实时决策。这些挑战因对大型基础模型的日益依赖而进一步放大,这些模型的规模需要云部署。我们提出CADET(通过分布式实验工具包实现协作自主),这是一个模块化平台,用于在真实部署条件下对分布式协作自主系统进行系统化和可重复的评估。CADET将自动驾驶堆栈解耦为可组合的模块,这些模块可以灵活地部署在车辆、基础设施和边缘/云层级上。该框架集成了最先进的模型,引入了基于轨迹的网络和工作负载仿真,并提供了同步的模型级、系统级和任务级检测。通过V2V和V2I实验,我们表明分布式部署选择从根本上影响安全性,其中V2V意图数据包优于基于云的感知,而RSU辅助感知在过载并发请求之前维持安全性。尽管专为自动驾驶管道设计,CADET也支持数据集驱动的实验,使系统和机器学习研究人员能够独立于完整的车辆仿真来基准测试分布式推理工作负载。CADET是开源的,代码和演示可在https://nesl.github.io/cadet-web获取。

英文摘要

Deep learning models are increasingly central to autonomous vehicle (AV) pipelines, yet their integration has traditionally followed a monolithic design where perception, planning, and control execute on a single onboard computer. This design overlooks the emerging paradigm of cooperative autonomy, where vehicles interact with roadside units (RSUs), edge servers, and cloud-hosted intelligence through vehicle-to-everything (V2X) connectivity. Cooperative perception and control improve safety and efficiency, but also introduce systems-level challenges: network latency, compute heterogeneity, and multi-tenant contention, all critically affect real-time decision-making. These challenges are further amplified by the increasing reliance on large foundation models, whose scale necessitates cloud deployment. We present CADET (Cooperative Autonomy through Distributed Experimentation Toolkit), a modular platform for systematic and reproducible evaluation of distributed cooperative autonomy systems under realistic deployment conditions. CADET decouples the AV stack into composable modules that can be flexibly deployed across vehicles, infrastructure, and edge/cloud tiers. The framework integrates state-of-the-art models, incorporates trace-driven network and workload emulation, and provides synchronized model-, system-, and task-level instrumentation. Through V2V and V2I experiments, we show that distributed deployment choices fundamentally shape safety, with V2V intent packets outperforming cloud-based perception and RSU-assisted perception sustaining safety until overloaded by concurrent requests. Although designed for AV pipelines, CADET also supports dataset-driven experimentation, enabling systems and ML researchers to benchmark distributed inference workloads independently of full vehicle simulation. CADET is open source, with code and demo available at https://nesl.github.io/cadet-web.

2606.04046 2026-06-04 cs.CV cs.AI cs.CL cs.LG cs.RO 版本更新

Dive into the Scene: Breaking the Perceptual Bottleneck in Vision-Language Decision Making via Focus Plan Generation

深入场景:通过焦点计划生成打破视觉-语言决策中的感知瓶颈

Boyuan Xiao, Bohong Chen, Yumeng Li, Ji Feng, Yao-Xiang Ding, Kun Zhou

发表机构 * University of Science and Technology of China(中国科学技术大学) Tsinghua University(清华大学)

AI总结 提出SceneDiver方法,通过从粗到细的焦点计划生成,逐步构建场景图并分解任务,减少视觉幻觉,提升视觉-语言模型和视觉-语言-动作模型在具身决策任务中的表现。

Comments Accepted at ICML 2026

详情
AI中文摘要

在具身视觉-语言决策任务(如机器人操作和导航)中,视觉-语言模型和视觉-语言-动作模型(VLMs & VLAs)是具有不同优势的强大工具:VLMs更擅长长期规划,而VLAs更擅长反应控制。然而,它们的性能受到相同感知瓶颈的限制:由于模型无法区分任务相关对象与干扰物,导致视觉幻觉。原则上,准确识别并聚焦关键对象同时过滤无关对象是突破这一限制的关键。一个直接的解决方案是一步聚焦:直接关注重要对象。然而,这种方法被证明无效,因为有效的聚焦本质上需要深度场景理解。为此,我们提出SceneDiver,一种利用VLMs长期规划能力的从粗到细的焦点计划生成方法,首先构建整体场景图以建立初步理解,然后通过识别、理解和分析的迭代循环逐步将任务分解为更简单的子问题。为了实现反应控制,我们还设计了一个轻量级适配器,将深思熟虑的聚焦能力蒸馏到VLAs中。在标准具身AI基准上的评估证实,我们的方法显著减少了VLMs和VLAs的视觉幻觉,同时在需要快速执行的任务中保持了计算效率。我们的代码和数据发布在:https://future-item.github.io/SceneDiver。

英文摘要

In embodied vision-language decision making tasks such as robotic manipulation and navigation, Vision-Language and Vision-Language-Action Models (VLMs & VLAs) are powerful tools with different benefits: VLMs are better at long-term planning, while VLAs are better at reactive control. However, their performance is limited by the same perceptual bottleneck: visual hallucinations arise due to the models' inability to distinguish task-relevant objects from distractors. In principle, accurate identification and focus on critical objects while filtering out irrelevant ones is the key to break this limitation. A straightforward solution is one-step focus: directly attending to essential objects. However, this approach proves ineffective because effective focus inherently requires deep scene understanding. To this end, we propose SceneDiver, a coarse-to-fine focus plan generation method for VLMs leveraging their long-term planning abilities, that first constructs a holistic scene graph to establish initial comprehension, then progressively decomposes the task into simpler sub-problems through an iterative cycle of recognition, understanding, and analysis. To enable reactive control, we also design a lightweight adapter for distilling the deliberate focus ability into VLAs. Evaluations on standard embodied AI benchmarks confirm that our method substantially reduces visual hallucinations for both VLMs and VLAs, while preserving computational efficiency in tasks requiring fast execution. Our code and data are released at: https://future-item.github.io/SceneDiver.

2606.03943 2026-06-04 cs.RO cs.CV cs.LG 版本更新

PointAction: 3D Points as Universal Action Representations for Robot Control

PointAction: 3D点作为机器人控制的通用动作表示

Mutian Tong, Han Jiang, Qiao Feng, Lingjie Liu, Jiatao Gu

发表机构 * University of Pennsylvania(宾夕法尼亚大学)

AI总结 提出PointAction框架,通过微调视频生成模型联合预测未来RGB帧和动态3D点图,将点动力学作为与具体本体无关的动作接口,再由扩散动作解码器映射为可执行动作,以减少RGB动作歧义并跨任务/本体迁移。

Comments Project page: https://oriontmt.github.io/pointaction/

详情
AI中文摘要

视频-动作模型(VAM)利用预训练视频扩散模型捕获的广泛视觉动态,为通用机器人操作提供了有前景的路径。然而,仅RGB视频展开无法直接操作:它们未明确指定度量3D运动、接触几何和细粒度空间约束,导致动作基础不明确。同时,跨不同任务和本体的动作监督扩展仍然成本高昂。我们提出PointAction,一个通过显式基于点的4D建模将视频预测桥接到机器人动作的框架。PointAction微调基础视频生成模型,联合预测未来RGB帧和动态3D点图,产生任务相关场景几何的时间一致3D运动。这些点动力学作为结构化的、与本体无关的动作接口,由基于扩散的动作解码器映射为可执行的机器人动作。通过使用度量3D点动力学作为视频预测和控制之间的接口,PointAction减少了仅RGB动作基础的不确定性,并支持在有限动作监督下跨任务和本体的迁移。实验表明,PointAction在机器人场景上实现了最先进的4D生成质量,在模拟中优于现有基线,并泛化到预训练中未见过的两个真实机器人手臂。

英文摘要

Video-Action Models (VAMs) leverage the broad visual dynamics captured by pre-trained video diffusion models, offering a promising path toward generalizable robot manipulation. However, RGB-only video rollouts are not directly actionable: they leave metric 3D motion, contact geometry, and fine-grained spatial constraints under-specified, making action grounding ambiguous. Meanwhile, scaling action supervision across diverse tasks and embodiments remains costly. We present PointAction, a framework that bridges video predictions to robot actions through explicit point-based 4D modeling. PointAction fine-tunes a foundation video generation model to jointly predict future RGB frames and dynamic 3D pointmaps, producing temporally consistent 3D motion of task-relevant scene geometry. These point dynamics serve as a structured, embodiment-agnostic action interface, which a diffusion-based action decoder maps to executable robot actions. By using metric 3D point dynamics as the interface between video prediction and control, PointAction reduces the ambiguity of RGB-only action grounding and supports transfer across tasks and embodiments with limited action supervision. Experiments show that PointAction achieves state-of-the-art 4D generation quality on robot scenes, outperforms existing baselines in simulation, and generalizes to two real robot arms unseen during pretraining.

2606.03784 2026-06-04 cs.RO 版本更新

Revisiting Embodied Chain-of-Thought for Generalizable Robot Manipulation

重新审视具身思维链以实现可泛化的机器人操作

Nan Sun, Yuan Zhang, Yongkun Yang, Wentao Zhao, Peiyan Li, Jun Guo, Wenxuan Song, Pengxiang Ding, Runze Suo, Yifei Su, Xin Xiao, Xinghang Li, Huaping Liu

发表机构 * Tsinghua University(清华大学) Xiaomi Robotics(小米机器人) Peking University(北京大学) CASIA HKUST(GZ)(香港科技大学(广州)) Zhejiang University(浙江大学) Fudan University(复旦大学) Wuhan University(武汉大学) Shanghai Innovation Institute(上海创新研究院)

AI总结 本文通过构建最大规模具身思维链语料库,提出ERVLA模型,利用思维链作为表征塑造监督而非测试时推理,显著提升了机器人操作的可泛化性。

详情
AI中文摘要

具身思维链旨在连接语言推理与机器人控制,但其有效形式和集成策略仍待探索。本文在大规模视觉-语言-动作模型上重新审视具身思维链。我们构建了迄今为止最大的具身思维链语料库,包含978,743条轨迹、226.3M样本和2592.5小时机器人数据。通过大量实验,我们发现有效的具身思维链应将高层语义理解具体化为可操作的动作指导,例如末端执行器运动描述和图像空间轨迹,而仅靠高层推理带来的收益有限。我们进一步表明,当显式思维链用作自回归动作前缀时,其扩展性不可靠,因为存在复合推理错误和不稳定的推理-动作耦合。为解决这些问题,我们提出ERVLA,一种将具身思维链用作表征塑造监督而非强制性测试时推理的VLA模型。ERVLA采用推理丢弃策略进行训练,使模型在训练期间吸收丰富的推理痕迹,同时在推理时无需思维链解码直接预测动作。这种设计提高了随预训练数据增加的扩展性,并避免了自回归不稳定性。ERVLA在LIBERO-Plus上达到86.9%的成功率,在VLABench上达到53.2%的成功率,展现出强大的分布外泛化能力。在真实机器人实验中,ERVLA进一步优于竞争性基线,尤其是在需要语义消歧和长时域执行的任务上。代码、数据和模型检查点将发布。

英文摘要

Embodied chain-of-thought (CoT) aims to bridge linguistic reasoning and robotic control, but its effective form and integration strategy remain underexplored. In this paper, we revisit embodied CoT for vision-language-action (VLA) models at large scale. We construct the largest embodied CoT corpus to date, comprising 978,743 trajectories, 226.3M samples, and 2592.5 hours of robot data. Through extensive experiments, we find that effective embodied CoT should ground high-level semantic understanding into concrete action guidance, such as end-effector movement descriptions and image-space trajectories, while high-level reasoning alone brings only marginal gains. We further show that explicit CoT does not scale reliably when used as an autoregressive action prefix, as it suffers from compounding inference errors and unstable reasoning-action coupling. To address these limitations, we propose ERVLA, a VLA model that uses embodied CoT as representation-shaping supervision rather than mandatory test-time reasoning. ERVLA is trained with a reasoning-dropout strategy, enabling the model to absorb rich reasoning traces during training while predicting actions directly without CoT decoding during inference. This design improves scalability with increasing pre-training data and avoids autoregressive instability. ERVLA achieves state-of-the-art performance on LIBERO-Plus with an 86.9% success rate and reaches 53.2% success rate on VLABench, demonstrating strong out-of-distribution generalization. In real-robot experiments, ERVLA further outperforms competitive state-of-the-art baselines, especially on tasks requiring semantic disambiguation and long-horizon execution.

2606.03598 2026-06-04 cs.RO cs.AI cs.CV 版本更新

PHASER: Phase-Aware and Semantic Experience Replay for Vision-Language-Action Models

PHASER: 面向视觉-语言-动作模型的相位感知与语义经验回放

Ziyang Chen, Shaoguang Wang, Weiyu Guo, Qianyi Cai, He Zhang, Pengteng Li, Yiren Zhao, Yandong Guo

发表机构 * Thrust of AI, HKUST(Guangzhou)(人工智能 thrust,香港科技大学(广州)) AI 2 Robotics, Shenzhen, China(人工智能与机器人,深圳,中国)

AI总结 提出PHASER框架,通过相位感知容量分配和多模态干扰路由策略,结合自动相位提取管线Auto-PC,解决VLA模型在持续学习中的灾难性遗忘问题,在LIBERO基准上平均成功率提升高达31%。

Comments 20 pages, 8 figures, 12 tables

详情
AI中文摘要

视觉-语言-动作(VLA)模型在语言条件机器人操作中取得了显著成功。然而,在开放环境中部署这些模型需要持续获取新技能,这一过程不可避免地会严重遗忘先前学习的行为。虽然经验回放(ER)是一种标准的缓解策略,但简单的均匀采样从根本上与操作轨迹的时间特征不一致。它系统性地欠采样短暂但因果关键的子技能,导致相位饥饿,并完全忽略了历史任务中不同程度的遗忘。为克服这些限制,我们提出PHASER,一种架构无关的持续学习框架。PHASER采用以相位为中心的容量分配,确保所有子技能获得平等的记忆支持,并结合多模态干扰路由策略,动态优先处理遗忘风险高的历史相位。此外,为实现完全自主的终身适应,我们集成了Auto-PC,一种轻量级管线,结合无监督动作信号变化点检测和基于VLM的语义验证,无需大量人工监督即可提取时间边界。在LIBERO持续学习套件上对三个VLA骨干网络的评估表明,PHASER取得了显著的实证改进,与匹配预算的ER相比,平均成功率(ASR)提升高达31%,并在LIBERO-Goal CL设置中达到87.8%的最终ASR。

英文摘要

Vision-Language-Action (VLA) models have achieved remarkable success in language-conditioned robotic manipulation. However, deploying these models in open-ended environments requires continuously acquiring novel skills, a process that inevitably triggers severe catastrophic forgetting of previously learned behaviors. While experience replay (ER) serves as a standard mitigating strategy, naive uniform sampling fundamentally misaligns with the temporal characteristics of manipulation trajectories. It systematically under-samples brief but causally critical sub-skills, leading to phase starvation, and completely overlooks the varying degrees of forgetting across historical tasks. To overcome these limitations, we introduce PHASER, an architecture-agnostic continual learning framework. PHASER employs a phase-centric capacity allocation to guarantee equal memory support for all sub-skills, coupled with a multi-modal interference routing strategy that dynamically prioritizes historical phases at high risk of forgetting. Furthermore, to enable fully autonomous lifelong adaptation, we integrate Auto-PC, a lightweight pipeline combining unsupervised action-signal change-point detection with VLM-based semantic verification to extract temporal boundaries without intensive manual supervision. Evaluated across three VLA backbones on LIBERO continual learning suites, PHASER yields substantial empirical improvements, increasing Average Success Rate (ASR) by up to 31% over matched-budget ER and achieving an 87.8% final ASR on the LIBERO-Goal CL setting.

2606.03441 2026-06-04 cs.RO cs.LG 版本更新

PerchRL: Vision-Based Agile Perching on Inclined Platforms under Rapid and Irregular Motion

PerchRL:基于视觉的快速不规则运动倾斜平台敏捷着陆

Zihong Lu, Zongzhuo Liu, Huaxu Li, Jinqiang Cui, Jie Mei, Youmin Gong, U Kei Cheang, Boyu Zhou

发表机构 * SUSTech(四川大学) HITSZ(哈尔滨工业大学) PCL(鹏城实验室) Differential Robotics(差分机器人实验室)

AI总结 提出PerchRL强化学习框架,通过两阶段学习策略(状态预训练+视觉微调)和混合学习框架(可见性感知状态增强+主动感知奖励),实现四旋翼在快速不规则运动倾斜平台上的自主视觉着陆。

详情
AI中文摘要

自主视觉引导的四旋翼在移动倾斜平台上的着陆对于空地协作至关重要,但由于有限的视场角(FOV)而具有挑战性。本文提出PerchRL,一种基于强化学习(RL)的框架,用于在快速和不规则运动下的倾斜平台上进行基于视觉的敏捷着陆。具体而言,我们采用两阶段学习策略,包括基于状态的预训练和基于视觉的微调。为了提高对不同平台运动的泛化能力,我们使用随机化的平台轨迹来防止过拟合,并采用时间增强方法从历史观测中捕捉潜在运动模式。在基于视觉的微调过程中,提出了一种混合学习框架,包括可见性感知状态增强和主动感知奖励,以提高在间歇性视觉丢失下的鲁棒性。大量的仿真和真实世界实验证明了PerchRL的可行性、稳定性和实时性能,而在不同四旋翼平台上的成功部署进一步验证了其适应性。源代码将发布以惠及社区。

英文摘要

Autonomous vision-based perching of quadrotors on moving inclined platforms is critical for air-ground collaboration but remains challenging due to the limited field of view (FOV). In this paper, we propose PerchRL, a reinforcement learning (RL) framework for vision-based agile perching on inclined platforms under rapid and irregular motion. Specifically, we employ a two-stage learning strategy consisting of state-based pre-training followed by vision-based fine-tuning. To improve generalization across diverse platform motions, we employ randomized platform trajectories to prevent overfitting and temporal augmentation methods to capture latent motion patterns from historical observations. During vision-based fine-tuning, a hybrid learning framework consisting of visibility-aware state augmentation and active perception rewards is presented to improve robustness under intermittent visual loss. Extensive simulation and real-world experiments demonstrate the feasibility, stability, and real-time performance of PerchRL, while successful deployment across distinct quadrotor platforms further validates its adaptability. The source code will be released to benefit the community.

2606.03175 2026-06-04 cs.CV cs.RO 版本更新

Ask When It Pays: Cost-Aware Open-Ended Interaction for Instance Goal Navigation

在值得时询问:面向实例目标导航的成本感知开放式交互

Xunyi Zhao, Sihao Lin, Gengze Zhou, Zerui Li, Shijie Li, Wei Tao, Jiajun Liu, Qi Wu

发表机构 * Adelaide University(阿德莱德大学) Responsible AI Research Centre, Australian Institute for Machine Learning(负责任人工智能研究中心,澳大利亚机器学习研究所) Institute for Infocomm Research (I2R), A*STAR(信息与通信研究院(I2R),A*STAR) iMotion CSIRO Data61 Project Website(CSIRO Data61项目网站)

AI总结 针对实例目标导航中语言歧义问题,提出一种成本敏感的不确定性减少方法,通过信息增益分析确定有效问题类型,并构建基准测试和加权成功率指标,实现零样本MLLM导航器仅在预期收益大于成本时查询。

详情
AI中文摘要

实例目标导航(IGN)要求具身智能体根据不明确的自然语言描述,在干扰物中找到特定对象实例。这种歧义通常无法仅通过感知和语言解决,因此与oracle的交互成为消歧的自然机制。先前的交互方法允许oracle查询,但将轻量级澄清和路径级指导同等对待,使得智能体通过重复的高信息量问题提高成功率,而非高效解决潜在歧义。我们将交互式IGN重新定义为成本敏感的不确定性减少问题,其中智能体应提出其答案相对于惩罚能最大程度减少导航不确定性的问题。为此,我们对现有导航语料库进行信息增益分析,以识别哪些线索能减少导航不确定性,从而得到一组紧凑的问题类型和数据驱动的成本。然而,现有的交互式导航基准并未建模不同问题类型的成本,也未评估智能体使用交互的效率,因此不适合研究成本敏感的交互。基于此分类,我们构建了一个用于诊断交互行为和效率的基准,以及一个加权成功率指标,该指标根据推导出的成本对每次查询进行惩罚。我们进一步提出了一种零样本MLLM导航器,仅在预期不确定性减少证明交互成本合理时,才在每个决策步骤有选择地进行查询。

英文摘要

Instance Goal Navigation (IGN) requires an embodied agent to find a specific object instance among distractors from an under-specified natural-language description. Such ambiguity often cannot be resolved from perception and language alone, making interaction with an oracle a natural mechanism for disambiguation. Prior interactive methods allow oracle queries but treat lightweight clarification and route-level guidance alike, letting agents boost success rate through repeated high-information questions rather than by resolving the underlying ambiguity efficiently. We recast interactive IGN as a cost-sensitive uncertainty-reduction problem, where the agent should ask the question whose answer provides the largest reduction in navigation uncertainty relative to its penalty. To this end, we apply an information-gain analysis on existing navigation corpora to identify which cues reduce navigation uncertainty, yielding a compact set of question types and data-derived weights. However, existing interactive navigation benchmarks do not model the cost of different question types or evaluate how efficiently agents use interaction, making them unsuitable for studying cost-sensitive interaction. Based on this taxonomy, we construct a benchmark for diagnosing interaction behavior and efficiency, together with a Weighted Success Rate metric that penalizes each query by its derived cost. We further propose a zero-shot MLLM navigator that selectively queries at each decision step only when the expected uncertainty reduction justifies the interaction cost.

2606.02636 2026-06-04 cs.RO cs.AI 版本更新

Too Much of a Good Thing: When sim2real Efforts Impede Policy Learning (And What to Do About It)

过犹不及:当 sim2real 努力阻碍策略学习(以及如何应对)

Kyle Morgenstein, Bharath Masetty, Stephen Welch, Luis Sentis

发表机构 * Apptronik University of Texas at Austin(得克萨斯大学奥斯汀分校)

AI总结 本文指出 sim2real 努力与策略学习之间存在激励错位,导致模拟器锁定和策略探索不足,并提出通过 sim2sim2real 范式仅以机器人运动学为设计约束的潜在解决方案。

详情
AI中文摘要

虽然 sim2real 努力对于有效将策略迁移到硬件上是必要的,但过犹不及。我们认为,sim2real 努力导致了与策略学习的激励错位,由于现实世界施加的不合理约束,导致模拟器锁定和策略探索不足。我们对当前问题状态进行了诊断和解释,并提出了一种潜在解决方案,即通过 sim2sim2real 范式,仅以机器人的运动学作为唯一设计约束。

英文摘要

While sim2real efforts are necessary for effective policy transfer to hardware, there is such a thing as too much of a good thing. We argue that sim2real efforts have led to misaligned incentives with policy learning, resulting in simulator lock in and poor policy exploration due to the unreasonable constraints imposed by the real world. We offer a diagnosis and explanation of the current status of the problem, and propose a potential solution via a sim2sim2real paradigm that leverages the robot's kinematics as the sole design constraint.

2605.21446 2026-06-04 cs.RO cs.AI 版本更新

Lost in Fog: Sensor Perturbations Expose Reasoning Fragility in Driving VLAs

迷失在雾中:传感器扰动暴露驾驶VLA的推理脆弱性

Abhinaw Priyadershi, Jelena Frtunikj

发表机构 * NVIDIA Corporation, USA(NVIDIA公司,美国) NVIDIA GmbH, Germany(NVIDIA德国公司)

AI总结 通过受控传感器扰动实验,发现因果链解释的一致性可作为轨迹可靠性的高保真指标,并证明启用因果链生成可提升轨迹精度。

详情
AI中文摘要

可解释的自主驾驶规划器不仅依赖于生成解释,还依赖于这些解释在真实传感器退化下的可靠性。本文对自主驾驶中视觉-语言-动作(VLA)模型的鲁棒性进行了受控扰动研究,评估了Alpamayo R1(10B参数)在八种传感器扰动(四种强度的高斯噪声、两种光照极端条件和两种雾浓度;约18,000次推理试验)下的1,996个场景。我们发现推理一致性是轨迹可靠性的高保真指标:当扰动后因果链(CoC)解释发生变化时,轨迹偏差激增5.3倍(21.8米 vs 4.1米),跨攻击类型的相关系数r=0.99,每样本点双列相关系数r_pb=0.53(Cohen's d=1.12)。受控消融实验表明,在匹配的推理设置下,启用CoC生成与轨迹精度提升相关(平均提升11.8%;p<0.0001)。在测试的噪声范围(σ∈{10,30,50,70})内,退化近似线性(R²=0.957),而标准输入预处理防御仅提供边际缓解。综上,这些结果将CoC一致性确立为规划安全的定量代理,并激励基于推理的运行时监控以实现更安全的VLA部署。

英文摘要

Interpretable autonomous driving planners depend not only on generating explanations, but also on those explanations remaining reliable under real-world sensor degradation. In this paper we present a controlled perturbation study of Vision-Language-Action (VLA) robustness in autonomous driving, evaluating Alpamayo R1 (10B parameters) across 1,996 scenarios under eight sensor perturbations (Gaussian noise at four intensities, two lighting extremes, and two fog levels; ${\sim}18{,}000$ inference trials). We find that reasoning consistency is a high-fidelity indicator of trajectory reliability: when Chain-of-Causation (CoC) explanations change after perturbation, trajectory deviation spikes $5.3{\times}$ (21.8m vs 4.1m), with $r\!=\!0.99$ across attack types and $r_{pb}\!=\!0.53$ per-sample (Cohen's $d\!=\!1.12$). A controlled ablation provides evidence that enabling CoC generation is associated with improved trajectory accuracy (11.8% on average across conditions; $p < 0.0001$) under matched inference settings. Over the tested noise range ($σ\in \{10, 30, 50, 70\}$), degradation is approximately linear ($R^2\!=\!0.957$), while standard input preprocessing defenses provide only marginal relief. Together, these results establish CoC consistency as a quantitative proxy for planning safety and motivate reasoning-based runtime monitoring for safer VLA deployment.

2605.19294 2026-06-04 cs.RO cs.AI 版本更新

DEFLECT: Temporal Counterfactual Preference Learning for Delay-Robust Asynchronous VLAs

DEFLECT: 面向延迟鲁棒异步VLA的时间反事实偏好学习

Yixiang Zhu, Yonghao Chen, Zijie Yang, Yusong Hu, Xinyu Chen

发表机构 * The Hong Kong University of Science and Technology (Guangzhou)(香港科学与技术大学(广州)) One Robotics

AI总结 针对异步视觉-语言-动作(VLA)策略中陈旧观测导致的预测-执行不匹配问题,提出离线后训练框架DEFLECT,通过反事实偏好监督学习偏好与执行时间对齐的动作,无需人工标注、在线部署或架构修改,显著提升高延迟下的任务成功率。

详情
AI中文摘要

视觉-语言-动作(VLA)策略越来越依赖异步推理,将大模型延迟隐藏在持续的机器人运动背后。虽然这避免了同步动作块执行的“走走停停”行为,但产生了预测-执行不匹配:下一个动作块是根据推理开始时的陈旧观测计算得出的,但仅在机器人和场景发生变化后才执行。因此,适合预测时状态的动作可能与执行时状态不对齐。现有的运行时修复、行为克隆和偏好对齐方法并未直接教导策略解决这种陈旧输入不匹配问题。我们提出DEFLECT,一个面向延迟鲁棒异步VLA的离线后训练框架。DEFLECT将延迟引起的不匹配转化为反事实偏好监督:冻结的参考VLA从未来的执行时间观测生成偏好块,并从陈旧的预测时间观测生成拒绝块。可训练策略在相同的部署时间输入下对两个块进行评分,学习偏好与执行时间对齐的动作,同时监督微调锚点保留专家动作流形。DEFLECT不需要人工偏好标签、奖励模型、在线机器人部署、架构更改或额外的推理时间计算。在Kinetix、LIBERO和三个真实机器人任务上,DEFLECT相比强异步VLA基线提高了延迟鲁棒性,在高延迟下成功率提升高达6.4个百分点,并在真实规模VLA的最长延迟下实现4.6个百分点的增益。

英文摘要

Vision-Language-Action (VLA) policies increasingly rely on asynchronous inference to hide large-model latency behind ongoing robot motion. While this avoids the stop-and-go behavior of synchronous action-chunk execution, it creates a prediction-execution mismatch: the next chunk is computed from a stale observation at inference start but executed only after the robot and scene have evolved. As a result, actions that fit the prediction-time state can become misaligned with the execution-time state. Existing runtime repair, behavior-cloning, and preference-alignment approaches do not directly teach the policy to resolve this stale-input mismatch. We propose DEFLECT, an offline post-training framework for delay-robust asynchronous VLAs. DEFLECT converts latency-induced mismatch into counterfactual preference supervision: a frozen reference VLA generates a preferred chunk from the future execution-time observation and a rejected chunk from the stale prediction-time observation. The trainable policy scores both chunks under the same deployment-time input, learning to favor execution-time-aligned actions while a supervised fine-tuning anchor preserves the expert action manifold. DEFLECT requires no human preference labels, reward models, online robot rollouts, architectural changes, or additional inference-time computation. Across Kinetix, LIBERO, and three real-robot tasks, DEFLECT improves delay robustness over strong asynchronous VLA baselines, raising high-latency success by up to 6.4 percentage points and achieving a 4.6 percentage-point gain at the longest delay on a real-scale VLA.

2605.15949 2026-06-04 cs.RO 版本更新

A Reproducible and Physically Feasible Dynamic Parameter Identification Framework for a Low-Cost Robot Arm

低成本机器人臂的可重复且物理可行的动力学参数辨识框架

Junji Oaki, Koki Yamane, Koki Inami, Sho Sakaino

发表机构 * Institute of Systems and Information Engineering, University of Tsukuba(系统与信息工程研究所,茨川大学)

AI总结 针对低成本机器人臂CRANE-X7,提出一种结合最小二乘、半定规划投影和闭环输入误差精化的可重复且物理可行的动力学参数辨识方法,并通过主成分分析和惯性矩阵正定性审核确保模型统计一致性与物理可行性。

Comments 11 pages, 8 figures, 7 tables, 1 algorithm and 2 appendices

详情
AI中文摘要

本文针对由模块化智能驱动器驱动的低成本机器人臂CRANE-X7,提出了一种可重复且物理可行的动力学参数辨识框架。为提高实际可辨识性,根据近似连杆对称性移除惯性积,将刚体模型从65个基础参数减少至39个。辨识运动是在实际关节限位下,由结构化的单关节和相邻关节基元手工设计而成。所提出的流程结合了预处理、基于逆动力学回归的普通最小二乘(OLS)、用于可行性恢复的条件半定规划(SDP)投影以及闭环输入误差(CLIE)精化。在共同的主成分分析(PCA)空间中分析来自40个结构化轨迹的候选解,以选择一个统计上中心的代表性模型。由于统计中心性本身不能保证物理可接受性,最终选定的模型需通过所有位姿下的惯性矩阵正定性审核,并在必要时通过局部化的后CLIE SDP救援步骤进行修正。实验表明,参数云从OLS到SDP再到CLIE逐渐变得更加集中,而最终接受的模型在保留的验证运动上保持了高预测精度。这些结果为低成本机器人平台获得统计一致且物理可行的动力学模型提供了一条实用途径。

英文摘要

This paper presents a reproducible and physically feasible dynamic parameter identification framework for CRANE-X7, a low-cost robot arm driven by modular smart actuators. To improve practical identifiability, products of inertia are removed according to approximate link symmetry, reducing the rigid-body model from 65 to 39 base parameters. Identification motions are hand-designed from structured single-joint and adjacent-joint primitives under practical joint-range limits. The proposed pipeline combines preprocessing, inverse-dynamics-regressor-based ordinary least squares (OLS), conditional semidefinite-programming (SDP) projection for feasibility recovery, and closed-loop input error (CLIE) refinement. Candidate solutions from 40 structured trajectories are analyzed in a common principal component analysis (PCA) space to select a statistically central representative model. Because statistical centrality alone does not ensure physical acceptability, the selected model is finally screened by an all-pose positive-definiteness audit of the inertia matrix and, when necessary, corrected by a localized post-CLIE SDP rescue step. Experiments show that the parameter cloud becomes progressively more concentrated from OLS to SDP and CLIE, while the final accepted model preserves high predictive accuracy on held-out validation motions. These results demonstrate a practical route to statistically coherent and physically feasible dynamic models for low-cost robot platforms.

1902.10607 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Necessary and Sufficient Conditions for Passivity of Velocity-Sourced Impedance Control of Series Elastic Actuators

速度源阻抗控制系列弹性驱动器被动性的必要和充分条件

Fatih Emre Tosun, Volkan Patoglu

发表机构 * Faculty of Engineering and Natural Sciences, Sabancı University(工程与自然科学学院,萨班奇大学)

AI总结 本文研究了系列弹性驱动器速度源阻抗控制架构的被动性条件,提出了非保守的设计指南以实现null阻抗和纯弹簧的触觉显示,并强调了在积分控制器中包含物理阻尼的重要性。

Comments Submitted to IEEE T-RO, 12 pages, 10 figures, 7 tables

详情
AI中文摘要

系列弹性驱动(SEA)因其在物理人机交互应用中的稳定性鲁棒性和力控制精度而变得普遍。已提出几种SEA的阻抗控制架构。其中,具有内层速度环、中层扭矩环和外层阻抗环的级联控制器因其简单性、鲁棒性和性能而特别受欢迎。本文推导了确保该级联控制器架构在渲染两种最常见的虚拟阻抗模型时被动性的必要和充分条件。基于新建立的被动性条件,我们提供了非保守的设计指南,以触觉显示null阻抗和纯弹簧,同时确保交互的被动性。我们还展示了在推导被动性条件时,当使用积分控制器时,包含物理阻尼的重要性。特别是,我们展示了物理阻尼对系统被动性的影响。

英文摘要

Series Elastic Actuation (SEA) has become prevalent in applications involving physical human-robot interaction as it provides considerable advantages over traditional stiff actuators in terms of stability robustness and fidelity of force control. Several impedance control architectures have been proposed for SEA. Among these alternatives, the cascaded controller with an inner-most velocity loop, an intermediate torque loop and an outer-most impedance loop is particularly favoured for its simplicity, robustness, and performance. In this paper, we derive the \emph{necessary and sufficient conditions} to ensure the passivity of this cascade-controller architecture for rendering two most common virtual impedance models. Based on the newly established passivity conditions, we provide non-conservative design guidelines to haptically display a null impedance and a pure spring while ensuring the passivity of interaction. We also demonstrate the importance of including physical damping in the actuator model during derivation of passivity conditions, when integral controllers are utilized. In particular, we show the adversary effect of physical damping on system passivity.

2304.10891 2026-06-04 cs.LG cs.AI cs.CV cs.RO cs.SY eess.SY 版本更新

Transformer-Based Autonomous Driving Models and Deployment-Oriented Compression: A Survey

基于Transformer的自动驾驶模型与面向部署的压缩:综述

Juan Zhong, Yuhang Shi, Zukang Xu, Xi Chen

发表机构 * Renmin University of China(中国人民大学) Artificial Intelligence Innovation and Incubation Institute, Fudan University(复旦大学人工智能创新与孵化院) Shanghai Academy of AI for Science(上海人工智能科学研究院) Department of houmo.ai(houmo.ai部门)

AI总结 本文综述了基于Transformer的自动驾驶模型,并从部署角度分析了压缩与加速策略(如量化、剪枝、知识蒸馏等)如何影响模型设计、部署性、鲁棒性和安全性。

详情
AI中文摘要

基于Transformer的模型正成为自动驾驶的核心范式,因为它们能够捕捉感知、预测和规划中的长程空间依赖、多智能体交互和多模态上下文。然而,它们在真实车辆中的部署仍然困难,因为高容量注意力架构带来了显著的延迟、内存和能量开销。本综述回顾了具有代表性的基于Transformer的自动驾驶模型,并按任务角色、感知配置和架构设计进行组织。更重要的是,我们从面向部署的角度审视这些模型,分析效率约束如何在实际中重塑模型设计选择。我们进一步回顾了与基于Transformer的驾驶系统相关的压缩和加速策略,包括量化、剪枝、知识蒸馏、低秩近似和高效注意力,并讨论了它们的优势、局限性和任务依赖性。我们不将压缩视为孤立的后期处理步骤,而是强调其作为直接影响部署性、鲁棒性和安全性的系统级设计考虑。最后,我们指出了面向标准化、安全感知和硬件感知的高效自动驾驶系统评估的开放挑战和未来研究方向。

英文摘要

Transformer-based models are becoming a central paradigm in autonomous driving because they can capture long-range spatial dependencies, multi-agent interactions, and multimodal context across perception, prediction, and planning. At the same time, their deployment in real vehicles remains difficult because high-capacity attention-based architectures impose substantial latency, memory, and energy overhead. This survey reviews representative Transformer-based autonomous driving models and organizes them by task role, sensing configuration, and architectural design. More importantly, it examines these models from a deployment-oriented perspective and analyzes how efficiency constraints reshape model design choices in practice. We further review compression and acceleration strategies relevant to Transformer-based driving systems, including quantization, pruning, knowledge distillation, low-rank approximation, and efficient attention, and discuss their benefits, limitations, and task-dependent applicability. Rather than treating compression as an isolated post-processing step, we highlight it as a system-level design consideration that directly affects deployability, robustness, and safety. Finally, we identify open challenges and future research directions toward standardized, safety-aware, and hardware-conscious evaluation of efficient autonomous driving systems.

2605.00416 2026-06-04 cs.RO 版本更新

Learning While Deploying: Fleet-Scale Reinforcement Learning for Generalist Robot Policies

在部署中学习:面向通用机器人策略的机群规模强化学习

Yi Wang, Xinchen Li, Pengwei Xie, Pu Yang, Buqing Nie, Yunuo Cai, Qinglin Zhang, Chendi Qu, Jeffrey Wu, Jianheng Song, Xinlin Ren, Jingshun Huang, Mingjie Pan, Siyuan Feng, Zhi Chen, Jianlan Luo

发表机构 * Shanghai Innovation Institute(上海创新研究院) AGIBOT Finch Columbia University(哥伦比亚大学)

AI总结 提出LWD框架,通过机群规模的离线到在线强化学习,结合分布式隐式价值学习与伴随匹配Q学习,持续后训练通用视觉-语言-动作策略,在16台双臂机器人上实现95%平均成功率。

Comments No

详情
AI中文摘要

通用机器人策略日益受益于大规模预训练,但仅靠离线数据不足以实现稳健的实世界部署。已部署的机器人会遇到分布偏移、长尾故障、任务变化以及人类纠正机会,这些是固定演示数据集无法完全捕获的。我们提出了“在部署中学习”(LWD),一个机群规模的离线到在线强化学习框架,用于通用视觉-语言-动作(VLA)策略的持续后训练。从预训练的VLA策略开始,LWD通过使用在机器人机群中收集的自主 rollout 和人类干预,在部署、共享物理经验、策略改进和重新部署之间形成闭环。为了稳定地从异构、稀疏奖励的机群数据中学习,LWD结合了分布式隐式价值学习(DIVL)进行鲁棒的价值估计,以及通过伴随匹配的Q学习(QAM)在基于流的VLA动作生成器中进行策略提取。我们在一个由16台双臂机器人组成的机群上,在八个真实世界操作任务(包括语义杂货补货和3-5分钟的长时域任务)上验证了LWD。单个通用策略随着机群经验的积累而改进,平均成功率达到95%,在长时域任务上提升最大。

英文摘要

Generalist robot policies increasingly benefit from large-scale pretraining, but offline data alone is insufficient for robust real-world deployment. Deployed robots encounter distribution shifts, long-tail failures, task variations, and human correction opportunities that fixed demonstration datasets cannot fully capture. We present Learning While Deploying (LWD), a fleet-scale offline-to-online reinforcement learning framework for continual post-training of generalist Vision-Language-Action (VLA) policies. Starting from a pretrained VLA policy, LWD closes the loop between deployment, shared physical experience, policy improvement, and redeployment by using autonomous rollouts and human interventions collected across a robot fleet. To stabilize learning from heterogeneous, sparse-reward fleet data, LWD combines Distributional Implicit Value Learning (DIVL) for robust value estimation with Q-learning via Adjoint Matching (QAM) for policy extraction in flow-based VLA action generators. We validate LWD on a fleet of 16 dual-arm robots across eight real-world manipulation tasks, including semantic grocery restocking and 3--5 minute long-horizon tasks. A single generalist policy improves as fleet experience accumulates, reaching an average success rate of 95%, with the largest gains on long-horizon tasks.

2604.25050 2026-06-04 cs.RO 版本更新

DiscreteRTC: Discrete Diffusion Policies are Natural Asynchronous Executors

DiscreteRTC:离散扩散策略是自然的异步执行器

Pengcheng Wang, Kaiwen Hong, Chensheng Peng, Katherine Driggs-Campbell, Masayoshi Tomizuka, Chenfeng Xu, Chen Tang

发表机构 * UC Berkeley(加州大学伯克利分校) UIUC(伊利诺伊大学香槟分校) UT Austin(德克萨斯大学奥斯汀分校) UCLA(加州大学洛杉矶分校)

AI总结 针对同步执行器在动态任务中的致命停顿问题,提出DiscreteRTC方法,利用离散扩散策略的原生修复能力实现异步执行,在动态模拟和真实操作任务中取得更高成功率。

详情
AI中文摘要

与聊天机器人不同,物理AI必须在世界不断变化的同时行动。因此,无论推理速度有多快,同步执行器的块间停顿对于动态任务都是致命的。异步执行——边行动边思考——因此是一个结构性要求,而实时分块(RTC)通过将块转换重新定义为修复(冻结已承诺的动作并一致地生成剩余部分)使其可行。然而,基于流匹配策略的RTC在结构上并非最优:其修复来自推理时的修正而非基础策略,导致几乎没有预训练收益、需要特定微调、启发式指导以及增加延迟的额外计算。在这项工作中,我们观察到离散扩散策略通过迭代去掩码生成动作,是自然的异步执行器,一次性解决了所有限制:由于修复是其原生操作,因此无需微调,而提前停止进一步提供了自适应指导并降低了推理成本。我们提出了DiscreteRTC,用原生去掩码替代外部修正,并在动态模拟基准和真实世界动态操作任务上展示了其比连续RTC和其他基线更高的成功率。总之,DiscreteRTC实现更简单,无需额外代码即可启用异步修复;推理更快,仅需从头生成动作约0.7倍的计算量;执行更好,在真实世界曲棍球防守任务中,成功率比流匹配RTC高65%,比训练时流匹配RTC高30%。更多可视化见https://outsider86.github.io/DiscreteRTCSite/。

英文摘要

Unlike chatbots, physical AI must act while the world keeps evolving. Therefore, the inter-chunk pause of synchronous executors are fatal for dynamic tasks regardless of how fast the inference is. Asynchronous execution -- thinking while acting -- is therefore a structural requirement, and real-time chunking (RTC) makes it viable by recasting chunk transitions as inpainting: freezing committed actions and consistently generating the remainder. However, RTC with flow-matching policy is structurally suboptimal: its inpainting comes from inference-time corrections rather than the base policy, yielding little pre-training benefit, specific fine-tuning, heuristic guidance, and extra computation that inflates the latency. In this work, we observe that discrete diffusion policies, which generate actions by iteratively unmasking, are natural asynchronous executors that resolve all limitations at once: they are fine-tuning free since inpainting is their native operation, while early stopping further provides adaptive guidance and reduces inference cost. We propose DiscreteRTC, which replaces external corrections with native unmasking, and show on dynamic simulated benchmarks and real-world dynamic manipulation tasks that it achieves higher success rates than continuous RTC and other baselines. In summary, DiscreteRTC is simpler to implement with 0 lines of additional code to enable async inpainting, faster at inference with only ~0.7 computation compared with generating actions from scratch, and better at execution with 65% higher success rate in real-world hockey defend task compared with flow-matching RTC, and 30% higher compared with training-time flow-matching RTC. More visualizations are on https://outsider86.github.io/DiscreteRTCSite/.

2604.12645 2026-06-04 cs.RO cs.AI 版本更新

Contextual Multi-Task Reinforcement Learning for Autonomous Reef Monitoring

上下文多任务强化学习用于自主珊瑚礁监测

Melvin Laux, Yi-Ling Liu, Rina Alo, Sören Töpper, Mariela De Lucas Alvarez, Frank Kirchner, Rebecca Adam

发表机构 * University of Bremen(不莱梅大学)

AI总结 针对水下动力学不确定性和任务变化,提出上下文多任务强化学习框架,学习可复用的控制策略,在模拟环境中实现高效训练、零样本泛化和鲁棒性。

Comments To be published in IEEE OCEANS 2026 (Sanya) conference proceedings

详情
AI中文摘要

尽管自主水下航行器有望实现海洋生态系统监测,但其部署从根本上受限于在高度不确定和非平稳的水下动力学下控制航行器的难度。为了解决这些挑战,我们采用数据驱动的强化学习方法来补偿未知动力学和任务变化。传统的单任务强化学习容易过拟合训练环境,从而限制了所学策略的长期实用性。因此,我们提出使用上下文多任务强化学习范式,允许我们学习可复用于各种任务的控制器,例如在一个珊瑚礁中检测牡蛎,在另一个珊瑚礁中检测珊瑚。我们评估上下文多任务强化学习是否能有效学习自主水下珊瑚礁监测的鲁棒且可泛化的控制策略。我们在HoloOcean中的模拟珊瑚礁环境中训练了一个单一上下文相关策略,该策略能够解决多个相关的监测任务。在我们的实验中,我们经验性地评估了上下文策略在样本效率、对未见任务的零样本泛化以及对变化水流的鲁棒性方面的表现。通过利用多任务强化学习,我们旨在提高训练效率以及所学策略的可重用性,从而向更可持续的自主珊瑚礁监测程序迈进一步。

英文摘要

Although autonomous underwater vehicles promise the capability of marine ecosystem monitoring, their deployment is fundamentally limited by the difficulty of controlling vehicles under highly uncertain and non-stationary underwater dynamics. To address these challenges, we employ a data-driven reinforcement learning approach to compensate for unknown dynamics and task variations. Traditional single-task reinforcement learning has a tendency to overfit the training environment, thus, limit the long-term usefulness of the learnt policy. Hence, we propose to use a contextual multi-task reinforcement learning paradigm instead, allowing us to learn controllers that can be reused for various tasks, e.g., detecting oysters in one reef and detecting corals in another. We evaluate whether contextual multi-task reinforcement learning can efficiently learn robust and generalisable control policies for autonomous underwater reef monitoring. We train a single context-dependent policy that is able to solve multiple related monitoring tasks in a simulated reef environment in HoloOcean. In our experiments, we empirically evaluate the contextual policies regarding sample-efficiency, zero-shot generalisation to unseen tasks, and robustness to varying water currents. By utilising multi-task reinforcement learning, we aim to improve the training effectiveness, as well as the reusability of learnt policies to take a step towards more sustainable procedures in autonomous reef monitoring.

2604.04974 2026-06-04 cs.RO 版本更新

From Video to Control: A Survey of Learning Manipulation Interfaces from Temporal Visual Data

从视频到控制:基于时间视觉数据学习操作接口的综述

Linfang Zheng, Zikai Ouyang, Chen Wang, Jia Pan, Wei Zhang

发表机构 * School of Automation and Intelligent Manufacturing (AiM)(自动化与智能制造学院) Guangdong Provincial Key Laboratory of Fully Actuated System Control Theory and Technology(广东省全驱动系统控制理论与技术省级重点实验室) Southern University of Science and Technology(南方科技大学) The University of Hong Kong(香港大学) Peng Cheng Laboratory(鹏城实验室) LimX Dynamics

AI总结 本文综述了利用无动作标注的时间视频数据学习机器人操作控制接口的方法,提出以接口为中心的三种分类:直接视频-动作策略、潜在动作方法和显式视觉接口,并分析了控制集成特性与未来研究方向。

详情
AI中文摘要

视频是物理动力学的可扩展观测:它捕捉物体如何移动、接触如何展开以及场景在交互中如何演变——所有这些都不需要机器人动作标签。然而,将这种时间结构转化为可靠的机器人控制仍然是一个开放的挑战,因为视频缺乏动作监督,并且在具身、视角和物理约束方面与机器人经验不同。本综述回顾了利用无动作标注的时间视频来学习机器人操作控制接口的方法。我们引入了一种以接口为中心的分类法,按照视频到控制接口的构建位置及其启用的控制属性进行组织,识别出三个家族:直接视频-动作策略(保持接口隐式)、潜在动作方法(通过紧凑的学习中间体路由时间结构)以及显式视觉接口(预测下游控制的可解释目标)。对于每个家族,我们分析了控制集成特性——如何闭合回路、执行前可以验证什么以及失败在何处进入。跨家族的综合分析表明,最紧迫的开放挑战集中在机器人集成层——将视频衍生的预测连接到可靠机器人行为的机制——我们概述了弥合这一差距的研究方向。

英文摘要

Video is a scalable observation of physical dynamics: it captures how objects move, how contact unfolds, and how scenes evolve under interaction -- all without requiring robot action labels. Yet translating this temporal structure into reliable robotic control remains an open challenge, because video lacks action supervision and differs from robot experience in embodiment, viewpoint, and physical constraints. This survey reviews methods that exploit non-action-annotated temporal video to learn control interfaces for robotic manipulation. We introduce an interface-centric taxonomy organized by where the video-to-control interface is constructed and what control properties it enables, identifying three families: direct video-action policies, which keep the interface implicit; latent-action methods, which route temporal structure through a compact learned intermediate; and explicit visual interfaces, which predict interpretable targets for downstream control. For each family, we analyze control-integration properties -- how the loop is closed, what can be verified before execution, and where failures enter. A cross-family synthesis reveals that the most pressing open challenges center on the robotics integration layer -- the mechanisms that connect video-derived predictions to dependable robot behavior -- and we outline research directions toward closing this gap.

2602.23312 2026-06-04 cs.HC cs.AI cs.LG cs.RO cs.SY eess.SY 版本更新

Evaluating Zero-Shot and One-Shot Adaptation of Small Language Models in Leader-Follower Interaction

评估小语言模型在领导者-跟随者交互中的零样本和单样本适应

Rafael R. Baptista, André de Lima Salgado, Ricardo V. Godoy, Marcelo Becker, Thiago Boaventura, Gustavo J. G. Lahr

发表机构 * University of Sao Paulo(圣保罗大学) Federal University of Lavras(拉瓦尔联邦大学) Faculdade Israelita de Ensino e Pesquisa Albert Einstein(亚伯拉罕·林克·埃instein教育与研究学院)

AI总结 本文通过微调小语言模型(Qwen2.5-0.5B)在领导者-跟随者交互中实现角色分类,零样本微调达到86.66%准确率且延迟低至22.2毫秒,但单样本模式因上下文长度增加导致性能下降。

详情
AI中文摘要

领导者-跟随者交互是人机交互(HRI)中的一个重要范式。然而,对于资源受限的移动和辅助机器人来说,实时分配角色仍然具有挑战性。虽然大型语言模型(LLMs)在自然通信方面显示出潜力,但其规模和延迟限制了设备端部署。小语言模型(SLMs)提供了一种潜在的替代方案,但它们在HRI中角色分类的有效性尚未得到系统评估。在本文中,我们提出了一个用于领导者-跟随者通信的SLMs基准测试,引入了一个源自已发表数据库的新数据集,并增加了合成样本以捕捉交互特定的动态。我们研究了两种适应策略:提示工程和微调,在零样本和单样本交互模式下进行研究,并与未训练的基线进行比较。使用Qwen2.5-0.5B的实验表明,零样本微调实现了稳健的分类性能(86.66%准确率),同时保持低延迟(每个样本22.2毫秒),显著优于基线和提示工程方法。然而,结果也表明在单样本模式下性能下降,其中增加的上下文长度挑战了模型的架构能力。这些发现表明,微调的SLMs为直接角色分配提供了有效的解决方案,同时突出了边缘端对话复杂性与分类可靠性之间的关键权衡。

英文摘要

Leader-follower interaction is an important paradigm in human-robot interaction (HRI). Yet, assigning roles in real time remains challenging for resource-constrained mobile and assistive robots. While large language models (LLMs) have shown promise for natural communication, their size and latency limit on-device deployment. Small language models (SLMs) offer a potential alternative, but their effectiveness for role classification in HRI has not been systematically evaluated. In this paper, we present a benchmark of SLMs for leader-follower communication, introducing a novel dataset derived from a published database and augmented with synthetic samples to capture interaction-specific dynamics. We investigate two adaptation strategies: prompt engineering and fine-tuning, studied under zero-shot and one-shot interaction modes, compared with an untrained baseline. Experiments with Qwen2.5-0.5B reveal that zero-shot fine-tuning achieves robust classification performance (86.66% accuracy) while maintaining low latency (22.2 ms per sample), significantly outperforming baseline and prompt-engineered approaches. However, results also indicate a performance degradation in one-shot modes, where increased context length challenges the model's architectural capacity. These findings demonstrate that fine-tuned SLMs provide an effective solution for direct role assignment, while highlighting critical trade-offs between dialogue complexity and classification reliability on the edge.

2603.09170 2026-06-04 cs.RO cs.AI 版本更新

ZeroWBC: Learning Natural Whole-Body Humanoid Interaction from Human Egocentric Data

ZeroWBC: 从人类自我中心数据学习自然全身人形交互

Haoran Yang, Jiacheng Bao, Yucheng Xin, Haoming Song, Yuyang Tian, Bin Zhao, Dong Wang, Xuelong Li

发表机构 * University of Science and Technology of China(中国科学技术大学) Shanghai AI Laboratory(上海人工智能实验室) Northwestern Polytechnical University(西北工业大学) Tsinghua University(清华大学) Shanghai Jiao Tong University(上海交通大学) TeleAI, China Telecom(TeleAI,中国电信)

AI总结 提出ZeroWBC框架,利用人类自我中心视频和同步全身运动数据,通过生成-跟踪方法实现无遥操作的人形机器人全身交互控制。

详情
AI中文摘要

由于全身遥操作数据成本高昂,实现多功能且自然的全身人形交互控制仍然具有挑战性。我们提出ZeroWBC,一种无需遥操作的框架,从人类自我中心视频以及同步的全身运动和文本注释中学习人形全身交互。ZeroWBC采用生成-跟踪公式来解决静态场景全身交互控制问题。给定初始自我中心图像和语言指令,微调的视觉-语言模型生成未来人类全身运动标记,这些标记被解码为连续运动并重定向到人形机器人。得到的参考运动,连同根部和关键身体部位轨迹,然后由通用交互运动跟踪策略执行。为了提高交互性能,我们引入了一种面向交互的跟踪奖励,该奖励优先考虑全局根部和关键身体部位轨迹对齐,同时保持自然的全身运动。在Unitree G1人形机器人上的实验表明,ZeroWBC无需机器人遥操作演示即可实现多样化的场景感知行为。这些结果表明了一种从人类自我中心数据学习自然人形全身交互的可扩展范式。

英文摘要

Achieving versatile and natural whole-body humanoid interaction control remains challenging due to the high cost of whole-body teleoperation data. We present ZeroWBC, a teleoperation-free framework that learns humanoid whole-body interaction from human egocentric videos paired with synchronized whole-body motion and text annotations. ZeroWBC adopts a generation-then-tracking formulation to tackle the static scene whole-body interaction control problem. Given an initial egocentric image and a language instruction, a fine-tuned Vision-Language Model generates future human whole-body motion tokens, which are decoded into continuous motions and retargeted to the humanoid. The resulting reference motions, together with root and key body-part trajectories, are then executed by a general interactive motion tracking policy. To improve interaction performance, we introduce an interaction-oriented tracking reward that prioritizes global root and key body-part trajectory alignment while preserving natural whole-body motion. Experiments on the Unitree G1 humanoid robot show that ZeroWBC enables diverse scene-aware behaviors without robot teleoperation demonstrations. These results suggest a scalable paradigm for learning natural humanoid whole-body interaction from human egocentric data.

2510.27191 2026-06-04 cs.RO cs.AI 版本更新

Vectorized Online POMDP Planning

向量化在线POMDP规划

Marcus Hoerger, Muhammad Sudrajat, Hanna Kurniawati

发表机构 * School of Computing, Australian National University(澳大利亚国立大学计算机学院)

AI总结 提出向量化在线POMDP规划器(VOPP),通过全向量化计算消除并行瓶颈,实现大规模并行在线规划,计算效率比现有最先进并行求解器高至少20倍。

Comments 8 pages, 3 figures. Accepted at ICRA 2026

详情
AI中文摘要

部分可观测下的规划是自主机器人的一项基本能力。部分可观测马尔可夫决策过程(POMDP)为部分可观测问题下的规划提供了强大的框架,捕捉了动作的随机效应以及通过噪声观测获得的有限信息。POMDP求解可以极大受益于当今硬件上的大规模并行化,但并行化POMDP求解器一直具有挑战性。大多数求解器依赖于将动作上的数值优化与其价值估计交错进行,这会在并行进程之间产生依赖关系和同步瓶颈,从而抵消并行化的好处。在本文中,我们提出了向量化在线POMDP规划器(VOPP),一种新颖的并行在线求解器,它利用了最近的POMDP公式,该公式解析地解决了优化组件的一部分,将数值计算仅保留为期望的估计。VOPP将所有与规划相关的数据结构表示为张量集合,并将所有规划步骤实现为该表示上的全向量化计算。结果是一个大规模并行的在线求解器,并发进程之间没有依赖关系或同步瓶颈。实验结果表明,与现有的最先进并行在线求解器相比,VOPP在计算近最优解方面的效率至少高出20倍。此外,VOPP优于最先进的顺序在线求解器,同时使用的规划预算小1000倍。

英文摘要

Planning under partial observability is an essential capability of autonomous robots. The Partially Observable Markov Decision Process (POMDP) provides a powerful framework for planning under partial observability problems, capturing the stochastic effects of actions and the limited information available through noisy observations. POMDP solving could benefit tremendously from massive parallelization on today's hardware, but parallelizing POMDP solvers has been challenging. Most solvers rely on interleaving numerical optimization over actions with the estimation of their values, which creates dependencies and synchronization bottlenecks between parallel processes that can offset the benefits of parallelization. In this paper, we propose Vectorized Online POMDP Planner (VOPP), a novel parallel online solver that leverages a recent POMDP formulation which analytically solves part of the optimization component, leaving numerical computations to consist of only estimation of expectations. VOPP represents all data structures related to planning as a collection of tensors, and implements all planning steps as fully vectorized computations over this representation. The result is a massively parallel online solver with no dependencies or synchronization bottlenecks between concurrent processes. Experimental results indicate that VOPP is at least $20\times$ more efficient in computing near-optimal solutions compared to an existing state-of-the-art parallel online solver. Moreover, VOPP outperforms state-of-the-art sequential online solvers, while using a planning budget that is $1000\times$ smaller.

2603.08485 2026-06-04 cs.RO 版本更新

3PoinTr: 3D Point Tracks for Learning Manipulation from Unconstrained Human Videos

3PoinTr: 从无约束人类视频中通过3D点轨迹学习操作

Adam Hung, Bardienus Pieter Duisterhof, Jeffrey Ichnowski

发表机构 * Carnegie Mellon University(卡内基梅隆大学)

AI总结 提出3PoinTr方法,通过预测密集3D点轨迹从无约束人类视频中预训练样本高效的机器人操作策略,在真实和模拟任务中显著优于基线方法。

详情
AI中文摘要

从人类视频中学习操作策略可以大大减少对昂贵的机器人演示的需求,但现有方法通常需要限制性假设,如编排的人类动作、预定义关键点、手动注释或已知的抓取位置。我们提出3PoinTr,一种通过预测密集3D点轨迹从无约束人类视频中预训练样本高效机器人策略的方法。在无约束的人类演示视频中,人类可以自由地遵循他们认为合适的任何轨迹和操作策略,而不是编排他们的动作来模仿机器人。3PoinTr使用轻量级的可见性感知Transformer从人类视频中学习场景点应如何移动,然后训练一个闭环多任务机器人策略,以从这些预测的点轨迹中灵活提取与动作相关的先验知识。仅使用20个带动作标签的机器人演示,3PoinTr在真实世界任务上的平均成功率比最强的行为克隆和视频预训练基线高出25.0个百分点,在模拟中平均成功率高出29.6个百分点。针对性的消融实验支持关键设计选择,并证实了从无动作视频中学习的好处。我们进一步表明,3PoinTr的点轨迹预测Transformer通过保留对部分遮挡点的监督,优于强基线。项目页面:https://adamhung60.github.io/3PoinTr/。

英文摘要

Learning manipulation policies from human videos could greatly reduce the need for expensive robot demonstrations, but existing approaches typically require restrictive assumptions such as choreographed human motions, predefined keypoints, manual annotations, or known grasp locations. We propose 3PoinTr, a method for pretraining sample-efficient robot policies from unconstrained human videos by predicting dense 3D point tracks. In the unconstrained human demonstration videos, humans are free to follow whatever trajectories and manipulation strategies they see fit, rather than choreographing their motions to mimic a robot. 3PoinTr uses a lightweight visibility-aware transformer to learn how scene points should move from human videos, and then trains a closed-loop multitask robot policy to flexibly extract action-relevant priors from those predicted point tracks. With only 20 action-labeled robot demonstrations, 3PoinTr achieves a 25.0 percentage point higher average success rate than the strongest behavior cloning and video-pretraining baselines on real-world tasks, and a 29.6 percentage point higher average success rate in simulation. Targeted ablations support the key design choices and confirm the benefit of learning from actionless videos. We further show that 3PoinTr's point track prediction transformer outperforms a strong baseline by preserving supervision over partially occluded points. Project page: https://adamhung60.github.io/3PoinTr/.

2601.04493 2026-06-04 cs.RO 版本更新

Continuum Robot State Estimation with Actuation Uncertainty

具有驱动不确定性的连续体机器人状态估计

James M. Ferguson, Alan Kuntz, Tucker Hermans

发表机构 * Department of Electrical and Computer Engineering and Department of Computer Science, Vanderbilt University(电气与计算机工程系和计算机科学系,范德比尔特大学) Kahlert School of Computing and Robotics Center, University of Utah(计算与机器人中心,犹他大学) NVIDIA(英伟达)

AI总结 针对连续体机器人在未知交互力和模型不确定性下的形状估计问题,提出一种基于机械原理的驱动先验的离散Cosserat杆模型,联合估计机器人形状、外部载荷和驱动输入,并通过稀疏因子图实现高效非线性优化。

Comments Public preprint for IEEE RAL. Accepted May 2026

详情
AI中文摘要

连续体机器人是柔性、细长的操纵器,非常适合受限的手术环境。在这些环境中,未知的相互作用力和模型不确定性显著影响机器人形状,从而激发了从外部观测进行状态估计的需求。现有的估计方法要么忽略驱动建模,要么依赖于简化的确定性驱动模型。相比之下,我们使用机械原理的驱动先验联合估计机器人形状、外部载荷和驱动输入。为此,我们提出了一种具有分段线性应变积分的离散Cosserat杆公式,该公式提供了高数值精度,同时诱导出稀疏因子图结构以实现高效的非线性优化。我们将该框架扩展到仿真中的腱驱动和并联机器人,并在手术同心管机器人上进行了实验验证。总体而言,我们的方法能够在多种机器人架构上实现原理性的实时估计,同时通过线性化因子图直接访问操纵器雅可比矩阵。

英文摘要

Continuum robots are flexible, slender manipulators well suited for confined surgical environments. In these settings, unknown interaction forces and model uncertainty significantly affect robot shape, motivating state estimation from external observations. Existing estimation methods either neglect actuation modeling or rely on simplified deterministic actuation models. In contrast, we jointly estimate robot shape, external loads, and actuation inputs using mechanically principled actuation priors. To achieve this, we present a discrete Cosserat rod formulation with piecewise-linear strain integration that provides high numerical accuracy while inducing a sparse factor graph structure for efficient nonlinear optimization. We extend the framework to tendon-driven and parallel robots in simulation and validate it experimentally on a surgical concentric tube robot. Overall, our approach enables principled real-time estimation across multiple robot architectures while providing direct access to manipulator Jacobians through the linearized factor graph.

2602.13081 2026-06-04 cs.RO 版本更新

Agentic AI for Robot Control: Flexible but still Fragile

用于机器人控制的智能体AI:灵活但依然脆弱

Oscar Lima, Marc Vinci, Martin Günther, Marian Renz, Alexander Sung, Sebastian Stock, Johannes Brust, Lennart Niecksch, Zongyao Yi, Felix Igelbrink, Benjamin Kisliuk, Martin Atzmueller, Joachim Hertzberg

发表机构 * ETH Zurich(苏黎世联邦理工学院) University of Cambridge(剑桥大学) Technical University of Munich(慕尼黑技术大学) Max Planck Institute for Intelligent Systems(智能系统马克斯·普朗克研究所) German Aerospace Center (DLR)(德国航空航天中心(DLR))

AI总结 本文提出一种基于推理型语言模型的智能体控制系统,通过迭代规划-执行循环选择并调用机器人技能完成任务,在两种物理平台上实验表明系统灵活但存在非确定性行为、指令遵循错误和提示敏感等脆弱性。

详情
AI中文摘要

最近的工作利用生成模型的能力和常识先验进行机器人控制。在本文中,我们提出了一种智能体控制系统,其中具有推理能力的语言模型通过迭代规划器和执行器循环选择并调用机器人技能来规划和执行任务。我们在两种物理机器人平台上部署该系统,分别用于:(i) 室内移动操作(Mobipick)中的桌面抓取、放置和箱子插入,以及 (ii) 自主农业导航和感知(Valdemar)。两种设置都涉及不确定性、部分可观测性、传感器噪声和模糊的自然语言命令。该系统公开了其规划和决策过程的结构化内省,通过显式事件检查对外部事件做出反应,并支持操作员干预以修改或重定向正在进行的执行。在两个平台上,我们的概念验证实验揭示了显著的脆弱性,包括非确定性次优行为、指令遵循错误以及对提示规范的高度敏感性。同时,该架构是灵活的:转移到不同的机器人和任务领域主要需要更新系统提示(领域模型、可供性和动作目录)并将相同的工具接口重新绑定到平台特定的技能API。

英文摘要

Recent work leverages the capabilities and commonsense priors of generative models for robot control. In this paper, we present an agentic control system in which a reasoning-capable language model plans and executes tasks by selecting and invoking robot skills within an iterative planner and executor loop. We deploy the system on two physical robot platforms in two settings: (i) tabletop grasping, placement, and box insertion in indoor mobile manipulation (Mobipick) and (ii) autonomous agricultural navigation and sensing (Valdemar). Both settings involve uncertainty, partial observability, sensor noise, and ambiguous natural-language commands. The system exposes structured introspection of its planning and decision process, reacts to exogenous events via explicit event checks, and supports operator interventions that modify or redirect ongoing execution. Across both platforms, our proof-of-concept experiments reveal substantial fragility, including non-deterministic suboptimal behavior, instruction-following errors, and high sensitivity to prompt specification. At the same time, the architecture is flexible: transfer to a different robot and task domain largely required updating the system prompt (domain model, affordances, and action catalogue) and re-binding the same tool interface to the platform-specific skill API.

2602.12215 2026-06-04 cs.RO 版本更新

LDA-1B: Scaling Latent Dynamics Action Model via Universal Embodied Data Ingestion

LDA-1B:通过通用具身数据摄取扩展潜动力学动作模型

Jiangran Lyu, Kai Liu, Xuheng Zhang, Haoran Liao, Yusen Feng, Wenxuan Zhu, Tingrui Shen, Jiayi Chen, Jiazhao Zhang, Yifei Dong, Wenbo Cui, Senmao Qi, Shuo Wang, Yixin Zheng, Mi Yan, Xuesong Shi, Haoran Li, Dongbin Zhao, Ming-Yu Liu, Zhizheng Zhang, Li Yi, Yizhou Wang, He Wang

发表机构 * Peking University(北京大学) Galbot CASIA(中国科学院自动化研究所) BAAI(北京人工智能研究院) Tsinghua University(清华大学) Sun Yat-sen University(中山大学) NVIDIA

AI总结 提出LDA-1B机器人基础模型,通过统一格式的具身交互数据集EI-30k和结构化DINO潜空间中的动力学学习,联合建模动力学、策略和视觉预测,在接触丰富、灵巧和长时任务上分别提升高达21%、48%和23%。

Comments Accepted at RSS 2026, Project Page:https://pku-epic.github.io/LDA

详情
AI中文摘要

最近的机器人基础模型很大程度上依赖于大规模行为克隆,该方法模仿专家动作,但丢弃了嵌入在异构具身数据中的可迁移动力学知识。虽然统一世界模型(UWM)公式有潜力利用这种多样化数据,但由于粗粒度的数据使用和碎片化的数据集,现有实例难以扩展到基础模型级别。我们引入了LDA-1B,一个通过通用具身数据摄取进行扩展的机器人基础模型,它通过联合学习动力学、策略和视觉预测,为不同质量的数据分配不同的角色。为了大规模支持这种机制,我们组装并标准化了EI-30k,一个具身交互数据集,包含超过3万小时的人类和机器人轨迹,采用统一格式。通过结构化DINO潜空间中的预测实现了对这种异构数据的可扩展动力学学习,避免了冗余的像素空间外观建模。作为这种表示的补充,LDA-1B采用多模态扩散变换器来处理异步视觉和动作流,从而在1B参数规模上实现稳定训练。在模拟和真实世界的实验中,LDA-1B在接触丰富、灵巧和长时任务上分别比先前方法(例如π_{0.5})高出高达21%、48%和23%。值得注意的是,LDA-1B实现了数据高效的微调,通过利用通常有害且被丢弃的30%低质量轨迹,获得了10%的提升。

英文摘要

Recent robot foundation models largely rely on large-scale behavior cloning, which imitates expert actions but discards transferable dynamics knowledge embedded in heterogeneous embodied data. While the Unified World Model (UWM) formulation has the potential to leverage such diverse data, existing instantiations struggle to scale to foundation-level due to coarse data usage and fragmented datasets. We introduce LDA-1B, a robot foundation model that scales through universal embodied data ingestion by jointly learning dynamics, policy, and visual forecasting, assigning distinct roles to data of varying quality. To support this regime at scale, we assemble and standardize EI-30k, an embodied interaction dataset comprising over 30k hours of human and robot trajectories in a unified format. Scalable dynamics learning over such heterogeneous data is enabled by prediction in a structured DINO latent space, which avoids redundant pixel-space appearance modeling. Complementing this representation, LDA-1B employs a multi-modal diffusion transformer to handle asynchronous vision and action streams, enabling stable training at the 1B-parameter scale. Experiments in simulation and the real world show LDA-1B outperforms prior methods (e.g., $π_{0.5}$) by up to 21\%, 48\%, and 23\% on contact-rich, dexterous, and long-horizon tasks, respectively. Notably, LDA-1B enables data-efficient fine-tuning, gaining 10\% by leveraging 30\% low-quality trajectories typically harmful and discarded.

2602.03920 2026-06-04 cs.RO cs.HC 版本更新

How Users Understand Robot Foundation Model Performance through Task Success Rates and Beyond

用户如何通过任务成功率及其他方式理解机器人基础模型性能

Isaac Sheidlower, Jindan Huang, James Staley, Bingyu Wu, Qicong Chen, Reuben Aronson, Elaine Short

发表机构 * Brown University(布朗大学) Tufts University(塔夫茨大学)

AI总结 通过用户研究,探讨非机器人专家如何理解机器人基础模型(RFM)评估中的任务成功率(TSR)及其他信息,发现用户不仅按专家预期使用TSR,还重视未常报告的失败案例,并希望获取历史评估数据和机器人对新任务的性能估计。

详情
AI中文摘要

机器人基础模型(RFM)代表了一种开发通用家用机器人的有前景的方法。鉴于RFM的广泛能力,用户不可避免地会要求基于RFM的机器人执行RFM未经训练或评估的任务。在这些情况下,由于失败成本相对较高,用户理解尝试新任务的相关风险至关重要。此外,了解RFM能力的知情用户将知道机器人能够处理哪些情况和任务。在本文中,我们研究非机器人专家如何解释RFM评估中的性能信息。这些评估通常报告任务成功率(TSR)作为主要性能指标。虽然TSR对专家来说是直观的,但有必要验证新手是否也按预期使用这些信息。为此,我们进行了一项研究,用户看到了真实的评估数据,包括TSR、失败案例描述以及来自多个已发表RFM研究项目的视频。结果强调,非专家不仅以与专家预期一致的方式使用TSR,而且还高度重视其他类型的信息,例如RFM评估中通常不报告的失败案例。此外,我们发现用户希望访问RFM先前评估的真实数据以及机器人关于其在新任务上表现如何的估计。

英文摘要

Robot Foundation Models (RFMs) represent a promising approach to developing general-purpose home robots. Given the broad capabilities of RFMs, users will inevitably ask an RFM-based robot to perform tasks that the RFM was not trained or evaluated on. In these cases, it is crucial that users understand the risks associated with attempting novel tasks due to the relatively high cost of failure. Furthermore, an informed user who understands an RFM's capabilities will know what situations and tasks the robot can handle. In this paper, we study how non-roboticists interpret performance information from RFM evaluations. These evaluations typically report task success rate (TSR) as the primary performance metric. While TSR is intuitive to experts, it is necessary to validate whether novices also use this information as intended. Toward this end, we conducted a study in which users saw real evaluation data, including TSR, failure case descriptions, and videos from multiple published RFM research projects. The results highlight that non-experts not only use TSR in a manner consistent with expert expectations but also highly value other information types, such as failure cases that are not often reported in RFM evaluations. Furthermore, we find that users want access to both real data from previous evaluations of the RFM and estimates from the robot about how well it will do on a novel task.

2602.01429 2026-06-04 cs.RO 版本更新

Sem-NaVAE: Semantically-Guided Outdoor Mapless Navigation via Generative Trajectory Priors

Sem-NaVAE: 基于语义引导的室外无地图导航通过生成式轨迹先验

Gonzalo Olguín, Javier Ruiz-del-Solar

发表机构 * Department of Electrical Engineering & the Advanced Mining Technology Center (AMTC), Universidad de Chile(电气工程系及先进采矿技术中心(AMTC)、智利大学)

AI总结 提出Sem-NaVAE方法,结合条件变分自编码器生成多样化轨迹和轻量视觉语言模型进行语义选择,实现室外无地图实时导航,在未见环境中达到90%成功率。

Comments Accepted for publication in IEEE Robotics and Automation Letters (RA-L). 8 pages, 5 figures

详情
AI中文摘要

本工作提出了一种用于室外应用的无地图导航方法。它结合了条件变分自编码器(CVAE)生成轨迹的探索能力和轻量视觉语言模型(VLM)的语义分割能力来选择要执行的轨迹。使用开放词汇分割基于自然语言对生成的轨迹进行评分和选择,并由最先进的局部规划器执行速度命令。该方法的关键特性之一是能够生成大量多样的轨迹并实时选择它们进行导航。在真实世界的室外实验中,Sem-NaVAE在未见环境中的120-240米路线上实现了90%的成功率,比最近的基线高出10%,同时保持在基于地图的上限的7%以内。展示系统实验运行的视频可在https://youtu.be/i3R5ey5O2yk找到。

英文摘要

This work presents a mapless navigation approach for outdoor applications. It combines the exploratory capacity of conditional variational autoencoders (CVAEs) to generate trajectories and the semantic segmentation capabilities of a lightweight visual language model (VLM) to select the trajectory to execute. Open-vocabulary segmentation is used to score and select the generated trajectories based on natural language, and a state-of-the-art local planner executes velocity commands. One of the key features of the proposed approach is its ability to generate a large variability of trajectories and select them to navigate in real-time. In real-world outdoor experiments, Sem-NaVAE achieves a 90% success rate across routes of 120-240m in unseen environments, outperforming the nearest baseline by 10% while remaining within 7% of a map-based upper bound. A video showing an experimental run of the system can be found in https://youtu.be/i3R5ey5O2yk.

2512.24698 2026-06-04 cs.RO 版本更新

Dynamic Policy Learning for Legged Robot with Simplified Model Pretraining and Model-Homotopy-Inspired Transfer

基于简化模型预训练与模型同伦启发迁移的足式机器人动态策略学习

Dongyun Kang, Min-Gyu Kim, Tae-Gyu Song, Hajun Kim, Sehoon Ha, Hae-Won Park

发表机构 * Department of Mechanical Engineering, Korea Advanced Institute of Science and Technology (KAIST)(机械工程系,韩国科学技术院(KAIST)) School of Interactive Computing, Georgia Institute of Technology(交互计算学院,佐治亚理工学院)

AI总结 提出一种延续学习框架,结合简化模型预训练和模型同伦启发迁移,高效生成和优化足式机器人的复杂动态行为,并在真实四足机器人上验证了翻转和墙面辅助等动态任务。

Comments 8 pages

详情
Journal ref
IEEE Robotics and Automation Letters, vol. 11, no. 7, pp. 8068-8075, July 2026
AI中文摘要

为足式机器人生成动态运动仍然是一个具有挑战性的问题。虽然强化学习在各种足式运动任务中取得了显著成功,但产生高度动态的行为通常需要大量的奖励调整或高质量的演示。利用降阶模型有助于缓解这些挑战。然而,当将策略迁移到全身动力学环境时,模型差异构成了重大挑战。在这项工作中,我们引入了一个基于延续的学习框架,该框架结合了简化模型预训练和模型同伦启发迁移,以高效生成和优化复杂的动态行为。首先,我们使用单刚体模型预训练策略,以在简化环境中捕获核心运动模式。接下来,我们采用延续策略逐步将策略迁移到全身环境,以最小化性能损失。为了定义延续路径,我们引入了一条从单刚体模型到全身模型的参数化过渡路径,通过逐步重新分配躯干和腿之间的质量和惯性。与基线方法相比,所提出的方法在迁移过程中实现了更快的收敛并表现出更优的稳定性。我们的框架在包括翻转和墙面辅助机动在内的多种动态任务上得到了验证,并成功部署在真实的四足机器人上。

英文摘要

Generating dynamic motions for legged robots remains a challenging problem. While reinforcement learning has achieved notable success in various legged locomotion tasks, producing highly dynamic behaviors often requires extensive reward tuning or high-quality demonstrations. Leveraging reduced-order models can help mitigate these challenges. However, the model discrepancy poses a significant challenge when transferring policies to full-body dynamics environments. In this work, we introduce a continuation-based learning framework that combines simplified model pretraining and model-homotopy-inspired transfer to efficiently generate and refine complex dynamic behaviors. First, we pretrain the policy using a single rigid body model to capture core motion patterns in a simplified environment. Next, we employ a continuation strategy to progressively transfer the policy to the full-body environment, minimizing performance loss. To define the continuation path, we introduce a parametric transition path from the single rigid body model to the full-body model by gradually redistributing mass and inertia between the trunk and legs. The proposed method achieves faster convergence and demonstrates superior stability during the transfer process compared to baseline methods. Our framework is validated on a range of dynamic tasks, including flips and wall-assisted maneuvers, and is successfully deployed on a real quadrupedal robot.

2512.16919 2026-06-04 cs.CV cs.AI cs.RO 版本更新

DVGT: Driving Visual Geometry Transformer

DVGT: 驾驶视觉几何变换器

Sicheng Zuo, Zixun Xie, Wenzhao Zheng, Shaoqing Xu, Fang Li, Shengyin Jiang, Long Chen, Zhi-Xin Yang, Jiwen Lu

发表机构 * Tsinghua University(清华大学) University of Macau(澳门大学) Xiaomi EV(小米电动车) Peking University(北京大学)

AI总结 提出DVGT,一种从无位姿多视角图像序列重建全局稠密3D点图的视觉几何变换器,通过交替注意力机制学习几何关系,无需相机参数和后处理对齐,在多个驾驶数据集上显著优于现有模型。

Comments Code is available at https://github.com/wzzheng/DVGT

详情
AI中文摘要

从视觉输入中感知和重建3D场景几何对于自动驾驶至关重要。然而,目前仍缺乏一种能够适应不同场景和相机配置的、面向驾驶的稠密几何感知模型。为弥补这一空白,我们提出了驾驶视觉几何变换器(DVGT),它从一系列无位姿的多视角视觉输入中重建全局稠密3D点图。我们首先使用DINO骨干网络为每张图像提取视觉特征,并采用交替的视角内局部注意力、跨视角空间注意力和跨帧时间注意力来推断图像间的几何关系。然后,我们使用多个头解码第一帧自车坐标系下的全局点图以及每帧的自车位姿。与依赖精确相机参数的传统方法不同,DVGT无需显式的3D几何先验,能够灵活处理任意相机配置。DVGT直接从图像序列预测度量尺度的几何,消除了与外部传感器后对齐的需求。在包含nuScenes、OpenScene、Waymo、KITTI和DDAD的大型驾驶数据集混合训练下,DVGT在各种场景中显著优于现有模型。代码可在https://github.com/wzzheng/DVGT获取。

英文摘要

Perceiving and reconstructing 3D scene geometry from visual inputs is crucial for autonomous driving. However, there still lacks a driving-targeted dense geometry perception model that can adapt to different scenarios and camera configurations. To bridge this gap, we propose a Driving Visual Geometry Transformer (DVGT), which reconstructs a global dense 3D point map from a sequence of unposed multi-view visual inputs. We first extract visual features for each image using a DINO backbone, and employ alternating intra-view local attention, cross-view spatial attention, and cross-frame temporal attention to infer geometric relations across images. We then use multiple heads to decode a global point map in the ego coordinate of the first frame and the ego poses for each frame. Unlike conventional methods that rely on precise camera parameters, DVGT is free of explicit 3D geometric priors, enabling flexible processing of arbitrary camera configurations. DVGT directly predicts metric-scaled geometry from image sequences, eliminating the need for post-alignment with external sensors. Trained on a large mixture of driving datasets including nuScenes, OpenScene, Waymo, KITTI, and DDAD, DVGT significantly outperforms existing models on various scenarios. Code is available at https://github.com/wzzheng/DVGT.

2504.03038 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Learning to Adapt Control Barrier Functions Under Epistemic and Aleatoric Uncertainty

在认知不确定性和偶然不确定性下学习自适应控制障碍函数

Taekyung Kim, Robin Inho Kee, Dimitra Panagou

发表机构 * Department of Robotics, University of Michigan(机器人学系,密歇根大学) Charles Stark Draper Laboratory(查尔斯·斯泰克·德拉珀实验室) Department of Aerospace Engineering, University of Michigan(航空航天工程系,密歇根大学)

AI总结 提出在线自适应CBF框架(OA-CBF),通过概率集成神经网络和图注意力编码器动态调整CBF参数,在保证安全性的同时减少保守性。

Comments Extended journal version of the IEEE CDC 2025 paper (available as arXiv:2504.03038v5). Project page: https://www.taekyung.me/oa-cbf

详情
AI中文摘要

控制障碍函数(CBF)为机器人系统强制执行安全约束提供了一种可处理的机制,但其实际性能强烈依赖于类-K函数参数的选择。在输入约束下,保守参数通常以牺牲进展速度为代价保持可行性,而激进参数可能导致基于CBF的优化不可行或不安全。本文提出了在线自适应CBF(OA-CBF),一种在运行时调整CBF参数的框架。我们引入了局部验证的CBF参数概念,该概念在有限预测范围内认证候选参数,并表明当这种验证在连续更新间隔内保持时,安全性得以保留。为了高效识别局部验证的参数,OA-CBF训练一个概率集成神经网络来评估查询的CBF参数,而不是直接预测单个参数。图注意力编码器表示可变大小的障碍物环境,由保形预测校准的认知不确定性门拒绝不可靠的预测,分布鲁棒的CVaR条件筛选偶然风险。在验证的候选参数中,OA-CBF选择具有最佳预测进展度量的参数,并通过MPC-CBF或CBF-QP安全滤波器应用它。在动态独轮车、平面和三维四旋翼、运动学自行车以及VTOL四翼飞机基准上的仿真研究表明,OA-CBF在保持低碰撞率和不可行率的同时,减少了固定参数CBF控制器的保守性。

英文摘要

Control barrier functions (CBFs) provide a tractable mechanism for enforcing safety constraints in robotic systems, but their practical performance depends strongly on the choice of class-K function parameters. Under input constraints, conservative parameters often preserve feasibility at the cost of slow progress, whereas aggressive parameters can make the CBF-based optimization infeasible or unsafe. This paper proposes Online Adaptive CBF (OA-CBF), a framework for adapting CBF parameters at runtime. We introduce the notion of locally validated CBF parameters, which certify candidate parameters over a finite prediction horizon, and show that safety is preserved when such validation is maintained over successive update intervals. To identify locally validated parameters efficiently, OA-CBF trains a probabilistic ensemble neural network to evaluate queried CBF parameters rather than directly predict a single parameter. A graph-attention encoder represents variable-size obstacle environments, an epistemic uncertainty gate calibrated by conformal prediction rejects unreliable predictions, and a distributionally robust CVaR condition screens aleatoric risk. Among the verified candidates, OA-CBF selects the parameter with the best predicted progress metric and applies it through either an MPC-CBF or CBF-QP safety filter. Simulation studies on dynamic unicycle, planar and three-dimensional quadrotor, kinematic bicycle, and VTOL quadplane benchmarks show that OA-CBF reduces the conservatism of fixed-parameter CBF controllers while maintaining low collision and infeasibility rates.

2510.13704 2026-06-04 cs.LG cs.AI cs.RO 版本更新

Simplicial Embeddings Improve Sample Efficiency in Actor-Critic Agents

单纯形嵌入提升Actor-Critic智能体的样本效率

Johan Obando-Ceron, Walter Mayor, Samuel Lavoie, Scott Fujimoto, Aaron Courville, Pablo Samuel Castro

发表机构 * Mila – Québec AI Institute(魁北克人工智能研究所) Université de Montréal(蒙特利尔大学) McGill University(麦吉尔大学) CIFAR AI Chair(CIFAR人工智能主席)

AI总结 针对大规模环境并行化下Actor-Critic方法仍需大量交互的问题,提出使用单纯形嵌入作为轻量级表示层,通过几何归纳偏置产生稀疏离散特征,稳定评论家引导并强化策略梯度,在FastTD3、FastSAC和PPO中一致提升样本效率和最终性能。

详情
AI中文摘要

最近的工作提出通过大规模环境并行化来加速actor-critic方法的挂钟训练时间;不幸的是,这些方法有时仍需要大量的环境交互才能达到期望的性能水平。注意到结构良好的表示可以改善深度强化学习(RL)智能体的泛化能力和样本效率,我们提出使用单纯形嵌入:将嵌入约束到单纯形结构的轻量级表示层。这种几何归纳偏置产生稀疏且离散的特征,稳定了评论家引导并强化了策略梯度。当应用于FastTD3、FastSAC和PPO时,单纯形嵌入在多种连续和离散控制环境中一致提高了样本效率和最终性能,且不损失运行速度。

英文摘要

Recent works have proposed accelerating the wall-clock training time of actor-critic methods via the use of large-scale environment parallelization; unfortunately, these can sometimes still require large number of environment interactions to achieve a desired level of performance. Noting that well-structured representations can improve the generalization and sample efficiency of deep reinforcement learning (RL) agents, we propose the use of simplicial embeddings: lightweight representation layers that constrain embeddings to simplicial structures. This geometric inductive bias results in sparse and discrete features that stabilize critic bootstrapping and strengthen policy gradients. When applied to FastTD3, FastSAC, and PPO, simplicial embeddings consistently improve sample efficiency and final performance across a variety of continuous- and discrete-control environments, without any loss in runtime speed.

2509.10247 2026-06-04 cs.RO cs.AI 版本更新

DiffAero: A GPU-Accelerated Differentiable Simulation Framework for Efficient Quadrotor Policy Learning

DiffAero: 一种用于高效四旋翼策略学习的GPU加速可微分仿真框架

Xinhong Zhang, Runqing Wang, Yunfan Ren, Jian Sun, Hao Fang, Jie Chen, Gang Wang

发表机构 * State Key Lab of Autonomous Intelligent Unmanned Systems, Beijing Institute of Technology(自主智能无人系统国家重点实验室,北京理工大学) Zhongguancun Academy(中关村academy) Department of Mechanical Engineering, University of Hong Kong(香港大学机械工程系) Harbin Institute of Technology(哈尔滨工业大学)

AI总结 提出DiffAero,一种轻量级、GPU加速且完全可微的仿真框架,通过并行化物理与渲染实现高效四旋翼控制策略学习,并在消费级硬件上数小时内训练出鲁棒策略。

Comments 8 pages, 11 figures, 1 table

详情
AI中文摘要

本文介绍了DiffAero,一种轻量级、GPU加速且完全可微的仿真框架,专为高效的四旋翼控制策略学习而设计。DiffAero支持环境级和智能体级并行,并在统一的GPU原生训练接口中集成了多种动力学模型、可定制的传感器堆栈(IMU、深度相机和LiDAR)以及多样化的飞行任务。通过在GPU上完全并行化物理和渲染,DiffAero消除了CPU-GPU数据传输瓶颈,并在仿真吞吐量上实现了数量级的提升。与现有仿真器相比,DiffAero不仅提供高性能仿真,还作为探索可微和混合学习算法的研究平台。广泛的基准测试和真实世界飞行实验表明,DiffAero与混合学习算法相结合,可以在消费级硬件上数小时内学习到鲁棒的飞行策略。代码可在https://github.com/flyingbitac/diffaero获取。

英文摘要

This letter introduces DiffAero, a lightweight, GPU-accelerated, and fully differentiable simulation framework designed for efficient quadrotor control policy learning. DiffAero supports both environment-level and agent-level parallelism and integrates multiple dynamics models, customizable sensor stacks (IMU, depth camera, and LiDAR), and diverse flight tasks within a unified, GPU-native training interface. By fully parallelizing both physics and rendering on the GPU, DiffAero eliminates CPU-GPU data transfer bottlenecks and delivers orders-of-magnitude improvements in simulation throughput. In contrast to existing simulators, DiffAero not only provides high-performance simulation but also serves as a research platform for exploring differentiable and hybrid learning algorithms. Extensive benchmarks and real-world flight experiments demonstrate that DiffAero and hybrid learning algorithms combined can learn robust flight policies in hours on consumer-grade hardware. The code is available at https://github.com/flyingbitac/diffaero.

2507.21638 2026-06-04 cs.AI cs.LG cs.MA cs.RO 版本更新

Assistax: A Multi-Agent Hardware-Accelerated Reinforcement Learning Benchmark for Assistive Robotics

Assistax: 一个用于辅助机器人的多智能体硬件加速强化学习基准

Leonard Hinckeldey, Elliot Fosong, Rimvydas Rubavicius, Elle Miller, Trevor McInroe, Fan Zhang, Patricia Wollstadt, Stefano V. Albrecht, Subramanian Ramamoorthy

发表机构 * University of California, Berkeley(加州大学伯克利分校) Stanford University(斯坦福大学)

AI总结 提出Assistax基准,利用JAX硬件加速和基于多智能体强化学习的辅助机器人任务,实现高达370倍加速,并测试机器人的零样本协调能力。

Comments Accepted at the Reinforcement Learning Conference 2026

详情
AI中文摘要

强化学习(RL)算法的发展在很大程度上受到具有挑战性的任务和基准的推动。游戏在RL基准中占据主导地位,因为它们呈现了相关的挑战,运行成本低且易于理解。虽然围棋和Atari等游戏带来了许多突破,但它们通常不能直接转化为现实世界的具身应用。在认识到需要多样化RL基准并解决具身交互场景中出现的复杂性的情况下,我们引入了Assistax:一个旨在解决辅助机器人任务中出现的挑战的开源基准。Assistax利用JAX的硬件加速,在基于物理的模拟中实现显著的学习加速。在开环挂钟时间方面,Assistax在向量化训练运行时比基于CPU的替代方案快高达370倍。Assistax使用多智能体RL将辅助机器人与活跃的人类患者之间的交互概念化,以训练一群多样化的伙伴智能体,从而可以测试具身机器人智能体的零样本协调能力。对流行的连续控制RL和MARL算法进行的广泛评估和超参数调优提供了可靠的基线,并将Assistax确立为推进辅助机器人RL研究的实用基准。代码可在以下网址获取:https://github.com/assistive-autonomy/assistax。

英文摘要

The development of reinforcement learning (RL) algorithms has been largely driven by ambitious challenge tasks and benchmarks. Games have dominated RL benchmarks because they present relevant challenges, are inexpensive to run and easy to understand. While games such as Go and Atari have led to many breakthroughs, they often do not directly translate to real-world embodied applications. In recognising the need to diversify RL benchmarks and addressing complexities that arise in embodied interaction scenarios, we introduce Assistax: an open-source benchmark designed to address challenges arising in assistive robotics tasks. Assistax uses JAX's hardware acceleration for significant speed-ups for learning in physics-based simulations. In terms of open-loop wall-clock time, Assistax runs up to $370\times$ faster when vectorising training runs compared to CPU-based alternatives. Assistax conceptualises the interaction between an assistive robot and an active human patient using multi-agent RL to train a population of diverse partner agents against which an embodied robotic agent's zero-shot coordination capabilities can be tested. Extensive evaluation and hyperparameter tuning for popular continuous control RL and MARL algorithms provide reliable baselines and establish Assistax as a practical benchmark for advancing RL research for assistive robotics. The code is available at: https://github.com/assistive-autonomy/assistax.

1905.04235 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Autonomous Locomotion Mode Transition in Quadruped Track-Legged Robots: A Simulation-Based Analysis for Step Negotiation

四足履轮腿机器人自主运动模式切换:基于仿真的步阶跨越分析

Jie Wang, Krispin Davies

发表机构 * University of Cambridge(剑桥大学) ClearPath AI

AI总结 本文提出了一种用于四足混合机器人自主切换运动模式的方法,特别是在跨越不同高度台阶时,通过能量效率评估机制实现平稳过渡。

详情
AI中文摘要

混合履轮腿机器人结合了轮式和腿式运动的优势,通过高效切换滚动和行走模式,在多种地形中实现适应性。然而,自动实现这些切换仍然是重大挑战。本文介绍了一种用于四足混合机器人自主模式切换的方法,特别是在跨越台阶时。我们的方法基于一种决策机制,利用所提出的基于能量的准则评估两种运动模式的能量效率。为了确保平稳跨越台阶,我们结合了两种攀爬步态,用于评估行走运动的能量使用情况。仿真结果验证了该方法的有效性,显示在不同高度的台阶上实现了成功的自主切换。我们提出的方法具有通用性,可以修改以适应类似机械配置的其他混合机器人,前提是其运动能量性能已先进行研究。

英文摘要

Hybrid track/wheel-legged robots combine the advantages of wheel-based and leg-based locomotion, granting adaptability across varied terrains through efficient transitions between rolling and walking modes. However, automating these transitions remains a significant challenge. In this paper, we introduce a method designed for autonomous mode transition in a quadruped hybrid robot with a track/wheel-legged configuration, especially during step negotiation. Our approach hinges on a decision-making mechanism that evaluates the energy efficiency of both locomotion modes using a proposed energy-based criterion. To guarantee a smooth negotiation of steps, we incorporate two climbing gaits designated for the assessment of energy usage in walking locomotion. Simulation results validate the method's effectiveness, showing successful autonomous transitions across steps of diverse heights. Our suggested approach has universal applicability and can be modified to suit other hybrid robots of similar mechanical configuration, provided their locomotion energy performance is studied beforehand.

1702.03433 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Path Assignment Techniques For Vehicle Tracking

车辆跟踪中的路径分配技术

Richard Altendorfer, Sebastian Wirkert

发表机构 * Driver Assistance Systems, ZF TRW(驾驶辅助系统,ZF TRW) Deutsches Krebsforschungszentrum(德国癌症研究中心)

AI总结 本文提出两种路径分配方法,旨在通过延迟处理阶段来过滤测量数据,以避免延迟和其他中间滤波器的伪影,通过生成离散后验概率分布并使用中位数估计器提取路径或车道索引,通过ROC曲线展示方法性能。

Comments 6 pages, 9 figures

详情
Journal ref
Proceedings of the IEEE Intelligent Vehicles Symposium (2014) 1451-1456
AI中文摘要

许多驾驶员辅助系统,如自适应巡航控制系统,需要识别处于主机车辆路径中的最近车辆。这涉及将检测到的车辆分配给主机车辆的路径或邻近路径。在回顾了主机车辆路径估计和车道分配技术的方法后,我们介绍了两种方法,这些方法受到在尽可能晚的处理阶段过滤测量数据的动机,以避免延迟和其他中间滤波器的伪影。这些滤波器生成离散后验概率分布,从中通过中位数估计器提取路径或“车道”索引。通过使用实验数据和标记的地面真实数据,展示了这些方法的相对性能。

英文摘要

Many driver assistance systems such as Adaptive Cruise Control require the identification of the closest vehicle that is in the host vehicle's path. This entails an assignment of detected vehicles to the host vehicle path or neighboring paths. After reviewing approaches to the estimation of the host vehicle path and lane assignment techniques we introduce two methods that are motivated by the rationale to filter measured data as late in the processing stages as possible in order to avoid delays and other artifacts of intermediate filters. These filters generate discrete posterior probability distributions from which a path or "lane" index is extracted by a median estimator. The relative performance of those methods is illustrated by a ROC using experimental data and labeled ground truth data.

1508.04124 2026-06-04 eess.SY cs.RO cs.SY 版本更新

A Complete Derivation Of The Association Log-Likelihood Distance For Multi-Object Tracking

多目标跟踪中关联对数似然距离的完整推导

Richard Altendorfer, Sebastian Wirkert

发表机构 * Driver Assistance Systems, ZF TRW(ZF TRW驾驶辅助系统) German Cancer Research Center(德国癌症研究中心)

AI总结 本文基于多目标跟踪中的关联问题,推导了关联对数似然距离,并通过蒙特卡洛模拟验证了其在关联关系准确性上的优越性。

Comments 7 pages, 3 figures

详情
Journal ref
2016 IEEE Intelligent Vehicles Symposium (IV)
AI中文摘要

Mahalanobis距离常用于多目标跟踪中的测量-跟踪关联。从Mahalanobis距离的原始定义出发,我们回顾了其在关联中的应用。由于多目标跟踪中没有原则将Mahalanobis距离视为一种独特的统计距离,我们重新审视了多假设跟踪中的全局关联假设作为最通用的关联设置。这些关联假设诱导出一种用于分配的距离似然量,我们称之为关联对数似然距离。我们比较了Mahalanobis距离与关联对数似然距离在蒙特卡洛模拟中产生正确关联关系的能力。结果表明,基于关联对数似然的距离在平均上比Mahalanobis距离表现更好,证实了最大化全局关联假设比最小化特定统计距离度量是一种更根本的关联方法。

英文摘要

The Mahalanobis distance is commonly used in multi-object trackers for measurement-to-track association. Starting with the original definition of the Mahalanobis distance we review its use in association. Given that there is no principle in multi-object tracking that sets the Mahalanobis distance apart as a distinguished statistical distance we revisit the global association hypotheses of multiple hypothesis tracking as the most general association setting. Those association hypotheses induce a distance-like quantity for assignment which we refer to as association log-likelihood distance. We compare the ability of the Mahalanobis distance to the association log-likelihood distance to yield correct association relations in Monte-Carlo simulations. It turns out that on average the distance based on association log-likelihood performs better than the Mahalanobis distance, confirming that the maximization of global association hypotheses is a more fundamental approach to association than the minimization of a certain statistical distance measure.

1809.07870 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Suspended Load Path Tracking Control Using a Tilt-rotor UAV Based on Zonotopic State Estimation

基于退化拓扑状态估计的倾斜旋翼无人机悬浮负载路径跟踪控制

Brenner S. Rego, Guilherme V. Raffo

发表机构 * Graduate Program in Electrical Engineering, Federal University of Minas Gerais, Belo Horizonte, MG 31270-901, Brazil(巴西联邦矿务工程师学院电气工程研究生项目) Department of Electronics Engineering, Federal University of Minas Gerais, Belo Horizonte, MG 31270-901, Brazil(巴西联邦矿务工程师学院电子工程系)

AI总结 本文研究了利用倾斜旋翼无人机进行悬浮负载路径跟踪控制的问题,通过建立多体机械系统的动力学模型,提出退化拓扑状态估计器来估计负载位置和姿态,并设计了具有极点放置约束的离散时间混合H2/H∞控制器以实现鲁棒的路径跟踪控制。

详情
AI中文摘要

本文解决了一种倾斜旋翼无人机悬浮负载路径跟踪控制的问题。主要挑战来自于负载动态行为,通常通过绳索与无人机连接,增加了系统的未驱动自由度。此外,为了执行负载运输任务,通常需要知道负载的位置信息。由于可用传感器通常嵌入在移动平台上,负载位置的信息可能无法直接获取。为了解决这个问题,本文首先从负载的角度出发,推导了多体机械系统的运动学,利用欧拉-拉格朗日方法推导出详细的动力学模型,得到一个高度耦合、非线性的状态空间表示,输入是仿射的,负载的位置和姿态直接由状态变量表示。提出了一种退化拓扑状态估计器来解决负载位置和姿态的估计问题,该估计器基于飞机上的传感器,具有不同的采样时间,并且测量噪声是未知但有界的。为了解决路径跟踪问题,设计了一个具有极点放置约束的离散时间混合H2/H∞控制器,具有保证的时间响应特性,并对未建模动态、参数不确定性以及外部干扰具有鲁棒性。通过在基于Gazebo模拟器的平台上进行的数值实验以及系统计算机辅助设计(CAD)模型上的实验,验证了退化拓扑状态估计器和设计的控制器的性能。

英文摘要

This work addresses the problem of path tracking control of a suspended load using a tilt-rotor UAV. The main challenge in controlling this kind of system arises from the dynamic behavior imposed by the load, which is usually coupled to the UAV by means of a rope, adding unactuated degrees of freedom to the whole system. Furthermore, to perform the load transportation it is often needed the knowledge of the load position to accomplish the task. Since available sensors are commonly embedded in the mobile platform, information on the load position may not be directly available. To solve this problem in this work, initially, the kinematics of the multi-body mechanical system are formulated from the load's perspective, from which a detailed dynamic model is derived using the Euler-Lagrange approach, yielding a highly coupled, nonlinear state-space representation of the system, affine in the inputs, with the load's position and orientation directly represented by state variables. A zonotopic state estimator is proposed to solve the problem of estimating the load position and orientation, which is formulated based on sensors located at the aircraft, with different sampling times, and unknown-but-bounded measurement noise. To solve the path tracking problem, a discrete-time mixed $\mathcal{H}_2/\mathcal{H}_\infty$ controller with pole-placement constraints is designed with guaranteed time-response properties and robust to unmodeled dynamics, parametric uncertainties, and external disturbances. Results from numerical experiments, performed in a platform based on the Gazebo simulator and on a Computer Aided Design (CAD) model of the system, are presented to corroborate the performance of the zonotopic state estimator along with the designed controller.

1809.03225 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Gait learning for soft microrobots controlled by light fields

基于光场控制的软微机器人步态学习

Alexander von Rohr, Sebastian Trimpe, Alonso Marco, Peer Fischer, Stefano Palagi

发表机构 * Micro, Nano, and Molecular Systems Group, Max Planck Institute for Intelligent Systems(微、纳、分子系统组,人工智能系统马克斯·普朗克研究所) Max Planck ETH Center for Learning Systems(马克斯·普朗克-ETH学习系统中心)

AI总结 本文提出一种基于贝叶斯优化和高斯过程的概率学习方法,用于优化光场控制的软微机器人步态,通过有限实验预算实现高效且鲁棒的运动性能提升。

Comments 8 pages, 7 figures, to appear in the proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems 2018

详情
AI中文摘要

基于光场控制的软微机器人可以生成多种不同的步态。这种内在的灵活性可以用来最大化其在特定环境中的运动性能,并用于适应变化的条件。然而,由于缺乏准确的运动模型以及微机器人之间的固有变异性,分析控制设计是不可能的。另一方面,常见的数据驱动方法需要运行大量的实验,导致非常特定于样本的结果。本文提出了一种基于贝叶斯优化(BO)和高斯过程(GPs)的概率学习方法,用于光场控制的软微机器人。所提出的方法产生了一种学习方案,该方案在数据效率方面表现优异,能够在有限的实验预算下进行步态优化,并且对微机器人样本之间的差异具有鲁棒性。这些特性是通过在半合成数据集上比较不同的GP先验和BO设置来设计学习方案获得的。开发的学习方案在微机器人实验中得到验证,结果在仅20次实验的预算下,使微机器人的运动性能提高了115%。这些令人鼓舞的结果为基于光场控制的软微机器人和概率学习控制的自适应微机器人系统铺平了道路。

英文摘要

Soft microrobots based on photoresponsive materials and controlled by light fields can generate a variety of different gaits. This inherent flexibility can be exploited to maximize their locomotion performance in a given environment and used to adapt them to changing conditions. Albeit, because of the lack of accurate locomotion models, and given the intrinsic variability among microrobots, analytical control design is not possible. Common data-driven approaches, on the other hand, require running prohibitive numbers of experiments and lead to very sample-specific results. Here we propose a probabilistic learning approach for light-controlled soft microrobots based on Bayesian Optimization (BO) and Gaussian Processes (GPs). The proposed approach results in a learning scheme that is data-efficient, enabling gait optimization with a limited experimental budget, and robust against differences among microrobot samples. These features are obtained by designing the learning scheme through the comparison of different GP priors and BO settings on a semi-synthetic data set. The developed learning scheme is validated in microrobot experiments, resulting in a 115% improvement in a microrobot's locomotion performance with an experimental budget of only 20 tests. These encouraging results lead the way toward self-adaptive microrobotic systems based on light-controlled soft microrobots and probabilistic learning control.

1811.04333 2026-06-04 cs.RO cs.FL cs.SY eess.SY 版本更新

Reactive Task and Motion Planning for Robust Whole-Body Dynamic Locomotion in Constrained Environments

面向受限环境的鲁棒全身体动态运动的反应任务与运动规划

Ye Zhao, Yinan Li, Luis Sentis, Ufuk Topcu, Jun Liu

发表机构 * George W. Woodruff School of Mechanical Engineering, Georgia Tech, USA(佐治亚理工学院机械工程学院) Department of Applied Mathematics, University of Waterloo, Canada(滑铁卢大学应用数学系) Department of Aerospace Engineering and Engineering Mechanics, UT Austin, USA(得克萨斯大学奥斯汀分校航空航天工程与工程力学系) Institute for Computational Engineering and Sciences, UT Austin, USA(得克萨斯大学奥斯汀分校计算工程与科学研究所)

AI总结 本文提出了一种基于时序逻辑的游戏框架,用于在受限和动态变化的环境中进行全身体动态运动的任务规划与控制,通过符号系统的方法确保运动行为的正确性。

Comments 49 pages, 23 figures, 1 table

详情
AI中文摘要

基于接触的决策和规划方法越来越重要,以赋予四足机器人更高的自主性。源自符号系统的正式合成方法在推理高层运动决策和实现复杂 maneuvering 行为方面具有巨大潜力。本文首次尝试正式设计由任务规划和全身体动态运动控制组成的架构,在受限和动态变化的环境中。在高层,我们构建了一个多肢体运动规划器与其动态环境之间的双玩家时序逻辑游戏,以合成一个获胜策略,提供符号运动动作。这些运动动作满足由时序逻辑片段表达的期望高层任务规范。这些动作发送到一个稳健的有限状态转换系统,该系统合成一个满足状态可达性约束的运动控制器。此控制器通过低层运动规划器进一步执行,生成可行的运动轨迹。我们构建了一系列动态运动模型用于腿部机器人,作为处理多样化环境事件的模板库。我们设计了一种重新规划策略,考虑突发环境变化或大状态扰动,以提高最终运动行为的鲁棒性。我们正式证明了分层运动框架的正确性,通过运动规划层保证鲁棒实现。在多种环境中的反应运动行为模拟表明,我们的框架有潜力成为智能运动行为的理论基础。

英文摘要

Contact-based decision and planning methods are becoming increasingly important to endow higher levels of autonomy for legged robots. Formal synthesis methods derived from symbolic systems have great potential for reasoning about high-level locomotion decisions and achieving complex maneuvering behaviors with correctness guarantees. This study takes a first step toward formally devising an architecture composed of task planning and control of whole-body dynamic locomotion behaviors in constrained and dynamically changing environments. At the high level, we formulate a two-player temporal logic game between the multi-limb locomotion planner and its dynamic environment to synthesize a winning strategy that delivers symbolic locomotion actions. These locomotion actions satisfy the desired high-level task specifications expressed in a fragment of temporal logic. Those actions are sent to a robust finite transition system that synthesizes a locomotion controller that fulfills state reachability constraints. This controller is further executed via a low-level motion planner that generates feasible locomotion trajectories. We construct a set of dynamic locomotion models for legged robots to serve as a template library for handling diverse environmental events. We devise a replanning strategy that takes into consideration sudden environmental changes or large state disturbances to increase the robustness of the resulting locomotion behaviors. We formally prove the correctness of the layered locomotion framework guaranteeing a robust implementation by the motion planning layer. Simulations of reactive locomotion behaviors in diverse environments indicate that our framework has the potential to serve as a theoretical foundation for intelligent locomotion behaviors.

1710.05465 2026-06-04 cs.AI cs.RO cs.SY eess.SY 版本更新

Flow: A Modular Learning Framework for Mixed Autonomy Traffic

Flow: 一种用于混合自主性的模块化学习框架

Cathy Wu, Aboudy Kreidieh, Kanaad Parvate, Eugene Vinitsky, Alexandre M Bayen

发表机构 * Laboratory for Information and Decision Systems, Massachusetts Institute of Technology(信息与决策实验室,麻省理工学院) Institute of Data, Systems, and Society, Massachusetts Institute of Technology(数据、系统与社会研究所,麻省理工学院) Department of Mechanical Engineering, University of California, Berkeley(机械工程系,加州大学伯克利分校)

AI总结 本文提出了一种模块化学习框架,利用深度强化学习解决复杂交通动态问题,通过提高系统层面的速度,使学习到的控制法则在仅有4-7%的自动驾驶汽车参与度下,相比人类驾驶性能提升高达57%。此外,在单车道交通中,一个仅使用局部观测的小型神经网络控制法则能够消除拥堵现象,达到近最优性能。

Comments 17 pages, 8 figures, 5 tables. 2021 IEEE Transactions on Robotics (T-RO)

详情
AI中文摘要

自动驾驶车辆(AVs)的快速发展为交通系统带来了巨大的潜力,通过提高安全性和效率以及出行可及性。然而,随着AVs的采用,这些影响的发展进程并不清楚。从分析部分自动驾驶的总体目标来看,出现了许多技术挑战:部分控制和观测、多车辆交互以及现实世界网络所代表的大量场景。为了深入了解近期AV的影响,本文研究了深度强化学习(RL)在低AV采用率环境下克服这些挑战的适用性。本文提出了一种模块化学习框架,利用深度RL来处理复杂的交通动态。模块由多个部分组成,以捕捉常见的交通现象(如停止-启动交通拥堵、车道变换、交叉口)。学习到的控制法则在系统层面的速度上优于人类驾驶性能,仅在4-7%的AVs参与度下,提高了高达57%。此外,在单车道交通中,一个仅使用局部观测的小型神经网络控制法则被发现能够消除停止-启动交通现象,超越了所有已知的基于模型的控制器,达到近最优性能,并且能够推广到非分布交通密度。

英文摘要

The rapid development of autonomous vehicles (AVs) holds vast potential for transportation systems through improved safety, efficiency, and access to mobility. However, the progression of these impacts, as AVs are adopted, is not well understood. Numerous technical challenges arise from the goal of analyzing the partial adoption of autonomy: partial control and observation, multi-vehicle interactions, and the sheer variety of scenarios represented by real-world networks. To shed light into near-term AV impacts, this article studies the suitability of deep reinforcement learning (RL) for overcoming these challenges in a low AV-adoption regime. A modular learning framework is presented, which leverages deep RL to address complex traffic dynamics. Modules are composed to capture common traffic phenomena (stop-and-go traffic jams, lane changing, intersections). Learned control laws are found to improve upon human driving performance, in terms of system-level velocity, by up to 57% with only 4-7% adoption of AVs. Furthermore, in single-lane traffic, a small neural network control law with only local observation is found to eliminate stop-and-go traffic - surpassing all known model-based controllers to achieve near-optimal performance - and generalize to out-of-distribution traffic densities.

1610.04091 2026-06-04 eess.SY cs.DC cs.RO cs.SY math.OC 版本更新

Optimizing Communication and Computation for Multi-UAV Information Gathering Applications

为多UAV信息采集应用优化通信与计算

Mason Thammawichai, Sujit P. Baliyarasimhuni, Eric C. Kerrigan, João B. Sousa

发表机构 * Department of Aeronautics, Imperial College London(帝国理工学院伦敦校区航空系) Department of Electronics and Communications Engineering, Indraprastha Institute of Information Technology(印度拉普拉兹信息技术学院电子与通信工程系) Department of Electrical & Electronic Engineering and the Department of Aeronautics, Imperial College London(帝国理工学院伦敦校区电子与电气工程系及航空系) Department of Electrical and Computer Engineering, University of Porto(葡萄牙波尔图大学电气与计算机工程系)

AI总结 本文针对多UAV系统中通信与计算能耗的优化问题,提出了一种混合整数优化方法,通过数据聚合和多跳分层聚类实现高效的路由方案,以延长系统寿命。

详情
Journal ref
IEEE Transactions on Aerospace and Electronic Systems ( Volume: 54, Issue: 2, April 2018)
AI中文摘要

移动代理网络,如多UAV系统,受到资源限制的约束。特别是,有限的能源直接影响系统性能,如系统寿命。在无线传感器网络文献中已证明,通信能耗主导了计算和传感能耗。因此,通过优化通信数据量可以显著延长多UAV系统的寿命,但会增加计算成本。在本文中,我们旨在取得通信与计算能耗之间的最佳权衡。具体而言,我们提出了一种混合整数优化公式,用于多跳分层聚类基于自组织UAV网络的数据聚合,以获得节能的信息路由方案。所提出的框架在两个应用上进行了测试,即目标跟踪和区域映射。基于仿真结果,我们的方法相比没有数据聚合和聚类方案的基线策略,能显著节省能量。

英文摘要

Mobile agent networks, such as multi-UAV systems, are constrained by limited resources. In particular, limited energy affects system performance directly, such as system lifetime. It has been demonstrated in the wireless sensor network literature that the communication energy consumption dominates the computational and the sensing energy consumption. Hence, the lifetime of the multi-UAV systems can be extended significantly by optimizing the amount of communication data, at the expense of increasing computational cost. In this work, we aim at attaining an optimal trade-off between the communication and the computational energy. Specifically, we propose a mixed-integer optimization formulation for a multi-hop hierarchical clustering-based self-organizing UAV network incorporating data aggregation, to obtain an energy-efficient information routing scheme. The proposed framework is tested on two applications, namely target tracking and area mapping. Based on simulation results, our method can significantly save energy compared to a baseline strategy, where there is no data aggregation and clustering scheme.

1806.06723 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Towards Manipulability of Interactive Lagrangian Systems

面向交互拉格朗日系统的可操作性

Hanlei Wang

发表机构 * Science and Technology on Space Intelligent Control Laboratory(航天智能控制实验室)

AI总结 本文研究了具有参数不确定性和通信/传感约束的交互拉格朗日系统的可操作性,提出了一种新的动态反馈方法,设计了能够实现无限可操作性和对通信/传感约束鲁棒性的自适应控制器,解决了非线性双侧远程操作中的任意未知时变延迟问题。

Comments 15 pages, 15 figures

详情
Journal ref
Automatica, 119: 108913, 2020
AI中文摘要

本文研究了具有参数不确定性和通信/传感约束的交互拉格朗日系统的可操作性。两个标准例子是主从远程操作和机器人教学操作。我们系统地提出了通用动力系统中无限可操作性的概念,并探讨了这种统一动机如何形成一种设计范式,以保证交互动力系统的无限可操作性,并特别促进了交互拉格朗日系统的非线性自适应控制器的设计和分析。具体而言,基于一个新的动态反馈类,我们提出了一种自适应控制器,能够实现受控拉格朗日系统的无限可操作性和对通信/传感约束的鲁棒性,主要归因于由此产生的动态级联框架。所提出的范式在人机交互的网络耦合要求和受控动力学之间实现了理想的平衡。我们还证明了我们主要结果的一个特殊情况解决了长期存在的非线性双侧远程操作问题,即任意未知时变延迟。仿真结果展示了所提出自适应控制器下交互机器人系统的性能。

英文摘要

This paper investigates manipulability of interactive Lagrangian systems with parametric uncertainty and communication/sensing constraints. Two standard examples are teleoperation with a master-slave system and teaching operation of robots. We here systematically formulate the concept of infinite manipulability for general dynamical systems, and investigate how such a unified motivation yields a design paradigm towards guaranteeing the infinite manipulability of interactive dynamical systems and in particular facilitates the design and analysis of nonlinear adaptive controllers for interactive Lagrangian systems. Specifically, based on a new class of dynamic feedback, we propose adaptive controllers that achieve both the infinite manipulability of the controlled Lagrangian systems and the robustness with respect to the communication/sensing constraints, mainly owing to the resultant dynamic-cascade framework. The proposed paradigm yields the desirable balance between network coupling requirements and controlled dynamics of human-system interaction. We also show that a special case of our main result resolves the longstanding nonlinear bilateral teleoperation problem with arbitrary unknown time-varying delay. Simulation results show the performance of the interactive robotic systems under the proposed adaptive controllers.

1710.04465 2026-06-04 cs.RO cs.SY eess.SY stat.CO 版本更新

Markerless visual servoing on unknown objects for humanoid robot platforms

无标记未知物体的人形机器人视觉伺服

Claudio Fantacci, Giulia Vezzani, Ugo Pattacini, Vadim Tikhanoff, Lorenzo Natale

AI总结 本文提出了一种无标记未知物体的人形机器人视觉伺服框架,通过立体视觉计算可抓取物体的体积,利用递归贝叶斯滤波估计末端执行器的6D姿态,结合非线性约束优化问题计算目标姿态,并通过图像基于视觉伺服控制实现末端执行器的精确控制。

详情
Journal ref
IEEE International Conference on Robotics and Automation (ICRA), 2018
AI中文摘要

为了精确地抓住一个物体,人形机器人需要对末端执行器、物体姿态和形状有良好的了解。本文提出了一种无标记未知物体的视觉伺服框架,分为四个主要部分:I) 通过立体视觉建立最小二乘问题来计算机器人手可抓取的物体体积;II) 基于序贯蒙特卡洛(SMC)滤波的递归贝叶斯滤波技术,用于在不使用标记的情况下估计机器人末端执行器的6D姿态;III) 建立非线性约束优化问题来计算关于物体的目标可抓取姿态;IV) 通过图像基于视觉伺服控制命令机器人末端执行器向目标姿态移动。我们通过大量实验在iCub人形机器人平台上验证了该方法的有效性和鲁棒性,实现了实时计算、平滑轨迹和亚像素精度。

英文摘要

To precisely reach for an object with a humanoid robot, it is of central importance to have good knowledge of both end-effector, object pose and shape. In this work we propose a framework for markerless visual servoing on unknown objects, which is divided in four main parts: I) a least-squares minimization problem is formulated to find the volume of the object graspable by the robot's hand using its stereo vision; II) a recursive Bayesian filtering technique, based on Sequential Monte Carlo (SMC) filtering, estimates the 6D pose (position and orientation) of the robot's end-effector without the use of markers; III) a nonlinear constrained optimization problem is formulated to compute the desired graspable pose about the object; IV) an image-based visual servo control commands the robot's end-effector toward the desired pose. We demonstrate effectiveness and robustness of our approach with extensive experiments on the iCub humanoid robot platform, achieving real-time computation, smooth trajectories and sub-pixel precisions.

1807.06172 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Experimental Resilience Assessment of An Open-Source Driving Agent

开放源代码驾驶代理的实验韧性评估

Abu Hasnat Mohammad Rubaiyat, Yongming Qin, Homa Alemzadeh

发表机构 * Department of Electrical and Computer Engineering(电气与计算机工程系)

AI总结 本文提出基于系统理论过程分析(STPA)的故障注入框架,用于评估开放源代码驾驶代理openpilot在不同环境条件和传感器数据故障下的韧性,通过战略性软件故障注入方法提高不安全场景的覆盖率,从而更有效地模拟安全关键故障并测试自动驾驶车辆。

Comments 10 pages, 7 figures

详情
AI中文摘要

自动驾驶车辆(AV)依赖雷达和相机等传感器进行环境感知、路径规划和控制。随着自主性提高和与复杂环境的交互增加,对AV安全性和可靠性的关注日益增长。本文提出一种基于系统理论过程分析(STPA)的故障注入框架,用于评估开放源代码驾驶代理openpilot在不同环境条件和影响传感器数据的故障下的韧性。为了增加测试期间不安全场景的覆盖率,我们采用战略性软件故障注入方法,其中触发故障注入的触发器来源于系统高级危险分析中识别出的不安全场景。实验结果表明,所提出的战略性故障注入方法相比随机故障注入提高了危险覆盖率,从而有助于更有效地模拟安全关键故障并测试AV。此外,本文还提供了关于openpilot安全机制性能及其在及时检测和恢复故障输入能力方面的见解。

英文摘要

Autonomous vehicles (AV) depend on the sensors like RADAR and camera for the perception of the environment, path planning, and control. With the increasing autonomy and interactions with the complex environment, there have been growing concerns regarding the safety and reliability of AVs. This paper presents a Systems-Theoretic Process Analysis (STPA) based fault injection framework to assess the resilience of an open-source driving agent, called openpilot, under different environmental conditions and faults affecting sensor data. To increase the coverage of unsafe scenarios during testing, we use a strategic software fault-injection approach where the triggers for injecting the faults are derived from the unsafe scenarios identified during the high-level hazard analysis of the system. The experimental results show that the proposed strategic fault injection approach increases the hazard coverage compared to random fault injection and, thus, can help with more effective simulation of safety-critical faults and testing of AVs. In addition, the paper provides insights on the performance of openpilot safety mechanisms and its ability in timely detection and recovery from faulty inputs.

1605.04344 2026-06-04 eess.SY cs.RO cs.SY 版本更新

On the Effects of Measurement Uncertainty in Optimal Control of Contact Interactions

关于测量不确定性在接触相互作用最优控制中的影响

Brahayam Ponton, Stefan Schaal, Ludovic Righetti

发表机构 * Max-Planck Institute for Intelligent Systems, Tuebingen-Germany(图灵智能研究所,图宾根-德国) University of Southern California, Los Angeles-USA(南加州大学,洛杉矶-美国)

AI总结 本文研究了在机器人应用中,接触相互作用的不确定性不仅来自过程模型的噪声,还来自对世界知识的不精确,提出了一种基于风险敏感控制的SOC算法,通过引入观测器动态来显式依赖当前测量不确定性,仿真结果显示测量不确定性导致低阻尼行为,与过程噪声导致的刚性行为形成对比。

Comments 17 pages, 5 figures - this version is the one published at WAFR 2016 to fulfill the open access requirements of the EU commission, please refer to the previous version for the complete derivation of the algorithm

详情
AI中文摘要

随机最优控制(SOC)通常仅考虑过程模型中的噪声,即未知干扰。然而,在许多涉及与环境交互的机器人应用中,如运动和操作,不确定性还来自对世界的不精确知识,这并非实际干扰。我们通过开发基于风险敏感控制的SOC算法,分析同时考虑测量模型中的噪声的影响,该算法将观测器动态纳入其中,使得控制律显式依赖于当前的测量不确定性。在简单2D机械臂的仿真结果中,我们观察到测量不确定性导致低阻尼行为,这一结果与过程噪声产生刚性行为的效果形成对比。这表明考虑测量不确定性可能是解决涉及不确定接触相互作用问题的一种很有前途的方法。

英文摘要

Stochastic Optimal Control (SOC) typically considers noise only in the process model, i.e. unknown disturbances. However, in many robotic applications involving interaction with the environment, such as locomotion and manipulation, uncertainty also comes from lack of precise knowledge of the world, which is not an actual disturbance. We analyze the effects of also considering noise in the measurement model, by developing a SOC algorithm based on risk-sensitive control, that includes the dynamics of an observer in such a way that the control law explicitly depends on the current measurement uncertainty. In simulation results on a simple 2D manipulator, we have observed that measurement uncertainty leads to low impedance behaviors, a result in contrast with the effects of process noise that creates stiff behaviors. This suggests that taking into account measurement uncertainty could be a potentially very interesting way to approach problems involving uncertain contact interactions.

1905.12191 2026-06-04 cs.RO cs.MA cs.SY eess.SY 版本更新

CARE: Cooperative Autonomy for Resilience and Efficiency of Robot Teams for Complete Coverage of Unknown Environments under Robot Failures

CARE: 机器人团队在未知环境中鲁棒性和效率的协同自主性

Junnan Song, Shalabh Gupta

AI总结 本文提出了一种分布式算法CARE,用于解决未知环境中多机器人覆盖率路径规划问题,该算法在机器人故障情况下提供鲁棒性并提高整体效率,通过事件驱动的重新规划实现任务重新分配,实验结果表明其在故障情况下能够实现完全覆盖、减少覆盖时间和加快目标发现。

详情
Journal ref
Autonomous Robots, volume 44, 2020
AI中文摘要

本文针对未知环境中多机器人覆盖率路径规划(MCPP)问题,特别是在机器人故障情况下,提出了一个分布式算法,称为协同自主性以实现鲁棒性和效率(CARE)。该算法不仅为机器人团队提供故障容忍能力,还通过事件驱动的重新规划提高整体操作效率。算法使用分布式离散事件监督器(DESs),在机器人故障或空闲时触发一组可行玩家之间的游戏,以做出协作决策进行任务重新分配。游戏理论结构通过潜在游戏构建,其中每个玩家的效用与所有玩家的共享目标函数对齐。该算法已在各种复杂场景的高保真机器人模拟器上得到验证,结果表明,与三种替代方法相比,团队在故障情况下实现了完全覆盖,减少了覆盖时间,并加快了目标发现。

英文摘要

This paper addresses the problem of Multi-robot Coverage Path Planning (MCPP) for unknown environments in the presence of robot failures. Unexpected robot failures can seriously degrade the performance of a robot team and in extreme cases jeopardize the overall operation. Therefore, this paper presents a distributed algorithm, called Cooperative Autonomy for Resilience and Efficiency (CARE), which not only provides resilience to the robot team against failures of individual robots, but also improves the overall efficiency of operation via event-driven replanning. The algorithm uses distributed Discrete Event Supervisors (DESs), which trigger games between a set of feasible players in the event of a robot failure or idling, to make collaborative decisions for task reallocations. The game-theoretic structure is built using Potential Games, where the utility of each player is aligned with a shared objective function for all players. The algorithm has been validated in various complex scenarios on a high-fidelity robotic simulator, and the results demonstrate that the team achieves complete coverage under failures, reduced coverage time, and faster target discovery as compared to three alternative methods.

1803.07696 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Inverse Optimal Control from Incomplete Trajectory Observations

从不完整轨迹观测中逆最优控制

Wanxin Jin, Dana Kulić, Shaoshuai Mou, Sandra Hirche

发表机构 * School of Aeronautics and Astronautics, Purdue University(航空与航天学院,普渡大学) Monash University(墨尔本大学) Chair of Information-oriented Control, Technical University of Munich(信息导向控制教授职位,慕尼黑技术大学)

AI总结 本文提出了一种从不完整轨迹观测中学习最优控制系统目标函数的方法,通过恢复矩阵确定候选特征的权重,并开发了增量逆最优控制算法。

Comments Codes: https://github.com/wanxinjin/IOC-from-Incomplete-Trajectory-Observations

详情
Journal ref
The International Journal of Robotics Research. 2021;40(6-7):848-865
AI中文摘要

本文开发了一种方法,使能够从不完整的轨迹观测中学习最优控制系统的的目标函数。假设目标函数是未知权重的特征(或基函数)的加权和,观测数据是系统状态和输入轨迹的一段。所提出的技术引入了恢复矩阵的概念,以建立任何可用轨迹段与给定候选特征权重之间的关系。恢复矩阵的秩表明是否可以在候选特征中找到相关子集,并且可以从段数据中学习相应的权重。恢复矩阵可以迭代获得,其秩非递减的性质表明额外的观测可能有助于目标学习。基于恢复矩阵,建立了一种使用不完整轨迹观测学习所选特征权重的方法,并通过自动寻找所需的最小观测开发了增量逆最优控制算法。该方法的有效性在线性二次调节系统和模拟机器人机械臂上得到了验证。

英文摘要

This article develops a methodology that enables learning an objective function of an optimal control system from incomplete trajectory observations. The objective function is assumed to be a weighted sum of features (or basis functions) with unknown weights, and the observed data is a segment of a trajectory of system states and inputs. The proposed technique introduces the concept of the recovery matrix to establish the relationship between any available segment of the trajectory and the weights of given candidate features. The rank of the recovery matrix indicates whether a subset of relevant features can be found among the candidate features and the corresponding weights can be learned from the segment data. The recovery matrix can be obtained iteratively and its rank non-decreasing property shows that additional observations may contribute to the objective learning. Based on the recovery matrix, a method for using incomplete trajectory observations to learn the weights of selected features is established, and an incremental inverse optimal control algorithm is developed by automatically finding the minimal required observation. The effectiveness of the proposed method is demonstrated on a linear quadratic regulator system and a simulated robot manipulator.

1904.00378 2026-06-04 cs.RO cs.SY eess.SY 版本更新

MAT-Fly: An Educational Platform for Simulating Unmanned Aerial Vehicles Aimed to Detect and Track Moving Objects

MAT-Fly:一种用于模拟无人驾驶航空器的教育平台,旨在检测和跟踪移动物体

Giuseppe Silano, Luigi Iannelli

发表机构 * Faculty of Electrical Engineering, Czech Technical University in Prague(布拉格捷克技术大学电气工程系) Department of Engineering, University of Sannio in Benevento, Piazza Roma 21(巴内维诺萨恩尼奥大学工程系,罗马广场21号)

AI总结 本文提出了一种用于无人驾驶航空器领域特定任务的模拟方法,即视觉检测和跟踪任意移动物体,介绍了MAT-Fly平台,该平台基于Matlab和MathWorks虚拟现实(VR)和计算机视觉系统(CVS)工具箱,用于模拟四旋翼飞行器跟踪沿复杂路径移动的汽车,并开源供教育使用。

Comments 11 pages, 15 figures, journal paper

详情
Journal ref
IEEE Access, 2021
AI中文摘要

本文的主要动机是提出一种针对无人驾驶航空器领域特定任务的模拟方法,即视觉检测和跟踪任意移动物体。特别地,介绍了MAT-Fly,一个具有易用性和控制开发特点的多旋翼飞行器数值模拟平台。该平台基于Matlab和MathWorks虚拟现实(VR)和计算机视觉系统(CVS)工具箱,共同模拟四旋翼飞行器在跟踪沿复杂路径移动的汽车时的行为。VR工具箱被选择是因为学生对Matlab比较熟悉,并且由于其结构简单,用户在学习和开发阶段不需要付出显著的努力。整体架构非常模块化,使得每个模块可以轻松替换,从而简化代码重用和平台定制。一些简单的测试环境被展示以证明该方法的有效性以及平台的工作方式。该模拟器作为开源发布,使用户能够查看系统中的任何部分,并用于教育目的。

英文摘要

The main motivation of this work is to propose a simulation approach for a specific task within the Unmanned Aerial Vehicle (UAV) field, i.e., the visual detection and tracking of arbitrary moving objects. In particular, it is described MAT-Fly, a numerical simulation platform for multi-rotor aircraft characterized by the ease of use and control development. The platform is based on Matlab and the MathWorks Virtual Reality (VR) and Computer Vision System (CVS) toolboxes that work together to simulate the behavior of a quad-rotor while tracking a car that moves along a nontrivial path. The VR toolbox has been chosen due to the familiarity that students have with Matlab and because it does not require a notable effort by the user for the learning and development phase thanks to its simple structure. The overall architecture is quite modular so that each block can be easily replaced with others simplifying the code reuse and the platform customization. Some simple testbeds are presented to show the validity of the approach and how the platform works. The simulator is released as open-source, making it possible to go through any part of the system, and available for educational purposes.

1903.08818 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Contingency Model Predictive Control for Automated Vehicles

应急模型预测控制用于自动驾驶车辆

John P. Alsterda, Matthew Brown, J. Christian Gerdes

发表机构 * Department of Mechanical Engineering, Stanford University(斯坦福大学机械工程系)

AI总结 本文提出了一种新的应急模型预测控制(CMPC)框架,该框架在跟踪期望路径的同时,维护一个替代轨迹以避免已识别的潜在紧急情况。通过在经典递推MPC时间 horizon中并行添加额外的预测时间 horizon,CMPC能够预见可能发生的情况,而不是在紧急情况发生后才反应。通过数学定义框架并实验比较其性能与最先进的确定性MPC,展示了CMPC在应对潜在摩擦损失(如冰面)时的有效性。

Comments American Control Conference, July 2019; 6 pages

详情
Journal ref
IEEE American Control Conference (ACC) (2019) 717-722
AI中文摘要

我们提出了应急模型预测控制(CMPC),一种新颖且可实施的控制框架,该框架在跟踪期望路径的同时,同时维护一个应急计划——一个替代轨迹以避免已识别的潜在紧急情况。通过在经典递推MPC时间 horizon中并行添加额外的预测时间 horizon,CMPC能够预见可能发生的情况,而不是在紧急情况发生后才反应。通过数学定义框架并实验比较其性能与最先进的确定性MPC,展示了CMPC在应对潜在摩擦损失(如冰面)时的有效性。

英文摘要

We present Contingency Model Predictive Control (CMPC), a novel and implementable control framework which tracks a desired path while simultaneously maintaining a contingency plan -- an alternate trajectory to avert an identified potential emergency. In this way, CMPC anticipates events that might take place, instead of reacting when emergencies occur. We accomplish this by adding an additional prediction horizon in parallel to the classical receding MPC horizon. The contingency horizon is constrained to maintain a feasible avoidance solution; as such, CMPC is selectively robust to this emergency while tracking the desired path as closely as possible. After defining the framework mathematically, we demonstrate its effectiveness experimentally by comparing its performance to a state-of-the-art deterministic MPC. The controllers drive an automated research platform through a left-hand turn which may be covered by ice. Contingency MPC prepares for the potential loss of friction by purposefully and intuitively deviating from the prescribed path to approach the turn more conservatively; this deviation significantly mitigates the consequence of encountering ice.

1607.01202 2026-06-04 eess.SY cs.AI cs.RO cs.SY 版本更新

Optimal control for a robotic exploration, pick-up and delivery problem

机器人探索、拾取和配送问题的最优控制

Vladislav Nenchev, Christos G. Cassandras, Jörg Raisch

AI总结 本文研究了机器人在最小时间内寻找并收集有限数量物体并运送到集散地的最优控制问题,采用递推时间窗方案解决混合系统中的最优控制问题,并提出基于运动参数化和梯度优化的事件驱动方法,以提高计算效率。

Comments 14 pages, 23 figures

详情
AI中文摘要

本文解决了一个机器人在最小时间内寻找并收集有限数量物体并运送到集散地的最优控制问题。该机器人具有四阶动力学,其在拾取或放下物体时会瞬间改变。物体被建模为具有预先未知位置的点质量,在有界二维空间中可能包含未知障碍物。对于这种混合系统,通过递推时间窗方案近似求解最优控制问题(OCP),其中推导出的成本到目标的下界在最坏情况和概率情况下进行评估,假设物体位置服从均匀分布。首先,基于时间和位置空间离散化和混合整数规划的时间驱动近似解被提出。由于该解的计算成本较高,提出了一种基于合适运动参数化和梯度优化的事件驱动近似方法。在数值示例中比较了解决方案,表明后一种方法在计算上具有显著优势,同时与前者产生相似的定性结果。这些方法特别适用于各种机器人应用,如自动化清洁、搜索和救援、收割或制造。

英文摘要

This paper addresses an optimal control problem for a robot that has to find and collect a finite number of objects and move them to a depot in minimum time. The robot has fourth-order dynamics that change instantaneously at any pick-up or drop-off of an object. The objects are modeled by point masses with a-priori unknown locations in a bounded two-dimensional space that may contain unknown obstacles. For this hybrid system, an Optimal Control Problem (OCP) is approximately solved by a receding horizon scheme, where the derived lower bound for the cost-to-go is evaluated for the worst and for a probabilistic case, assuming a uniform distribution of the objects. First, a time-driven approximate solution based on time and position space discretization and mixed integer programming is presented. Due to the high computational cost of this solution, an alternative event-driven approximate approach based on a suitable motion parameterization and gradient-based optimization is proposed. The solutions are compared in a numerical example, suggesting that the latter approach offers a significant computational advantage while yielding similar qualitative results compared to the former. The methods are particularly relevant for various robotic applications like automated cleaning, search and rescue, harvesting or manufacturing.

1903.01577 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Episodic Learning with Control Lyapunov Functions for Uncertain Robotic Systems

具有控制李雅普诺夫函数的不确定性机器人系统的经验学习

Andrew J. Taylor, Victor D. Dorobantu, Hoang M. Le, Yisong Yue, Aaron D. Ames

发表机构 * California Institute of Technology(加州理工学院)

AI总结 本文提出了一种基于控制李雅普诺夫函数的机器学习框架,用于适应机器人系统中的参数不确定性和未建模动态,通过迭代更新李雅普诺夫函数导数的估计和改进控制器,最终获得一个稳定性的二次规划基于控制器,并在平面Segway模拟中验证了方法的有效性。

详情
AI中文摘要

许多现代非线性控制方法旨在赋予系统保证性质,如稳定性或安全性,并已成功应用于机器人领域。然而,模型不确定性仍然是持续的挑战,削弱了理论保证并导致物理系统中的实施失败。本文开发了一种以控制李雅普诺夫函数(CLFs)为中心的机器学习框架,以适应一般机器人系统中的参数不确定性和未建模动态。我们提出的方法通过迭代更新李雅普诺夫函数导数的估计并改进控制器,最终获得一个基于二次规划的稳定控制器。我们在平面Segway模拟中验证了我们的方法,通过迭代改进基础无模型控制器,展示了显著的性能提升。

英文摘要

Many modern nonlinear control methods aim to endow systems with guaranteed properties, such as stability or safety, and have been successfully applied to the domain of robotics. However, model uncertainty remains a persistent challenge, weakening theoretical guarantees and causing implementation failures on physical systems. This paper develops a machine learning framework centered around Control Lyapunov Functions (CLFs) to adapt to parametric uncertainty and unmodeled dynamics in general robotic systems. Our proposed method proceeds by iteratively updating estimates of Lyapunov function derivatives and improving controllers, ultimately yielding a stabilizing quadratic program model-based controller. We validate our approach on a planar Segway simulation, demonstrating substantial performance improvements by iteratively refining on a base model-free controller.

1710.02066 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Feedback Regularization and Geometric PID Control for Robust Stabilization of a Planar Three-link Hybrid Bipedal Walking Model

反馈校正与几何PID控制用于平面三连杆混合双足步行模型的鲁棒稳定化

W. M. L. T. Weerakoon, T. W. U. Madhushani, D. H. S. Maithripala, J. M. Berg

发表机构 * Department of Mechanical Engineering, University of Peradeniya(珀斯德尼亚大学机械工程系) Postgraduate and Research Unit, Sri Lanka Technological Campus(斯里兰卡科技校园研究生与研究单位) Department of Mechanical Engineering, Texas Tech University(德克萨斯技术大学机械工程系)

AI总结 本文应用一种 recently 开发的几何PID控制器来稳定一个平面三连杆混合动态步行模型。该模型有三个连杆,代表机器人躯干和两个无膝腿,每个髋关节有独立的控制力矩。几何PID控制器是为完全驱动的机械系统开发的,但在摆动相中,三连杆双足机器人有三个自由度但只有两个控制输入。通过选择两个“虚拟约束”来强制执行,解决欠驱动问题,并验证所得到的二维零动力学的稳定性。所得到的受控动力学不具有机械系统的结构,但通过“反馈校正”恢复了这种结构,随后使用几何PID控制来提供对虚拟约束的鲁棒渐近调节。所提出的方法可以容忍更大的倾斜变化,展示了几何方法的价值和积分作用的益处。

Comments Preprint submitted to 2018 American Control Conference

详情
AI中文摘要

This paper applies a recently developed geometric PID controller to stabilize a three-link planar bipedal hybrid dynamic walking model. The three links represent the robot torso and two kneeless legs, with an independent control torque available at each hip joint. The geometric PID controller is derived for fully actuated mechanical systems, however in the swing phase the three-link biped robot has three degrees of freedom and only two controls. Following the bipedal walking literature, underactuation is addressed by choosing two "virtual constraints" to enforce, and verifying the stability of the resulting two-dimensional zero dynamics. The resulting controlled dynamics do not have the structure of a mechanical system, however this structure is restored using "feedback regularization," following which geometric PID control is used to provide robust asymptotic regulation of the virtual constraints. The proposed method can tolerate significantly greater variations in inclination, showing the value of the geometric methods, and the benefit of integral action.

英文摘要

This paper applies a recently developed geometric PID controller to stabilize a three-link planar bipedal hybrid dynamic walking model. The three links represent the robot torso and two kneeless legs, with an independent control torque available at each hip joint. The geometric PID controller is derived for fully actuated mechanical systems, however in the swing phase the three-link biped robot has three degrees of freedom and only two controls. Following the bipedal walking literature, underactuation is addressed by choosing two "virtual constraints" to enforce, and verifying the stability of the resulting two-dimensional zero dynamics. The resulting controlled dynamics do not have the structure of a mechanical system, however this structure is restored using "feedback regularization," following which geometric PID control is used to provide robust asymptotic regulation of the virtual constraints. The proposed method can tolerate significantly greater variations in inclination, showing the value of the geometric methods, and the benefit of integral action.

1904.02851 2026-06-04 cs.RO cs.AI cs.LG cs.SY eess.SY 版本更新

Planning under non-rational perception of uncertain spatial costs

在不确定空间成本下的非理性感知规划

Aamodh Suresh, Sonia Martinez

AI总结 本文研究了在不确定空间成本下考虑非理性风险感知的运动规划策略,提出基于累积前景理论(CPT)生成感知风险地图的方法,并通过理论和仿真验证了CPT模型的建模能力,与CVaR等其他风险感知模型相比,展示了在路径规划中的优势。

Comments 12 pages and 10 figures. This revision adds more explanation and clearer figures

详情
AI中文摘要

本工作探讨了设计一种考虑与不确定空间成本相关的风险感知的运动规划策略。我们提出的方法利用累积前景理论(CPT)来生成给定环境中的感知风险地图。CPT-like感知风险和路径长度指标被结合以定义一个符合采样运动规划器(RRT*)渐近最优要求的成本函数。通过理论和仿真展示了CPT的建模能力,并与其他风险感知模型如条件价值-at-风险(CVaR)进行了比较。理论上,我们定义了风险感知模型的表达性概念,并证明CPT的表达性高于CVaR和期望风险。然后我们展示了这种表达性在路径规划设置中的转化,其中我们观察到一个配备CPT和同时扰动随机近似(SPSA)方法的规划器可以更好地近似任意环境中的路径。此外,我们通过仿真展示了我们的规划器能够捕捉一组丰富的有意义路径,代表了不同风险感知的自定义环境。然后我们通过在拥挤和动态环境中的仿真比较了我们的规划器与T-RRT*(连续成本空间的规划器)和Risk-RRT*(动态人类障碍物的风险感知规划器)的性能,展示了我们所提规划器的优势。

英文摘要

This work investigates the design of risk-perception-aware motion-planning strategies that incorporate non-rational perception of risks associated with uncertain spatial costs. Our proposed method employs the Cumulative Prospect Theory (CPT) to generate a perceived risk map over a given environment. CPT-like perceived risks and path-length metrics are then combined to define a cost function that is compliant with the requirements of asymptotic optimality of sampling-based motion planners (RRT*). The modeling power of CPT is illustrated in theory and in simulation, along with a comparison to other risk perception models like Conditional Value at Risk (CVaR). Theoretically, we define a notion of expressiveness for a risk perception model and show that CPT's is higher than that of CVaR and expected risk. We then show that this expressiveness translates to our path planning setting, where we observe that a planner equipped with CPT together with a simultaneous perturbation stochastic approximation (SPSA) method can better approximate arbitrary paths in an environment. Additionally, we show in simulation that our planner captures a rich set of meaningful paths, representative of different risk perceptions in a custom environment. We then compare the performance of our planner with T-RRT* (a planner for continuous cost spaces) and Risk-RRT* (a risk-aware planner for dynamic human obstacles) through simulations in cluttered and dynamic environments respectively, showing the advantage of our proposed planner.

1902.10139 2026-06-04 eess.SY cs.LG cs.RO cs.SY 版本更新

Learning Dynamic-Objective Policies from a Class of Optimal Trajectories

从一类最优轨迹中学习动态目标策略

Christopher Iliffe Sprague, Dario Izzo, Petter Ögren

AI总结 本文提出了一种新颖且简单的方法,通过轨迹优化、同伦持续和模仿学习相结合,合成能够在线切换目标函数的最优状态反馈控制器,并在倒立摆摆起和航天器轨道转移问题中验证了其有效性。

Comments Accepted to the 59th IEEE Conference on Decision and Control (CDC)

详情
AI中文摘要

最优状态反馈控制器,能够切换不同的目标函数,在可能遇到意外情况的系统中具有优势。然而,即使对于单一目标函数,合成此类控制器也是极具挑战性的。本文提出了一种新颖且简单的方法,通过轨迹优化、同伦持续和模仿学习相结合,来合成这些策略。我们使用数值持续法高效地生成多个目标函数和边界条件下的最优演示,并利用这些演示来训练我们的策略。此外,我们展示了我们的策略能够有效学习一系列最优状态反馈控制器,这些控制器可以在线切换目标函数。我们通过两个轨迹优化问题,即倒立摆摆起和航天器轨道转移,展示了该方法,并证明在仿真中合成的策略产生的轨迹接近最优。这些结果表明,轨迹优化和同伦持续对动态目标情境下的控制器合成具有益处。

英文摘要

Optimal state-feedback controllers, capable of changing between different objective functions, are advantageous to systems in which unexpected situations may arise. However, synthesising such controllers, even for a single objective, is a demanding process. In this paper, we present a novel and straightforward approach to synthesising these policies through a combination of trajectory optimisation, homotopy continuation, and imitation learning. We use numerical continuation to efficiently generate optimal demonstrations across several objectives and boundary conditions, and use these to train our policies. Additionally, we demonstrate the ability of our policies to effectively learn families of optimal state-feedback controllers, which can be used to change objective functions online. We illustrate this approach across two trajectory optimisation problems, an inverted pendulum swingup and a spacecraft orbit transfer, and show that the synthesised policies, when evaluated in simulation, produce trajectories that are near-optimal. These results indicate the benefit of trajectory optimisation and homotopy continuation to the synthesis of controllers in dynamic-objective contexts.

1710.09691 2026-06-04 eess.SY cs.LG cs.RO cs.SY 版本更新

Iterative Machine Learning for Precision Trajectory Tracking with Series Elastic Actuators

迭代机器学习用于系列弹性执行器的高精度轨迹跟踪

Nathan Banka, W. Tony Piaskowy, Joseph Garbini, Santosh Devasia

发表机构 * Ultra-Precision Controls Lab(超精密控制实验室) University of Washington(华盛顿大学)

AI总结 本文研究了在系列弹性执行器中使用迭代学习方法提高位置跟踪精度的问题,通过迭代学习生成前馈命令,利用复值高斯过程回归技术估计局部系统模型,从而减少跟踪误差。

Comments 9 pages, 16 figure. Submitted to AMC Workshop

详情
Journal ref
2018 IEEE 15th International Workshop on Advanced Motion Control (AMC), Tokyo, 2018, pp. 234-239
AI中文摘要

当机器人在未知环境中操作时,位置的小误差可能导致接触力的大幅变化,尤其是对于典型的高阻抗设计。这可能会损坏周围环境或机器人本身。系列弹性执行器(SEAs)是一种减少机器人手臂输出阻抗以提高对环境施加力的控制能力的流行方法。然而,这种增加的力控制能力伴随着较低的位置精度和带宽。本文探讨了使用迭代学习的前馈命令来改进使用SEAs时的位置跟踪。在每次迭代中,系统对量化输入的输出响应被用来估计线性化的局部系统模型。这些估计的模型是通过复值高斯过程回归(cGPR)技术获得的,然后用于基于前一次迭代的误差生成新的前馈输入命令。本文展示了该迭代机器学习(IML)技术在双自由度(2-DOF)机器人手臂上的应用,并证明了IML方法能够成功收敛以减少跟踪误差。

英文摘要

When robots operate in unknown environments small errors in postions can lead to large variations in the contact forces, especially with typical high-impedance designs. This can potentially damage the surroundings and/or the robot. Series elastic actuators (SEAs) are a popular way to reduce the output impedance of a robotic arm to improve control authority over the force exerted on the environment. However this increased control over forces with lower impedance comes at the cost of lower positioning precision and bandwidth. This article examines the use of an iteratively-learned feedforward command to improve position tracking when using SEAs. Over each iteration, the output responses of the system to the quantized inputs are used to estimate a linearized local system models. These estimated models are obtained using a complex-valued Gaussian Process Regression (cGPR) technique and then, used to generate a new feedforward input command based on the previous iteration's error. This article illustrates this iterative machine learning (IML) technique for a two degree of freedom (2-DOF) robotic arm, and demonstrates successful convergence of the IML approach to reduce the tracking error.

1905.03419 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Testing Scenario Library Generation for Connected and Automated Vehicles, Part I: Methodology

连接和自动化车辆测试场景库生成方法研究,第一部分:方法论

Shuo Feng, Yiheng Feng, Chunhui Yu, Yi Zhang, Henry X. Liu

发表机构 * University of Michigan(密歇根大学)

AI总结 本文提出了一种系统框架,用于生成连接和自动化车辆(CAVs)的测试场景库(TSLG),考虑不同的操作设计域(ODDs)、CAV模型和性能指标,通过引入新的场景关键性度量标准和多起始优化方法,提高测试效率。

Comments 11 pages,3 figures

详情
Journal ref
IEEE Transactions on Intelligent Transportation Systems, 2020
AI中文摘要

在连接和自动化车辆(CAVs)的发展和部署过程中,测试和评估是一个关键步骤,但目前尚无系统框架用于生成测试场景库。本研究旨在提供一种通用框架,用于解决测试场景库生成(TSLG)问题,考虑不同的操作设计域(ODDs)、CAV模型和性能指标。给定一个ODD,测试场景库被定义为一组关键场景,可用于CAV测试。每个测试场景通过新提出的度量标准——场景关键性进行评估,该度量标准可以计算为机动挑战和暴露频率的组合。为了寻找关键场景,设计了辅助目标函数,并应用了多起始优化方法和种子填充技术。所提出的框架在理论上被证明可以以远少的测试次数获得准确的评估结果,与道路测试方法相比。在本研究的第二部分中,通过三个案例研究来演示所提出的方法。基于强化学习的技术被应用于在高维场景下增强搜索方法。

英文摘要

Testing and evaluation is a critical step in the development and deployment of connected and automated vehicles (CAVs), and yet there is no systematic framework to generate testing scenario library. This study aims to provide a general framework for the testing scenario library generation (TSLG) problem with different operational design domains (ODDs), CAV models, and performance metrics. Given an ODD, the testing scenario library is defined as a critical set of scenarios that can be used for CAV test. Each testing scenario is evaluated by a newly proposed measure, scenario criticality, which can be computed as a combination of maneuver challenge and exposure frequency. To search for critical scenarios, an auxiliary objective function is designed, and a multi-start optimization method along with seed-filling is applied. The proposed framework is theoretically proved to obtain accurate evaluation results with much fewer number of tests, if compared with the on-road test method. In part II of the study, three case studies are investigated to demonstrate the proposed methodologies. Reinforcement learning based technique is applied to enhance the searching method under high-dimensional scenarios.

1810.00182 2026-06-04 eess.SY cs.MA cs.RO cs.SY math.OC 版本更新

Collaborative target-tracking control using multiple autonomous fixed-wing UAVs with constant speeds

使用多架自主固定翼无人机进行协同目标跟踪控制

Zhiyong Sun, Hector Garcia de Marina, Brian D. O. Anderson, Changbin Yu

发表机构 * Department of Electrical Engineering, Eindhoven University of Technology(埃因霍温理工大学电子工程系) Universidad Complutense de Madrid(马德里complutense大学) Research School of Electrical, Energy and Material Engineering, Australian National University(澳大利亚国立大学电气、能源和材料工程研究学校) Optus-Curtin Centre of Excellence in Artificial Intelligence, Curtin University(Curtin大学人工智能卓越中心)

AI总结 本文研究了使用多架固定翼无人机进行协同跟踪控制的问题,通过设计控制器使无人机群体能够协同跟踪目标的位置和速度,提出了相对速度条件和参考速度构造方法,并讨论了常速约束下的控制器设计和性能限制。

Comments 33 pages (single column). To be published in the AIAA Journal of Guidance, Dynamics, and Control

详情
AI中文摘要

本文考虑了使用一组具有恒定且非相同速度的固定翼无人飞行器(UAV)进行协同跟踪控制的问题。固定翼UAV的动力学通过具有非完整约束的单轮型方程建模,假设UAV在名义操作模式下以恒定高度飞行。控制器设计使得所有固定翼UAV作为群体能够协同跟踪目标的位置和速度。我们首先提出了跟踪UAV与目标之间相对速度的条件,以确保在UAV受恒定速度约束时跟踪目标的可行性。我们构造了一个包含目标速度和位置作为反馈的参考速度,该参考速度由群体质心跟踪。通过这种方式,所有车辆的航向被控制,使群体质心沿着成功跟踪目标轨迹的参考轨迹移动。进一步设计了一个间距控制器,以确保所有车辆靠近群体质心轨迹。还详细讨论了控制器设计中的权衡以及由于恒速约束导致的目标跟踪控制性能限制。提供了三架固定翼UAV跟踪目标旋翼机的实验结果。

英文摘要

This paper considers a collaborative tracking control problem using a group of fixed-wing unmanned aerial vehicles (UAVs) with constant and non-identical speeds. The dynamics of fixed-wing UAVs are modelled by unicycle-type equations with nonholonomic constraints, assuming that UAVs fly at constant altitudes in the nominal operation mode. The controller is designed such that all fixed-wing UAVs as a group can collaboratively track a desired target's position and velocity. We first present conditions on the relative speeds of tracking UAVs and the target to ensure that the tracking objective can be achieved when UAVs are subject to constant speed constraints. We construct a reference velocity that includes both the target's velocity and position as feedback, which is to be tracked by the group centroid. In this way, all vehicles' headings are controlled such that the group centroid follows a reference trajectory that successfully tracks the target's trajectory. A spacing controller is further devised to ensure that all vehicles stay close to the group centroid trajectory. Trade-offs in the controller design and performance limitations of the target tracking control due to the constant-speed constraint are also discussed in detail. Experimental results with three fixed-wing UAVs tracking a target rotorcraft are provided.

1802.09099 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Pareto optimal multi-robot motion planning

Pareto最优多机器人运动规划

Guoxiang Zhao, Minghui Zhu

发表机构 * School of Electrical Engineering and Computer Science, Pennsylvania State University(宾夕法尼亚州立大学电气工程与计算机科学学院)

AI总结 本文研究了一类多机器人协调问题,目标是使机器人以最短时间到达目标区域并避免碰撞。提出了一种新的数值算法来识别Pareto最优解,确保没有机器人可以单方面减少旅行时间而不延长其他机器人的。通过室内多机器人平台和计算机模拟实验,展示了该算法的 anytime 特性。

详情
AI中文摘要

本文研究了一类多机器人协调问题,其中一组机器人旨在以最短时间到达目标区域并避免与障碍物和其他机器人碰撞。提出了一种新的数值算法来识别Pareto最优解,其中没有机器人可以单方面减少其旅行时间而不延长其他机器人的。通过集合值数值分析保证了算法在epigraphical profile意义下的一致近似。在室内多机器人平台和计算机模拟上的实验显示了所提出算法的anytime特性;即,它能够快速返回一个安全的控制策略,使机器人安全地到达目标区域,并且在给予更多时间的情况下,持续改进策略的最优性。

英文摘要

This paper studies a class of multi-robot coordination problems where a team of robots aim to reach their goal regions with minimum time and avoid collisions with obstacles and other robots. A novel numerical algorithm is proposed to identify the Pareto optimal solutions where no robot can unilaterally reduce its traveling time without extending others'. The consistent approximation of the algorithm in the epigraphical profile sense is guaranteed using set-valued numerical analysis. Experiments on an indoor multi-robot platform and computer simulations show the anytime property of the proposed algorithm; i.e., it is able to quickly return a feasible control policy that safely steers the robots to their goal regions and it keeps improving policy optimality if more time is given.

1806.04225 2026-06-04 cs.RO cs.AI cs.LG cs.SY eess.SY math.OC 版本更新

PAC-Bayes Control: Learning Policies that Provably Generalize to Novel Environments

PAC-Bayes 控制:学习能够证明在新环境中泛化的能力的策略

Anirudha Majumdar, Alec Farid, Anoopkumar Sonar

发表机构 * Department of Mechanical and Aerospace Engineering(1,2 机械与航空航天工程系) Department of Computer Science Princeton University(3 计算机科学系 纽约大学普林斯顿分校)

AI总结 本文提出了一种基于PAC-Bayes框架的机器人策略学习方法,通过在新环境中泛化能力的理论分析,为机器人系统提供强泛化保证。

Comments Extended version of paper presented at the 2018 Conference on Robot Learning (CoRL)

详情
AI中文摘要

我们的目标是学习能够证明在新环境中泛化能力的机器人控制策略,给定一组示例环境的数据集。我们方法的关键技术思想是利用机器学习中的泛化理论工具,通过精确的类比(以缩减形式呈现)将控制策略在新环境中的泛化与监督学习中的假设泛化联系起来。特别是,我们利用Probably Approximately Correct (PAC)-Bayes框架,这使我们能够获得在新环境中(随机)控制策略预期成本的上界。我们提出策略学习算法,明确寻求最小化此上界。相应的优化问题可以在有限策略空间的设置中通过凸优化(特别是相对熵编程)解决。在更一般的情况下,对于连续参数化策略(例如神经网络策略),我们使用随机梯度下降来最小化此上界。我们展示了所提出方法应用于学习(1)反应性障碍物回避策略和(2)基于神经网络的抓取策略的模拟结果。我们还展示了Parrot Swing无人机在不同障碍物环境中的硬件结果。我们的例子展示了该方法在具有连续状态和动作空间、复杂(例如非线性)动态、丰富感官输入(例如深度图像)和基于神经网络的策略的机器人系统中提供强泛化保证的潜力。

英文摘要

Our goal is to learn control policies for robots that provably generalize well to novel environments given a dataset of example environments. The key technical idea behind our approach is to leverage tools from generalization theory in machine learning by exploiting a precise analogy (which we present in the form of a reduction) between generalization of control policies to novel environments and generalization of hypotheses in the supervised learning setting. In particular, we utilize the Probably Approximately Correct (PAC)-Bayes framework, which allows us to obtain upper bounds that hold with high probability on the expected cost of (stochastic) control policies across novel environments. We propose policy learning algorithms that explicitly seek to minimize this upper bound. The corresponding optimization problem can be solved using convex optimization (Relative Entropy Programming in particular) in the setting where we are optimizing over a finite policy space. In the more general setting of continuously parameterized policies (e.g., neural network policies), we minimize this upper bound using stochastic gradient descent. We present simulated results of our approach applied to learning (1) reactive obstacle avoidance policies and (2) neural network-based grasping policies. We also present hardware results for the Parrot Swing drone navigating through different obstacle environments. Our examples demonstrate the potential of our approach to provide strong generalization guarantees for robotic systems with continuous state and action spaces, complicated (e.g., nonlinear) dynamics, rich sensory inputs (e.g., depth images), and neural network-based policies.

1903.11483 2026-06-04 cs.LG cs.NE cs.RO cs.SY eess.SY stat.ML 版本更新

Constructing Parsimonious Analytic Models for Dynamic Systems via Symbolic Regression

通过符号回归构建动态系统的简洁解析模型

Erik Derner, Jiří Kubalík, Nicola Ancona, Robert Babuška

发表机构 * Czech Institute of Informatics, Robotics, and Cybernetics(捷克信息学、机器人学与自动化研究所) Czech Technical University in Prague(布拉格捷克技术大学) Department of Control Engineering, Faculty of Electrical Engineering(电气工程系控制工程系) Delft University of Technology(代尔夫特理工大学)

AI总结 本文提出利用符号回归构建动态系统的简洁解析模型,通过两种先进的符号回归算法在状态空间域和输入输出域中应用,展示了在模拟示例和真实系统中的优越性能。

详情
Journal ref
Applied Soft Computing, Volume 94, September 2020, 106432
AI中文摘要

构建动态系统的数学模型对于许多工程和科学学科至关重要。模型有助于模拟、分析系统行为、决策制定和自动控制算法的设计。即使像强化学习(RL)这样的无模型控制技术也已被证明能从使用模型中受益,通常这些模型是在线学习的。任何模型构建方法都必须处理模型的准确性和复杂性之间的权衡,这很难做到。本文提出利用符号回归(SR)来构建由解析方程描述的简洁过程模型。我们为方法配备了两种最先进的符号回归算法,它们自动搜索适合测量数据的方程:单节点遗传编程(SNGP)和多基因遗传编程(MGGP)。除了状态空间域中的标准问题表述外,我们还展示了该方法如何应用于非线性自回归加外生输入(NARX)类型的输入输出模型。我们展示了该方法在三个模拟示例中的应用,这些示例的状态空间最高可达14维:倒立摆、移动机器人和双足行走机器人。与深度神经网络和局部线性回归的比较表明,SR在大多数情况下优于这些常用替代方法。我们在真实摆系统上展示了解析模型的发现使RL控制器能够成功完成摆起任务,该模型仅基于100个数据样本构建。

英文摘要

Developing mathematical models of dynamic systems is central to many disciplines of engineering and science. Models facilitate simulations, analysis of the system's behavior, decision making and design of automatic control algorithms. Even inherently model-free control techniques such as reinforcement learning (RL) have been shown to benefit from the use of models, typically learned online. Any model construction method must address the tradeoff between the accuracy of the model and its complexity, which is difficult to strike. In this paper, we propose to employ symbolic regression (SR) to construct parsimonious process models described by analytic equations. We have equipped our method with two different state-of-the-art SR algorithms which automatically search for equations that fit the measured data: Single Node Genetic Programming (SNGP) and Multi-Gene Genetic Programming (MGGP). In addition to the standard problem formulation in the state-space domain, we show how the method can also be applied to input-output models of the NARX (nonlinear autoregressive with exogenous input) type. We present the approach on three simulated examples with up to 14-dimensional state space: an inverted pendulum, a mobile robot, and a bipedal walking robot. A comparison with deep neural networks and local linear regression shows that SR in most cases outperforms these commonly used alternative methods. We demonstrate on a real pendulum system that the analytic model found enables a RL controller to successfully perform the swing-up task, based on a model constructed from only 100 data samples.

1809.04048 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Accurate Tracking of Aggressive Quadrotor Trajectories using Incremental Nonlinear Dynamic Inversion and Differential Flatness

使用增量非线性动态逆反与微分平坦性准确跟踪攻击性四旋翼轨迹

Ezra Tal, Sertac Karaman

AI总结 本文提出了一种新的控制律,用于跟踪四旋翼的位置、偏航角及其至四阶导数,包括速度、加速度、加加速度和加加加速度以及偏航速率和偏航加速度。通过基于四旋翼动力学的微分平坦性,使用前馈输入跟踪加加速度和加加加速度。通过闭环电机速度控制,直接控制机体扭矩以实现加加加速度跟踪。控制器利用增量非线性动态逆反(INDI)在外部干扰如空气动力学阻力的情况下实现对线性和角加速度的鲁棒跟踪。通过响应分析严格分析所提出的控制律,并通过实验演示。控制器使四旋翼无人机能够跟踪复杂的3D轨迹,达到最高12.9 m/s的速度和2.1g的加速度,同时保持均方根跟踪误差仅为6.6 cm,在约18m×7m×3m的飞行体积内。此外,通过在飞行测试中附加拖板和在悬停时拉拽无人机来演示控制器的鲁棒性。

Comments To be published in IEEE Transactions on Control Systems Technology. Revision: new set of experiments at increased speed (up to 12.9 m/s), updated controller design using quaternion representation, new video available at https://youtu.be/K15lNBAKDCs

详情
AI中文摘要

自主无人机(UAV)能够执行攻击性(即高速和高加速度)动作已经引起了极大的关注。本文聚焦于准确跟踪攻击性四旋翼轨迹。我们提出了一种新的控制律,用于跟踪位置和偏航角及其导数至四阶,具体包括速度、加速度、加加速度和加加加速度以及偏航速率和偏航加速度。通过基于四旋翼动力学的微分平坦性,使用前馈输入跟踪加加速度和加加加速度。加加加速度跟踪需要直接控制机体扭矩,我们通过闭环电机速度控制,基于安装在电机上的光学编码器的测量来实现。控制器利用增量非线性动态逆反(INDI)在外部干扰如空气动力学阻力的情况下实现对线性和角加速度的鲁棒跟踪。因此,不需要先验建模空气动力学效应。我们通过响应分析严格分析所提出的控制律,并通过实验演示。控制器使四旋翼UAV能够跟踪复杂的3D轨迹,达到最高12.9 m/s的速度和2.1g的加速度,同时保持根均方跟踪误差仅为6.6 cm,在约18m×7m×3m的飞行体积内。我们还通过在飞行测试中附加拖板和在悬停时拉拽UAV来演示控制器的鲁棒性。

英文摘要

Autonomous unmanned aerial vehicles (UAVs) that can execute aggressive (i.e., high-speed and high-acceleration) maneuvers have attracted significant attention in the past few years. This paper focuses on accurate tracking of aggressive quadcopter trajectories. We propose a novel control law for tracking of position and yaw angle and their derivatives of up to fourth order, specifically, velocity, acceleration, jerk, and snap along with yaw rate and yaw acceleration. Jerk and snap are tracked using feedforward inputs for angular rate and angular acceleration based on the differential flatness of the quadcopter dynamics. Snap tracking requires direct control of body torque, which we achieve using closed-loop motor speed control based on measurements from optical encoders attached to the motors. The controller utilizes incremental nonlinear dynamic inversion (INDI) for robust tracking of linear and angular accelerations despite external disturbances, such as aerodynamic drag forces. Hence, prior modeling of aerodynamic effects is not required. We rigorously analyze the proposed control law through response analysis, and we demonstrate it in experiments. The controller enables a quadcopter UAV to track complex 3D trajectories, reaching speeds up to 12.9 m/s and accelerations up to 2.1g, while keeping the root-mean-square tracking error down to 6.6 cm, in a flight volume that is roughly 18 m by 7 m and 3 m tall. We also demonstrate the robustness of the controller by attaching a drag plate to the UAV in flight tests and by pulling on the UAV with a rope during hover.

1904.02209 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

The Green Choice: Learning and Influencing Human Decisions on Shared Roads

绿色选择:学习和影响共享道路上的人类决策

Erdem Bıyık, Daniel A. Lazar, Dorsa Sadigh, Ramtin Pedarsani

发表机构 * Department of Electrical Engineering, Stanford University(斯坦福大学电气工程系) Department of Electrical and Computer Engineering, UC Santa Barbara(加州大学圣芭芭拉分校电气与计算机工程系) Department of Computer Science, Stanford University(斯坦福大学计算机科学系)

AI总结 本文研究如何通过设计价格策略来影响人类在共享道路上的决策,以最大化道路使用效率并减少交通延误,核心方法是基于用户偏好算法和交通基本图模型的规划优化。

Comments Submitted to CDC 2019

详情
AI中文摘要

自动驾驶车辆通过编队有潜力提高道路容量,即使在与人类驾驶员共享道路的情况下。然而,当道路网络用户自私地选择路线时,所产生的交通配置可能非常低效。为此,我们考虑如何影响人类决策以减少道路上的拥堵。我们考虑一个由平行道路组成的网络,有两种交通模式:(i) 人类驾驶员会选择他们可用的最快路线,(ii) 滚动出行服务提供一系列自动驾驶车辆出行选项,每种选项有不同的价格。在本工作中,我们试图设计这些价格,使得当自动驾驶服务用户选择这些选项,而人类驾驶员自私地选择他们的路线时,道路使用最大化,交通延误最小化。为此,我们正式化了自动驾驶服务用户在不同价格/延误值路线之间做出选择的模型。开发基于偏好的算法来学习用户的偏好,并使用与交通基本图相关的车辆流模型,我们制定了一种规划优化以最大化社会目标,并展示了所提出路线和学习方案的好处。

英文摘要

Autonomous vehicles have the potential to increase the capacity of roads via platooning, even when human drivers and autonomous vehicles share roads. However, when users of a road network choose their routes selfishly, the resulting traffic configuration may be very inefficient. Because of this, we consider how to influence human decisions so as to decrease congestion on these roads. We consider a network of parallel roads with two modes of transportation: (i) human drivers who will choose the quickest route available to them, and (ii) ride hailing service which provides an array of autonomous vehicle ride options, each with different prices, to users. In this work, we seek to design these prices so that when autonomous service users choose from these options and human drivers selfishly choose their resulting routes, road usage is maximized and transit delay is minimized. To do so, we formalize a model of how autonomous service users make choices between routes with different price/delay values. Developing a preference-based algorithm to learn the preferences of the users, and using a vehicle flow model related to the Fundamental Diagram of Traffic, we formulate a planning optimization to maximize a social objective and demonstrate the benefit of the proposed routing and learning scheme.

1904.01068 2026-06-04 cs.RO cs.AI cs.LG cs.SY eess.SY 版本更新

Efficient and Safe Exploration in Deterministic Markov Decision Processes with Unknown Transition Models

在未知转移模型的确定性马尔可夫决策过程中实现高效且安全的探索

Erdem Bıyık, Jonathan Margoliash, Shahrouz Ryan Alimo, Dorsa Sadigh

发表机构 * Stanford University(斯坦福大学) Jet Propulsion Laboratory(喷气推进实验室) California Institute of Technology(加州理工学院)

AI总结 本文提出了一种安全探索算法,通过利用Lipschitz连续性确保在探索过程中不访问危险状态,该算法在确定性马尔可夫决策过程中提供了确定性的安全保证,并通过模拟导航任务验证了其性能。

Comments Proceedings of the American Control Conference (ACC), July 2019. The first two authors have equal contribution

详情
AI中文摘要

我们提出了一种安全探索算法,用于具有未知转移模型的确定性马尔可夫决策过程。我们的算法通过利用Lipschitz连续性来保证安全性,确保在探索过程中不访问不安全的状态。与许多其他现有技术不同,所提供的安全保证是确定性的。我们的算法被优化以减少探索安全空间所需的操作次数。我们在导航任务的模拟中将我们的算法与基线方法进行了比较,以展示其性能。

英文摘要

We propose a safe exploration algorithm for deterministic Markov Decision Processes with unknown transition models. Our algorithm guarantees safety by leveraging Lipschitz-continuity to ensure that no unsafe states are visited during exploration. Unlike many other existing techniques, the provided safety guarantee is deterministic. Our algorithm is optimized to reduce the number of actions needed for exploring the safe space. We demonstrate the performance of our algorithm in comparison with baseline methods in simulation on navigation tasks.

1905.10706 2026-06-04 cs.LG cs.RO cs.SY eess.SY stat.ML 版本更新

Interactive Differentiable Simulation

交互式可微分模拟

Eric Heiden, David Millard, Hejia Zhang, Gaurav S. Sukhatme

发表机构 * University of Southern California(南加州大学)

AI总结 本文提出交互式可微分模拟(IDS),一种能够高效准确推断刚体系统物理属性的可微分物理引擎,通过视觉输入实现系统识别,从而建立具有物理意义的世界模型,并在非线性动态系统中实现自动任务机器人设计和参数估计,显著提升了非线性控制领域的样本效率。

详情
AI中文摘要

智能体需要对世界有物理理解才能预测其未来行动的影响。虽然基于学习的环境动力学模型在样本效率上相比无模型强化学习算法有所改进,但通常无法泛化到训练数据之外的系统状态,且往往依赖于非解释性的潜在变量。我们引入交互式可微分模拟(IDS),一种可微分的物理引擎,能够高效准确地推断刚体系统的物理属性。将模型集成到深度学习架构中,该模型能够利用视觉输入实现系统识别,从而建立具有物理意义的世界模型。我们展示了通过自动计算IDS中的梯度,实现非线性动态系统的自动任务机器人设计和参数估计。当与自适应模型预测控制算法结合时,我们的方法在具有挑战性的非线性控制领域中,相比无模型强化学习算法显示出数量级的样本效率提升。

英文摘要

Intelligent agents need a physical understanding of the world to predict the impact of their actions in the future. While learning-based models of the environment dynamics have contributed to significant improvements in sample efficiency compared to model-free reinforcement learning algorithms, they typically fail to generalize to system states beyond the training data, while often grounding their predictions on non-interpretable latent variables. We introduce Interactive Differentiable Simulation (IDS), a differentiable physics engine, that allows for efficient, accurate inference of physical properties of rigid-body systems. Integrated into deep learning architectures, our model is able to accomplish system identification using visual input, leading to an interpretable model of the world whose parameters have physical meaning. We present experiments showing automatic task-based robot design and parameter estimation for nonlinear dynamical systems by automatically calculating gradients in IDS. When integrated into an adaptive model-predictive control algorithm, our approach exhibits orders of magnitude improvements in sample efficiency over model-free reinforcement learning algorithms on challenging nonlinear control domains.

1804.08871 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Representing the Unknown - Impact of Uncertainty on the Interaction between Decision Making and Trajectory Generation

表示未知 - 不确定性对决策制定与轨迹生成交互的影响

Marcus Nolte, Susanne Ernst, Jan Richelmann, Markus Maurer

发表机构 * Institute of Control Engineering(控制工程研究所)

AI总结 本文探讨了不确定性对自动驾驶车辆运动规划问题参数和环境模型要求的影响,强调了决策制定与轨迹生成之间明确接口的重要性。

详情
AI中文摘要

尽管自动驾驶车辆的运动规划已经讨论了超过二十年,但该领域仍然是高度活跃的研究领域,近年来发表了各种不同的方法。当考虑SAE Level 3+车辆的市场引入时,运动规划主题很可能在安全性和用户接受性之间经历更加详细的讨论。本文将讨论运动规划问题的参数和环境模型的要求。重点放在不同类型的不确定性(如传感器遮挡)的表示上,论证了决策制定与轨迹生成之间明确接口的重要性。

英文摘要

Even though motion planning for automated vehicles has been extensively discussed for more than two decades, it is still a highly active field of research with a variety of different approaches having been published in the recent years. When considering the market introduction of SAE Level 3+ vehicles, the topic of motion planning will most likely be subject to even more detailed discussions between safety and user acceptance. This paper shall discuss parameters of the motion planning problem and requirements to an environment model. The focus is put on the representation of different types of uncertainty at the example of sensor occlusion, arguing the importance of a well-defined interface between decision making and trajectory generation.

1810.03345 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Bounded Collision Force by the Sobolev Norm

通过Sobolev范数限制碰撞力

Kevin Haninger, Dragoljub Surdilovic

AI总结 本文提出利用Sobolev范数作为系统范数,为动态系统中的最大碰撞力提供严格界限,通过实验和仿真验证了该方法在分析机器人碰撞力与控制策略、关节灵活性及末端执行器柔顺性之间的关系方面的有效性。

Comments Accepted ICRA2019, suppoprted by EU H2020 programme, Grant #820689

详情
AI中文摘要

机器人与环境或人类接触时可能产生安全风险,包括过大的碰撞力。尽管文献中有关于机器人惯性、相对速度和界面刚度对碰撞影响的实验研究,但关于最大碰撞力的分析模型仍局限于简化质量-弹簧机器人模型。该简化模型限制了对控制(力/扭矩、阻抗或顺应性)或柔顺机器人(关节和末端执行器柔顺性)的分析。本文将Sobolev范数适配为系统范数,为一般动态系统中刚度元件的最大力提供严格界限,允许使用更精确的模型和反馈控制进行碰撞研究。Sobolev范数可通过转换系统的$\mathcal{H}_2$范数找到,允许高效计算、连接现有控制理论并合成控制器以最小化碰撞力。Sobolev范数首先通过顺应性控制机器人进行实验验证,然后在模拟线性柔性关节机器人中进行验证。随后,它被用于研究控制、关节灵活性和末端执行器柔顺性对碰撞的影响,并展示了碰撞性能与环境估计不确定性之间的权衡。

英文摘要

A robot making contact with an environment or human presents potential safety risks, including excessive collision force. While experiments on the effect of robot inertia, relative velocity, and interface stiffness on collision are in literature, analytical models for maximum collision force are limited to a simplified mass-spring robot model. This simplified model limits the analysis of control (force/torque, impedance, or admittance) or compliant robots (joint and end-effector compliance). Here, the Sobolev norm is adapted to be a system norm, giving rigorous bounds on the maximum force on a stiffness element in a general dynamic system, allowing the study of collision with more accurate models and feedback control. The Sobolev norm can be found through the $\mathcal{H}_2$ norm of a transformed system, allowing efficient computation, connection with existing control theory, and controller synthesis to minimize collision force. The Sobolev norm is validated, first experimentally with an admittance-controlled robot, then in simulation with a linear flexible-joint robot. It is then used to investigate the impact of control, joint flexibility and end-effector compliance on collision, and a trade-off between collision performance and environmental estimation uncertainty is shown.

1602.04450 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Bayesian Optimization with Safety Constraints: Safe and Automatic Parameter Tuning in Robotics

具有安全约束的贝叶斯优化:机器人中的安全自动参数调节

Felix Berkenkamp, Andreas Krause, Angela P. Schoellig

发表机构 * 1 Learning \& Adaptive Systems Group, Department of Computer Science, ETH Zurich, Switzerland. 2 Dynamic Systems Lab, Institute for Aerospace Studies, University of Toronto, Canada.

AI总结 本文提出了一种通用算法,允许在目标函数之外存在多个独立的安全约束。该算法在给定初始安全参数的情况下,最大化性能,但仅评估满足所有安全约束的参数。通过利用高斯过程先验的正则性假设,该算法仔细探索参数空间,并展示了如何利用上下文变量安全地将知识转移到新任务中。

详情
AI中文摘要

机器人算法通常依赖于各种参数,这些参数的选择对机器人的性能有显著影响。虽然初始参数猜测可以从机器人的动态模型中获得,但通常需要在真实系统上手动调整参数以达到最佳性能。优化算法,如贝叶斯优化,已被用来自动化这一过程。然而,这些方法在优化过程中可能会评估不安全的参数,导致安全关键系统的故障。最近,一种称为SafeOpt的安全贝叶斯优化算法已被开发,该算法保证系统性能永远不会低于临界值;即,安全性是基于性能函数定义的。然而,在机器人中,将性能和安全性结合往往并不理想。例如,高增益控制器可能实现低平均跟踪误差(性能),但可能会超调并违反输入约束。在本文中,我们提出了一种通用算法,允许在目标函数之外存在多个独立的安全约束。给定初始的安全参数集,该算法最大化性能,但只评估满足所有约束的参数,以高概率。为此,它通过利用高斯过程先验的正则性假设来仔细探索参数空间。此外,我们展示了如何利用上下文变量安全地将知识转移到新情况和任务中。我们提供了理论分析,并证明所提出的算法能够实现快速、自动和安全的参数调节优化,在四旋翼飞行器的实验中得到了验证。

英文摘要

Robotic algorithms typically depend on various parameters, the choice of which significantly affects the robot's performance. While an initial guess for the parameters may be obtained from dynamic models of the robot, parameters are usually tuned manually on the real system to achieve the best performance. Optimization algorithms, such as Bayesian optimization, have been used to automate this process. However, these methods may evaluate unsafe parameters during the optimization process that lead to safety-critical system failures. Recently, a safe Bayesian optimization algorithm, called SafeOpt, has been developed, which guarantees that the performance of the system never falls below a critical value; that is, safety is defined based on the performance function. However, coupling performance and safety is often not desirable in robotics. For example, high-gain controllers might achieve low average tracking error (performance), but can overshoot and violate input constraints. In this paper, we present a generalized algorithm that allows for multiple safety constraints separate from the objective. Given an initial set of safe parameters, the algorithm maximizes performance but only evaluates parameters that satisfy safety for all constraints with high probability. To this end, it carefully explores the parameter space by exploiting regularity assumptions in terms of a Gaussian process prior. Moreover, we show how context variables can be used to safely transfer knowledge to new situations and tasks. We provide a theoretical analysis and demonstrate that the proposed algorithm enables fast, automatic, and safe optimization of tuning parameters in experiments on a quadrotor vehicle.

1712.01491 2026-06-04 eess.SY cs.RO cs.SY 版本更新

TrackerBots: Autonomous Unmanned Aerial Vehicle for Real-Time Localization and Tracking of Multiple Radio-Tagged Animals

TrackerBots: 用于实时定位和跟踪多个无线电标签动物的自主无人机

Hoa Van Nguyen, Michael Chesser, Lian Pin Koh, S. Hamid Rezatofighi, Damith C. Ranasinghe

发表机构 * Wiley(威立出版集团)

AI总结 本文提出了一种名为TrackerBots的自主无人机系统,利用RSSI测量实现低成本轻量级的多只无线电标签动物的实时定位和跟踪,通过粒子滤波和POMDP动态路径规划提高导航效率并节省电量。

Comments The accepted version to the Journal of Field Robotics, published after the embargo period (12 months)

详情
Journal ref
Journal of Field Robotics. 2019; 36: 617 - 635
AI中文摘要

自主空中机器人为通过高效收集时间空间粒度信息研究濒危物种的栖息地和行为提供了新可能。我们提出了一种新的自主空中车辆系统-TrackerBots,用于跟踪和定位多个无线电标签动物。利用非常规场常用的高频(VHF)无线电颈环接收信号强度指示(RSSI)值的简单性,实现了一种低成本轻量级的跟踪平台,适合与无人机(UAV)集成。由于基于RSSI测量的系统存在不确定性和非线性,我们的跟踪和规划方法整合了粒子滤波用于跟踪和定位;部分可观测马尔可夫决策过程(POMDP)用于动态路径规划。这种方法允许无人机在最大信息增益方向上自主导航以定位多个移动动物并减少探索时间;从而节省机载电池电量。我们还采用了搜索终止标准的概念以在空中系统的功率限制内最大化所定位动物的数量。我们通过广泛的模拟和使用两个移动VHF无线电标签的实地实验验证了我们的实时和在线方法。

英文摘要

Autonomous aerial robots provide new possibilities to study the habitats and behaviors of endangered species through the efficient gathering of location information at temporal and spatial granularities not possible with traditional manual survey methods. We present a novel autonomous aerial vehicle system-TrackerBots-to track and localize multiple radio-tagged animals. The simplicity of measuring the received signal strength indicator (RSSI) values of very high frequency (VHF) radio-collars commonly used in the field is exploited to realize a low cost and lightweight tracking platform suitable for integration with unmanned aerial vehicles (UAVs). Due to uncertainty and the nonlinearity of the system based on RSSI measurements, our tracking and planning approaches integrate a particle filter for tracking and localizing; a partially observable Markov decision process (POMDP) for dynamic path planning. This approach allows autonomous navigation of a UAV in a direction of maximum information gain to locate multiple mobile animals and reduce exploration time; and, consequently, conserve onboard battery power. We also employ the concept of a search termination criteria to maximize the number of located animals within power constraints of the aerial system. We validated our real-time and online approach through both extensive simulations and field experiments with two mobile VHF radio-tags.

1711.00493 2026-06-04 eess.SY cs.RO cs.SY eess.SP 版本更新

Event-Triggered Diffusion Kalman Filters

事件触发扩散卡尔曼滤波器

Amr Alanwar, Hazem Said, Ankur Mehta, Matthias Althoff

发表机构 * Technical University of Munich(慕尼黑技术大学) Ain Shams University(艾因夏姆斯大学) University of California, Los Angeles(加州大学洛杉矶分校)

AI总结 本文提出了一种事件触发的扩散卡尔曼滤波器,通过本地信号指示估计误差来收集测量并交换信息,从而减少资源消耗并提高分布式状态估计的有效性。

详情
AI中文摘要

分布式状态估计强烈依赖于协作信号处理,这通常需要在资源受限的传感器节点上执行过多的通信和计算。为了解决这个问题,我们提出了一种事件触发的扩散卡尔曼滤波器,该滤波器根据本地信号指示的估计误差收集测量并交换信息。在此基础上,我们开发了一种节能的状态估计算法,该算法调节无线网络中的资源消耗,并确保每个消耗的资源的有效性。所提出的算法不需要节点共享其本地协方差矩阵,从而允许显著减少传输信息的数量。为了验证其效率,我们将所提出算法应用于分布式的同时定位和时间同步问题,并在移动四旋翼节点和静止的定制超宽带无线设备的物理测试台上进行评估。获得的实验结果表明,所提出的算法在通信开销方面节省了与原始扩散卡尔曼滤波器相关的86%,而仅导致性能下降16%。我们在线提供了Matlab代码和实际测试数据。

英文摘要

Distributed state estimation strongly depends on collaborative signal processing, which often requires excessive communication and computation to be executed on resource-constrained sensor nodes. To address this problem, we propose an event-triggered diffusion Kalman filter, which collects measurements and exchanges messages between nodes based on a local signal indicating the estimation error. On this basis, we develop an energy-aware state estimation algorithm that regulates the resource consumption in wireless networks and ensures the effectiveness of every consumed resource. The proposed algorithm does not require the nodes to share its local covariance matrices, and thereby allows considerably reducing the number of transmission messages. To confirm its efficiency, we apply the proposed algorithm to the distributed simultaneous localization and time synchronization problem and evaluate it on a physical testbed of a mobile quadrotor node and stationary custom ultra-wideband wireless devices. The obtained experimental results indicate that the proposed algorithm allows saving 86% of the communication overhead associated with the original diffusion Kalman filter while causing deterioration of performance by 16% only. We make the Matlab code and the real testing data available online.

1612.04704 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Distributed area coverage control with imprecise robot localization: Simulation and experimental studies

分布式区域覆盖控制与不精确机器人定位:仿真与实验研究

Sotiris Papatheodorou, Anthony Tzes, Konstantinos Giannousakis, Yiannis Stergiopoulos

AI总结 本文研究了具有不精确定位的移动机器人网络的区域覆盖问题,提出了一种基于保证Voronoi原理的分布式控制方法,通过仿真和实验验证了其有效性。

Comments 12 pages, 28 figures, submitted to IEEE Transactions on Automatic Control in 14 December 2016

详情
AI中文摘要

本文探讨了具有不精确定位的移动机器人网络的区域覆盖问题。每个机器人具有均匀的径向感知能力,由一阶动力学模型控制。凸空间基于保证Voronoi(GV)原理进行划分,每个机器人的责任区域对应其GV-cell,由双曲线弧界定。所提出的控制律是分布式的,需要其GV-Delaunay邻居的位置信息。提供了仿真和实验研究以突出所提控制律的有效性。

英文摘要

This article examines the area coverage problem for a network of mobile robots with imprecise agents' localization. Each robot has uniform radial sensing ability, governed by first order kinodynamics. The convex-space is partitioned based on the Guaranteed Voronoi (GV) principle and each robot's area of responsibility corresponds to its GV-cell, bounded by hyperbolic arcs. The proposed control law is distributed, demanding the positioning information about its GV-Delaunay neighbors. Simulation and experimental studies are offered to highlight the efficiency of the proposed control law.

1904.13317 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

A data-efficient geometrically inspired polynomial kernel for robot inverse dynamics

一种数据高效且受几何启发的多项式核用于机器人逆动力学

Alberto Dalla Libera, Ruggero Carli

AI总结 本文提出了一种基于高斯过程回归的数据驱动逆动力学估计器,引入了几何启发多项式核(GIP),该核在合适输入空间上将逆动力学描述为多项式函数,并证明其定义了有限维的再生核希尔伯特空间,包含刚体动力学计算的逆动力学函数,实验表明该方法在数据效率和泛化能力上优于其他数据驱动方法,同时相比模型驱动方法需要更少的先验信息且不受模型偏差影响。

详情
Journal ref
IEEE Robotics and Automation Letters, vol. 5, no. 1, pp. 24-31, Jan. 2020
AI中文摘要

在本文中,我们介绍了一种基于高斯过程回归的新数据驱动逆动力学估计器。受逆动力学可以描述为合适输入空间上的多项式函数的启发,我们提出了一个名为几何启发多项式核(GIP)的新核。所得到的估计器在数据效率方面与基于模型的方法相似。事实上,我们证明了GIP核定义了一个有限维的再生核希尔伯特空间,该空间包含通过刚体动力学计算的逆动力学函数。所提出的核基于最近引入的乘法多项式核,这是经典多项式核的重新定义,配备了允许更高正则化的参数集。我们已在模拟环境和UR10机器人的真实实验中测试了所提出的方法。获得的结果证实,与其它数据驱动估计器相比,所提出的方法在数据效率和泛化能力上更优。相反,与基于模型的估计器相比,我们的方法需要更少的先验信息且不受模型偏差影响。

英文摘要

In this paper, we introduce a novel data-driven inverse dynamics estimator based on Gaussian Process Regression. Driven by the fact that the inverse dynamics can be described as a polynomial function on a suitable input space, we propose the use of a novel kernel, called Geometrically Inspired Polynomial Kernel (GIP). The resulting estimator behaves similarly to model-based approaches as concerns data efficiency. Indeed, we proved that the GIP kernel defines a finite-dimensional Reproducing Kernel Hilbert Space that contains the inverse dynamics function computed through the Rigid Body Dynamics. The proposed kernel is based on the recently introduced Multiplicative Polynomial Kernel, a redefinition of the classical polynomial kernel equipped with a set of parameters that allows for a higher regularization. We tested the proposed approach in a simulated environment, and also in real experiments with a UR10 robot. The obtained results confirm that, compared to other data-driven estimators, the proposed approach is more data-efficient and exhibits better generalization properties. Instead, with respect to model-based estimators, our approach requires less prior information and is not affected by model bias.

1905.03416 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Prioritized Inverse Kinematics: Nonsmoothness, Trajectory Existence, Task Convergence, Stability

优先级逆运动学:非光滑性、轨迹存在性、任务收敛性、稳定性

Sang-ik An, Dongheui Lee

发表机构 * German Excellence Initiative(德国卓越计划) Institute of Robotics and Mechatronics, German Aerospace Center(机器人与机电研究所,德国航空航天中心)

AI总结 本文研究了一类优先级逆运动学(PIK)解的理论性质,探讨了其作为动态系统优先级多输出的输出调节或跟踪控制律的特性。首先,开发了研究PIK解非光滑性的工具,并发现非光滑性的充分条件。这表明经典定理无法保证满足PIK解的关节轨迹存在性和唯一性。因此,构建了一个利用PIK解结构信息的替代存在性和唯一性定理。接着,将PIK解的类别缩小到所有任务都遵循某些期望任务轨迹的情况,并发现与任务收敛性相关的性质。进一步分析了当所有任务都达到某些期望任务位置时,由PIK解作为右侧的微分方程的平衡点的稳定性。最后,通过一个双臂机械臂的例子,展示了如何利用这些发现来分析由PIK解生成的关节轨迹行为。

Comments 18 pages

详情
AI中文摘要

在本文中,我们研究了一类优先级逆运动学(PIK)解的各种理论性质,这些解可以被视为动态系统优先级多输出的输出调节或跟踪控制律。我们首先开发了研究PIK解非光滑性的工具,并发现非光滑性的充分条件。这表明经典定理无法保证满足PIK解的关节轨迹存在性和唯一性。因此,我们构建了一个利用PIK解结构信息的替代存在性和唯一性定理。接着,我们将PIK解的类别缩小到所有任务都遵循某些期望任务轨迹的情况,并发现与任务收敛性相关的性质。研究进一步分析了当所有任务都达到某些期望任务位置时,由PIK解作为右侧的微分方程的平衡点的稳定性。最后,我们通过一个双臂机械臂的例子,展示了如何利用这些发现来分析由PIK解生成的关节轨迹行为。

英文摘要

In this paper, we study various theoretical properties of a class of prioritized inverse kinematics (PIK) solutions that can be considered as a class of (output regulation or tracking) control laws of a dynamical system with prioritized multiple outputs. We first develop tools to investigate nonsmoothness of PIK solutions and find a sufficient condition for nonsmoothness. It implies that existence and uniqueness of a joint trajectory satisfying a PIK solution cannot be guaranteed by the classical theorems. So, we construct an alternative existence and uniqueness theorem that uses structural information of PIK solutions. Then, we narrow the class of PIK solutions down to the case that all tasks are designed to follow some desired task trajectories and discover a few properties related to task convergence. The study goes further to analyze stability of equilibrium points of the differential equation whose right hand side is a PIK solution when all tasks are designed to reach some desired task positions. Finally, we furnish an example with a two-link manipulator that shows how our findings can be used to analyze the behavior of a joint trajectory generated from a PIK solution.

1808.08252 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Inverse Statics Optimization for Compound Tensegrity Robots

复合张力结构机器人的逆静态优化

Andrew P. Sabelhaus, Albert H. Li, Kimberly A. Sover, Jacob Madden, Andrew Barkan, Adrian K. Agogino, Alice M. Agogino

发表机构 * Department of Mechanical Engineering, University of California Berkeley(加州大学伯克利分校机械工程系) Intelligent Systems Division, NASA Ames Research Center(NASA阿姆斯研究中心智能系统部)

AI总结 本文提出了一种方法,用于计算复合张力结构机器人的电缆张力,以实现静态平衡。该方法基于改进的力密度法,通过二次优化问题解决电缆张力计算,并通过仿真和硬件实验验证了其在脊柱机器人和四足机器人设计与控制中的有效性。

详情
AI中文摘要

由电缆驱动的张力结构(张力-完整性)机器人具有软机器人许多优点,如灵活性和鲁棒性,同时仍遵循简单的静态和动态模型。然而,现有的张力结构建模方法无法原生描述具有任意刚体的张力网络的机器人。本文提出了一种方法,用于计算此类张力结构机器人的电缆张力,这里定义为复合张力结构。首先,将复合张力结构机器人的静态平衡模型重新表述为用于其他张力结构的标准力密度法。接下来,我们提出了在所提出模型下计算机器人电缆张力的问题。提出了解决方案作为带有实际约束的二次优化问题。仿真展示了如何利用该逆静态优化问题来设计和控制两种不同的复合张力结构应用:由该脊柱制成的脊柱机器人和四足机器人。最后,通过硬件实验验证了逆静态模型的准确性,证明了使用所提出方法进行低误差开环控制的可行性。

英文摘要

Robots built from cable-driven tensegrity (`tension-integrity') structures have many of the advantages of soft robots, such as flexibility and robustness, while still obeying simple statics and dynamics models. However, existing tensegrity modeling approaches cannot natively describe robots with arbitrary rigid bodies in their tension network. This work presents a method to calculate the cable tensions in static equilibrium for such tensegrity robots, here defined as compound tensegrity. First, a static equilibrium model for compound tensegrity robots is reformulated from the standard force density method used with other tensegrity structures. Next, we pose the problem of calculating tension forces in the robot's cables under our proposed model. A solution is proposed as a quadratic optimization problem with practical constraints. Simulations illustrate how this inverse statics optimization problem can be used for both the design and control of two different compound tensegrity applications: a spine robot and a quadruped robot built from that spine. Finally, we verify the accuracy of the inverse statics model through a hardware experiment, demonstrating the feasibility of low-error open-loop control using our proposed methodology.

1705.05415 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Robotic Wireless Sensor Networks

机器人无线传感器网络

Pradipta Ghosh, Andrea Gasparri, Jiong Jin, Bhaskar Krishnamachari

发表机构 * University of Southern California(南加州大学) Università degli studi "Roma Tre"(罗马三大学) Swinburne University of Technology(斯winburne技术大学)

AI总结 本文综述了机器人与无线传感器网络交叉领域的新兴研究领域,探讨了多机器人系统在满足通信性能要求的同时实现传感目标的协同控制、学习和适应方法,并指出现有文献中存在的一些研究空白和未来研究方向。

详情
AI中文摘要

在本章中,我们介绍了机器人与无线传感器网络(WSN)交叉领域的新兴、前沿且跨学科的研究领域,称为机器人无线传感器网络(RWSN)。我们定义RWSN为一种自主的多机器人网络系统,旨在通过协同控制、学习和适应,实现特定的传感目标,同时满足和维持一定的通信性能要求。尽管机器人和WSN这两个领域都非常知名且研究充分,但这两个领域交叉处存在大量新的机会和研究方向,这些方向要么相对未被探索,要么完全未被探索。例如,使用一组机器人路由器来建立发送方和接收方之间的临时通信路径,利用受控的移动性优势来改进数据路由。我们发现,直接归类为RWSN相关研究的文献数量非常有限,而机器人和WSN文献中存在一些相关于这一新研究领域的文章。为了连接这些点,我们首先识别了与RWSN相关的核心问题和研究趋势,如连通性、定位、路由和信息的鲁棒流动。接着,我们根据第一步中识别的问题和趋势,将现有的RWSN研究以及机器人和WSN社区的相关最新进展进行分类。最后,我们分析现有文献中缺失的部分,并确定未来需要更多研究关注的主题。

英文摘要

In this chapter, we present a literature survey of an emerging, cutting-edge, and multi-disciplinary field of research at the intersection of Robotics and Wireless Sensor Networks (WSN) which we refer to as Robotic Wireless Sensor Networks (RWSN). We define a RWSN as an autonomous networked multi-robot system that aims to achieve certain sensing goals while meeting and maintaining certain communication performance requirements, through cooperative control, learning and adaptation. While both of the component areas, i.e., Robotics and WSN, are very well-known and well-explored, there exist a whole set of new opportunities and research directions at the intersection of these two fields which are relatively or even completely unexplored. One such example would be the use of a set of robotic routers to set up a temporary communication path between a sender and a receiver that uses the controlled mobility to the advantage of packet routing. We find that there exist only a limited number of articles to be directly categorized as RWSN related works whereas there exist a range of articles in the robotics and the WSN literature that are also relevant to this new field of research. To connect the dots, we first identify the core problems and research trends related to RWSN such as connectivity, localization, routing, and robust flow of information. Next, we classify the existing research on RWSN as well as the relevant state-of-the-arts from robotics and WSN community according to the problems and trends identified in the first step. Lastly, we analyze what is missing in the existing literature, and identify topics that require more research attention in the future.

1905.08314 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Longitudinal Dynamic versus Kinematic Models for Car-Following Control Using Deep Reinforcement Learning

纵向动态模型与运动学模型在使用深度强化学习的汽车跟随控制中的比较

Yuan Lin, John McPhee, Nasser L. Azad

发表机构 * University of Waterloo, Ontario, Canada(加拿大温哥华大学)

AI总结 本文研究了在考虑车辆动力学的情况下,使用深度强化学习的纵向汽车跟随控制问题,通过引入延迟的控制输入和实际车辆加速度到强化学习环境状态中,改进了DRL框架,从而在考虑车辆动力学时实现了接近最优的控制性能。

Comments Accepted to 2019 IEEE Intelligent Transportation Systems Conference

详情
AI中文摘要

目前大多数关于通过深度强化学习(DRL)实现自动驾驶车辆控制的研究都使用点质量运动学模型,忽略了车辆动力学,包括加速度延迟和加速度命令动力学。加速度延迟源于传感和执行延迟,导致控制输入执行延迟。加速度命令动力学决定了实际车辆加速度不会立即达到期望的命令加速度,因为存在动力学限制。在本工作中,我们研究了将使用车辆运动学模型训练的DRL控制器应用于更现实的驾驶控制中的可行性。我们考虑了一个特定的纵向汽车跟随控制问题,即自适应巡航控制系统(ACC),该问题通过使用点质量运动学模型的DRL解决。当此类控制器应用于具有车辆动力学的汽车跟随时,我们观察到显著退化的汽车跟随性能。因此,我们重新设计DRL框架,通过将延迟的控制输入和实际车辆加速度分别添加到强化学习环境状态中,以适应加速度延迟和加速度命令动力学。训练结果表明,改进后的DRL控制器在考虑车辆动力学时的汽车跟随控制性能接近最优,与动态规划解决方案相比。

英文摘要

The majority of current studies on autonomous vehicle control via deep reinforcement learning (DRL) utilize point-mass kinematic models, neglecting vehicle dynamics which includes acceleration delay and acceleration command dynamics. The acceleration delay, which results from sensing and actuation delays, results in delayed execution of the control inputs. The acceleration command dynamics dictates that the actual vehicle acceleration does not rise up to the desired command acceleration instantaneously due to dynamics. In this work, we investigate the feasibility of applying DRL controllers trained using vehicle kinematic models to more realistic driving control with vehicle dynamics. We consider a particular longitudinal car-following control, i.e., Adaptive Cruise Control (ACC), problem solved via DRL using a point-mass kinematic model. When such a controller is applied to car following with vehicle dynamics, we observe significantly degraded car-following performance. Therefore, we redesign the DRL framework to accommodate the acceleration delay and acceleration command dynamics by adding the delayed control inputs and the actual vehicle acceleration to the reinforcement learning environment state, respectively. The training results show that the redesigned DRL controller results in near-optimal control performance of car following with vehicle dynamics considered when compared with dynamic programming solutions.

1903.12311 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Mesh-based Tools to Analyze Deep Reinforcement Learning Policies for Underactuated Biped Locomotion

基于网格的工具用于分析深度强化学习策略在欠驱动双足运动中的稳定性与鲁棒性

Nihar Talele, Katie Byl

AI总结 本文提出了一种基于网格的方法,用于分析通过深度强化学习获得的双足运动策略的稳定性与鲁棒性,通过量化评估策略鲁棒性的程度,提供更高效的工具来评估此类策略的鲁棒性特性。

详情
AI中文摘要

在本文中,我们提出了一种基于网格的方法,用于分析通过深度强化学习获得的五连杆平面模型各种双足步态策略的稳定性与鲁棒性。直观上,人们可能会认为在训练过程中包含扰动和/或其它类型的噪声会导致最终的控制策略更加鲁棒。然而,人们也希望有一个定量且计算高效的手段来评估这种可能性的程度。而不是依赖蒙特卡洛模拟,这种模拟在量化性能指标时可能会变得计算负担很重,我们的目标是提供更复杂的工具来评估此类策略的鲁棒性特性。我们的工作受双重视假的启发,即当动态收缩可行时,可以简化所需控制策略的复杂性,并且通过深度学习获得的控制策略可能会倾向于收缩到全状态空间中的低维流形,从而产生这种倾向。本文中网格工具的可操作性提供了一些证据表明这可能是正确的。

英文摘要

In this paper, we present a mesh-based approach to analyze stability and robustness of the policies obtained via deep reinforcement learning for various biped gaits of a five-link planar model. Intuitively, one would expect that including perturbations and/or other types of noise during training would likely result in more robustness of the resulting control policy. However, one would also like to have a quantitative and computationally-efficient means of evaluating the degree to which this might be so. Rather than relying on Monte Carlo simulations, which can become quite computationally burdensome in quantifying performance metrics, our goal is to provide more sophisticated tools to assess robustness properties of such policies. Our work is motivated by the twin hypotheses that contraction of dynamics, when achievable, can simplify the required complexity of a control policy and that control policies obtained via deep learning may therefore exhibit tendency to contract to lower-dimensional manifolds within the full state space, as a result. The tractability of our mesh-based tools in this work provides some evidence that this may be so.

1904.11898 2026-06-04 cs.RO cs.CV cs.LG cs.SY eess.SY 版本更新

Perceptual Attention-based Predictive Control

基于感知注意力的预测控制

Keuntaek Lee, Gabriel Nakajima An, Viacheslav Zakharov, Evangelos A. Theodorou

发表机构 * Georgia Institute of Technology(佐治亚理工学院)

AI总结 本文提出了一种新的信息处理架构,用于安全的深度学习视觉导航系统,通过模型预测控制(MPC)、卷积神经网络(CNNs)和不确定性量化方法,实现基于感知注意力的预测控制算法,提高了系统对不安全状况的快速检测能力。

详情
AI中文摘要

在本文中,我们提出了一种新的信息处理架构,用于安全的基于深度学习的视觉导航自主系统。所提出的信息处理架构用于支持一种基于感知注意力的预测控制算法,该算法利用模型预测控制(MPC)、卷积神经网络(CNNs)和不确定性量化方法。我们的方法新颖之处在于利用MPC学习如何在视觉输入的相关区域上放置注意力,从而最终使系统能够更快速地检测到不安全状况。我们通过使用MPC学习如何选择输入图像中的感兴趣区域,这些区域用于输出控制动作以及在注意力感知的视觉输入中的epistemic和aleatoric不确定性估计。我们使用这些不确定性估计来量化在当前导航条件下网络控制器的安全性。所提出的架构和算法在1:5比例的陆地车辆上进行了测试。实验结果表明,所提出的算法在早期检测不安全状况方面优于先前的方法,例如当导航环境中出现新障碍物时。所提出的架构是向在安全关键领域使用基于深度学习的感知控制策略迈出的第一步。

英文摘要

In this paper, we present a novel information processing architecture for safe deep learning-based visual navigation of autonomous systems. The proposed information processing architecture is used to support a perceptual attention-based predictive control algorithm that leverages model predictive control (MPC), convolutional neural networks (CNNs), and uncertainty quantification methods. The novelty of our approach lies in using MPC to learn how to place attention on relevant areas of the visual input, which ultimately allows the system to more rapidly detect unsafe conditions. We accomplish this by using MPC to learn to select regions of interest in the input image, which are used to output control actions as well as estimates of epistemic and aleatoric uncertainty in the attention-aware visual input. We use these uncertainty estimates to quantify the safety of our network controller under the current navigation condition. The proposed architecture and algorithm is tested on a 1:5 scale terrestrial vehicle. Experimental results show that the proposed algorithm outperforms previous approaches on early detection of unsafe conditions, such as when novel obstacles are present in the navigation environment. The proposed architecture is the first step towards using deep learning-based perceptual control policies in safety-critical domains.

1809.03674 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Planar Cooperative Extremum Seeking with Guaranteed Convergence Using A Three-Robot Formation

平面协作极值搜索与保证收敛的三机器人编队

Anna Skobeleva, Baris Fidan, V. Ugrinovskii, Ian R. Petersen

发表机构 * Mechanical and Mechatronics Engineering Department, University of Waterloo(滑铁卢大学机械与机电工程系)

AI总结 本文提出了一种结合编队获取和协作极值搜索控制的方案,用于在平面上移动的三机器人团队。极值搜索任务是找到一个未知二维函数在平面上的最大值点。该函数表示由于位于最大值点的源产生的信号强度场,并假设该函数在最大值点附近局部凹凸且随距离源点的增加而单调递减。通过在特定领头机器人位置和最大值点处对场函数进行泰勒展开,并结合基于机器人信号强度测量的梯度估计器,设计并分析了所提出的控制方案。所提出的方案被证明可以指数收敛并同时(i)获取指定的几何编队,(ii)将领头机器人驱动到指定的最大值点附近的盘内,该盘的半径取决于指定的期望编队大小以及场函数的海森矩阵范数上限。通过一组仿真实验评估了所提出控制方案的性能。

Comments Presented at the 2018 IEEE Conference on Decision and Control (CDC), Miami Beach, FL, USA

详情
AI中文摘要

本文提出了一种结合编队获取和协作极值搜索控制的方案,用于在平面上移动的三机器人团队。极值搜索任务是找到一个未知二维函数在平面上的最大值点。该函数表示由于位于最大值点的源产生的信号强度场,并假设该函数在最大值点附近局部凹凸且随距离源点的增加而单调递减。通过在特定领头机器人位置和最大值点处对场函数进行泰勒展开,并结合基于机器人信号强度测量的梯度估计器,设计并分析了所提出的控制方案。所提出的方案被证明可以指数收敛并同时(i)获取指定的几何编队,(ii)将领头机器人驱动到指定的最大值点附近的盘内,该盘的半径取决于指定的期望编队大小以及场函数的海森矩阵范数上限。通过一组仿真实验评估了所提出控制方案的性能。

英文摘要

In this paper, a combined formation acquisition and cooperative extremum seeking control scheme is proposed for a team of three robots moving on a plane. The extremum seeking task is to find the maximizer of an unknown two-dimensional function on the plane. The function represents the signal strength field due to a source located at maximizer, and is assumed to be locally concave around maximizer and monotonically decreasing in distance to the source location. Taylor expansions of the field function at the location of a particular lead robot and the maximizer are used together with a gradient estimator based on signal strength measurements of the robots to design and analyze the proposed control scheme. The proposed scheme is proven to exponentially and simultaneously (i) acquire the specified geometric formation and (ii) drive the lead robot to a specified neighborhood disk around maximizer, whose radius depends on the specified desired formation size as well as the norm bounds of the Hessian of the field function. The performance of the proposed control scheme is evaluated using a set of simulation experiments.

1402.3735 2026-06-04 cs.MA cs.RO cs.SY eess.SY 版本更新

Decentralized Goal Assignment and Safe Trajectory Generation in Multi-Robot Networks via Multiple Lyapunov Functions

多机器人网络中基于多重李雅普诺夫函数的去中心化目标分配与安全轨迹生成

Dimitra Panagou, Matthew Turpin, Vijay Kumar

发表机构 * GRASP Lab, School of Engineering and Applied Science, University of Pennsylvania(宾夕法尼亚大学工程与应用科学学院GRASP实验室)

AI总结 本文研究了在仅本地通信可用的情况下多机器人网络的去中心化目标分配与轨迹生成问题,提出了一种基于切换系统和集合不变性方法的解决方案。通过使用一组李雅普诺夫-like函数编码候选目标分配的局部决策,使连接的代理选择最短总距离到目标的分配。当最优分配可能导致碰撞轨迹时,使用另一组李雅普诺夫-like屏障函数来维持系统安全并保持收敛保证。所提出的切换策略产生计算高效且可扩展的反馈控制策略,适用于在有限信息共享下机器人网络的快速响应部署。

详情
AI中文摘要

本文考虑了在仅本地通信可用的情况下多机器人网络的去中心化目标分配与轨迹生成问题,并提出了一种基于与切换系统和集合不变性相关的方法的解决方案。采用一组李雅普诺夫-like函数来编码候选目标分配之间的(局部)决策,使得一组连接的代理选择导致到目标的最短总距离的分配。在最优分配可能导致碰撞轨迹的情况下,激活另一组李雅普诺夫-like屏障函数,从而在保持系统安全的同时保留收敛保证。所提出的切换策略产生了计算高效且可扩展的反馈控制策略,因此适用于在有限信息共享下机器人网络的快速响应部署。通过模拟结果和六台地面机器人的实验验证了所提出方法的有效性。

英文摘要

This paper considers the problem of decentralized goal assignment and trajectory generation for multi-robot networks when only local communication is available, and proposes an approach based on methods related to switched systems and set invariance. A family of Lyapunov-like functions is employed to encode the (local) decision making among candidate goal assignments, under which a group of connected agents chooses the assignment that results in the shortest total distance to the goals. An additional family of Lyapunov-like barrier functions is activated in the case when the optimal assignment may lead to colliding trajectories, maintaining thus system safety while preserving the convergence guarantees. The proposed switching strategies give rise to feedback control policies that are computationally efficient and scalable with the number of agents, and therefore suitable for applications including first-response deployment of robotic networks under limited information sharing. The efficacy of the proposed method is demonstrated via simulation results and experiments with six ground robots.

1811.12819 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Structure-Preserving Constrained Optimal Trajectory Planning of a Wheeled Inverted Pendulum

保持结构的约束最优轨迹规划 of 一个轮式倒置摆

Klaus Albert, Karmvir Singh Phogat, Felix Anhalt, Ravi N Banavar, Debasish Chatterjee, Boris Lohmann

发表机构 * Systems and Control Engineering(系统与控制工程)

AI总结 本文研究了轮式倒置摆的约束最优轨迹规划问题,通过离散力学推导了离散时间模型,并利用离散时间约束最优控制问题生成最优轨迹,同时设计了非线性连续时间模型和闭环LQ控制器,通过实验验证了非线性模型和控制方案的有效性。

Comments 12 pages, 8 figures, 1 table. arXiv admin note: text overlap with arXiv:1710.10932

详情
AI中文摘要

轮式倒置摆(WIP)是一个欠驱动、非完整机电系统,已被商业化为Segway。设计一个控制律进行运动规划,同时考虑状态和控制约束,并尊重配置流形,是一个具有挑战性的问题。本文通过离散力学推导了WIP系统的离散时间模型,并通过求解离散时间约束最优控制问题生成WIP系统的最优轨迹。进一步,我们描述了一个带有参数的非线性连续时间模型,用于设计闭环LQ控制器。所设计的最优轨迹作为参考提供给机器人,同时最优控制轨迹作为前馈控制作用,反馈模式下的LQ控制器用于抑制噪声和干扰,以确保WIP系统的稳定运动。在进行涉及剧烈操作和较为急转弯的WIP系统实验时,我们发现设计的最优轨迹与机器人跟踪这些轨迹时所走的路径高度一致。这证实了非线性模型和控制方案的有效性。最后,这些实验展示了WIP系统的高度非线性特性和控制方案的鲁棒性。

英文摘要

The Wheeled Inverted Pendulum (WIP) is an underactuated, nonholonomic mechatronic system, and has been popularized commercially as the Segway. Designing a control law for motion planning, that incorporates the state and control constraints, while respecting the configuration manifold, is a challenging problem. In this article we derive a discrete-time model of the WIP system using discrete mechanics and generate optimal trajectories for the WIP system by solving a discrete-time constrained optimal control problem. Further, we describe a nonlinear continuous-time model with parameters for designing a closed loop LQ-controller. A dual control architecture is implemented in which the designed optimal trajectory is then provided as a reference to the robot with the optimal control trajectory as a feedforward control action, and an LQ-controller in the feedback mode is employed to mitigate noise and disturbances for ensuing stable motion of the WIP system. While performing experiments on the WIP system involving aggressive maneuvers with fairly sharp turns, we found a high degree of congruence in the designed optimal trajectories and the path traced by the robot while tracking these trajectories. This corroborates the validity of the nonlinear model and the control scheme. Finally, these experiments demonstrate the highly nonlinear nature of the WIP system and robustness of the control scheme.

1802.00714 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Incremental Control and Guidance of Hybrid Aircraft Applied to a Tailsitter UAV

增量控制与引导用于混合式飞行器的尾旋无人机

E. J. J. Smeur, M. Bronz, G. C. H. E. de Croon

发表机构 * Delft University of Technology(代尔夫特理工大学) ENAC, MAIAA, University of Toulouse(法国图卢兹大学)

AI总结 本文提出了一种增量非线性动态逆控制方法,用于混合式飞行器的姿态和位置控制,实现了一个连续的控制器,能够跨飞行包线跟踪飞行器的期望加速度,并在尾旋无人机上进行了多场户外实验验证。

Comments 20 pages, 26 figures

详情
Journal ref
Journal of Guidance, Control and Dynamics, September 2019 [online]
AI中文摘要

混合无人飞行器可以显著提高微空气车辆的潜力,因为它们结合了悬停能力和机翼以实现快速高效的前进飞行。然而,这些车辆很难控制,因为其空气动力学难以建模且容易受到风切变影响。这通常导致复杂的复合控制器,具有悬停、过渡和前进飞行的不同模式。在本文中,我们提出了一种增量非线性动态逆控制用于姿态和位置控制。结果是一个单一的连续控制器,能够跨飞行包线跟踪飞行器的期望加速度。所提出的控制器在Cyclone混合无人机上实现。进行了多次户外实验,显示未建模的力和力矩被增量控制结构有效补偿。最后,我们提供了一种全面的程序,用于在其他类型的混合无人机上实现该控制器。

英文摘要

Hybrid unmanned aircraft can significantly increase the potential of micro air vehicles, because they combine hovering capability with a wing for fast and efficient forward flight. However, these vehicles are very difficult to control, because their aerodynamics are hard to model and they are susceptible to wind gusts. This often leads to composite and complex controllers, with different modes for hover, transition and forward flight. In this paper, we propose incremental nonlinear dynamic inversion control for the attitude and position control. The result is a single, continuous controller, that is able to track the desired acceleration of the vehicle across the flight envelope. The proposed controller is implemented on the Cyclone hybrid UAV. Multiple outdoor experiments are performed, showing that unmodeled forces and moments are effectively compensated by the incremental control structure. Finally, we provide a comprehensive procedure for the implementation of the controller on other types of hybrid UAVs.

1904.03830 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A Hybrid Compositional Approach to Optimal Mission Planning for Multi-rotor UAVs using Metric Temporal Logic

一种基于度量时序逻辑的混合组合方法用于多旋翼无人机最优任务规划

Usman A. Fiaz, John S. Baras

AI总结 本文提出了一种混合组合方法,用于多旋翼无人机在受约束环境中的时间敏感搜索救援任务规划,通过将度量时序逻辑形式化描述任务规范,并利用混合模型捕捉无人机的各种操作模式,将任务分解为子任务并使用混合整数线性规划求解器求解最优路径,从而降低计算复杂度并实现实时应用。

Comments 8 pages, 5 figures, 1 table. Fixed typos, added new references

详情
AI中文摘要

本文研究了一种混合组合方法用于多旋翼无人机的最优任务规划。我们考虑了一个时间敏感的搜索和救援场景,其中两个四旋翼无人机在一个受约束的环境中运行。度量时序逻辑(MTL)用于正式描述任务规范。为了捕捉无人机的各种操作模式,我们使用一个混合模型,该模型在不同操作点周围线性化动力学。我们通过利用各种任务规范的不变性质,即路径上的安全性和时间约束的相互独立性以及机器人不同模式(即动力学)来将任务分解为多个子任务。对于每个子任务,我们将MTL公式转换为线性约束,并使用混合整数线性规划(MILP)求解器求解所需的路径最优控制问题。完整的路径由各个最优子路径的组合构成。我们证明所得到的轨迹满足任务规范,并且所提出的方法显著降低了问题的计算复杂度,使其能够实现实时应用。

英文摘要

This paper investigates a hybrid compositional approach to optimal mission planning for multi-rotor Unmanned Aerial Vehicles (UAVs). We consider a time critical search and rescue scenario with two quadrotors in a constrained environment. Metric Temporal Logic (MTL) is used to formally describe the task specifications. In order to capture the various modes of UAV operation, we utilize a hybrid model for the system with linearized dynamics around different operating points. We divide the mission into several sub-tasks by exploiting the invariant nature of various task specifications i.e., the mutual independence of safety and timing constraints along the way, and the different modes (i,e., dynamics) of the robot. For each sub-task, we translate the MTL formulae into linear constraints, and solve the associated optimal control problem for desired path using a Mixed Integer Linear Program (MILP) solver. The complete path is constructed by the composition of individual optimal sub-paths. We show that the resulting trajectory satisfies the task specifications, and the proposed approach leads to significant reduction in computational complexity of the problem, making it possible to implement in real-time.

1610.04391 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

A guiding vector field algorithm for path following control of nonholonomic mobile robots

一种基于引导矢量场的路径跟随控制非holonomic移动机器人的算法

Yuri A. Kapitanyuk, Anton V. Proskurnikov, Ming Cao

发表机构 * Delft Center for Systems and Control at Delft University of Technology(代尔夫特理工大学系统与控制中心)

AI总结 本文提出了一种基于引导矢量场(GVF)思想的非holonomic移动机器人路径跟随控制算法,该算法能够将任意光滑曲线作为期望路径,通过设计引导矢量场使其积分曲线收敛到轨迹,并通过非线性运动控制器使机器人沿该积分曲线运动,最终到达期望路径,同时通过实验验证了算法的全局收敛性和有效性。

Comments under review in IEEE Transactions on Control Systems Technology

详情
AI中文摘要

在本文中,我们提出了一种基于引导矢量场(GVF)思想的非holonomic移动机器人路径跟随控制算法。期望路径可以是任意光滑曲线,即预定义光滑函数的等高线。利用该函数和机器人的运动学模型,我们设计了一个GVF,其积分曲线收敛到轨迹。随后提出一个非线性运动控制器,使机器人沿该积分曲线运动,最终到达期望路径。我们建立了该算法的全局收敛条件,并通过真实轮式机器人实验验证了其适用性和性能。

英文摘要

In this paper we propose an algorithm for path following control of the nonholonomic mobile robot based on the idea of the guiding vector field (GVF). The desired path may be an arbitrary smooth curve in its implicit form, that is, a level set of a predefined smooth function. Using this function and the robot's kinematic model, we design a GVF, whose integral curves converge to the trajectory. A nonlinear motion controller is then proposed which steers the robot along such an integral curve, bringing it to the desired path. We establish global convergence conditions for our algorithm and demonstrate its applicability and performance by experiments with real wheeled robots.

1905.05574 2026-06-04 cs.IT cs.DC cs.PF cs.RO cs.SY eess.SY math.IT 版本更新

Coded Distributed Tracking

编码分布式追踪

Albin Severinson, Eirik Rosnes, Alexandre Graell i Amat

发表机构 * Department of Electrical Engineering, Chalmers University of Technology(电气工程系,查尔姆斯理工大学)

AI总结 本文提出了一种云辅助的分布式追踪方案,利用编码分布式计算方法提高追踪的及时性和准确性,并通过MDS码进一步提升大规模更新间隔下的估计精度,同时揭示了信息年龄与估计精度之间的权衡。

Comments Accepted for publication at IEEE GLOBECOM 2019

详情
AI中文摘要

我们考虑在分布式环境中跟踪随时间演化的过程状态的问题,其中多个观察者各自观测状态的部分。我们提出了一种云辅助方案,其中追踪在云上进行。为了提供及时且准确的更新,并缓解云计算中的straggler问题,我们提出了一种编码分布式计算方法,其中编码的观测被分布到多个工作者上。所提出的方案基于一种编码版本的卡尔曼滤波器,该滤波器在使用擦除纠正码编码的数据上运行,使得状态可以从部分由子集工作者计算的更新中进行估计。我们将所提出的方法应用于跟踪多个车辆的问题。我们显示,复制实现了比相应未编码方案显著更高的精度。使用最大距离可分离(MDS)码进一步在较大的更新间隔下提高精度。在两种情况下,所提出的方案在更新间隔足够大时接近理想集中式方案的精度。最后,我们观察到MDS码在信息年龄和估计精度之间存在权衡。

英文摘要

We consider the problem of tracking the state of a process that evolves over time in a distributed setting, with multiple observers each observing parts of the state, which is a fundamental information processing problem with a wide range of applications. We propose a cloud-assisted scheme where the tracking is performed over the cloud. In particular, to provide timely and accurate updates, and alleviate the straggler problem of cloud computing, we propose a coded distributed computing approach where coded observations are distributed over multiple workers. The proposed scheme is based on a coded version of the Kalman filter that operates on data encoded with an erasure correcting code, such that the state can be estimated from partial updates computed by a subset of the workers. We apply the proposed scheme to the problem of tracking multiple vehicles. We show that replication achieves significantly higher accuracy than the corresponding uncoded scheme. The use of maximum distance separable (MDS) codes further improves accuracy for larger update intervals. In both cases, the proposed scheme approaches the accuracy of an ideal centralized scheme when the update interval is large enough. Finally, we observe a trade-off between age-of-information and estimation accuracy for MDS codes.

1903.12605 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Stable, Concurrent Controller Composition for Multi-Objective Robotic Tasks

多目标机器人任务的稳定并发控制器组合

Anqi Li, Ching-An Cheng, Byron Boots, Magnus Egerstedt

发表机构 * Institute for Robotics and Intelligent Machines, Georgia Institute of Technology(机器人与智能机器研究所,佐治亚理工学院)

AI总结 本文提出了一种稳定且并发的控制器组合方法,用于多目标机器人任务,通过分解任务为子任务并独立设计子任务控制器,再利用RMPflow框架结合生成整体控制策略,同时通过CLF分析确保系统稳定性。

Comments The 58th IEEE Conference on Decision and Control (CDC), 2019

详情
AI中文摘要

机器人系统往往需要同时考虑多个任务。这一挑战要求控制器合成算法能够在满足多个控制规范的同时保持系统稳定性。本文将多目标任务分解为子任务,其中每个子任务控制器独立设计,然后结合生成整体控制策略。特别地,我们采用最近提出的机器人控制器结构Riemannian Motion Policies(RMPs),以及其相关的计算框架RMPflow,用于结合RMP控制器。我们通过严格的控制Lyapunov函数(CLF)处理重新建立并扩展了RMPflow的稳定性结果。然后我们证明RMPflow能够稳定地结合满足一定CLF约束的独立设计的子任务控制器。这一新见解导致了一种高效的基于CLF的计算框架,用于生成同时考虑所有子任务的稳定控制器。与原始使用RMPflow相比,我们的框架为用户提供了一种通过名义控制器将设计启发式融入子任务的灵活性。我们通过数值模拟和机器人实现验证了所提出的计算框架。

英文摘要

Robotic systems often need to consider multiple tasks concurrently. This challenge calls for controller synthesis algorithms that fulfill multiple control specifications while maintaining the stability of the overall system. In this paper, we decompose multi-objective tasks into subtasks, where individual subtask controllers are designed independently and then combined to generate the overall control policy. In particular, we adopt Riemannian Motion Policies (RMPs), a recently proposed controller structure in robotics, and, RMPflow, its associated computational framework for combining RMP controllers. We re-establish and extend the stability results of RMPflow through a rigorous Control Lyapunov Function (CLF) treatment. We then show that RMPflow can stably combine individually designed subtask controllers that satisfy certain CLF constraints. This new insight leads to an efficient CLF-based computational framework to generate stable controllers that consider all the subtasks simultaneously. Compared with the original usage of RMPflow, our framework provides users the flexibility to incorporate design heuristics through nominal controllers for the subtasks. We validate the proposed computational framework through numerical simulation and robotic implementation.

1905.01261 2026-06-04 eess.SY cs.RO cs.SY 版本更新

A Lyapunov-Based Approach to Exploit Asymmetries in Robotic Dual-Arm Task Resolution

基于李雅普诺夫的方法利用双臂机器人任务解决中的不对称性

Diogo Almeida, Yiannis Karayiannidis

发表机构 * Dept. of Electrical Eng., Chalmers University of Technology(电气工程系,查尔姆斯理工大学)

AI总结 本文提出了一种基于李雅普诺夫的方法,用于设计绝对运动任务的控制律并更新双臂之间的相对任务分布,通过数值实验展示了该方法相较于对称分布的优势。

Comments Accepted for publication at CDC 2019

详情
AI中文摘要

双臂操作任务可以以机器人末端执行器的期望绝对和相对运动来规定。这些可以代表,例如,共同携带刚体或执行装配任务。当两种类型的运动需要同时执行时,相对运动在双臂之间的对称分布会防止任务冲突。相反,相对运动任务的非对称解将导致与绝对任务的冲突。在本文中,我们解决了设计绝对运动任务的控制律以及更新双臂之间相对任务分布的问题。通过一组数值结果,我们将我们的方法与经典的相对运动任务对称分布进行对比,以说明我们方法的优势。

英文摘要

Dual-arm manipulation tasks can be prescribed to a robotic system in terms of desired absolute and relative motion of the robot's end-effectors. These can represent, e.g., jointly carrying a rigid object or performing an assembly task. When both types of motion are to be executed concurrently, the symmetric distribution of the relative motion between arms prevents task conflicts. Conversely, an asymmetric solution to the relative motion task will result in conflicts with the absolute task. In this work, we address the problem of designing a control law for the absolute motion task together with updating the distribution of the relative task among arms. Through a set of numerical results, we contrast our approach with the classical symmetric distribution of the relative motion task to illustrate the advantages of our method.

1905.01248 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Asymmetric Dual-Arm Task Execution using an Extended Relative Jacobian

使用扩展相对雅可比的非对称双臂任务执行

Diogo Almeida, Yiannis Karayiannidis

发表机构 * Division of Robotics, Perception and Learning, KTH Royal Institute of Technology(机器人、感知与学习 division,皇家理工学院) Dept. of Electrical Eng., Chalmers University of Technology(电气工程系,查尔默斯技术大学)

AI总结 本文提出了一种基于扩展相对雅可比的非对称双臂任务执行方法,通过定义非对称相对运动空间,允许用户在不指定绝对运动目标的情况下设置任务执行的不对称程度,同时保留绝对运动作为功能性冗余。

Comments Accepted for presentation at ISRR19. 16 Pages

详情
AI中文摘要

协调的双臂操作任务可以大致分为具有绝对和相对运动成分。特别是相对运动任务,其在如何分配给末端执行器方面本质上是冗余的。在本工作中,我们从非对称解决相对运动任务的角度分析协作操作。我们讨论了现有方法如何使相对运动任务的非对称执行成为可能,并展示了如何定义非对称相对运动空间。我们利用这一结果提出了一种扩展的相对雅可比来建模协作系统,这使用户能够在不指定绝对运动目标的情况下设置任务执行的具体不对称程度。这无需规定绝对运动目标,而是将绝对运动保留为系统的功能性冗余。我们通过一种新的微分逆运动学算法的数值模拟来展示所提出雅可比的性质。

英文摘要

Coordinated dual-arm manipulation tasks can be broadly characterized as possessing absolute and relative motion components. Relative motion tasks, in particular, are inherently redundant in the way they can be distributed between end-effectors. In this work, we analyse cooperative manipulation in terms of the asymmetric resolution of relative motion tasks. We discuss how existing approaches enable the asymmetric execution of a relative motion task, and show how an asymmetric relative motion space can be defined. We leverage this result to propose an extended relative Jacobian to model the cooperative system, which allows a user to set a concrete degree of asymmetry in the task execution. This is achieved without the need for prescribing an absolute motion target. Instead, the absolute motion remains available as a functional redundancy to the system. We illustrate the properties of our proposed Jacobian through numerical simulations of a novel differential Inverse Kinematics algorithm.

1807.08229 2026-06-04 cs.AI cs.RO cs.SY eess.SY 版本更新

Optimal Continuous State POMDP Planning with Semantic Observations: A Variational Approach

基于语义观测的最优连续状态POMDP规划:一种变分方法

Luke Burks, Ian Loefgren, Nisar Ahmed

AI总结 本文提出了一种基于变分方法的最优规划策略,针对语义观测下的连续状态部分可观测马尔可夫决策过程(CPOMDP)进行改进,通过变分贝叶斯方法解决混合连续-离散概率模型的表示和推理问题,提升了动态决策任务的效率和鲁棒性。

Comments Final version accepted to IEEE Transactions on Robotics (in press as of August 2019)

详情
AI中文摘要

本文开发了用于利用语义观测进行最优规划的新策略,使用连续状态部分可观测马尔可夫决策过程(CPOMDP)。在高斯混合(GM)CPOMDP策略近似方法方面,提出了两项主要创新。尽管现有方法具有许多有益的理论性质,但它们无法高效地表示和推理混合连续-离散概率模型。第一项主要创新是通过softmax模型推导出连续-离散语义观测概率的变分贝叶斯GM近似,用于点基值迭代贝尔曼策略备份。这种方法的关键优势是可以在复杂的非高斯不确定性下进行动态决策,同时利用连续动态状态空间模型(从而避免繁琐且昂贵的离散化)。第二项主要创新是一种基于聚类的混合物凝聚技术,能够很好地扩展到非常大的GM策略函数和信念函数。针对目标搜索和拦截任务的仿真结果表明,这些创新所产生的GM策略比其他最先进的策略近似方法产生的策略更有效,但需要显著较少的建模开销和在线运行时间成本。此外,结果还显示该方法对模型误差具有鲁棒性,并能扩展到更高维度。

英文摘要

This work develops novel strategies for optimal planning with semantic observations using continuous state partially observable markov decision processes (CPOMDPs). Two major innovations are presented in relation to Gaussian mixture (GM) CPOMDP policy approximation methods. While existing methods have many desirable theoretical properties, they are unable to efficiently represent and reason over hybrid continuous-discrete probabilistic models. The first major innovation is the derivation of closed-form variational Bayes GM approximations of Point-Based Value Iteration Bellman policy backups, using softmax models of continuous-discrete semantic observation probabilities. A key benefit of this approach is that dynamic decision-making tasks can be performed with complex non-Gaussian uncertainties, while also exploiting continuous dynamic state space models (thus avoiding cumbersome and costly discretization). The second major innovation is a new clustering-based technique for mixture condensation that scales well to very large GM policy functions and belief functions. Simulation results for a target search and interception task with semantic observations show that the GM policies resulting from these innovations are more effective than those produced by other state of the art policy approximations, but require significantly less modeling overhead and online runtime cost. Additional results show the robustness of this approach to model errors and scaling to higher dimensions.

1905.04835 2026-06-04 cs.LG cs.CV cs.MA cs.RO cs.SY eess.SY stat.ML 版本更新

Multi-Agent Image Classification via Reinforcement Learning

通过强化学习进行多智能体图像分类

Hossein K. Mousavi, Mohammadreza Nazari, Martin Takáč, Nader Motee

AI总结 本文研究了利用多个能够收集未知环境部分姿态依赖观测的移动智能体进行图像分类的问题,提出了一种网络架构,用于指导智能体形成局部信念、采取局部行动并从原始部分观测中提取相关特征,通过与邻居智能体交换信息更新自身信念,并利用强化学习技术实现分类问题的去中心化实现。

Comments Preprint of the paper to be published in IROS'19 proceedings

详情
AI中文摘要

我们研究了使用多个能够收集未知环境部分姿态依赖观测的移动智能体进行分类问题。目标是在有限的时间范围内对图像进行分类。我们提出了一种网络架构,用于指导智能体如何形成局部信念、采取局部行动并从原始部分观测中提取相关特征。智能体被允许与邻居智能体交换信息以更新自身信念。证明了如何利用强化学习技术通过运行去中心化共识协议来实现分类问题的去中心化实现。我们在MNIST手写数字数据集上的实验结果展示了我们所提框架的有效性。

英文摘要

We investigate a classification problem using multiple mobile agents capable of collecting (partial) pose-dependent observations of an unknown environment. The objective is to classify an image over a finite time horizon. We propose a network architecture on how agents should form a local belief, take local actions, and extract relevant features from their raw partial observations. Agents are allowed to exchange information with their neighboring agents to update their own beliefs. It is shown how reinforcement learning techniques can be utilized to achieve decentralized implementation of the classification problem by running a decentralized consensus protocol. Our experimental results on the MNIST handwritten digit dataset demonstrates the effectiveness of our proposed framework.

1801.09627 2026-06-04 cs.LG cs.RO cs.SY eess.SY 版本更新

Barrier-Certified Adaptive Reinforcement Learning with Applications to Brushbot Navigation

具有应用的障碍证书自适应强化学习:Brushbot导航

Motoya Ohnishi, Li Wang, Gennaro Notomista, Magnus Egerstedt

发表机构 * School of Electrical Engineering, Royal Institute of Technology(皇家理工学院电气工程学院) Georgia Institute of Technology(佐治亚理工学院) RIKEN Center for Advanced Intelligence Project(日本理化学研究所高级智能研究中心) School of Mechanical Engineering(机械工程学院)

AI总结 本文提出了一种安全学习框架,结合自适应模型学习算法和障碍证书,用于具有可能非平稳智能体动态的系统。通过稀疏优化技术提取模型的动态结构,并利用学习的模型结合控制障碍证书来约束策略(反馈控制器),以保持安全性,即避免特定的不利状态空间区域。在某些条件下,保证了在安全被非平稳性破坏后,以李雅普诺夫稳定性的方式恢复安全。此外,将动作-价值函数近似重新公式化,使任何基于内核的非线性函数估计方法都能应用于我们的自适应学习框架。最后,保证了障碍证书策略优化的解是全局最优的,确保在温和条件下进行贪心策略改进。所得到的框架通过四旋翼无人机的模拟进行验证,该无人机此前在安全学习文献中被假设为平稳性,然后在动态未知、高度复杂且非平稳的Brushbot机器人上进行测试。

Comments ©2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

详情
Journal ref
Published in IEEE Transactions on Robotics, 2019
AI中文摘要

本文提出了一种安全学习框架,该框架结合了自适应模型学习算法和障碍证书,用于具有可能非平稳智能体动态的系统。为了提取模型的动态结构,我们使用了稀疏优化技术。我们利用学习的模型结合控制障碍证书,以约束策略(反馈控制器)从而保持安全性,即避免特定的状态空间区域中的不利区域。在某些条件下,恢复安全性的保证是在安全被非平稳性破坏后以李雅普诺夫稳定性的方式恢复。此外,我们重新公式化了动作-价值函数近似,使任何基于内核的非线性函数估计方法都能应用于我们的自适应学习框架。最后,保证了障碍证书策略优化的解是全局最优的,确保在温和条件下进行贪心策略改进。所得到的框架通过四旋翼无人机的模拟进行验证,该无人机此前在安全学习文献中被假设为平稳性,然后在动态未知、高度复杂且非平稳的Brushbot机器人上进行测试。

英文摘要

This paper presents a safe learning framework that employs an adaptive model learning algorithm together with barrier certificates for systems with possibly nonstationary agent dynamics. To extract the dynamic structure of the model, we use a sparse optimization technique. We use the learned model in combination with control barrier certificates which constrain policies (feedback controllers) in order to maintain safety, which refers to avoiding particular undesirable regions of the state space. Under certain conditions, recovery of safety in the sense of Lyapunov stability after violations of safety due to the nonstationarity is guaranteed. In addition, we reformulate an action-value function approximation to make any kernel-based nonlinear function estimation method applicable to our adaptive learning framework. Lastly, solutions to the barrier-certified policy optimization are guaranteed to be globally optimal, ensuring the greedy policy improvement under mild conditions. The resulting framework is validated via simulations of a quadrotor, which has previously been used under stationarity assumptions in the safe learnings literature, and is then tested on a real robot, the brushbot, whose dynamics is unknown, highly complex and nonstationary.

1903.11683 2026-06-04 stat.ML cs.CV cs.LG cs.RO cs.SY eess.SY stat.AP 版本更新

Outlier-Robust Spatial Perception: Hardness, General-Purpose Algorithms, and Guarantees

抗异常的空域感知:难度、通用算法和保证

Vasileios Tzoumas, Pasquale Antonante, Luca Carlone

AI总结 本文研究了空域感知中异常数据的影响,提出了一种通用算法来有效去除异常,并提供了对算法性能的理论保证。

详情
AI中文摘要

空域感知是许多机器人应用的核心,涵盖了定位与建图、点云对齐和从相机图像中估计相对姿态等广泛的研究问题。异常数据的存在会威胁到空域感知的鲁棒性,而一般情况下,异常值是主要问题。尽管已有处理异常值的技术,但它们可能以不可预测的方式失败(例如RANSAC、鲁棒估计器),或具有指数级的运行时间(例如分支界限法)。在本文中,我们通过三个贡献推动了异常拒绝的前沿。首先,我们证明了即使是最简单的线性异常拒绝实例也是近似不可行的:在最坏情况下,无法设计出一个准多项式时间算法来高效计算近似解。我们的第二个贡献是提供第一个实例级的次优界限,以评估给定异常拒绝结果的近似质量。我们的第三个贡献是提出了一种简单的通用算法,称为自适应修剪,用于去除异常值。我们的算法利用了最近提出的一类全局求解器,能够解决无异常的问题,并通过迭代去除误差较大的测量值。我们在三个空域感知问题上展示了所提出的算法:三维配准、双视几何和SLAM。结果表明,我们的算法在各种应用中优于几种最先进的方法,同时是一种通用的方法。

英文摘要

Spatial perception is the backbone of many robotics applications, and spans a broad range of research problems, including localization and mapping, point cloud alignment, and relative pose estimation from camera images. Robust spatial perception is jeopardized by the presence of incorrect data association, and in general, outliers. Although techniques to handle outliers do exist, they can fail in unpredictable manners (e.g., RANSAC, robust estimators), or can have exponential runtime (e.g., branch-and-bound). In this paper, we advance the state of the art in outlier rejection by making three contributions. First, we show that even a simple linear instance of outlier rejection is inapproximable: in the worst-case one cannot design a quasi-polynomial time algorithm that computes an approximate solution efficiently. Our second contribution is to provide the first per-instance sub-optimality bounds to assess the approximation quality of a given outlier rejection outcome. Our third contribution is to propose a simple general-purpose algorithm, named adaptive trimming, to remove outliers. Our algorithm leverages recently-proposed global solvers that are able to solve outlier-free problems, and iteratively removes measurements with large errors. We demonstrate the proposed algorithm on three spatial perception problems: 3D registration, two-view geometry, and SLAM. The results show that our algorithm outperforms several state-of-the-art methods across applications while being a general-purpose method.

1902.07747 2026-06-04 eess.SY cs.AI cs.DC cs.RO cs.SY 版本更新

Lookup Table-Based Consensus Algorithm for Real-Time Longitudinal Motion Control of Connected and Automated Vehicles

基于查找表的共识算法用于连接和自动化车辆的实时纵向运动控制

Ziran Wang, Kyuntae Han, BaekGyu Kim, Guoyuan Wu, Matthew J. Barth

AI总结 本文提出了一种基于查找表的共识算法,用于实时控制连接和自动化车辆的纵向运动,通过动态生成查找表来实时寻找最佳控制增益,优于之前的工作和线性反馈算法。

Comments 2019 American Control Conference (ACC)Philadelphia, PA, USA, July 10-12, 2019978-1-5386-7928-9

详情
AI中文摘要

连接和自动化车辆(CAV)技术是解决当前交通系统安全、机动性和可持续性问题的有前途的解决方案。具体而言,控制算法在CAV系统中起重要作用,因为它执行由前一步生成的命令,如通信、感知和规划。在本研究中,我们提出了一种共识算法,用于实时控制CAV的纵向运动。与该领域之前的研究不同,这些研究中的共识算法的控制增益是预先确定并固定的,我们开发了算法来构建查找表,实时寻找不同CAV初始条件下的理想控制增益。数值模拟显示,所提出的基于查找表的共识算法在四种不同场景中,针对各种CAV初始条件,在收敛时间和最大 jerk 方面均优于作者之前的工作以及van Arem的基于线性反馈的纵向运动控制算法。

英文摘要

Connected and automated vehicle (CAV) technology is one of the promising solutions to addressing the safety, mobility and sustainability issues of our current transportation systems. Specifically, the control algorithm plays an important role in a CAV system, since it executes the commands generated by former steps, such as communication, perception, and planning. In this study, we propose a consensus algorithm to control the longitudinal motion of CAVs in real time. Different from previous studies in this field where control gains of the consensus algorithm are pre-determined and fixed, we develop algorithms to build up a lookup table, searching for the ideal control gains with respect to different initial conditions of CAVs in real time. Numerical simulation shows that, the proposed lookup table-based consensus algorithm outperforms the authors' previous work, as well as van Arem's linear feedback-based longitudinal motion control algorithm in all four different scenarios with various initial conditions of CAVs, in terms of convergence time and maximum jerk of the simulation run.

1808.00649 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Robust Tracking with Model Mismatch for Fast and Safe Planning: an SOS Optimization Approach

具有模型不匹配的鲁棒跟踪:一种SOS优化方法

Sumeet Singh, Mo Chen, Sylvia L. Herbert, Claire J. Tomlin, Marco Pavone

发表机构 * Dept. of Aeronautics and Astronautics, Stanford University(航空航天系,斯坦福大学) Dept. of Electrical Engineering and Computer Science, University of California, Berkeley(电气工程与计算机科学系,加州大学伯克利分校)

AI总结 本文提出了一种基于SOS优化的方法,用于在快速且安全的规划中处理模型不匹配问题,通过设计反馈跟踪控制器和跟踪界限,以在不精确模型下保证系统安全性。

Comments Presented at WAFR 2018; final version v2 -- fixed typos

详情
AI中文摘要

在实时运动规划中,通常的做法是通过在简化低维动态模型上运行规划算法计算轨迹,然后使用反馈跟踪控制器来跟踪该轨迹,该控制器考虑了完整的高维系统动态。虽然这种规划与模型不匹配的策略通常能带来快速的计算时间,但缺乏动态可行性保证,这阻碍了其在安全关键系统中的应用。基于最近通过汉密尔顿-雅可比(HJ)可达性视角解决此问题的工作,我们提出了一种算法框架,用于离线计算一对“规划器”(即低维)和“跟踪器”(即高维)模型的反馈跟踪控制器及其相关跟踪界限。该界限随后用作在通过低维模型生成运动计划时的安全边际。具体而言,我们利用求和平方(SOS)编程的计算工具,设计了一个双线性优化算法,用于计算反馈跟踪控制器及其相关跟踪界限。该算法通过数值实验进行演示,重点研究SOS带来的计算可扩展性增加与内在保守性之间的权衡。总体而言,我们的结果使规划与模型不匹配的有吸引力策略能够扩展到HJ分析无法触及的系统,同时保持安全性保证。

英文摘要

In the pursuit of real-time motion planning, a commonly adopted practice is to compute a trajectory by running a planning algorithm on a simplified, low-dimensional dynamical model, and then employ a feedback tracking controller that tracks such a trajectory by accounting for the full, high-dimensional system dynamics. While this strategy of planning with model mismatch generally yields fast computation times, there are no guarantees of dynamic feasibility, which hampers application to safety-critical systems. Building upon recent work that addressed this problem through the lens of Hamilton-Jacobi (HJ) reachability, we devise an algorithmic framework whereby one computes, offline, for a pair of "planner" (i.e., low-dimensional) and "tracking" (i.e., high-dimensional) models, a feedback tracking controller and associated tracking bound. This bound is then used as a safety margin when generating motion plans via the low-dimensional model. Specifically, we harness the computational tool of sum-of-squares (SOS) programming to design a bilinear optimization algorithm for the computation of the feedback tracking controller and associated tracking bound. The algorithm is demonstrated via numerical experiments, with an emphasis on investigating the trade-off between the increased computational scalability afforded by SOS and its intrinsic conservativeness. Collectively, our results enable scaling the appealing strategy of planning with model mismatch to systems that are beyond the reach of HJ analysis, while maintaining safety guarantees.

1809.00975 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Minimum Violation Control Synthesis on Cyber-Physical Systems under Attacks

在攻击下的网络物理系统最小违规控制综合

Luyao Niu, Jie Fu, Andrew Clark

AI总结 本文研究了在存在攻击的情况下,如何在线性时序逻辑约束下综合出最小违规控制器,通过构建Stackelberg博弈模型并提出非线性规划问题来解决最优控制策略,最后通过数值案例验证了方法的有效性。

详情
AI中文摘要

网络物理系统正日益执行复杂任务,通常使用时序逻辑等形式化语言进行建模。恶意攻击者通过智能攻击可能会削弱系统执行所需任务的能力。然而,目前在存在此类攻击的情况下,综合研究受到关注有限。特别是,当由于对抗性攻击无法完全满足所需规范时,控制器综合问题尚未被研究。本文聚焦于在存在对抗者的情况下,对随机有限状态离散时间系统进行线性时序逻辑约束下的最小违规控制综合问题。最小违规控制策略是在用户定义的重要任务上满足,而违反次要任务。我们通过构建控制器与攻击者之间的并发Stackelberg博弈模型,并提出非线性规划问题来制定和解决最优控制策略。为了减少计算努力,我们开发了一种启发式算法,高效地解决该问题,并通过数值案例研究展示了所提出方法。

英文摘要

Cyber-physical systems are conducting increasingly complex tasks, which are often modeled using formal languages such as temporal logic. The system's ability to perform the required tasks can be curtailed by malicious adversaries that mount intelligent attacks. At present, however, synthesis in the presence of such attacks has received limited research attention. In particular, the problem of synthesizing a controller when the required specifications cannot be satisfied completely due to adversarial attacks has not been studied. In this paper, we focus on the minimum violation control synthesis problem under linear temporal logic constraints of a stochastic finite state discrete-time system with the presence of an adversary. A minimum violation control strategy is one that satisfies the most important tasks defined by the user while violating the less important ones. We model the interaction between the controller and adversary using a concurrent Stackelberg game and present a nonlinear programming problem to formulate and solve for the optimal control policy. To reduce the computation effort, we develop a heuristic algorithm that solves the problem efficiently and demonstrate our proposed approach using a numerical case study.

1809.07012 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Enhancing the settling time estimation of a class of fixed-time stable systems

增强一类固定时间稳定系统的 settling 时间估计

R. Aldana-López, D. Gómez-Gutiérrez, E. Jiménez-Rodríguez, J. D. Sánchez-Torres, M. Defoort

发表机构 * Multi-agent autonomous systems lab, Intel Labs, Intel Tecnología de M\'exico, Av. del Bosque 1001, Colonia El Bajío, Zapopan, 45019, Jalisco, M\'exico. Research Laboratory on Optimal Design, Devices Advanced Materials -OPTIMA-, Department of Mathematics LAMIH, CNRS UMR 8201, Univ. Valenciennes, Valenciennes 59313, France.

AI总结 本文研究了一类固定时间稳定系统的收敛时间分析,提出了一种新的非保守上界用于估计其 settling 时间,通过改进方法提供了更精确的上界,并展示了预定义时间控制器在第一和第二阶系统中的应用。

详情
Journal ref
International Journal of Robust and Nonlinear Control, Volume29, Issue12, Pages 4135-4148,2019
AI中文摘要

本文研究了一类固定时间稳定系统的收敛时间分析,旨在提供一种新的非保守上界用于其 settling 时间的估计。我们的贡献包括四个方面:首先,重新审视已知的固定时间稳定系统类,展示经典上界估计的保守性;其次,提供一个统一上界,该上界适用于系统中任意轨迹的 settling 时间;第三,通过略微修改之前的固定时间系统类,提出了一种新的预定义时间收敛算法,其中 settling 时间的上界作为系统参数预先设定;最后,介绍了用于第一阶和第二阶系统的预定义时间控制器。一些仿真结果展示了所提方案在 settling 时间估计方面的性能,与现有方法相比具有优势。

英文摘要

This paper deals with the convergence time analysis of a class of fixed-time stable systems with the aim to provide a new non-conservative upper bound for its settling time. Our contribution is fourfold. First, we revisit the well-known class of fixed-time stable systems, given in (Polyakov et al.,2012}, while showing the conservatism of the classical upper estimate of the settling time. Second, we provide the smallest constant that uniformly upper bounds the settling time of any trajectory of the system under consideration. Third, introducing a slight modification of the previous class of fixed-time systems, we propose a new predefined-time convergent algorithm where the least upper bound of the settling time is set a priori as a parameter of the system. At last, predefined-time controllers for first order and second order systems are introduced. Some simulation results highlight the performance of the proposed scheme in terms of settling time estimation compared to existing methods.

1903.02531 2026-06-04 cs.RO cs.AI cs.CV cs.LG cs.SY eess.SY 版本更新

Combining Optimal Control and Learning for Visual Navigation in Novel Environments

将最优控制与学习相结合用于新环境中的视觉导航

Somil Bansal, Varun Tolani, Saurabh Gupta, Jitendra Malik, Claire Tomlin

发表机构 * University of California, Berkeley(加州大学伯克利分校) Facebook AI Research(脸书人工智能研究)

AI总结 本文提出了一种结合模型控制与学习感知的方法,用于在新环境中实现可靠的视觉导航,通过生成无碰撞路径的 waypoints,使机器人能够高效地到达目标位置,同时在低帧率和仿真到现实的迁移中表现良好。

Comments Project website: https://vtolani95.github.io/WayPtNav/

详情
AI中文摘要

基于模型的控制是机器人导航的流行范式,因为它可以利用已知的动力学模型来高效地规划鲁棒的机器人轨迹。然而,在环境事先未知且只能通过机器人上的传感器部分观测的情况下,使用基于模型的方法具有挑战性。在本工作中,我们通过将基于模型的控制与基于学习的感知相结合来解决这一不足。基于学习的感知模块生成一系列 waypoints,通过无碰撞路径引导机器人到达目标。这些 waypoints 被用于基于模型的规划器生成平滑且动态可行的轨迹,该轨迹通过反馈控制在物理系统上执行。我们在模拟的真实世界复杂环境中以及在实际地面车辆上的实验表明,与纯几何映射或端到端学习方法相比,所提出的方法在新环境中能够更可靠、更高效地到达目标位置。我们的方法不依赖于详细的显式 3D 环境地图,能够与低帧率工作,并且在仿真到现实的迁移中表现良好。描述我们方法和实验的视频可在项目网站上获得。

英文摘要

Model-based control is a popular paradigm for robot navigation because it can leverage a known dynamics model to efficiently plan robust robot trajectories. However, it is challenging to use model-based methods in settings where the environment is a priori unknown and can only be observed partially through on-board sensors on the robot. In this work, we address this short-coming by coupling model-based control with learning-based perception. The learning-based perception module produces a series of waypoints that guide the robot to the goal via a collision-free path. These waypoints are used by a model-based planner to generate a smooth and dynamically feasible trajectory that is executed on the physical system using feedback control. Our experiments in simulated real-world cluttered environments and on an actual ground vehicle demonstrate that the proposed approach can reach goal locations more reliably and efficiently in novel environments as compared to purely geometric mapping-based or end-to-end learning-based alternatives. Our approach does not rely on detailed explicit 3D maps of the environment, works well with low frame rates, and generalizes well from simulation to the real world. Videos describing our approach and experiments are available on the project website.

1805.00222 2026-06-04 eess.SY cs.RO cs.SY eess.SP 版本更新

Model-Free Active Input-Output Feedback Linearization of a Single-Link Flexible Joint Manipulator: An Improved ADRC Approach

无模型主动输入输出反馈线性化单连杆柔性关节机械臂:一种改进的ADRC方法

Wameedh Riyadh Abdul Adheem, Ibraheem Kasim Ibraheem

发表机构 * Electrical Engineering Department(电气工程系) College of Engineering, Baghdad University(巴格达大学工程学院)

AI总结 本文提出了一种基于改进主动扰动抑制控制(IADRC)范式的无模型主动输入输出反馈线性化(AIOFL)技术,用于设计具有已知相对次数的非线性系统的反馈线性化控制律,通过改进非线性扩展状态观测器(INLESO)估计广义扰动并结合改进非线性状态误差反馈(INLSEF)生成名义控制律,实现实时消除所有不需要的动力学、外源扰动和系统不确定性,将系统转化为积分链,仅需非线性系统的相对次数信息,并通过李雅普诺夫函数分析证明了系统的渐近稳定性。

详情
AI中文摘要

传统输入输出反馈线性化(IOFL)要求系统动力学的完整知识,并假设输入通道无扰动且系统无不确定性。本文提出了一种基于改进主动扰动抑制控制(IADRC)范式的无模型主动输入输出反馈线性化(AIOFL)技术,用于设计具有已知相对次数的非线性系统的反馈线性化控制律。线性化控制律(LCL)由改进非线性扩展状态观测器(INLESO)估计的具有饱和行为的广义扰动缩放估计和由改进非线性状态误差反馈(INLSEF)生成的名义控制律组成。所提出的AIOFL能够实时消除代表所有不需要的动力学、外源扰动和系统不确定性的广义扰动,并将系统转化为积分链,直到系统的相对次数。通过李雅普诺夫函数进行了稳定性分析,证明了INLESO的收敛性和闭环系统的渐近稳定性。通过将所提出的AIOFL技术应用于柔性关节单连杆机械臂(SLFJM)验证了结果。仿真结果验证了基于IADRC的所提出AIOFL工具的有效性,与传统ADRC基于AIOFL和传统IOFL技术相比。

英文摘要

Traditional Input-Output Feedback Linearization (IOFL) requires full knowledge of system dynamics and assumes no disturbance at the input channel and no system's uncertainties. In this paper, a model-free Active Input-Output Feedback Linearization (AIOFL) technique based on an Improved Active Disturbance Rejection Control (IADRC) paradigm is proposed to design feedback linearization control law for a generalized nonlinear system with known relative degree. The Linearization Control Law(LCL) is composed of a scaled generalized disturbance estimated by an Improved Nonlinear Extended State Observer (INLESO) with saturation-like behavior and the nominal control law produced by an Improved Nonlinear State Error Feedback (INLSEF). The proposed AIOFL cancels in real-time fashion the generalized disturbances which represent all the unwanted dynamics, exogenous disturbances, and system uncertainties and transforms the system into a chain of integrators up to the relative degree of the system, the only information required about the nonlinear system. Stability analysis has been conducted based on Lyapunov functions and revealed the convergence of the INLESO and the asymptotic stability of the closed-loop system. Verification of the outcomes has been achieved by applying the proposed AIOFL technique on the Flexible Joint Single Link Manipulator (SLFJM). The simulations results validated the effectiveness of the proposed AIOFL tool based on IADRC as compared to the conventional ADRC based AIOFL and the traditional IOFL techniques.

1802.08953 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Robust Target-relative Localization with Ultra-Wideband Ranging and Communication

利用超宽带测距与通信的稳健目标相对定位

Thien-Minh Nguyen, Abdul Hanif Zaini, Chen Wang, Kexin Guo, Lihua Xie

AI总结 本文提出了一种方法,利用四旋翼无人机上的超宽带测距传感器实现目标的相对定位和跟踪,通过策略性安装传感器融合相对位置和方位信息,采用扩展卡尔曼滤波器估计器融合UWB测距数据和 onboard 传感器数据,并利用UWB通信能力传输目标的方位以处理目标方位与测距数据的耦合,实验结果表明无人机能够自主控制其相对于静态和移动目标的位置。

Comments 2018 International Conference on Robotics and Automation (ICRA 2018)

详情
Journal ref
2018 IEEE International Conference on Robotics and Automation (ICRA)
AI中文摘要

在本文中,我们提出了一种方法,利用超宽带(UWB)测距传感器实现四旋翼无人机对目标的相对定位和跟踪。这些传感器被策略性安装,以帮助获取四旋翼无人机与目标之间的相对位置和方位。为了实现即使在目标速度不确定的情况下也能进行稳健定位的自主飞行,开发了两个主要特征。首先,开发了一个基于扩展卡尔曼滤波器(EKF)的估计器,用于融合UWB测距测量数据与机载传感器数据,包括惯性测量单元(IMU)、气压计和光学流数据。其次,为了正确处理目标的方位与测距测量的耦合,利用UWB通信能力将目标的方位传输到四旋翼无人机。实验结果表明,四旋翼无人机在目标静止和移动的情况下都能自主控制其相对于目标的位置。

英文摘要

In this paper we propose a method to achieve relative positioning and tracking of a target by a quadcopter using Ultra-wideband (UWB) ranging sensors, which are strategically installed to help retrieve both relative position and bearing between the quadcopter and target. To achieve robust localization for autonomous flight even with uncertainty in the speed of the target, two main features are developed. First, an estimator based on Extended Kalman Filter (EKF) is developed to fuse UWB ranging measurements with data from onboard sensors including inertial measurement unit (IMU), altimeters and optical flow. Second, to properly handle the coupling of the target's orientation with the range measurements, UWB based communication capability is utilized to transfer the target's orientation to the quadcopter. Experiment results demonstrate the ability of the quadcopter to control its position relative to the target autonomously in both cases when the target is static and moving.

1904.04968 2026-06-04 cs.RO cs.SY eess.SY math.OC 版本更新

Asymptotic Optimality of a Time Optimal Path Parametrization Algorithm

时间最优路径参数化算法的渐近最优性

Igor Spasojevic, Varun Murali, Sertac Karaman

AI总结 本文研究了时间最优路径参数化问题,证明了线性时间算法在所有由凸优化方法最优解决的问题中都是渐近最优的,并且刻画了该问题的最优解。

详情
AI中文摘要

时间最优路径参数化问题是 minimizing the time interval during which an actuation constrained agent can traverse a given path. Recently, an efficient linear-time algorithm for solving this problem was proposed. However, its optimality was proved for only a strict subclass of problems solved optimally by more computationally intensive approaches based on convex programming. In this paper, we prove that the same linear-time algorithm is asymptotically optimal for all problems solved optimally by convex optimization approaches. We also characterize the optimum of the Time Optimal Path Parametrization Problem, which may be of independent interest.

英文摘要

Time Optimal Path Parametrization is the problem of minimizing the time interval during which an actuation constrained agent can traverse a given path. Recently, an efficient linear-time algorithm for solving this problem was proposed. However, its optimality was proved for only a strict subclass of problems solved optimally by more computationally intensive approaches based on convex programming. In this paper, we prove that the same linear-time algorithm is asymptotically optimal for all problems solved optimally by convex optimization approaches. We also characterize the optimum of the Time Optimal Path Parametrization Problem, which may be of independent interest.

1903.05073 2026-06-04 cs.RO cs.LO cs.SY eess.SY 版本更新

A Formal Safety Net for Waypoint Following in Ground Robots

为地面机器人路径跟随提供形式安全网

Brandon Bohrer, Yong Kiam Tan, Stefan Mitsch, Andrew Sogokon, André Platzer

AI总结 本文提出一个可重用的形式验证安全网,为具有容差和加速度的Dubins型地面机器人提供端到端的安全性和活跃性保证,通过形式化方法验证安全性和活跃性属性,并合成监控器以确保运行时的模型合规性。

详情
AI中文摘要

我们提出一个可重用的形式验证安全网,为具有容差和加速度的Dubins型地面机器人提供端到端的安全性和活跃性保证。我们:i) 用微分动态逻辑(dL)建模机器人,并对控制器和机器人运动学制定假设;ii) 证明具有速度限制的路径跟随形式安全性和活跃性属性;iii) 合成一个监控器,该监控器被自动证明在运行时强制执行模型合规性;iv) 我们使用VeriPhy工具链使这些保证延伸到不受信任的控制器、环境和计划的机器代码层面。安全网的保证适用于任何机器人,只要路径被安全选择且其模型中的物理假设成立。实验表明这些假设在实践中成立,存在合规性与性能之间的内在权衡。

英文摘要

We present a reusable formally verified safety net that provides end-to-end safety and liveness guarantees for 2D waypoint-following of Dubins-type ground robots with tolerances and acceleration. We: i) Model a robot in differential dynamic logic (dL), and specify assumptions on the controller and robot kinematics, ii) Prove formal safety and liveness properties for waypoint-following with speed limits, iii) Synthesize a monitor, which is automatically proven to enforce model compliance at runtime, and iv) Our use of the VeriPhy toolchain makes these guarantees carry over down to the level of machine code with untrusted controllers, environments, and plans. The guarantees for the safety net apply to any robot as long as the waypoints are chosen safely and the physical assumptions in its model hold. Experiments show these assumptions hold in practice, with an inherent trade-off between compliance and performance.

1605.00604 2026-06-04 eess.SY cs.LO cs.RO cs.SY 版本更新

Formal Verification of Obstacle Avoidance and Navigation of Ground Robots

地面机器人的障碍回避与导航的正式验证

Stefan Mitsch, Khalil Ghorbal, David Vogelbacher, André Platzer

发表机构 * Computer Science Department, Carnegie Mellon University(卡内基梅隆大学计算机科学系) INRIA(法国国家信息与自动化研究所) Karlsruhe Institute of Technology(卡尔斯鲁厄理工学院)

AI总结 本文研究了移动机器人在动态环境中安全性的核心问题,通过形式化验证方法验证了控制器在回避静态和移动障碍物时的安全性、被动安全性、主动友好安全性及被动方向安全性,并证明了在传感器不确定性和执行器扰动下仍能保证安全性的结论。

详情
Journal ref
International Journal of Robotics Research. 36(12), pp. 1312-1340, 2017
AI中文摘要

移动机器人在动态环境中的安全性依赖于确保其不与障碍物碰撞。为了支持此类安全论证,我们分析并形式化验证了一系列日益强大的控制器安全属性:(i) 静态安全性,确保不会与静态障碍物发生碰撞;(ii) 被动安全性,确保在机器人移动时不会与静态或移动障碍物发生碰撞;(iii) 更强的被动友好安全性,其中机器人进一步保持足够的机动距离以避免障碍物碰撞;(iv) 被动方向安全性,允许机器人传感器覆盖不完美,即机器人意识到其环境并非所有内容都可见。我们补充了这些可证明正确的安全属性以活化属性:我们证明可证明安全的运动足够灵活,使机器人仍能导航航点并通过交叉口。我们使用混合系统模型和定理证明技术,描述并形式化验证机器人离散的控制决策及其连续的物理运动。此外,我们正式证明了在传感器不确定性和执行器扰动,以及引入更激进的操控选择时,安全性仍能得到保证。我们的验证结果是通用的,因为它们不限于特定控制算法的选择,而是识别出使它们同时适用于广泛控制算法类别的条件。

英文摘要

The safety of mobile robots in dynamic environments is predicated on making sure that they do not collide with obstacles. In support of such safety arguments, we analyze and formally verify a series of increasingly powerful safety properties of controllers for avoiding both stationary and moving obstacles: (i) static safety, which ensures that no collisions can happen with stationary obstacles, (ii) passive safety, which ensures that no collisions can happen with stationary or moving obstacles while the robot moves, (iii) the stronger passive friendly safety in which the robot further maintains sufficient maneuvering distance for obstacles to avoid collision as well, and (iv) passive orientation safety, which allows for imperfect sensor coverage of the robot, i. e., the robot is aware that not everything in its environment will be visible. We complement these provably correct safety properties with liveness properties: we prove that provably safe motion is flexible enough to let the robot still navigate waypoints and pass intersections. We use hybrid system models and theorem proving techniques that describe and formally verify the robot's discrete control decisions along with its continuous, physical motion. Moreover, we formally prove that safety can still be guaranteed despite sensor uncertainty and actuator perturbation, and when control choices for more aggressive maneuvers are introduced. Our verification results are generic in the sense that they are not limited to the particular choices of one specific control algorithm but identify conditions that make them simultaneously apply to a broad class of control algorithms.

1904.05728 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Technical Report: Safe, Aggressive Quadrotor Flight via Reachability-based Trajectory Design

技术报告:基于可达性的安全激进四旋翼飞行轨迹设计

Shreyas Kousik, Patrick Holmes, Ramanarayan Vasudevan

发表机构 * Mechanical Engineering, University of Michigan, Ann Arbor, MI(机械工程,密歇根大学,安娜堡,MI)

AI总结 本文提出了一种基于可达性的四旋翼飞行轨迹设计方法,通过在线规划中创新性地使用Zonotopes来确保在存在轨迹依赖性跟踪误差的情况下飞行安全,实验显示在500个随机复杂环境中实现了最高5m/s的激进飞行且无碰撞。

Comments 12 Pages, 3 Figures, 1 Table

详情
AI中文摘要

四旋翼可以提供基础设施检查和搜索救援等服务,需要在复杂环境中自主操作。通常通过滚动时间规划实现自主性,即在执行短期计划的同时计算新的计划,因为传感器在任何时候都只能获得有限的信息。为了确保安全并防止机器人丢失,计划必须被验证为在存在不确定性(例如跟踪误差)的情况下仍然无碰撞。现有的样条基规划器通过均匀扩张障碍物来补偿不确定性,这可能会过于保守。另一方面,基于可达性的规划器可以将轨迹依赖性的不确定性作为轨迹函数来考虑。本文将基于可达性的轨迹设计(RTD)应用于规划四旋翼轨迹,以在存在轨迹依赖性跟踪误差的情况下保持安全。这通过在在线规划中以新颖的方式使用Zonotopes来实现。模拟显示在500个随机复杂环境中实现了最高5m/s的激进飞行且无碰撞。

英文摘要

Quadrotors can provide services such as infrastructure inspection and search-and-rescue, which require operating autonomously in cluttered environments. Autonomy is typically achieved with receding-horizon planning, where a short plan is executed while a new one is computed, because sensors receive limited information at any time. To ensure safety and prevent robot loss, plans must be verified as collision free despite uncertainty (e.g, tracking error). Existing spline-based planners dilate obstacles uniformly to compensate for uncertainty, which can be conservative. On the other hand, reachability-based planners can include trajectory-dependent uncertainty as a function of the planned trajectory. This work applies Reachability-based Trajectory Design (RTD) to plan quadrotor trajectories that are safe despite trajectory-dependent tracking error. This is achieved by using zonotopes in a novel way for online planning. Simulations show aggressive flight up to 5 m/s with zero crashes in 500 cluttered, randomized environments.

1805.09703 2026-06-04 eess.SY cs.RO cs.SY 版本更新

No More Differentiator in PID:Development of Nonlinear Lead for Precision Mechatronics

PID中不再需要微分器:非线性前置器在精密机电系统中的开发

Arun Palanikumar, Niranjan Saikumar, S. Hassan HosseinNia

AI总结 本文提出了一种新型非线性前置器,该前置器在精度和稳定性方面优于传统线性前置器,可用于替代传统前置器。通过洛伦兹驱动的纳米级精度平台验证了该概念,并展示了在精度、跟踪和带宽方面的改进。

Comments European Control Conference 2018

详情
AI中文摘要

工业PID控制系统由三个组成部分构成:滞后(积分器)、前置(微分器)和低通滤波器(LPF)。由于PID是一种线性控制方法,因此其固有地受到水床效应的限制,导致精度与跟踪之间存在权衡,一方面由滞后和LPF提供,另一方面由前置器提供稳定性与鲁棒性。应用于滞后和LPF元素的非线性重置策略在减少这种权衡方面非常有效。然而,开发重置前置器的研究却很少。在本文中,我们开发了一种新型前置器,其精度和稳定性均优于传统线性前置器,可作为其替代品使用。该概念在洛伦兹驱动的纳米级精度平台上进行了呈现和验证。通过两个独立设计展示了精度、跟踪和带宽的改进。性能在时域和频域中均得到验证,以确保实际设置中达到的相位裕度与设计理论相匹配。

英文摘要

Industrial PID consists of three elements: Lag (integrator), Lead (Differentiator) and Low Pass Filters (LPF). PID being a linear control method is inherently bounded by the waterbed effect due to which there exists a trade-off between precision \& tracking, provided by Lag and LPF on one side and stability \& robustness, provided by Lead on the other side. Nonlinear reset strategies applied in Lag and LPF elements have been very effective in reducing this trade-off. However, there is lack of study in developing a reset Lead element. In this paper, we develop a novel lead element which provides higher precision and stability compared to the linear lead filter and can be used as a replacement for the same. The concept is presented and validated on a Lorentz-actuated nanometer precision stage. Improvements in precision, tracking and bandwidth are shown through two separate designs. Performance is validated in both time and frequency domain to ensure that phase margin achieved on the practical setup matches design theories.

1709.04906 2026-06-04 eess.SY cs.MA cs.RO cs.SY 版本更新

On the interaction between Autonomous Mobility-on-Demand systems and the power network: models and coordination algorithms

关于自主出行-on-demand系统与电力网络相互作用:模型和协调算法

Federico Rossi, Ramon Iglesias, Mahnoosh Alizadeh, Marco Pavone

AI总结 本文研究了自主出行-on-demand系统与电力网络的相互作用,提出了一种模型来捕捉两者之间的耦合关系,并通过联合优化算法协调两个系统。

Comments Extended version of the paper presented at Robotics: Science and Systems XIV and accepted by TCNS. In Version 4, the body of the paper is largely rewritten for clarity and consistency, and new numerical simulations are presented. All source code is available (MIT) at https://dx.doi.org/10.5281/zenodo.3241651

详情
AI中文摘要

我们研究了由电动自动驾驶车辆提供按需运输请求的车队(称为自主出行-on-demand系统,或AMoD系统)与电力网络之间的相互作用。我们提出了一种模型,该模型捕捉了由于车辆充电需求而产生的两个系统之间的耦合,同时捕捉时间变化的客户需求、电力生成成本、道路拥堵、电池损耗以及电力传输和分配约束。随后,我们利用该模型联合优化两个系统的运行。我们设计了一种算法程序,通过捆绑客户需求来无损地减少问题规模,使该问题能够被现成的线性规划求解器高效求解。接下来,我们证明了联合问题的社会最优解可以作为一般均衡来强制执行,并提供了一种对偶分解算法,使自利的代理能够计算市场清算价格而无需共享私人信息。我们通过研究达拉斯-沃思堡的假设AMoD系统及其对德克萨斯电力网络的影响来评估该模型的性能。AMoD系统与电力网络之间缺乏协调会导致达拉斯-沃思堡电力价格增加4.4%;相反,AMoD系统与电力网络之间的协调可以减少电力支出,即使电力需求增加,也比没有汽车的情况更节省1.47亿美元/年。最后,我们提供了一种滚动时域实现,并通过基于代理的模拟评估其性能。本文的结果为电力驱动的AMoD系统与电力网络之间的相互作用提供了首次系统性的描述,并进一步揭示了AMoD的经济和社会价值。

英文摘要

We study the interaction between a fleet of electric, self-driving vehicles servicing on-demand transportation requests (referred to as Autonomous Mobility-on-Demand, or AMoD, system) and the electric power network. We propose a model that captures the coupling between the two systems stemming from the vehicles' charging requirements and captures time-varying customer demand and power generation costs, road congestion, battery depreciation, and power transmission and distribution constraints. We then leverage the model to jointly optimize the operation of both systems. We devise an algorithmic procedure to losslessly reduce the problem size by bundling customer requests, allowing it to be efficiently solved by off-the-shelf linear programming solvers. Next, we show that the socially optimal solution to the joint problem can be enforced as a general equilibrium, and we provide a dual decomposition algorithm that allows self-interested agents to compute the market clearing prices without sharing private information. We assess the performance of the mode by studying a hypothetical AMoD system in Dallas-Fort Worth and its impact on the Texas power network. Lack of coordination between the AMoD system and the power network can cause a 4.4% increase in the price of electricity in Dallas-Fort Worth; conversely, coordination between the AMoD system and the power network could reduce electricity expenditure compared to the case where no cars are present (despite the increased demand for electricity) and yield savings of up $147M/year. Finally, we provide a receding-horizon implementation and assess its performance with agent-based simulations. Collectively, the results of this paper provide a first-of-a-kind characterization of the interaction between electric-powered AMoD systems and the power network, and shed additional light on the economic and societal value of AMoD.

1905.05926 2026-06-04 cs.NI cs.RO cs.SY eess.SY 版本更新

Connectivity-Aware UAV Path Planning with Aerial Coverage Maps

具有空中覆盖图的连接意识UAV路径规划

Hongyu Yang, Jun Zhang, S. H. Song, Khaled B. Lataief

发表机构 * Department of ECE, Hong Kong University of Science and Technology, Hong Kong(香港科技大学电子工程系) Department of EIE, The Hong Kong Polytechnic University, Hong Kong(香港理工大学电子工程系)

AI总结 本文提出了一种利用UAV可控移动来克服地面用户优化的蜂窝网络在空中覆盖不连续问题的连接意识UAV路径规划方法,通过引入两个新指标量化UAV路径的蜂窝连接质量,并利用空中覆盖图提供准确的散射覆盖孔位置,将UAV路径规划问题转化为在连接约束下寻找最短路径的问题。

Comments This paper has been accepted by IEEE WCNC 2019

详情
AI中文摘要

蜂窝网络有潜力支持无人机的有效无线通信,有助于实现各种远程无人机应用。然而,这些网络是为地面用户提供优化的,因此无法保证无缝的空中覆盖。在本文中,我们通过利用UAV的可控移动来克服这一困难,并研究连接意识的UAV路径规划。为了明确将通信需求施加于UAV路径规划上,我们引入了两个新的指标来量化UAV路径的蜂窝连接质量。此外,空中覆盖图用于提供复杂传播环境中散落覆盖孔的准确位置。我们将UAV路径规划问题 formulation 为在连接约束下寻找最短路径的问题。基于图搜索方法,提出了一种新颖的连接意识路径规划算法,具有较低的复杂度。通过弗吉尼亚一个城区的空中覆盖图(通过射线追踪构建)验证了所提算法的有效性和优越性。仿真结果也展示了UAV路径长度和连接质量之间的权衡。

英文摘要

Cellular networks are promising to support effective wireless communications for unmanned aerial vehicles (UAVs), which will help to enable various long-range UAV applications. However, these networks are optimized for terrestrial users, and thus do not guarantee seamless aerial coverage. In this paper, we propose to overcome this difficulty by exploiting controllable mobility of UAVs, and investigate connectivity-aware UAV path planning. To explicitly impose communication requirements on UAV path planning, we introduce two new metrics to quantify the cellular connectivity quality of a UAV path. Moreover, aerial coverage maps are used to provide accurate locations of scattered coverage holes in the complicated propagation environment. We formulate the UAV path planning problem as finding the shortest path subject to connectivity constraints. Based on graph search methods, a novel connectivity-aware path planning algorithm with low complexity is proposed. The effectiveness and superiority of our proposed algorithm are demonstrated using the aerial coverage map of an urban section in Virginia, which is built by ray tracing. Simulation results also illustrate a tradeoff between the path length and connectivity quality of UAVs.

1905.12240 2026-06-04 eess.SY cs.HC cs.RO cs.SY 版本更新

Research on fuzzy PID Shared control method of small brain-controlled uav

小脑控无人机模糊PID共享控制方法研究

Na Dong, Wen-qi Zhang, Zhong-ke Gao

发表机构 * School of Electrical and Information Engineering, Tianjin University(天津大学电气与信息工程学院)

AI总结 本文针对脑控无人机在信号识别精度、时间限制和命令数量方面的不足,提出基于共享控制的模糊PID辅助控制器,实现自动控制与脑控制的协同,通过评估当前飞行状态和设置切换率来决定自动控制与脑控制的切换机制,提升系统控制性能,并通过矩形轨迹跟踪实验验证算法有效性。

详情
AI中文摘要

脑控无人机是一种通过BCI分析人类脑电信号来获取飞行指令的无人机。脑控无人机研究有助于推动脑机接口的整合,并具有广阔的应用前景。目前,BCI仍存在一些问题,例如识别精度有限、识别时间有限以及在分析EEG信号以获取控制指令时识别命令数量有限。因此,仅由脑控制的四旋翼无人机的控制性能并不理想。基于共享控制的概念,本文设计了一个辅助控制器,使用模糊PID控制,实现了自动控制与脑控制的协同控制。通过评估当前飞行状态并设置切换率,可以决定自动控制与脑控制的切换机制,以提高系统控制性能。最后,为小型四旋翼无人机设计了一个相同高度的矩形轨迹跟踪控制实验,以验证算法的有效性。

英文摘要

Brain-controlled unmanned aerial vehicle (uav) is a uav that can analyze human brain electrical signals through BCI to obtain flight commands. The research of brain-controlled uav can promote the integration of brain-computer and has a broad application prospect. At present, BCI still has some problems, such as limited recognition accuracy, limited recognition time and small number of recognition commands in the acquisition of control commands by analyzing eeg signals. Therefore, the control performance of the quadrotor which is controlled only by brain is not ideal. Based on the concept of Shared control, this paper designs an assistant controller using fuzzy PID control, and realizes the cooperative control between automatic control and brain control. By evaluating the current flight status and setting the switching rate, the switching mechanism of automatic control and brain control can be decided to improve the system control performance. Finally, a rectangular trajectory tracking control experiment of the same height is designed for small quadrotor to verify the algorithm.

1905.11176 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Temporally Coupled Dynamical Movement Primitives in Cartesian Space

笛卡尔空间中耦合的动态运动素体

Martin Karlsson, Anders Robertsson, Rolf Johansson

AI总结 本文提出了一种在笛卡尔空间中耦合动态运动素体(DMPs)的控制方法,利用单位四元数表示姿态,并证明单位四元数集减去一个点是可缩的,从而设计出连续且全局渐近稳定的反馈控制系统。

详情
AI中文摘要

机器人在笛卡尔空间中的姿态控制涉及一些困难,因为旋转群SO(3)不是可缩的,只有全局可缩的状态空间才能支持连续且全局渐近稳定的反馈控制系统。本文利用单位四元数来表示姿态,并首次证明单位四元数集减去一个单点是可缩的。这一结果被用于设计笛卡尔空间中耦合动态运动素体(DMPs)的控制系统。该控制系统的功能在工业机器人上进行了实验验证。

英文摘要

Control of robot orientation in Cartesian space implicates some difficulties, because the rotation group SO(3) is not contractible, and only globally contractible state spaces support continuous and globally asymptotically stable feedback control systems. In this paper, unit quaternions are used to represent orientations, and it is first shown that the unit quaternion set minus one single point is contractible. This is used to design a control system for temporally coupled dynamical movement primitives (DMPs) in Cartesian space. The functionality of the control system is verified experimentally on an industrial robot.

1905.11130 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Autonomous Interpretation of Demonstrations for Modification of Dynamical Movement Primitives

自主解释示范以修改动力学运动原语

Martin Karlsson, Anders Robertsson, Rolf Johansson

发表机构 * LCCC Linnaeus Center(LCCC林纳尤斯中心) ELLIIT Excellence Center(ELLIIT卓越中心)

AI总结 本文提出了一种框架,使机器人操作员能够直观地调整动力学运动原语(DMPs)。通过使用引导通过编程,操作员可以演示纠正轨迹,从而生成一个结合故障轨迹前部分和纠正轨迹后部分的修改DMP。

详情
Journal ref
IEEE International Conference on Robotics and Automation (ICRA), 2017, Singapore
AI中文摘要

动力学运动原语(DMPs)的概念已成为建模运动的流行方法,广泛应用于机器人。本文提出了一种框架,使机器人操作员能够以直观的方式调整DMPs。给定一个具有故障最后部分的生成轨迹,操作员可以使用引导通过编程来演示纠正轨迹。通过将故障轨迹的前部分和纠正轨迹的后部分结合,形成一个修改后的DMP。本文还展示了实时应用并通过实验进行了验证。

英文摘要

The concept of dynamical movement primitives (DMPs) has become popular for modeling of motion, commonly applied to robots. This paper presents a framework that allows a robot operator to adjust DMPs in an intuitive way. Given a generated trajectory with a faulty last part, the operator can use lead-through programming to demonstrate a corrective trajectory. A modified DMP is formed, based on the first part of the faulty trajectory and the last part of the corrective one. A real-time application is presented and verified experimentally.

1905.11129 2026-06-04 cs.RO cs.SY eess.SY 版本更新

On Motion Control and Machine Learning for Robotic Assembly

关于机器人装配的运动控制与机器学习

Martin Karlsson

发表机构 * Department of Automatic Control, Lund University(自动控制系,卢恩大学)

AI总结 本研究通过减少机器人编程所需的工程工作量并增强机器人应对意外事件的能力,提出了新的方法,从而加快编程速度并使非工程背景用户也能使用机器人。

详情
Journal ref
Licentiate Thesis (2017)
AI中文摘要

工业机器人通常需要非常结构化和可预测的工作环境以及显式编程,以发挥良好性能。因此,昂贵且耗时的工程工作是将任务传达给机器人时的主要障碍。本论文提出了减少机器人编程所需工程工作量并提高机器人处理意外事件能力的方法。这有两个主要优势:首先,编程可以更快完成,其次,非工程背景的用户也能使用。尽管这些方法可用于各种类型的机器人应用,但本论文专注于机器人装配任务。

英文摘要

Industrial robots typically require very structured and predictable working environments, and explicit programming, in order to perform well. Therefore, expensive and time-consuming engineering work is a major obstruction when mediating tasks to robots. This thesis presents methods that decrease the amount of engineering work required for robot programming, and increase the ability of robots to handle unforeseen events. This has two main benefits: Firstly, the programming can be done faster, and secondly, it becomes accessible to users without engineering experience. Even though these methods could be used for various types of robot applications, this thesis is focused on robotic assembly tasks.

1905.09396 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Predictive Control for Chasing a Ground Vehicle using a UAV

使用无人机追击地面车辆的预测控制

Jaeseung Byun, Karan P. Jain, Siddharth H. Nair, Haoyun Xu, Jiaming Zha

AI总结 本文提出了一种多旋翼无人机追击地面车辆的高层规划器,同时满足各种状态和输入约束。假设地面车辆的最小运动学模型,利用在线收集的数据在模型预测控制框架内生成预测,通过仿真和稳定四旋翼平台的实验验证了该方案。

详情
AI中文摘要

我们提出了一种多旋翼无人机追击地面车辆的高层规划器,同时满足各种状态和输入约束。假设地面车辆的最小运动学模型,我们利用在线收集的数据在模型预测控制框架内生成预测。我们的解决方案通过仿真和在稳定四旋翼平台上进行的实验得到了验证。

英文摘要

We propose a high-level planner for a multirotor to chase a ground vehicle, while simultaneously respecting various state and input constraints. Assuming a minimal kinematic model for the ground vehicle, we use data collected online to generate predictions for our planner within a model predictive control framework. Our solution is demonstrated, both via simulations and experiments on a stable quadcopter platform.

1905.06978 2026-06-04 eess.SY cs.LG cs.RO cs.SY stat.AP 版本更新

Randomized Algorithms for Data-Driven Stabilization of Stochastic Linear Systems

数据驱动的随机算法用于随机线性系统的稳定化

Mohamad Kazem Shirani Faradonbeh, Ambuj Tewari, George Michailidis

AI总结 本文提出两种随机算法用于数据驱动的随机线性系统稳定化,通过数值分析研究了随机反馈和随机参数方法的稳定速度和失败概率,证明在统计独立随机化数量不小时可以保证快速稳定化。

详情
AI中文摘要

数据驱动的控制策略在动态系统中广泛应用,尤其是在参数未知的情况下。一个关键问题是防止随机线性系统因决策者对动态参数不确定而失稳。本文提出了两种随机算法来解决这个问题,但其性能尚未充分研究。此外,算法中的关键参数,如随机化应用的幅度和频率的影响目前尚不明确。本文研究了数据驱动过程的稳定速度和失败概率。我们对两种方法:随机反馈和随机参数的性能进行了数值分析。所呈现的结果表明,只要统计独立的随机化数量不太多,就可以保证快速稳定化。

英文摘要

Data-driven control strategies for dynamical systems with unknown parameters are popular in theory and applications. An essential problem is to prevent stochastic linear systems becoming destabilized, due to the uncertainty of the decision-maker about the dynamical parameter. Two randomized algorithms are proposed for this problem, but the performance is not sufficiently investigated. Further, the effect of key parameters of the algorithms such as the magnitude and the frequency of applying the randomizations is not currently available. This work studies the stabilization speed and the failure probability of data-driven procedures. We provide numerical analyses for the performance of two methods: stochastic feedback, and stochastic parameter. The presented results imply that as long as the number of statistically independent randomizations is not too small, fast stabilization is guaranteed.

1905.05946 2026-06-04 cs.RO cs.CV cs.SY eess.SY 版本更新

Depth map estimation methodology for detecting free-obstacle navigation areas

用于自由障碍区域检测的深度图估计方法

Sergio Trejo, Karla Martinez, Gerardo Flores

AI总结 本文提出了一种基于视觉的方法,利用立体相机和一维LiDAR估计四旋翼导航中的自由障碍区域。通过加权最小二乘滤波器过滤深度图,并通过卡尔曼滤波算法融合信息,确定四旋翼可通过的足够大自由空间区域。整个过程在Jetson TX2嵌入式计算机上用ROS实现。

详情
Journal ref
ICUAS'19 The 2019 International Conference on Unmanned Aircraft Systems
AI中文摘要

本文提出了一种基于视觉的方法,利用立体相机和一维LiDAR估计四旋翼导航中的自由障碍区域。所提出的方法利用立体相机提供的深度图和一维LiDAR的测距信息。在对深度图进行加权最小二乘滤波器(WLS)过滤后,通过卡尔曼滤波算法融合信息。通过使用卡尔曼滤波器的输出信息,在视差图中标记一个区域,以确定是否存在足够大的自由空间供四旋翼通过。整个过程在Jetson TX2嵌入式计算机上用机器人操作系统(ROS)实现。实验展示了该方法的有效性。

英文摘要

This paper presents a vision-based methodology which makes use of a stereo camera rig and a one dimension LiDAR to estimate free obstacle areas for quadrotor navigation. The presented approach fuses information provided by a depth map from a stereo camera rig, and the sensing distance of the 1D-LiDAR. Once the depth map is filtered with a Weighted Least Squares filter (WLS), the information is fused through a Kalman filter algorithm. To determine if there is a free space large enough for the quadrotor to pass through, our approach marks an area inside the disparity map by using the Kalman Filter output information. The whole process is implemented in an embedded computer Jetson TX2 and coded in the Robotic Operating System (ROS). Experiments demonstrate the effectiveness of our approach.

1811.01516 2026-06-04 cs.RO cs.SY eess.SY 版本更新

SLAMBooster: An Application-aware Controller for Approximation in SLAM

SLAMBooster: 一种面向应用的近似控制算法用于SLAM

Yan Pei, Swarnendu Biswas, Donald S. Fussell, Keshav Pingali

发表机构 * University of Texas at Austin, USA(德克萨斯大学奥斯汀分校) Indian Institute of Technology Kanpur, India(印度理工学院坎普尔分校)

AI总结 本文提出SLAMBooster,一种面向应用的在线控制算法,用于在SLAM中实现近似计算,通过动态调整近似参数来减少计算时间和能耗,同时保持定位精度。

Comments 13 pages

详情
AI中文摘要

同时定位与建图(SLAM)是构建移动代理环境地图的同时定位代理的问题。密集SLAM算法在像素粒度上执行重建和定位。这些算法需要大量计算资源,这限制了它们在低功耗资源受限设备上的应用。近似计算可以用于加速SLAM实现,只要近似不会阻止代理正确导航通过环境。先前的SLAM近似研究假设代理的整个轨迹在开始前已知,并且专注于离线控制器,在轨迹开始时设置近似参数。在实践中,轨迹并不事先已知,允许在运行时动态调整参数提供了更多减少计算时间和能耗的机会。我们描述了SLAMBooster,一种面向应用的在线控制系统,用于密集SLAM,它在代理运动过程中自适应地控制近似参数。SLAMBooster基于比例-积分-微分(PID)控制器技术,但我们的实验表明这种通用控制器导致定位精度显著下降。为了解决这个问题,SLAMBooster还利用领域知识通过执行平滑表面检测和姿态校正来控制近似。我们实现了SLAMBooster在开源的SLAMBench框架中,并在多个轨迹上进行了评估。我们的实验表明,在嵌入式平台上,SLAMBooster平均减少了72%的计算时间和35%的能量消耗,同时保持定位精度在合理范围内。这些改进使得在更广泛设备上部署SLAM成为可能。

英文摘要

Simultaneous Localization and Mapping (SLAM) is the problem of constructing a map of a mobile agent's environment while localizing the agent within the map. Dense SLAM algorithms perform reconstruction and localization at pixel granularity. These algorithms require a lot of computational power, which has hindered their use on low-power resource-constrained devices. Approximate computing can be used to speed up SLAM implementations as long as the approximations do not prevent the agent from navigating correctly through the environment. Previous studies of approximation in SLAM have assumed that the entire trajectory of the agent is known before the agent starts, and they have focused on offline controllers that set approximation knobs at the start of the trajectory. In practice, the trajectory is not known ahead of time, and allowing knob settings to change dynamically opens up more opportunities for reducing computation time and energy. We describe SLAMBooster, an application-aware, online control system for dense SLAM that adaptively controls approximation knobs during the motion of the agent. SLAMBooster is based on a control technique called proportional-integral-derivative (PID) controller but our experiments showed this application-agnostic controller led to an unacceptable reduction in localization accuracy. To address this problem, SLAMBooster also exploits domain knowledge for controlling approximation by performing smooth surface detection and pose correction. We implemented SLAMBooster in the open-source SLAMBench framework and evaluated it on several trajectories. Our experiments show that on the average, SLAMBooster reduces the computation time by 72% and energy consumption by 35% on an embedded platform, while maintaining the accuracy of localization within reasonable bounds. These improvements make it feasible to deploy SLAM on a wider range of devices.

1810.03076 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Online Center of Mass Estimation for a Humanoid Wheeled Inverted Pendulum Robot

人形轮式反重力摆机器人在线质心估计

Munzir Zafar, Akash Patel, Bogdan Vlahov, Nathaniel Glaser, Sergio Aguillera, Seth Hutchinson

AI总结 本文提出了一种新颖的鲁棒控制与在线学习结合的方法,用于平衡具有n自由度的轮式反重力摆人形机器人,通过在线学习更新质量模型以获得更准确的质心估计,实验表明该方法提升了整体控制效率。

详情
AI中文摘要

我们提出了一种新颖的鲁棒控制和在线学习应用,用于平衡具有n个自由度(DoF)的轮式反重力摆(WIP)人形机器人。我们的技术将质量模型的不准确性转化为质心(CoM)误差,并在存在误差的情况下实现平衡,同时利用在线学习更新质量模型以获得更好的质心估计。使用我们机器人的模拟模型,我们元学习了一组激励关节姿态,使我们的梯度下降算法快速收敛到准确的(CoM)估计。该模拟流程完全在线执行,使用主动扰动抵消来解决由于持续演变的质量模型所产生的质量误差。在19个自由度的WIP上进行了实验,我们手动获取了学习姿态集的数据,并展示了由梯度下降产生的质量模型产生的质心估计能够提高整体控制和效率。本工作为Golem Krang人形机器人整体控制贡献了更丰富的文献。

英文摘要

We present a novel application of robust control and online learning for the balancing of a n Degree of Freedom (DoF), Wheeled Inverted Pendulum (WIP) humanoid robot. Our technique condenses the inaccuracies of a mass model into a Center of Mass (CoM) error, balances despite this error, and uses online learning to update the mass model for a better CoM estimate. Using a simulated model of our robot, we meta-learn a set of excitory joint poses that makes our gradient descent algorithm quickly converge to an accurate (CoM) estimate. This simulated pipeline executes in a fully online fashion, using active disturbance rejection to address the mass errors that result from a steadily evolving mass model. Experiments were performed on a 19 DoF WIP, in which we manually acquired the data for the learned set of poses and show that the mass model produced by a gradient descent produces a CoM estimate that improves overall control and efficiency. This work contributes to a greater corpus of whole body control on the Golem Krang humanoid robot.

1809.06715 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Controller Synthesis for Discrete-time Hybrid Polynomial Systems via Occupation Measures

通过占用测度进行离散时间混合多项式系统的控制器合成

Weiqiao Han, Russ Tedrake

AI总结 本文提出了一种基于占用测度的控制器合成方法,用于稳定具有多个与环境接触的刚体系统,通过解决有限维半正定规划问题来近似无限维线性规划问题,以实现系统稳定。

Comments Accepted by ICRA 2019. Some text overlap with arXiv:1803.09022 in introducing standard notations and preliminary knowledge. arXiv admin note: text overlap with arXiv:1803.09022

详情
AI中文摘要

我们考虑通过与环境建立和断开多个接触来稳定刚体系统的设计问题,不预先指定接触的时间或发生次数。我们将此类系统建模为离散时间混合多项式系统,其中状态-输入空间被划分为若干多边形区域,每个区域关联不同的多项式动力学方程。基于占用测度的概念,我们提出了一种新的控制器合成方法,通过求解有限维半正定规划问题来近似无限维线性规划问题以稳定系统。该优化公式简单且凸,对于任何固定的近似次数,计算复杂度与状态和控制输入的维度呈多项式关系。我们通过一些机器人示例来说明我们的方法。

英文摘要

We consider the feedback design for stabilizing a rigid body system by making and breaking multiple contacts with the environment without prespecifying the timing or the number of occurrence of the contacts. We model such a system as a discrete-time hybrid polynomial system, where the state-input space is partitioned into several polytopic regions with each region associated with a different polynomial dynamics equation. Based on the notion of occupation measures, we present a novel controller synthesis approach that solves finite-dimensional semidefinite programs as approximations to an infinite-dimensional linear program to stabilize the system. The optimization formulation is simple and convex, and for any fixed degree of approximations the computational complexity is polynomial in the state and control input dimensions. We illustrate our approach on some robotics examples.

1905.03349 2026-06-04 eess.SY cs.RO cs.SY 版本更新

A New Hybrid Control Architecture to Attenuate Large Horizontal Wind Disturbance for a Small-Scale Unmanned Helicopter

一种新型混合控制架构用于抑制小型无人驾驶直升机的大水平风扰动

Xiaorui Zhu, Wenwu Zeng, Zexiang Li, Chunyang Zheng

发表机构 * School of Mechanical Engineering and Automation, Harbin Institute of Technology Shenzhen Graduate School(哈尔滨工业大学机械工程与自动化学院深圳研究生院) Department of Electronic and Computer Engineering, Hong Kong University of Science and Technology(香港理工大学电子与计算机工程系)

AI总结 本文提出了一种结合风洞实验数据和反步算法的新方法,用于抑制小型无人驾驶直升机的大水平风扰动,通过混合控制架构实现更精确快速的扰动抑制。

详情
AI中文摘要

本文提出了一种新颖的方法,用于抑制小型无人驾驶自主直升机的大水平风扰动,结合风洞实验数据和反步算法。大水平风扰动对自主直升机有害,尤其是小型直升机,由于其低惯性以及多输入间的强交叉耦合效应。为实现更精确和快速的扰动抑制,提出了一种新的混合控制架构,利用基于风洞实验数据的直接力/力矩补偿。在该架构中,大水平风扰动被视为控制系统的附加输入,而非平衡状态附近的微小扰动。然后设计反步算法以保证直升机稳定收敛至期望位置。所提出的技术最终在HIROBO Eagle平台上通过仿真进行评估,并与传统风速补偿方法进行比较。

英文摘要

This paper presents a novel method to attenuate large horizontal wind disturbance for a small-scale unmanned autonomous helicopter combining wind tunnel-based experimental data and a backstepping algorithm. Large horizontal wind disturbance is harmful to autonomous helicopters, especially to small ones because of their low inertia and the high cross-coupling effects among the multiple inputs. In order to achieve more accurate and faster attenuation of large wind disturbance, a new hybrid control architecture is proposed to take advantage of the direct force/moment compensation based on the wind tunnel experimental data. In this architecture, large horizontal wind disturbance is treated as an additional input to the control system instead of a small perturbation around the equilibrium state. A backstepping algorithm is then designed to guarantee the stable convergence of the hilicopter to the desired position. The proposed technique is finally evaluated in simulation on the platform, HIROBO Eagle, compared with a traditional wind velocity compensation method.

1905.03131 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Vision-based Unscented FastSLAM for Mobile Robot

基于视觉的无迹快速SLAM

Chunxin Qiu, Xiaorui Zhu, Xiaobing Zhao

AI总结 本文提出一种结合无迹粒子滤波和无迹卡尔曼滤波的视觉无迹快速SLAM算法,通过双目视觉检测地标实现定位与建图,利用无迹快速SLAM提升定位与建图的精度和鲁棒性。

详情
AI中文摘要

本文提出一种基于视觉的无迹快速SLAM(UFastSLAM)算法,结合了 Rao-Blackwellized 粒子滤波和无迹卡尔曼滤波(UKF)。地标通过双目视觉检测来整合定位与建图。由于双目视觉系统通常继承较大的测量误差,因此采用无迹快速SLAM来提高定位与建图的性能。无迹快速SLAM利用UKF代替非线性函数的线性近似,有效粒子数作为标准以减少粒子退化。通过仿真和实验证明,无迹快速SLAM算法在视觉系统中比FastSLAM2.0算法在精度和鲁棒性方面表现更优。

英文摘要

This paper presents a vision-based Unscented FastSLAM (UFastSLAM) algorithm combing the Rao-Blackwellized particle filter and Unscented Kalman filte(UKF). The landmarks are detected by a binocular vision to integrate localization and mapping. Since such binocular vision system generally inherits larger measurement errors, it is suitable to adopt Unscented FastSLAM to improve the performance of localization and mapping. Unscented FastSLAM takes advantage of UKF instead of the linear approximations of the nonlinear function where the effective number of particles is used as the criteria to reduce the particle degeneration. Simulations and experiments are carried out to demonstrate that the Unscented FastSLAM algorithm can achieve much better performance in the vision-based system than FastSLAM2.0 algorithm on the accuracy and robustness.

1905.03130 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Cooperative Distributed Robust Control of Modular Mobile Robots with Bounded Curvature and Velocity

模块化移动机器人协同分布式鲁棒控制:具有有限曲率和速度的控制

Xiaorui Zhu, Youngshik Kim, Mark A. Minor

发表机构 * University of Utah(犹他大学)

AI总结 本文研究了一种新型的运动控制系统,用于具有柔性框架轮式模块化移动机器人(CFMMR)。该系统结合了基于有限曲率的运动控制器和非线性阻尼动态控制器,通过多种控制器交互形式评估不同控制结构,实验结果验证了姿态调节的有效性。

详情
AI中文摘要

本文研究了一种新型的运动控制系统,用于Compliant Framed wheeled Modular Mobile Robots (CFMMR)。该类轮式移动机器人通过柔性框架模块耦合刚性轴,提供全面的悬挂和增强的转向能力,而无需额外硬件。所提出的控制系统是通过结合基于有限曲率的运动控制器和非线性阻尼动态控制器开发的。特别是,研究了多种控制器交互形式。使用双轴侦察CFMMR配置来评估不同的控制结构。实验结果验证了高效的运动控制性能。

英文摘要

A novel motion control system for Compliant Framed wheeled Modular Mobile Robots (CFMMR) is studied in this paper. This type of wheeled mobile robot uses rigid axles coupled by compliant frame modules to provide both full suspension and enhanced steering capability without additional hardware. The proposed control system is developed by combining a bounded curvature-based kinematic controller and a nonlinear damping dynamic controller. In particular, multiple forms of controller interaction are examined. A twoaxle scout CFMMR configuration is used to evaluate the different control structures. Experimental results verify efficient motion control of posture regulation.

1905.03051 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Bayesian Optimization for Polynomial Time Probabilistically Complete STL Trajectory Synthesis

基于多项式时间概率完备STL轨迹综合的贝叶斯优化

Vince Kurtz, Hai Lin

AI总结 本文提出了一种基于贝叶斯优化的STL轨迹综合方法,替代传统的混合整数线性规划方法,实现了多项式时间复杂度和线性预测数量的高效轨迹综合,同时保证了概率完备性。

Comments CDC 2019 Submission

详情
AI中文摘要

近年来,信号临时逻辑(STL)作为机器人和网络物理系统控制目标编码的实用且表达能力强的手段获得了广泛关注。STL轨迹综合的最新进展是将问题形式化为混合整数线性规划(MILP)。MILP方法在有界规范下是正确且完整的,但这种强正确性保证以指数复杂度和规范的时间界为代价。在本工作中,我们提出了一种替代的综合范式,依赖于贝叶斯优化而不是混合整数规划。这将完备性保证放松为概率完备性,但显著更高效:我们的方法在STL时间界上呈多项式复杂度,在预测数量上呈线性复杂度。我们证明我们的方法是正确且概率完备的,并通过一个非平凡的例子展示了其可扩展性。

英文摘要

In recent years, Signal Temporal Logic (STL) has gained traction as a practical and expressive means of encoding control objectives for robotic and cyber-physical systems. The state-of-the-art in STL trajectory synthesis is to formulate the problem as a Mixed Integer Linear Program (MILP). The MILP approach is sound and complete for bounded specifications, but such strong correctness guarantees come at the price of exponential complexity in the number of predicates and the time bound of the specification. In this work, we propose an alternative synthesis paradigm that relies on Bayesian optimization rather than mixed integer programming. This relaxes the completeness guarantee to probabilistic completeness, but is significantly more efficient: our approach scales polynomially in the STL time-bound and linearly in the number of predicates. We prove that our approach is sound and probabilistically complete, and demonstrate its scalability with a nontrivial example.

1905.01028 2026-06-04 eess.SY cs.MA cs.RO cs.SY 版本更新

Robust Cooperative Formation Control of Fixed-Wing Unmanned Aerial Vehicles

固定翼无人飞行器鲁棒协同编队控制

Qingrui Zhang, Hugh H. T. Liu

发表机构 * Institute for Aerospace Studies, University of Toronto(航空航天研究学院,多伦多大学)

AI总结 本文研究了固定翼无人机在紧密编队飞行中用于节能的鲁棒协同编队控制问题,提出了一种新的协同控制方法,通过虚拟结构概念解决大规模无人机编队中虚拟领导者设计的难题,并采用协作者滤波器生成平滑的参考信号,同时利用不确定性与扰动观测器补偿气动耦合导致的模型不确定性,最终实现了鲁棒的编队控制。

详情
AI中文摘要

本文研究了固定翼无人飞行器在紧密编队飞行中用于节能的鲁棒协同编队控制问题。提出了一种新的协同控制方法。采用虚拟结构的概念来解决在编队飞行中为大量无人机设计虚拟领导者所面临的困难。为了提高暂态性能,期望轨迹通过一组协作者滤波器生成平滑的参考信号,即虚拟领导者的状态。利用不确定性与扰动观测器估计并补偿由于无人机之间气动耦合导致的模型不确定性。因此,整个设计包含三个主要组成部分:用于运动规划的协作者滤波器、基准协同控制以及不确定性与扰动观测。所提出的编队控制器至少可以保证编队跟踪的最终有界控制性能。如果满足某些条件,则可以实现渐近编队跟踪控制。本文的主要贡献在于两个方面:1)通过虚拟结构概念解决了设计虚拟领导者的问题;2)提出了一种针对大量无人机之间存在气动耦合的紧密编队飞行的鲁棒协同控制器。所提出设计的效率将通过五架无人机紧密编队飞行的数值模拟来展示。

英文摘要

Robust cooperative formation control is investigated in this paper for fixed-wing unmanned aerial vehicles in close formation flight to save energy. A novel cooperative control method is developed. The concept of virtual structure is employed to resolve the difficulty in designing virtual leaders for a large number of UAVs in formation flight. To improve the transient performance, desired trajectories are passed through a group of cooperative filters to generate smooth reference signals, namely the states of the virtual leaders. Model uncertainties due to aerodynamic couplings among UAVs are estimated and compensated using uncertainty and disturbance observers. The entire design, therefore, contains three major components: cooperative filters for motion planning, baseline cooperative control, and uncertainty and disturbance observation. The proposed formation controller could at least secure ultimate bounded control performance for formation tracking. If certain conditions are satisfied, asymptotic formation tracking control could be obtained. Major contributions of this paper lie in two aspects: 1) the difficulty in designing virtual leaders is resolved in terms of the virtual structure concept; 2) a robust cooperative controller is proposed for close formation flight of a large number of UAVs suffering from aerodynamic couplings in between. The efficiency of the proposed design will be demonstrated using numerical simulations of five UAVs in close formation flight.

1809.07874 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Task-Driven Estimation and Control via Information Bottlenecks

基于信息瓶颈的任务驱动估计与控制

Vincent Pacelli, Anirudha Majumdar

发表机构 * Princeton University(普林斯顿大学)

AI总结 本文提出了一种基于信息瓶颈理论的任务驱动估计与控制框架,通过任务相关的变量构建高效表示,并设计了在线算法进行状态估计以实现鲁棒的控制策略。

Comments 9 pages, 4 figures, abridged version accepted to ICRA2019; Incorporates changes in final conference submission

详情
AI中文摘要

我们的目标是开发一种原理明确且通用的算法框架,用于机器人系统的任务驱动估计与控制。最先进的机器人控制系统通常依赖于准确估计机器人完整状态(例如,运行中的机器人可能估计关节角度和速度、躯干状态以及相对于目标的位置)。然而,完整状态表示通常对于特定任务来说过于丰富,可能导致显著的计算不高效和对状态估计误差的脆弱性。相反,我们提出了一种方法,避免使用这种丰富的表示,并试图创建任务驱动的表示。关键的技术洞察是利用信息瓶颈理论来形式化“任务驱动表示”的概念,以信息理论量度来衡量表示的最小性。我们提出了新的迭代算法,用于自动合成(离线)任务驱动表示(以一组任务相关变量(TRVs)给出)和一个以TRVs为函数的高效控制策略。我们还提出了在线算法来估计TRVs,以便应用控制策略。我们证明了我们的方法在理论和彻底的模拟实验(包括向目标位置奔跑的弹簧加载倒立摆)中都对未建模的测量不确定性具有显著的鲁棒性。

英文摘要

Our goal is to develop a principled and general algorithmic framework for task-driven estimation and control for robotic systems. State-of-the-art approaches for controlling robotic systems typically rely heavily on accurately estimating the full state of the robot (e.g., a running robot might estimate joint angles and velocities, torso state, and position relative to a goal). However, full state representations are often excessively rich for the specific task at hand and can lead to significant computational inefficiency and brittleness to errors in state estimation. In contrast, we present an approach that eschews such rich representations and seeks to create task-driven representations. The key technical insight is to leverage the theory of information bottlenecks}to formalize the notion of a "task-driven representation" in terms of information theoretic quantities that measure the minimality of a representation. We propose novel iterative algorithms for automatically synthesizing (offline) a task-driven representation (given in terms of a set of task-relevant variables (TRVs)) and a performant control policy that is a function of the TRVs. We present online algorithms for estimating the TRVs in order to apply the control policy. We demonstrate that our approach results in significant robustness to unmodeled measurement uncertainty both theoretically and via thorough simulation experiments including a spring-loaded inverted pendulum running to a goal location.

1905.01683 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Path Planning for Autonomous Bus Driving in Urban Environments

城市环境中自动驾驶巴士的路径规划

Rui Oliveira, Pedro F. Lima, Gonçalo Collares Pereira, Jonas Mårtensson, Bo Wahlberg

AI总结 本文提出了一种针对城市环境中巴士驾驶的路径规划框架,通过优化问题解决巴士的复杂驾驶任务,利用道路对齐车辆模型,并考虑巴士的过道特性以实现安全的避障约束。

Comments 6 pages, 8 figures

详情
AI中文摘要

在城市环境中驾驶往往面临需要专家操作车辆的困难情况。当考虑大型车辆,如巴士时,这些情况变得更加具有挑战性。我们提出了一种路径规划框架,以解决巴士在城市区域中的复杂驾驶任务。该方法使用道路对齐的车辆模型进行建模。道路对齐的坐标系引入了对车辆本体和障碍物的扭曲,促使开发新的近似方法来捕捉这种扭曲。这些近似方法允许安全且非保守的碰撞避免约束的制定。与其他路径规划方法不同,我们的方法利用了 curb 和其他可扫过的区域,这些区域巴士在执行某些操作时必须扫过。此外,它充分利用了巴士的特定特性,即过道,这是车辆底盘的升高部分,可以扫过 curb。进行了模拟,展示了所提出方法的适用性和优势。

英文摘要

Driving in urban environments often presents difficult situations that require expert maneuvering of a vehicle. These situations become even more challenging when considering large vehicles, such as buses. We present a path planning framework that addresses the demanding driving task of buses in urban areas. The approach is formulated as an optimization problem using the road-aligned vehicle model. The road-aligned frame introduces a distortion on the vehicle body and obstacles, motivating the development of novel approximations that capture this distortion. These approximations allow for the formulation of safe and non-conservative collision avoidance constraints. Unlike other path planning approaches, our method exploits curbs and other sweepable regions, which a bus must often sweep over in order to manage certain maneuvers. Furthermore, it takes full advantage of the particular characteristics of buses, namely the overhangs, an elevated part of the vehicle chassis, that can sweep over curbs. Simulations are presented, showing the applicability and benefits of the proposed method.

1903.10623 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Attitude- and Cruise Control of a VTOL Tiltwing UAV

VTOL 倾转翼无人机的姿态与巡航控制

David Rohr, Thomas Stastny, Sebastian Verling, Roland Siegwart

发表机构 * IEEE

AI总结 本文提出了一种过量驱动的垂直起降(VTOL)倾转翼无人机的数学建模、控制器设计和飞行测试方法,通过简化空气动力学和第一原理,建立了能够捕捉关键空气动力学效应的动态模型,包括螺旋桨滑流对机翼的影响和机翼后失速特性。通过优化如功率最优调平等方法解决了无人机的过量驱动问题,并开发了由低层姿态控制器和高层巡航控制器组成的控制系统,通过系统线性化和查找表确定飞行包线内调平的强非线性变化,通过广泛的飞行测试验证了控制系统在所有飞行阶段(悬停、过渡、巡航)的性能。

Comments 8 pages

详情
AI中文摘要

本文提出了一种垂直起降(VTOL)倾转翼无人机的数学建模、控制器设计和飞行测试方法。基于简化空气动力学和第一原理,建立了能够捕捉关键空气动力学效应的动态模型,包括螺旋桨滑流对机翼的影响和机翼后失速特性。基于该模型分析了稳态飞行包线和相应的调平作用,并通过优化如功率最优调平等方法解决了无人机的过量驱动问题。所开发的控制系统由两个控制器组成:首先是一个基于动态反向和串行方法的低层姿态控制器,用于处理冗余执行器的分配。其次是一个高层巡航控制器,用于跟踪所需的垂直速度。该控制器基于系统线性化和查找表,以确定飞行包线内调平的强非线性变化。我们通过广泛的飞行测试验证了该控制系统的性能,适用于所有飞行阶段(悬停、过渡、巡航)。

英文摘要

This paper presents the mathematical modeling, controller design, and flight-testing of an over-actuated Vertical Take-off and Landing (VTOL) tiltwing Unmanned Aerial Vehicle (UAV). Based on simplified aerodynamics and first-principles, a dynamical model of the UAV is developed which captures key aerodynamic effects including propeller slipstream on the wing and post-stall characteristics of the airfoils. The model-based steady-state flight envelope and the corresponding trim-actuation is analyzed and the overactuation of the UAV solved by optimizing for, e.g., power-optimal trims. The developed control system is composed of two controllers: First, a low-level attitude controller based on dynamic inversion and a daisy-chaining approach to handle allocation of redundant actuators. Secondly, a higher-level cruise controller to track a desired vertical velocity. It is based on a linearization of the system and look-up tables to determine the strong and nonlinear variation of the trims throughout the flight-envelope. We demonstrate the performance of the control-system for all flight phases (hover, transition, cruise) in extensive flight-tests.

1905.01150 2026-06-04 cs.RO cs.MA cs.SY eess.SY 版本更新

A Right-of-Way Based Strategy to Implement Safe and Efficient Driving at Non-Signalized Intersections for Automated Vehicles

基于通行权的非信号交叉口安全高效自动驾驶策略

Yadong Xing, Can Zhao, ZhiHeng Li, Yi Zhang, Li Li, Fei-Yue Wang, Xiao Wang, Yujing Wang, Yuelong Su, Dongpu Cao

发表机构 * Graduate School at Shenzhen, Tsinghua University(清华大学深圳研究生院) Department of Automation, BNRist, Tsinghua University(清华大学自动化系,北京理工大学) traffic management solution division, traffic throughput through simulation experiments(交通管理解决方案部)

AI总结 本文提出一种基于通行权分配的策略,用于在非信号交叉口实现安全高效的自动驾驶,通过测试比较现有策略,证明该策略在交通效率上优于原有策略,但受限于通信范围和缺乏长期规划,不如协作驾驶策略,但通信成本更低。

Comments 6 pages, 7 figures

详情
AI中文摘要

非信号交叉口是连接和自动化车辆(CAVs)的典型和常见场景。如何平衡安全性和效率仍然是研究人员的难题。为了改进原有的责任敏感安全(RSS)驾驶策略在非信号交叉口的表现,本文提出了一种基于通行权分配(RWA)的新策略。对RSS策略、协作驾驶策略和基于RWA的策略进行了测试和比较。测试结果表明,我们的策略在交通效率上优于RSS策略,但因通信范围有限和缺乏长期规划,不如协作驾驶策略。然而,我们的新策略所需的车辆间通信成本要少得多。

英文摘要

Non-signalized intersection is a typical and common scenario for connected and automated vehicles (CAVs). How to balance safety and efficiency remains difficult for researchers. To improve the original Responsibility Sensitive Safety (RSS) driving strategy on the non-signalized intersection, we propose a new strategy in this paper, based on right-of-way assignment (RWA). The performances of RSS strategy, cooperative driving strategy, and RWA based strategy are tested and compared. Testing results indicate that our strategy yields better traffic efficiency than RSS strategy, but not satisfying as the cooperative driving strategy due to the limited range of communication and the lack of long-term planning. However, our new strategy requires much fewer communication costs among vehicles.

1904.12394 2026-06-04 math.DS cs.NA cs.RO math.NA 版本更新

Stability conditions of an ODE arising in human motion and its numerical simulation

来自人体运动的ODE的稳定性条件及其数值模拟

Takahiro Kosugi, Hitoshi Kino, Masaaki Goto, Yuki Matsutani

发表机构 * Department of Intelligent Mechanical Engineering, Faculty of Engineering, Fukuoka Institute of Technology(福冈技术学院智能机械工程系) Department of Robotics, Faculty of Engineering, Kindai University(近畿大学机器人系)

AI总结 本文研究了一个来自肌骨系统前馈位置控制的ODE的平衡点稳定性,提出了一种渐近稳定的充分条件,并通过惩罚ODE的数值模拟和实验结果验证了该条件。

Comments 15 pages, 7 figures

详情
AI中文摘要

本文讨论了一个来自肌骨系统前馈位置控制的常微分方程(ODE)的平衡点稳定性。所研究的系统包含一个连杆、一个关节和两条具有路由点的肌肉。系统的运动收敛性强烈依赖于肌骨系统中肌肉的排列方式。本文获得了渐近稳定的充分条件。此外,还描述了惩罚ODE的数值模拟和实验结果。

英文摘要

This paper discusses the stability of an equilibrium point of an ordinary differential equation (ODE) arising from a feed-forward position control for a musculoskeletal system. The studied system has a link, a joint and two muscles with routing points. The motion convergence of the system strongly depends on the muscular arrangement of the musculoskeletal system. In this paper, a sufficient condition for asymptotic stability is obtained. Furthermore, numerical simulations of the penalized ODE and experimental results are described.

1808.07921 2026-06-04 cs.RO cs.AI cs.PL cs.SE cs.SY eess.SY 版本更新

SOTER: A Runtime Assurance Framework for Programming Safe Robotics Systems

SOTER:一种用于安全机器人系统编程的运行时保证框架

Ankush Desai, Shromona Ghosh, Sanjit A. Seshia, Natarajan Shankar, Ashish Tiwari

发表机构 * University of California at Berkeley, CA, USA(加州大学伯克利分校) SRI International(SRI国际) Microsoft(微软)

AI总结 本文提出SOTER框架,通过一种编程语言和集成的运行时保证系统,为安全机器人系统提供保障,确保在使用未经认证组件时仍能满足安全要求。

详情
AI中文摘要

近年来,机器人实现更高自主性和智能性的趋势导致了高度复杂性。自主机器人越来越多地依赖第三方现成组件和复杂的机器学习技术。这种趋势使得提供强设计时认证的正确操作变得具有挑战性。为了解决这些挑战,我们提出了SOTER,一种机器人编程框架,包含两个关键组件:(1)一种用于实现和测试高层反应式机器人软件的编程语言;(2)一个集成的运行时保证(RTA)系统,该系统帮助在使用未经认证的组件时仍能提供安全保证。SOTER提供了语言原语,用于声明性地构建RTA模块,该模块包含一个高级高性能控制器(未经认证)、一个安全但性能较低的控制器(认证)以及期望的安全规范。该框架提供正式保证,确保一个良好的RTA模块始终满足安全规范,而无需完全牺牲性能,通过在安全时使用高性能未经认证的组件。SOTER允许复杂的机器人软件堆栈作为RTA模块的组合来构建,其中每个未经认证的组件都通过RTA模块进行保护。为了证明我们框架的有效性,我们考虑了一个现实世界案例研究,即构建一个安全的无人机监视系统。我们的实验在模拟和实际无人机上均表明,SOTER启用的RTA确保了系统的安全性,包括在不可信的第三方组件有bug或偏离预期行为时。

英文摘要

The recent drive towards achieving greater autonomy and intelligence in robotics has led to high levels of complexity. Autonomous robots increasingly depend on third party off-the-shelf components and complex machine-learning techniques. This trend makes it challenging to provide strong design-time certification of correct operation. To address these challenges, we present SOTER, a robotics programming framework with two key components: (1) a programming language for implementing and testing high-level reactive robotics software and (2) an integrated runtime assurance (RTA) system that helps enable the use of uncertified components, while still providing safety guarantees. SOTER provides language primitives to declaratively construct a RTA module consisting of an advanced, high-performance controller (uncertified), a safe, lower-performance controller (certified), and the desired safety specification. The framework provides a formal guarantee that a well-formed RTA module always satisfies the safety specification, without completely sacrificing performance by using higher performance uncertified components whenever safe. SOTER allows the complex robotics software stack to be constructed as a composition of RTA modules, where each uncertified component is protected using a RTA module. To demonstrate the efficacy of our framework, we consider a real-world case-study of building a safe drone surveillance system. Our experiments both in simulation and on actual drones show that the SOTER-enabled RTA ensures the safety of the system, including when untrusted third-party components have bugs or deviate from the desired behavior.

1904.08833 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A Passivity-based Nonlinear Admittance Control with Application to Powered Upper-limb Control under Unknown Environmental Interactions

一种基于被动性的非线性阻抗控制及其在未知环境交互下的动力上肢控制应用

Min Jun Kim, Woongyong Lee, Jae Yeon Choi, Goobong Chung, Kyung-Lyong Han, Il Seop Choi, Christian Ott, Wan Kyun Chung

AI总结 本文提出了一种基于被动性理论的动力上肢外骨骼机器人的阻抗控制器,通过非线性运动方程建模,利用被动性理论将人类操作员和环境交互纳入控制回路,通过力/扭矩传感器与人类交互,通过末端执行器与环境交互,尽管环境交互无法被任何传感器检测到(未知),但被动性允许自然交互。分析表明,当控制增益趋于无穷大时,实际系统的行为与名义模型相似,表明所提出的方法是一种阻抗控制器。然而,由于实际中控制增益无法无限增长,根据可实现的控制增益性能限制也被分析。分析结果表明,按无限范数意义,性能与控制增益成线性关系。在实验中,使用1自由度测试台验证了所提出的方法,并用实际的动力上肢外骨骼设备来提升和操控未知负载。

Comments Accepted in IEEE/ASME Transactions on Mechatronics (T-MECH)

详情
AI中文摘要

本文提出了一种基于被动性理论的动力上肢外骨骼机器人的阻抗控制器,该机器人由非线性运动方程支配。被动性允许我们将人类操作员和环境交互纳入控制回路。机器人通过F/T传感器与人类操作员交互,主要通过末端执行器与环境交互。尽管环境交互无法被任何传感器检测到(因此未知),被动性允许我们实现自然交互。分析表明,当控制增益趋于无穷大时,实际系统的行为与名义模型相似,这表明所提出的方法是一种阻抗控制器。然而,由于实际中控制增益无法无限增长,根据可实现的控制增益的性能限制也被分析。分析结果表明,按无限范数意义,性能与控制增益成线性关系。在实验中,所提出的方法通过1自由度测试台进行了验证,并使用实际的动力上肢外骨骼设备来提升和操控未知负载。

英文摘要

This paper presents an admittance controller based on the passivity theory for a powered upper-limb exoskeleton robot which is governed by the nonlinear equation of motion. Passivity allows us to include a human operator and environmental interaction in the control loop. The robot interacts with the human operator via F/T sensor and interacts with the environment mainly via end-effectors. Although the environmental interaction cannot be detected by any sensors (hence unknown), passivity allows us to have natural interaction. An analysis shows that the behavior of the actual system mimics that of a nominal model as the control gain goes to infinity, which implies that the proposed approach is an admittance controller. However, because the control gain cannot grow infinitely in practice, the performance limitation according to the achievable control gain is also analyzed. The result of this analysis indicates that the performance in the sense of infinite norm increases linearly with the control gain. In the experiments, the proposed properties were verified using 1 degree-of-freedom testbench, and an actual powered upper-limb exoskeleton was used to lift and maneuver the unknown payload.

1904.08361 2026-06-04 cs.LG cs.RO cs.SY eess.SY stat.ML 版本更新

Decoupled Data Based Approach for Learning to Control Nonlinear Dynamical Systems

基于解耦数据的方法用于学习控制非线性动力学系统

Ran Wang, Karthikeya Parunandi, Dan Yu, Dileep Kalathil, Suman Chakravorty

发表机构 * College of Astronautics, Nanjing University and hence, run into the curse of dimensionality(南京大学航天学院) Department of Electrical and Computer Engineering, Texas A&M University, Texas, USA(德克萨斯A&M大学电气与计算机工程系)

AI总结 本文提出了一种解耦数据基于的方法,用于学习控制具有连续状态空间、连续动作空间和未知动态的非线性随机动力学系统,通过解耦的开环-闭环方法,利用黑盒仿真模型解决开环确定性轨迹优化问题,并通过线性化动态在该名义轨迹上开发闭环控制,从而使用线性二次调节器算法,证明了该方法的性能近似最优,并在训练时间上显著优于其他先进算法。

详情
AI中文摘要

本文解决了一个非线性随机动力学系统学习最优控制策略的问题,该系统具有连续状态空间、连续动作空间和未知动态。此类问题通常在随机自适应控制和强化学习文献中使用基于模型和无模型的方法分别解决。这两种方法都依赖于解决动态规划问题,无论是直接还是间接,以找到最优闭环控制策略。动态规划方法固有的'维度灾难'使这些方法也变得计算上困难。本文提出了一种新颖的解耦数据基于控制(D2C)算法,通过解耦的'开环-闭环'方法解决这个问题。首先,使用动力学系统的黑盒仿真模型解决一个开环确定性轨迹优化问题。然后,通过在该名义轨迹上线性化动态,开发围绕该开环轨迹的闭环控制。通过线性化,可以使用基于线性二次调节器的算法来实现该闭环控制。我们证明了D2C算法的性能近似最优。此外,仿真性能表明,与其它先进算法相比,训练时间显著减少。

英文摘要

This paper addresses the problem of learning the optimal control policy for a nonlinear stochastic dynamical system with continuous state space, continuous action space and unknown dynamics. This class of problems are typically addressed in stochastic adaptive control and reinforcement learning literature using model-based and model-free approaches respectively. Both methods rely on solving a dynamic programming problem, either directly or indirectly, for finding the optimal closed loop control policy. The inherent `curse of dimensionality' associated with dynamic programming method makes these approaches also computationally difficult. This paper proposes a novel decoupled data-based control (D2C) algorithm that addresses this problem using a decoupled, `open loop - closed loop', approach. First, an open-loop deterministic trajectory optimization problem is solved using a black-box simulation model of the dynamical system. Then, a closed loop control is developed around this open loop trajectory by linearization of the dynamics about this nominal trajectory. By virtue of linearization, a linear quadratic regulator based algorithm can be used for this closed loop control. We show that the performance of D2C algorithm is approximately optimal. Moreover, simulation performance suggests significant reduction in training time compared to other state of the art algorithms.

1904.07479 2026-06-04 eess.SY cs.MA cs.RO cs.SY 版本更新

Robust nonlinear control of close formation flight

紧密编队飞行的鲁棒非线性控制

Qingrui Zhang, Hugh H. T. Liu

发表机构 * University of Toronto(多伦多大学)

AI总结 本文研究了在动态飞行操作中,为跟随飞行器在受领航机生成的尾涡影响下实现精确位置控制的问题,提出了一种鲁棒非线性编队控制算法,通过基线控制律和扰动观测器实现精确编队跟踪控制,通过数值模拟验证了设计的有效性。

Comments 33 pages, 20 figures

详情
AI中文摘要

本文研究了紧密编队飞行的鲁棒非线性控制问题。其目标是在动态飞行操作中,为跟随飞行器在受领航机生成的尾涡影响下实现精确位置控制。一个关键问题是如何确保在存在不确定性和扰动的情况下,位置误差保持有界。本文提出了一种鲁棒非线性编队控制算法,以实现精确的紧密编队跟踪控制。所提出的控制算法由基线控制律和扰动观测器组成。基线控制律用于稳定紧密编队飞行的非线性动力学,而扰动观测器用于补偿系统不确定性和编队相关的空气动力学扰动。通过所提出的设计,可以保证位置控制性能在期望的有界范围内,从而为在紧密编队中使用该设计获得足够的阻力减少。所提出设计的有效性通过两架飞行器的紧密编队飞行数值模拟得以验证。

英文摘要

This paper investigates the robust nonlinear close formation control problem. It aims to achieve precise position control at dynamic flight operation for a follower aircraft under the aerodynamic impact due to the trailing vortices generated by a leader aircraft. One crucial concern is the control robustness that ensures the boundedness of position error subject to uncertainties and disturbances to be regulated with accuracy. This paper develops a robust nonlinear formation control algorithm to fulfill precise close formation tracking control. The proposed control algorithm consists of baseline control laws and disturbance observers. The baseline control laws are employed to stabilize the nonlinear dynamics of close formation flight, while the disturbance observers are introduced to compensate system uncertainties and formation-related aerodynamic disturbances. The position control performance can be guaranteed within the desired boundedness to harvest enough drag reduction for a follower aircraft in close formation using the proposed design. The efficacy of the proposed design is demonstrated via numerical simulations of close formation flight of two aircraft.

1904.06892 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Learning to Guide: Guidance Law Based on Deep Meta-learning and Model Predictive Path Integral Control

学习引导:基于深度元学习和模型预测路径积分控制的引导法

Chen Liang, Weihong Wang, Zhenghua Liu, Chao Lai, Benchun Zhou

发表机构 * School of Automation Science and Electrical Engineering, Beihang University(北京航空航天大学自动化科学与电气工程学院) Navigation and Control Technology Research Institute of China North Industries Group Corporation(中国北方工业集团有限公司导航与控制技术研究院)

AI总结 本文提出了一种基于模型驱动深度强化学习的新型引导方案,通过将深度神经网络作为引导动力学的预测模型融入模型预测路径积分(MPPI)控制框架中,利用元学习技术使深度神经动力学模型能够在线适应环境变化,从而缓解标准MPPI控制因实际环境与训练数据差异导致的性能下降,并构建了在存在作动器故障时拦截机动目标的新型引导律。

Comments Code available at https://github.com/tccliangchen/deep_meta-learning_guidance_law . in IEEE Access 2019

详情
AI中文摘要

在本文中,我们提出了一种基于模型驱动深度强化学习(RL)技术的新型引导方案。利用模型驱动深度RL方法,训练一个深度神经网络作为引导动力学的预测模型,并将其纳入模型预测路径积分(MPPI)控制框架中。然而,传统的MPPI框架假设实际环境与训练数据集相似,这在实践中由于目标机动、其他扰动和作动器故障等因素而难以实现。为了解决这个问题,我们的方法利用元学习技术,使深度神经动力学模型能够在线适应这些变化。通过这种方法,我们可以减轻标准MPPI控制因实际环境与训练数据差异导致的性能下降。然后,基于上述技术,构建了一种新型引导律,用于在存在作动器故障的情况下拦截具有期望终端撞击角的机动目标。不同情况下的仿真和实验结果表明,所提出的引导律在实现成功拦截机动目标方面具有有效性与鲁棒性。

英文摘要

In this paper, we present a novel guidance scheme based on model-based deep reinforcement learning (RL) technique. With model-based deep RL method, a deep neural network is trained as a predictive model of guidance dynamics which is incorporated into a model predictive path integral (MPPI) control framework. However the traditional MPPI framework assumes the actual environment similar to the training dataset for the deep neural network which is impractical in practice with different maneuvering of target, other perturbations and actuator failures. To address this problem, our method utilize meta-learning technique to make the deep neural dynamics model adapt to such changes online. With this approach we can alleviate the performance deterioration of standard MPPI control caused by the difference between actual environment and training data. Then, a novel guidance law for a varying velocity interceptor intercepting maneuvering target with desired terminal impact angle under actuator failure is constructed based on aforementioned techniques. Simulation and experiment results under different cases show the effectiveness and robustness of the proposed guidance law in achieving successful interceptions of maneuvering target.

1904.06680 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Online Sampling in the Parameter Space of a Neural Network for GPU-accelerated Motion Planning of Autonomous Vehicles

神经网络参数空间中的在线采样用于自动驾驶车辆的GPU加速运动规划

Mogens Graf Plessen

发表机构 * MPG

AI总结 本文提出了一种用于自动驾驶车辆GPU加速运动规划的神经网络参数空间在线采样方法,通过神经网络作为控制器参数化来处理非线性和非凸系统,并在预测时间范围内保持参数化不变,同时通过变化的特征向量确定转向和纵向加速度控制。

Comments 8 pages, 8 figures, 3 tables, conference paper

详情
AI中文摘要

本文提出了一种用于自动驾驶车辆GPU加速运动规划的神经网络参数空间在线采样方法。神经网络被用作控制器参数化,因为它们能够处理非线性和非凸系统,且其复杂性不随预测时间长度而增加。在网络参数化在每个采样时间点被采样后,在预测时间范围内保持不变。由于输入到网络的特征向量在预测时间范围内变化,因此控制仍然在预测时间范围内变化。全维车辆通过多面体建模。在障碍物点数据假设下,并在其预测时间范围内以恒定速度假设进行外推的情况下,碰撞避免减少为线性不等式检查。转向和纵向加速度控制同时确定。所提出的方法设计用于并行化,因此非常适合从GPU等硬件的持续进步中受益。所提出方法的特点在5个数值模拟实验中得到说明,包括动态障碍物避让、需要交替前进和倒车的路径点跟踪以及倒车停车场景。

英文摘要

This paper proposes online sampling in the parameter space of a neural network for GPU-accelerated motion planning of autonomous vehicles. Neural networks are used as controller parametrization since they can handle nonlinear non-convex systems and their complexity does not scale with prediction horizon length. Network parametrizations are sampled at each sampling time and then held constant throughout the prediction horizon. Controls still vary over the prediction horizon due to varying feature vectors fed to the network. Full-dimensional vehicles are modeled by polytopes. Under the assumption of obstacle point data, and their extrapolation over a prediction horizon under constant velocity assumption, collision avoidance reduces to linear inequality checks. Steering and longitudinal acceleration controls are determined simultaneously. The proposed method is designed for parallelization and therefore well-suited to benefit from continuing advancements in hardware such as GPUs. Characteristics of proposed method are illustrated in 5 numerical simulation experiments including dynamic obstacle avoidance, waypoint tracking requiring alternating forward and reverse driving with maximal steering, and a reverse parking scenario.

1904.06524 2026-06-04 cs.RO cs.SY eess.SY 版本更新

On Model Adaptation for Sensorimotor Control of Robots

关于机器人传感器运动控制的模型适应

David Navarro-Alarcon, Andrea Cherubini, Xiang Li

发表机构 * The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong(香港理工大学) University of Montpellier / LIRMM, Montpellier, France(蒙彼利埃大学 / LIRMM) The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong(香港中文大学)

AI总结 本文研究了如何计算用于引导具有不确定动作-感知关系的机器人系统运动的自适应传感器运动模型,通过两个案例研究展示了所提出的方法:变形物体的形状控制和未校准传感器下的超声探头软操控。

Comments 38th Chinese Control Conference

详情
AI中文摘要

在本文中,我们解决了计算可用于引导具有不确定动作-感知关系的机器人系统运动的自适应传感器运动模型的问题。首先提出了未校准的传感器基于控制问题的 formulations,然后推导并分析了各种构建自适应传感器运动模型的计算方法。所提出的方法通过两个案例研究进行了示例:(i) 未知属性的变形物体的形状控制,以及 (ii) 未校准传感器下的超声探头软操控。

英文摘要

In this article, we address the problem of computing adaptive sensorimotor models that can be used for guiding the motion of robotic systems with uncertain action-to-perception relations. The formulation of the uncalibrated sensor-based control problem is first presented, then, various computational methods for building adaptive sensorimotor models are derived and analysed. The proposed methodology is exemplified with two cases of study: (i) shape control of deformable objects with unknown properties, and (ii) soft manipulation of ultrasonic probes with uncalibrated sensors.

1904.05814 2026-06-04 cs.CV cs.GR cs.LG cs.NA cs.RO math.NA 版本更新

Probabilistic Permutation Synchronization using the Riemannian Structure of the Birkhoff Polytope

利用Birkhoff多面体的Riemannian结构的概率排列同步

Tolga Birdal, Umut Şimşekli

AI总结 本文提出了一种新的几何和概率方法,用于在多个对象或图像集合之间同步对应关系。核心方法包括基于Birkhoff-Riemannian L-BFGS优化放松后的循环一致性损失,以及基于Birkhoff-Riemannian Langevin Monte Carlo生成Birkhoff多面体样本并估计解的置信度。

Comments To appear as oral presentation at CVPR 2019. 20 pages including the supplementary material

详情
AI中文摘要

我们提出了一种全新的几何和概率方法,用于在多个对象或图像集合之间同步对应关系。具体而言,我们提出了两个算法:(1) Birkhoff-Riemannian L-BFGS用于以系统化的方式优化放松后的循环一致性损失的松弛版本;(2) Birkhoff-Riemannian Langevin Monte Carlo用于在Birkhoff多面体上生成样本并估计找到的解的置信度。为此,我们首先介绍了最近发展出的Birkhoff多面体的Riemannian几何。接着,我们引入了一种新的概率同步模型,形式为马尔可夫随机场(MRF)。最后,基于一阶retraction算子,我们将问题 formulation 为模拟随机微分方程,并设计了新的积分器。我们在合成和真实数据集上展示,我们能够以更快的收敛速度和可靠的置信度/不确定性估计获得高质量的多图匹配结果。

英文摘要

We present an entirely new geometric and probabilistic approach to synchronization of correspondences across multiple sets of objects or images. In particular, we present two algorithms: (1) Birkhoff-Riemannian L-BFGS for optimizing the relaxed version of the combinatorially intractable cycle consistency loss in a principled manner, (2) Birkhoff-Riemannian Langevin Monte Carlo for generating samples on the Birkhoff Polytope and estimating the confidence of the found solutions. To this end, we first introduce the very recently developed Riemannian geometry of the Birkhoff Polytope. Next, we introduce a new probabilistic synchronization model in the form of a Markov Random Field (MRF). Finally, based on the first order retraction operators, we formulate our problem as simulating a stochastic differential equation and devise new integrators. We show on both synthetic and real datasets that we achieve high quality multi-graph matching results with faster convergence and reliable confidence/uncertainty estimates.

1904.05423 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Game-Theoretic Modeling of Multi-Vehicle Interactions at Uncontrolled Intersections

多车辆在无信号交叉口交互的博弈建模

Nan Li, Yu Yao, Ilya Kolmanovsky, Ella Atkins, Anouck Girard

发表机构 * Robotics Institute, University of Michigan(密歇根大学机器人研究所)

AI总结 本文提出了一种基于博弈论的框架,用于建模自动驾驶和人工驾驶车辆在无信号交叉口的交互行为,通过参数化交叉口布局和几何结构,展示了模型在交通场景中的合理性和计算效率。

Comments 18 pages, 13 figures, 1 table

详情
AI中文摘要

受开发用于验证和验证自动驾驶系统在包含自动驾驶和人工驾驶车辆的交通中的仿真工具的需要所驱动,我们提出了一种用于建模无信号交叉口车辆交互的框架。所提出的交互建模方法基于博弈论,包含多个并发的领导者-追随者对,并考虑了常见的交通规则。我们参数化交叉口布局和几何结构以建模具有各种配置的无信号交叉口,并应用所提出的方法来建模这些交叉口处车辆的交互行为。基于各种交通场景的仿真结果,我们表明该模型表现出预期的交通行为,包括能够再现从真实世界交通数据中提取的场景以及在解决交通冲突方面的合理性能。该模型进一步基于服务水平交通质量评级系统进行验证,并展示了与传统多玩家博弈论模型相比的可管理计算复杂性。

英文摘要

Motivated by the need to develop simulation tools for verification and validation of autonomous driving systems operating in traffic consisting of both autonomous and human-driven vehicles, we propose a framework for modeling vehicle interactions at uncontrolled intersections. The proposed interaction modeling approach is based on game theory with multiple concurrent leader-follower pairs, and accounts for common traffic rules. We parameterize the intersection layouts and geometries to model uncontrolled intersections with various configurations, and apply the proposed approach to model the interactive behavior of vehicles at these intersections. Based on simulation results in various traffic scenarios, we show that the model exhibits reasonable behavior expected in traffic, including the capability of reproducing scenarios extracted from real-world traffic data and reasonable performance in resolving traffic conflicts. The model is further validated based on the level-of-service traffic quality rating system and demonstrates manageable computational complexity compared to traditional multi-player game-theoretic models.

1904.05271 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Indoor Testing and Simulation Platform for Close-distance Visual Inspection of Complex Structures using Micro Quadrotor UAV

复杂结构近距离视觉检测用微四旋翼无人机室内测试与仿真平台

Zhexiong Shang, Zhigang Shen

发表机构 * Durham School of Architectural Engineering & Construction, University of Nebraska-Lincoln(达勒姆建筑工程与建设学院,内布拉斯加大学林肯分校)

AI总结 本文提出了一种基于微四旋翼无人机的低成本实验平台,用于测试无人机路径规划算法,通过室内环境实现高效、安全的近距离视觉检测,验证了现有路径规划算法在实际应用中的有效性。

Comments 6 pages, 6 figures, accepted in ICCCBE 2018

详情
AI中文摘要

近年来,使用无人机(也称为无人驾驶飞行器,UAV)进行近距离视觉检测已成为多个学科中的活跃研究领域。然而,在实现自主检测之前,仍有许多挑战需要克服,尤其是在检测复杂结构时。复杂的民用结构,如桥梁、水坝和风力涡轮机,规模庞大且几何复杂。这要求使用复杂的路径规划算法来实现近距离检测,同时避免碰撞。在实践中,直接在这些结构上部署路径规划结果容易出错、成本高且充满危险。本文基于微四旋翼无人机,提出了一种经济实惠的实验平台,用于测试基于无人机的路径规划结果。该平台允许用户随时进行多种路径规划实验,而无需担心昂贵且耗时的户外飞行测试。该平台基于Crazyflie套件开发,包括Crazyflie 2.0四旋翼、Crazyradio和定位系统(LPS)。该平台配备 onboard 微型FPV相机,飞行过程中可实时流式传输视觉数据到主机计算机。该平台明确设计了手动配置和航点控制功能,以提高其在路径跟随和调试方面的灵活性和性能。为了评估所提出测试平台的实用性,测试了两种现有的基于无人机的路径规划算法。结果表明,尽管存在一定程度的误差,视觉数据的质量和路径跟随的准确性足以模拟大多数实际检测应用。

英文摘要

In recent years, using drone, also known as unmanned aerial vehicle (UAV), in close-distance visual inspection has became an active area in many disciplines. However, many challenges still remain before we can achieve autonomous inspection, especially when inspecting complex structures. The complex civil structures, such as bridges, dams and wind turbines, are large-scale and geometrical complicated. It requires sophisticated path planning algorithms to achieve close-distance inspection and, at the same time, avoid collisions. In practice, directly deploying the path planning result on such structures is error prone, costly, and full of hazards. In this paper, rely on micro quadrotor UAV, the authors present an affordable experimental platform for testing drone-based path planning result. The platform allows the users to conduct many path planning experiments at any time without worrying expensive and time consuming outdoor test flying. This platform is developed based on the bundle of Crazyflie, which includes Crazyflie 2.0 quadrotor, Crazyradio and loco positioning system (LPS). Equipped with an onboard micro FPV camera, the visual data can be lively streamed to the host computer during flight. The functions of manual configuration and waypoints control are explicitly designed in this platform to increase its flexibility and performance on path following and debugging. To evaluate the practicability of the proposed test platform, two existing drone-based path planning algorithms are tested. The results show that even though certain level of error existed, the quality of visual data and accuracy of path following are high enough for simulating most practical inspection applications.

1904.05072 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

Differential Dynamic Programming for Multi-Phase Rigid Contact Dynamics

多相刚体接触动力学中的微分动态规划

Rohan Budhiraja, Justin Carpentier, Carlos Mastalli, Nicolas Mansard

发表机构 * CNRS, LAAS(法国国家科学研究中心,拉拉斯研究所) INRIA, France(法国国家信息与自动化研究所,法国)

AI总结 本文提出使用微分动态规划算法来优化多相刚体接触动力学的全身轨迹,通过利用角动量提高运动效率,减少力和冲击,并在无外力情况下实现姿态控制。

Comments 6 pages, IEEE RAS International Conference on Humanoid Robots

详情
AI中文摘要

当今生成高效运动的常见策略是将问题分解为两个连续步骤:第一步生成接触序列和质心轨迹,第二步计算遵循质心模式的全身轨迹。然而,第二步通常由简单的程序如逆运动学求解器处理。相反,我们提出使用局部最优控制求解器,即微分动态规划(DDP),来计算全身轨迹。我们的方法通过利用角动量产生更高效的运动,具有较低的力和较小的冲击。为此,我们提出了一种原始的DDP公式,利用刚体接触模型的Kuhn-Tucker约束。通过在真实HRP-2机器人上执行大步行走和无外力情况下的姿态控制问题,我们实验性地展示了这种方法的重要性。

英文摘要

A common strategy today to generate efficient locomotion movements is to split the problem into two consecutive steps: the first one generates the contact sequence together with the centroidal trajectory, while the second one computes the whole-body trajectory that follows the centroidal pattern. Yet the second step is generally handled by a simple program such as an inverse kinematics solver. In contrast, we propose to compute the whole-body trajectory by using a local optimal control solver, namely Differential Dynamic Programming (DDP). Our method produces more efficient motions, with lower forces and smaller impacts, by exploiting the Angular Momentum (AM). With this aim, we propose an original DDP formulation exploiting the Karush-Kuhn-Tucker constraint of the rigid contact model. We experimentally show the importance of this approach by executing large steps walking on the real HRP-2 robot, and by solving the problem of attitude control under the absence of external forces.

1904.04600 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Hierarchical Planning of Dynamic Movements without Scheduled Contact Sequences

无调度接触序列的动态运动分层规划

Carlos Mastalli, Ioannis Havoutis, Michele Focchi, Darwin G. Caldwell, Claudio Semini

发表机构 * Department of Advanced Robotics, Istituto Italiano di Tecnologia(意大利技术研究院先进机器人部) Robot Learning and Interaction Group, Idiap Research Institute(伊迪普研究所机器人学习与交互小组)

AI总结 本文提出了一种分层轨迹优化方法,用于规划无需调度接触序列的动态运动,通过计算能够实现无法通过刚体运动达到的目标的全身运动,首先根据机器人的质心动力学找到可行的质心运动,然后通过应用完整的动力学模型进行优化,利用可行的质心轨迹作为预热起点,通过互补约束描述接触模型,即环境几何和非滑动主动接触,两个优化阶段均作为互补约束数学规划问题(MPCC)进行求解,实验表明该规划方法在一系列具有挑战性的任务中表现出色。

Comments 6 pages, IEEE International Conference on Robotics and Automation (ICRA)

详情
AI中文摘要

大多数动物和人类完成复杂任务的运动行为涉及动态运动和丰富的接触交互。事实上,复杂操作需要同时考虑动态运动和接触事件。我们提出了一种分层轨迹优化方法,用于规划具有无调度接触序列的动态运动。我们计算出能够实现无法通过刚体运动达到的目标的全身运动。首先,我们根据机器人的质心动力学找到可行的质心运动。然后,我们通过应用机器人的完整动力学模型来优化解决方案,其中可行的质心轨迹用作预热起点。为了实现无调度的接触行为,我们使用互补约束来描述接触模型,即环境几何和非滑动主动接触。两个优化阶段均被提出为互补约束数学规划问题(MPCC)。实验测试展示了我们的规划方法在一系列具有挑战性的任务中的性能。

英文摘要

Most animal and human locomotion behaviors for solving complex tasks involve dynamic motions and rich contact interaction. In fact, complex maneuvers need to consider dynamic movement and contact events at the same time. We present a hierarchical trajectory optimization approach for planning dynamic movements with unscheduled contact sequences. We compute whole-body motions that achieve goals that cannot be reached in a kinematic fashion. First, we find a feasible CoM motion according to the centroidal dynamics of the robot. Then, we refine the solution by applying the robot's full-dynamics model, where the feasible CoM trajectory is used as a warm-start point. To accomplish the unscheduled contact behavior, we use complementarity constraints to describe the contact model, i.e. environment geometry and non-sliding active contacts. Both optimization phases are posed as Mathematical Program with Complementarity Constraints (MPCC). Experimental trials demonstrate the performance of our planning approach in a set of challenging tasks.

1904.04595 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

Simultaneous Contact, Gait and Motion Planning for Robust Multi-Legged Locomotion via Mixed-Integer Convex Optimization

通过混合整数凸优化实现鲁棒多足运动的同步接触、步态和运动规划

Bernardo Aceituno-Cabezas, Carlos Mastalli, Hongkai Dai, Michele Focchi, Andreea Radulescu, Darwin G. Caldwell, Jose Cappelletto, Juan C. Grieco, Gerardo Fernandez-Lopez, Claudio Semini

发表机构 * School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA 30332 USA(电气与计算机工程系,佐治亚理工学院,亚特兰大,GA 30332 USA) Twentieth Century Fox, Springfield, USA(二十世纪福克斯,斯普林菲尔德,USA) Starfleet Academy, San Francisco, CA 96678 USA(星际舰队学院,旧金山,CA 96678 USA) Tyrell Inc., 123 Replicant Street, Los Angeles, CA 90210 USA(泰勒尔公司,123 复制人街,洛杉矶,CA 90210 USA)

AI总结 本文提出了一种混合整数凸优化方法,用于同时规划多足机器人的接触位置、步态转换和运动,以提高运动的通用性并保持低计算时间。

Comments 8 pages, IEEE Robotics and Automation Letters

详情
AI中文摘要

传统多足运动规划方法将问题分为多个阶段,如接触搜索和轨迹生成。然而,同时考虑接触和运动对于生成复杂的全身行为至关重要。目前,将这些问题耦合在一起需要假设固定的步态序列和平坦地形条件,或者使用非凸优化,计算时间不可行。本文提出了一种混合整数凸公式,以高效的方式同时规划接触位置、步态转换和运动。与之前的工作不同,我们的方法不限于平坦地形或预设的步态序列。相反,我们纳入摩擦锥稳定性边际,近似机器人扭矩限制,并使用混合整数凸约束规划步态。我们通过在HyQ机器人上实验验证了我们的方法,穿越了不同具有挑战性的地形,其中非凸性和平坦地形假设可能导致次优或不稳定计划。我们的方法在保持低计算时间的同时提高了运动的通用性。

英文摘要

Traditional motion planning approaches for multi-legged locomotion divide the problem into several stages, such as contact search and trajectory generation. However, reasoning about contacts and motions simultaneously is crucial for the generation of complex whole-body behaviors. Currently, coupling theses problems has required either the assumption of a fixed gait sequence and flat terrain condition, or non-convex optimization with intractable computation time. In this paper, we propose a mixed-integer convex formulation to plan simultaneously contact locations, gait transitions and motion, in a computationally efficient fashion. In contrast to previous works, our approach is not limited to flat terrain nor to a pre-specified gait sequence. Instead, we incorporate the friction cone stability margin, approximate the robot's torque limits, and plan the gait using mixed-integer convex constraints. We experimentally validated our approach on the HyQ robot by traversing different challenging terrains, where non-convexity and flat terrain assumptions might lead to sub-optimal or unstable plans. Our method increases the motion generality while keeping a low computation time.

1904.03742 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Nonlinear Model Predictive Control for 3D Formation of Multirotor Micro Aerial Vehicles with Relative Sensing in Local Coordinates

多旋翼微型飞行器在局部坐标系中基于相对感知的三维编队非线性模型预测控制

I. Kagan Erunsal, Rodrigo Ventura, Alcherio Martinoli

发表机构 * Institute for Systems and Robotics (ISR), IST, Lisbon, Portugal(系统与机器人研究所(ISR),IST,里斯本,葡萄牙)

AI总结 本文提出了一种基于相对感知信息的多旋翼微型飞行器三维编队控制方法,采用集中式非线性模型预测控制策略,通过引入六自由度数学模型实现了对编队的鲁棒控制。

Comments 8 pages, 10 figures, IROS'2019 (submitted)

详情
AI中文摘要

复杂的任务如监视、建设、搜索和救援可以受益于多旋翼微型飞行器(MAVs)的机动性,以获得稳健、协作的系统行为和编队控制是这些复杂任务的重要组成部分。本文聚焦于利用仅有的相对传感信息实现多旋翼MAVs的三维编队控制。它提出了一种集中式非线性模型预测控制(NMPC)方法,在领导者-追随者方案中。介绍了一个现实的六自由度数学模型,并用于控制律。问题的制定基于NMPC和相对传感框架,相对于机器人的局部坐标系。这种制定使编队不依赖于全局或共同参考系的完整知识以及昂贵的全局定位传感器。通过考虑新的制定,提出了基于实时迭代(RTI)的最优控制问题(OCP)的解决方案。设计了一个广泛的场景来测试和验证该策略。结果评估表明,在模型不确定性和本地传感器噪声以及编队动态突然变化的情况下,取得了令人满意的鲁棒性能。

英文摘要

The complex tasks such as surveillance, construction, search and rescue can benefit of the maneuverability of multirotor Micro Aerial Vehicles (MAVs) to obtain robust, cooperative system behavior and formation control is a prominent component of the these complex tasks. This work focuses on the problem of three-dimensional formation control of multirotor MAVs by using exclusively relative sensory information. It proposes a centralized Nonlinear Model Predictive Control (NMPC) approach in a leader-follower scheme. A realistic six degrees of freedom mathematical model of a multirotor MAVs is introduced and leveraged in the control laws. The formulation of the problem is performed based on NMPC and relative sensing framework with respect to local coordinate frames of the robots. This type of formulation makes the formation independent of the full knowledge of global or common reference frames and the utilization of expensive global localization sensors. Real-time Iteration (RTI) based solution to optimal control problem (OCP) is proposed by taking the novel formulation into account. An extensive scenario is designed to test and validate the strategy. Evaluation of the results suggests that satisfactory robust performance is achieved and maintained under model uncertainty and noise in local sensors and even in cases where the dynamics of the formation suddenly changes.

1904.03665 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Learning to Control Highly Accelerated Ballistic Movements on Muscular Robots

学习控制高加速度的球形运动在肌肉机器人上

Dieter Büchler, Roberto Calandra, Jan Peters

发表机构 * Max Planck Institute for Intelligent Systems(智能系统马克斯·普朗克研究所) Facebook AI Research(脸书人工智能研究)

AI总结 本文研究了如何通过学习方法提高肌肉机器人在高速高加速度运动中的控制精度,提出了一种四自由度的机器人臂,利用气动人工肌肉实现高关节角加速度,并通过贝叶斯优化直接在硬件上调整控制参数,从而在快速轨迹上实现了优于以往的结果。

Comments 12 pages, preprint submitted to Journal of Robotics and Autonomous Systems

详情
AI中文摘要

高速和高加速度的运动本质上很难控制。在人形机器人臂上应用学习方法来控制此类运动可以提高控制的准确性,但可能会损害系统。学习方法的内在探索可能导致不稳定性和机器人在高速下达到关节极限。因此,具有安全探索高速和高加速度运动硬件的需求是必要的。为了解决这个问题,我们提出使用由气动人工肌肉(PAMs)驱动的机器人。在本文中,我们展示了一种四自由度的机器人臂,能够达到高达28000度/秒²的关节角加速度,同时通过拮抗驱动和空气压力范围限制避免危险的关节极限。利用这种机器人臂,我们能够通过贝叶斯优化直接在硬件上调整控制参数,而无需额外的安全考虑。在快速轨迹上的跟踪性能超过了以往在类似PAM驱动机器人上的结果。我们还展示了由于电缆弯曲最小、轻量级动力学和PAMs与链接之间的最小接触等精心设计考虑,系统能够使用PID控制器在慢速轨迹上良好控制。最后,我们提出了一种新的技术来控制拮抗肌肉对的协同收缩。实验结果表明,选择最佳的协同收缩水平对于达到更好的跟踪性能至关重要。通过使用PAM驱动机器人和学习,我们朝着未来能够实现更像人类运动的机器人发展迈出了小一步。

英文摘要

High-speed and high-acceleration movements are inherently hard to control. Applying learning to the control of such motions on anthropomorphic robot arms can improve the accuracy of the control but might damage the system. The inherent exploration of learning approaches can lead to instabilities and the robot reaching joint limits at high speeds. Having hardware that enables safe exploration of high-speed and high-acceleration movements is therefore desirable. To address this issue, we propose to use robots actuated by Pneumatic Artificial Muscles (PAMs). In this paper, we present a four degrees of freedom (DoFs) robot arm that reaches high joint angle accelerations of up to 28000 deg/s^2 while avoiding dangerous joint limits thanks to the antagonistic actuation and limits on the air pressure ranges. With this robot arm, we are able to tune control parameters using Bayesian optimization directly on the hardware without additional safety considerations. The achieved tracking performance on a fast trajectory exceeds previous results on comparable PAM-driven robots. We also show that our system can be controlled well on slow trajectories with PID controllers due to careful construction considerations such as minimal bending of cables, lightweight kinematics and minimal contact between PAMs and PAMs with the links. Finally, we propose a novel technique to control the the co-contraction of antagonistic muscle pairs. Experimental results illustrate that choosing the optimal co-contraction level is vital to reach better tracking performance. Through the use of PAM-driven robots and learning, we do a small step towards the future development of robots capable of more human-like motions.

1811.07049 2026-06-04 cs.RO cs.SY eess.SY 版本更新

RMPflow: A Computational Graph for Automatic Motion Policy Generation

RMPflow:一种用于自动运动策略生成的计算图

Ching-An Cheng, Mustafa Mukadam, Jan Issac, Stan Birchfield, Dieter Fox, Byron Boots, Nathan Ratliff

发表机构 * NVIDIA, Seattle Robotics Lab(NVIDIA西雅图机器人实验室) Georgia Institute of Technology, Robot Learning Lab(佐治亚理工学院机器人学习实验室) University of Washington, Robotics and State Estimation Lab(华盛顿大学机器人与状态估计实验室)

AI总结 本文提出了一种基于Riemannian Motion Policies几何一致变换的新型策略合成算法RMPflow,通过结合局部策略生成全局策略并利用稀疏结构提高计算效率,同时研究了其几何性质并验证了在高自由度 manipulation 系统中通过障碍物规划的简化效果。

Comments WAFR 2018

详情
AI中文摘要

我们开发了一种基于Riemannian Motion Policies(RMPs)几何一致变换的新型策略合成算法RMPflow。RMPs是一类用于参数化非欧几里得行为为动力系统在本质上非线性任务空间中的反应性运动策略。给定一组为个别任务设计的RMPs,RMPflow可以一致地将这些局部策略组合起来生成具有表达力的全局策略,同时利用稀疏结构提高计算效率。我们研究了RMPflow的几何性质,并提供了稳定性充分条件。最后,我们通过实验证明,考虑任务策略的几何特性可以简化传统上困难的问题,例如在高自由度 manipulation 系统中通过障碍物的规划。

英文摘要

We develop a novel policy synthesis algorithm, RMPflow, based on geometrically consistent transformations of Riemannian Motion Policies (RMPs). RMPs are a class of reactive motion policies designed to parameterize non-Euclidean behaviors as dynamical systems in intrinsically nonlinear task spaces. Given a set of RMPs designed for individual tasks, RMPflow can consistently combine these local policies to generate an expressive global policy, while simultaneously exploiting sparse structure for computational efficiency. We study the geometric properties of RMPflow and provide sufficient conditions for stability. Finally, we experimentally demonstrate that accounting for the geometry of task policies can simplify classically difficult problems, such as planning through clutter on high-DOF manipulation systems.

1809.02472 2026-06-04 eess.SY cs.RO cs.SY 版本更新

An Analytical Design Optimization Method for Electric Propulsion Systems of Multicopter UAVs with Desired Hovering Endurance

多旋翼无人机电推进系统分析设计优化方法

Xunhua Dai, Quan Quan, Jinrui Ren, Kai-Yuan Cai

AI总结 本文提出了一种简单实用的方法,帮助设计师根据给定的设计要求找到最优的推进系统,通过研究四个基本组件的建模方法,将整体优化设计问题分解为多个子问题,从而获得各组件的最佳参数,并快速从相应数据库中确定最优产品。

详情
Journal ref
IEEE/ASME Transactions on Mechatronics, 2019, 24: 228-239
AI中文摘要

多旋翼无人机在民用和军事领域正变得越来越重要。目前,大多数多旋翼推进系统是通过经验和试错实验设计的,这成本高且效率低。本文提出了一种简单实用的方法,帮助设计师根据给定的设计要求找到最优的推进系统。首先,分别研究了推进系统四个基本组件(螺旋桨、电机、电子调速器和电池)的建模方法。其次,将整体优化设计问题简化并分解为几个子问题。通过解决这些子问题,可以分别获得各组件的最佳参数。最后,基于获得的最佳组件参数,可以快速从相应数据库中确定每个组件的最优产品。实验和统计分析证明了所提方法的有效性。

英文摘要

Multicopters are becoming increasingly important in both civil and military fields. Currently, most multicopter propulsion systems are designed by experience and trial-and-error experiments, which are costly and ineffective. This paper proposes a simple and practical method to help designers find the optimal propulsion system according to the given design requirements. First, the modeling methods for four basic components of the propulsion system including propellers, motors, electric speed controls, and batteries are studied respectively. Secondly, the whole optimization design problem is simplified and decoupled into several sub-problems. By solving these sub-problems, the optimal parameters of each component can be obtained respectively. Finally, based on the obtained optimal component parameters, the optimal product of each component can be quickly located and determined from the corresponding database. Experiments and statistical analyses demonstrate the effectiveness of the proposed method.

1904.02765 2026-06-04 cs.RO cs.LG cs.SY eess.SY math.OC 版本更新

Intent-Aware Probabilistic Trajectory Estimation for Collision Prediction with Uncertainty Quantification

意图感知的概率轨迹估计用于碰撞预测与不确定性量化

Andrew Patterson, Arun Lakshmanan, Naira Hovakimyan

发表机构 * Department of Mechanical Science and Engineering, University of Illinois at Urbana-Champaign(伊利诺伊大学厄巴纳-香槟分校机械科学与工程系)

AI总结 本文提出了一种基于高斯过程的概率轨迹估计方法,用于在不确定环境中预测碰撞,通过概率方法替代确定性假设,以考虑更广泛的障碍物类型,并通过案例研究展示了在有限障碍物行为知识下预测碰撞的能力。

详情
AI中文摘要

在动态和未知的环境中,碰撞预测依赖于对环境变化的理解。许多碰撞预测方法依赖于对障碍物运动的确定性知识,但完全确定性的障碍物运动知识往往不可用。本文提出了一种基于高斯过程的预测方法,用概率知识替代对每个障碍物未来行为的确定性假设,以考虑更广泛的障碍物。该方法仅依赖位置和速度测量来预测与动态障碍物的碰撞。我们证明,障碍物位置的不确定性区域可以表示为通过高斯过程回归生成的多项式的组合。为了控制任意时间范围内不确定性的增长,假设概率障碍物意图作为障碍物位置和速度的分布,这可以自然地包含在高斯过程框架中。我们的方法在两个案例研究中得到验证:(i) 障碍物超越代理;(ii) 障碍物垂直穿过代理的路径。在这些模拟中,我们展示了即使在有限的障碍物行为知识下也能预测碰撞。

英文摘要

Collision prediction in a dynamic and unknown environment relies on knowledge of how the environment is changing. Many collision prediction methods rely on deterministic knowledge of how obstacles are moving in the environment. However, complete deterministic knowledge of the obstacles' motion is often unavailable. This work proposes a Gaussian process based prediction method that replaces the assumption of deterministic knowledge of each obstacle's future behavior with probabilistic knowledge, to allow a larger class of obstacles to be considered. The method solely relies on position and velocity measurements to predict collisions with dynamic obstacles. We show that the uncertainty region for obstacle positions can be expressed in terms of a combination of polynomials generated with Gaussian process regression. To control the growth of uncertainty over arbitrary time horizons, a probabilistic obstacle intention is assumed as a distribution over obstacle positions and velocities, which can be naturally included in the Gaussian process framework. Our approach is demonstrated in two case studies in which (i), an obstacle overtakes the agent and (ii), an obstacle crosses the agent's path perpendicularly. In these simulations we show that the collision can be predicted despite having limited knowledge of the obstacle's behavior.

1904.02341 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

Online Risk-Bounded Motion Planning for Autonomous Vehicles in Dynamic Environments

在线风险受限的自主车辆动态环境中的运动规划

Xin Huang, Sungkweon Hong, Andreas Hofmann, Brian C. Williams

发表机构 * MIT Computer Science and Artificial Intelligence Laboratory(麻省理工学院计算机科学与人工智能实验室)

AI总结 本文提出了一种在线风险受限的运动规划方法,通过结合意图识别算法和POMDP求解器,生成安全高效的路径规划方案,尤其在无保护左转和变道等复杂环境中表现更优。

Comments Accepted at ICAPS'19. 10 pages, 6 figures, 1 table

详情
AI中文摘要

高效且稳健的自主车辆运动规划面临的关键挑战是理解周围代理的意图。忽略动态环境中其他代理的意图会导致风险或过于保守的规划。本文将运动规划问题建模为部分可观测马尔可夫决策过程(POMDP),并提出一个在线系统,结合意图识别算法和POMDP求解器,为自主车辆生成风险受限的路径规划。意图识别算法利用贝叶斯过滤和预学习的机动运动模型,预测每个代理车辆在有限时间 horizon 内的混合运动状态。我们实时更新POMDP模型,并使用启发式搜索算法求解,生成具有碰撞概率上界保证的策略。我们证明,与基线方法相比,我们的系统在多个具有挑战性的环境中,能够生成更高效和安全的运动规划。

英文摘要

A crucial challenge to efficient and robust motion planning for autonomous vehicles is understanding the intentions of the surrounding agents. Ignoring the intentions of the other agents in dynamic environments can lead to risky or over-conservative plans. In this work, we model the motion planning problem as a partially observable Markov decision process (POMDP) and propose an online system that combines an intent recognition algorithm and a POMDP solver to generate risk-bounded plans for the ego vehicle navigating with a number of dynamic agent vehicles. The intent recognition algorithm predicts the probabilistic hybrid motion states of each agent vehicle over a finite horizon using Bayesian filtering and a library of pre-learned maneuver motion models. We update the POMDP model with the intent recognition results in real time and solve it using a heuristic search algorithm which produces policies with upper-bound guarantees on the probability of near colliding with other dynamic agents. We demonstrate that our system is able to generate better motion plans in terms of efficiency and safety in a number of challenging environments including unprotected intersection left turns and lane changes as compared to the baseline methods.

1903.09749 2026-06-04 cs.RO cs.SY eess.SY math.DS math.OC 版本更新

Passivity guaranteed stiffness control with multiple frequency band specifications for a cable-driven series elastic actuator

具有多频率带规范的电缆驱动串联弹性执行器的被动保证刚度控制

Ningbo Yu, Wulin Zou, Yubo Sun

发表机构 * Tianjin Key Laboratory of Intelligent Robotics, Nankai University(天津智能机器人重点实验室,南开大学)

AI总结 本文针对电缆驱动串联弹性执行器的刚度控制问题,提出了一种基于H∞综合方法的改进方案,通过在特定频率带内满足被动性、执行器限制、扰动抑制和噪声抑制等约束条件,提升了刚度控制的精度和鲁棒性。

Comments 10 pages, already published in Mechanical Systems and Signal Processing

详情
AI中文摘要

阻抗控制和特别是刚度控制被广泛应用于物理人机交互。串联弹性执行器(SEA)提供了固有的柔顺性、安全性和进一步的好处。本文旨在改进电缆驱动SEA的刚度控制性能。现有的阻抗控制器是在全频域内设计的,尽管人机交互通常发生在低频范围内。我们通过制定的被动性、执行器限制、扰动抑制和特定频率范围内的噪声抑制约束条件来增强刚度渲染性能。首先,我们将多频率带优化问题重新公式化为H∞综合框架。然后,性能目标通过各自受限频域规范作为范数界限来定量描述。进一步,直接综合出一个结构化的控制器以满足所有竞争性性能要求。仿真和实验结果表明,所生成的控制器能够为每个期望的刚度(从0到1倍的物理弹簧常数)提供良好的交互性能。与基于被动性的PID方法相比,所提出的H∞综合方法在保证被动性的前提下实现了更精确和鲁棒的刚度控制性能。

英文摘要

Impedance control and specifically stiffness control are widely applied for physical human-robot interaction. The series elastic actuator (SEA) provides inherent compliance, safety and further benefits. This paper aims to improve the stiffness control performance of a cable-driven SEA. Existing impedance controllers were designed within the full frequency domain, though human-robot interaction commonly falls in the low frequency range. We enhance the stiffness rendering performance under formulated constraints of passivity, actuator limitation, disturbance attenuation, noise rejection at their specific frequency ranges. Firstly, we reformulate this multiple frequency-band optimization problem into the $H_\infty$ synthesis framework. Then, the performance goals are quantitatively characterized by respective restricted frequency-domain specifications as norm bounds. Further, a structured controller is directly synthesized to satisfy all the competing performance requirements. Both simulation and experimental results showed that the produced controller enabled good interaction performance for each desired stiffness varying from 0 to 1 times of the physical spring constant. Compared with the passivity-based PID method, the proposed $H_\infty$ synthesis method achieved more accurate and robust stiffness control performance with guaranteed passivity.

1904.00035 2026-06-04 cs.RO cs.LG cs.SY eess.SY stat.ML 版本更新

Autonomous Highway Driving using Deep Reinforcement Learning

使用深度强化学习实现自动驾驶高速公路驾驶

Subramanya Nageshrao, Eric Tseng, Dimitar Filev

发表机构 * Ford Greenfield Labs(福特绿谷实验室) Ford Research and Innovation Center(福特研究与创新中心)

AI总结 本文提出了一种基于强化学习的方法,通过与模拟交通直接交互,使自动驾驶车辆在复杂和多变的环境中做出决策,解决了传统规则和预设成本函数在实时优化中的不足,提高了学习效率和安全性。

详情
AI中文摘要

自动驾驶车辆的操作空间可以是多样的,并且可能显著变化。这可能导致设计阶段未预料到的场景。因此,基于规则的决策者选择动作可能并不理想。同样,设计一个先验成本函数然后在实时中求解最优控制问题可能也不够有效。为了应对这些问题并避免在遇到意外场景时出现异常行为,我们提出了一种基于强化学习(RL)的方法,其中自动驾驶车辆通过与模拟交通直接交互来学习决策。决策者由深度神经网络实现,根据给定的系统状态提供动作选择。在关键应用如驾驶中,没有明确安全概念的RL代理可能无法收敛,或者需要极大量的样本才能找到可靠的策略。为了更好地解决这个问题,本文将强化学习与额外的短时间安全检查(SC)相结合。在关键场景中,安全检查还将为代理提供替代的安全动作,如果存在的话。这导致了两个新的贡献。首先,它扩展了可能导致不良“接近事件”或“碰撞”的状态。其次,安全检查的加入可以提供一个安全且稳定的训练环境。这显著提高了学习效率,同时不抑制有意义的探索,以确保安全和最优的学习行为。我们展示了所开发算法在高速公路驾驶场景中的性能,其中训练好的自动驾驶车辆在高速公路环境下遇到不同交通密度的情况。

英文摘要

The operational space of an autonomous vehicle (AV) can be diverse and vary significantly. This may lead to a scenario that was not postulated in the design phase. Due to this, formulating a rule based decision maker for selecting maneuvers may not be ideal. Similarly, it may not be effective to design an a-priori cost function and then solve the optimal control problem in real-time. In order to address these issues and to avoid peculiar behaviors when encountering unforeseen scenario, we propose a reinforcement learning (RL) based method, where the ego car, i.e., an autonomous vehicle, learns to make decisions by directly interacting with simulated traffic. The decision maker for AV is implemented as a deep neural network providing an action choice for a given system state. In a critical application such as driving, an RL agent without explicit notion of safety may not converge or it may need extremely large number of samples before finding a reliable policy. To best address the issue, this paper incorporates reinforcement learning with an additional short horizon safety check (SC). In a critical scenario, the safety check will also provide an alternate safe action to the agent provided if it exists. This leads to two novel contributions. First, it generalizes the states that could lead to undesirable "near-misses" or "collisions ". Second, inclusion of safety check can provide a safe and stable training environment. This significantly enhances learning efficiency without inhibiting meaningful exploration to ensure safe and optimal learned behavior. We demonstrate the performance of the developed algorithm in highway driving scenario where the trained AV encounters varying traffic density in a highway setting.

1903.11204 2026-06-04 cs.RO cs.SY eess.SY math.OC 版本更新

Priority Maps for Surveillance and Intervention of Wildfires and other Spreading Processes

优先地图用于监视和干预野火及其他扩散过程

Vera L. J. Somers, Ian R. Manchester

发表机构 * Australian Centre for Field Robotics(澳大利亚田径场机器人研究中心) University of Sydney(悉尼大学)

AI总结 本文提出了一种生成优先地图的方法,用于监视或干预动态扩散过程,如野火。该方法利用正系统性质,特别是价值函数的分离结构,提供可扩展的算法。通过16和1000节点示例展示了方法如何响应系统动态变化,并结合旅行商问题进行无人机路径规划。

Comments Accepted for ICRA 2019

详情
AI中文摘要

无人驾驶航空器(UAV)路径规划算法通常假设一个知识奖励函数或优先地图,指示最重要的区域。本文提出了一种方法,用于生成监视或干预动态扩散过程(如野火)的优先地图。所提出的优化框架利用正系统的性质,特别是价值函数(成本到目标)的分离结构,提供可扩展的监视和干预算法。我们展示了16和1000节点示例的结果,并说明了优先地图如何响应系统动态的变化。1000节点的更大示例代表一个虚构景观,展示了该方法如何整合野火扩散动态、景观和风条件。最后,我们给出将所提方法与旅行商问题结合用于野火干预的无人机路径规划示例。

英文摘要

Unmanned Aerial Vehicle (UAV) path planning algorithms often assume a knowledge reward function or priority map, indicating the most important areas to visit. In this paper we propose a method to create priority maps for monitoring or intervention of dynamic spreading processes such as wildfires. The presented optimization framework utilizes the properties of positive systems, in particular the separable structure of value (cost-to-go) functions, to provide scalable algorithms for surveillance and intervention. We present results obtained for a 16 and 1000 node example and convey how the priority map responds to changes in the dynamics of the system. The larger example of 1000 nodes, representing a fictional landscape, shows how the method can integrate bushfire spreading dynamics, landscape and wind conditions. Finally, we give an example of combining the proposed method with a travelling salesman problem for UAV path planning for wildfire intervention.

1903.09890 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Physics-Based Freely Scalable Continuum Deformation for UAS Traffic Coordination

基于物理的自由可扩展连续变形用于无人机交通协调

Hossein Rastgoftar, Ella Atkins

AI总结 本文提出了一种新的基于物理的交通协调方法,并将其应用于无人机(UAS)交通管理。该方法扩展了之前应用于高速公路和城市街道1-D交通流的基于物理的方法,以支持更高维度空域中的交通协调模型,特别是在没有预定义路径的情况下。论文将空域视为有限控制体积,而无人机协调(视为连续变形)在空域边界处进行控制。通过将空域划分为计划和非计划空间,论文将计划空域中的名义协调建模为带有时空参数的偏微分方程的解。此外,本文改进了对车辆故障的韧性,通过一种鲁棒边界控制算法来更新计划空间的几何形状,当无人机问题威胁到现有可航行空域通道的安全协调时。为了支持微观层面的无人机协调,我们提出根据车辆性能限制对车辆进行聚类。无人机集群,每个无人机视为虚拟刚体的粒子,使用领导者-追随者包容策略来获取宏观期望轨迹。

Comments 11 pages; 9 figures; submitted for publication

详情
AI中文摘要

本文开发了一种新颖的基于物理的交通协调方法,并将其应用于无人机(UAS)交通管理。我们扩展了之前应用于高速公路和城市街道1-D交通流的基于物理的方法,以支持更高维度空域中的交通协调模型,特别是在没有预定义路径的情况下。论文将空域视为有限控制体积,而无人机协调(视为连续变形)在空域边界处进行控制。通过将空域划分为计划和非计划空间,论文将计划空域中的名义协调建模为带有时空参数的偏微分方程的解。本文还改进了对车辆故障的韧性,通过一种鲁棒边界控制算法来更新计划空间的几何形状,当无人机问题威胁到现有可航行空域通道的安全协调时。为了支持微观层面的无人机协调,我们提出根据车辆性能限制对车辆进行聚类。无人机集群,每个无人机视为虚拟刚体的粒子,使用领导者-追随者包容策略来获取宏观期望轨迹。

英文摘要

This paper develops a novel physics-inspired traffic coordination approach and applies it to Unmanned Aircraft System (UAS) traffic management. We extend available physics-inspired approaches previously applied to 1-D traffic flow on highways and urban streets to support models of traffic coordination in higher dimension airspace for cases where no predefined paths exist. The paper considers airspace as a finite control volume while UAS coordination, treated as continuum deformation, is controlled at the airspace boundaries. By partitioning airspace into planned and unplanned spaces, the paper models nominal coordination in the planned airspace as the solution of a partial differential equation with spatiotemporal parameters. This paper also improves resilience to vehicle failures with a resilient boundary control algorithm to update the geometry of the planned space when UAS problems threaten safe coordination in existing navigable airspace channels. To support UAS coordination at the microscopic level, we propose clustering vehicles based on vehicle performance limits. UAS clusters, with each UAS treated as a particle of a virtual rigid body, use leader-follower containment to acquire the macroscopic desired trajectory.

1903.09748 2026-06-04 cs.RO cs.SY eess.SY math.DS math.OC 版本更新

Impedance control of a cable-driven SEA with mixed $H_2/H_\infty$ synthesis

电缆驱动串联弹性执行器的阻抗控制:混合H2/H∞综合方法

Ningbo Yu, Wulin Zou

发表机构 * Institute of Robotics and Automatic Information Systems, Nankai University(机器人与自动信息系统研究所,南开大学) Tianjin Key Laboratory of Intelligent Robotics, Nankai University(天津智能机器人重点实验室,南开大学)

AI总结 本文提出了一种基于混合H2/H∞综合和放松被动性的电缆驱动串联弹性执行器的阻抗控制方法,用于物理人机交互。

Comments 11 pages, already published in Assembly Automation

详情
Journal ref
Assembly Automation, Vol. 37, Issue: 3, pp.296-303, 2017
AI中文摘要

目的:本文提出了一种混合H2/H∞综合和放松被动性的阻抗控制方法,用于电缆驱动串联弹性执行器,以应用于物理人机交互。设计/方法/研究途径:为了使系统的阻抗匹配所需的动态模型,将阻抗控制问题重新公式化为阻抗匹配结构。所需的竞争性能要求以及来自物理系统的约束可以通过针对各自信号的加权函数来表征。考虑到人类运动的频率特性,被动约束对于稳定的人机交互,其在整个频谱上要求,可能会带来保守的解决方案,已被放松成仅限制低频带。因此,阻抗控制成为混合H2/H∞综合问题,并可以得到动态输出反馈控制器。发现:所提出的阻抗控制策略已针对各种期望的阻抗进行了测试,包括在电缆驱动串联弹性执行器平台上进行的仿真和实验。实际的交互扭矩在期望的范数范围内良好跟踪了期望的扭矩,且控制输入被调节在电机速度限制以下。闭环系统可以在低频上保证放松的被动性。仿真和实验结果都验证了所提出方法的可行性和有效性。原创性/价值:这种基于混合H2/H∞综合和放松被动性的阻抗控制策略提供了一种新颖、有效且更少保守的方法用于物理人机交互控制。

英文摘要

Purpose: This paper presents an impedance control method with mixed $H_2/H_\infty$ synthesis and relaxed passivity for a cable-driven series elastic actuator to be applied for physical human-robot interaction. Design/methodology/approach: To shape the system's impedance to match a desired dynamic model, the impedance control problem was reformulated into an impedance matching structure. The desired competing performance requirements as well as constraints from the physical system can be characterized with weighting functions for respective signals. Considering the frequency properties of human movements, the passivity constraint for stable human-robot interaction, which is required on the entire frequency spectrum and may bring conservative solutions, has been relaxed in such a way that it only restrains the low frequency band. Thus, impedance control became a mixed $H_2/H_\infty$ synthesis problem, and a dynamic output feedback controller can be obtained. Findings: The proposed impedance control strategy has been tested for various desired impedance with both simulation and experiments on the cable-driven series elastic actuator platform. The actual interaction torque tracked well the desired torque within the desired norm bounds, and the control input was regulated below the motor velocity limit. The closed loop system can guarantee relaxed passivity at low frequency. Both simulation and experimental results have validated the feasibility and efficacy of the proposed method. Originality/value: This impedance control strategy with mixed $H_2/H_\infty$ synthesis and relaxed passivity provides a novel, effective and less conservative method for physical human-robot interaction control.

1903.09673 2026-06-04 cs.RO cs.SY eess.SY math.DS math.OC 版本更新

Compliance Shaping for Control of Strength Amplification Exoskeletons with Elastic Cuffs

合规性塑形用于具有弹性围裙的强度放大外骨骼控制

Gray Cortright Thomas, Jeremiah M. Coholich, Luis Sentis

发表机构 * Human Centered Robotics Lab in the University of Texas at Austin(德克萨斯大学奥斯汀分校人本机器人实验室)

AI总结 本文提出了一种双合规性塑形方法,通过在力敏感围裙中串联弹簧来设计外骨骼的合规行为,以实现高放大比下的稳定性和鲁棒性,同时引入反馈控制器和增益调节方法,并通过单自由度肘部外骨骼验证了方法的有效性。

Comments 8 pages, 9 figures, conference

详情
AI中文摘要

能够放大操作者力量的外骨骼可以实现对未知物体的重载操作。然而,这种行为难以实现,因为外骨骼需要感知并放大操作者交互力的同时保持稳定。然而,放大与连接到操作者时的鲁棒稳定性目标本质上是冲突的。为此,我们引入了一种设计,在力敏感围裙中串联一个弹簧。这允许我们设计出名义上被动的外骨骼合规行为,即使具有高放大比。实际上,时间延迟和离散时间滤波器阻止我们的策略实际上实现被动性,但设计的合规性仍使外骨骼对弹簧状人类行为更具鲁棒性。我们的外骨骼由串联弹性执行器(SEA)驱动,这向系统引入了另一个弹簧。我们表明,为外骨骼塑形围裙的合规性可以近似转化为对SEA弹簧合规性的塑形问题。因此,我们引入了一种反馈控制器和增益调节方法,利用现有的SEAs合规性塑形技术。我们称之为“双合规性塑形”方法。在大放大比下,此控制器倾向于放大非线性传动摩擦效应,因此我们还提出了“传动扰动观测器”以缓解这一缺点。我们的方法在单自由度肘部外骨骼上进行了验证。

英文摘要

Exoskeletons which amplify the strength of their operators can enable heavy-duty manipulation of unknown objects. However, this type of behavior is difficult to accomplish; it requires the exoskeleton to sense and amplify the operator's interaction forces while remaining stable. But, the goals of amplification and robust stability when connected to the operator fundamentally conflict. As a solution, we introduce a design with a spring in series with the force sensitive cuff. This allows us to design an exoskeleton compliance behavior which is nominally passive, even with high amplification ratios. In practice, time delay and discrete time filters prevent our strategy from actually achieving passivity, but the designed compliance still makes the exoskeleton more robust to spring-like human behaviors. Our exoskeleton is actuated by a series elastic actuator (SEA), which introduces another spring into the system. We show that shaping the cuff compliance for the exoskeleton can be made into approximately the same problem as shaping the spring compliance of an SEA. We therefore introduce a feedback controller and gain tuning method which takes advantage of an existing compliance shaping technique for SEAs. We call our strategy the "double compliance shaping" method. With large amplification ratios, this controller tends to amplify nonlinear transmission friction effects, so we additionally propose a "transmission disturbance observer" to mitigate this drawback. Our methods are validated on a single-degree-of-freedom elbow exoskeleton.

1903.08781 2026-06-04 cs.AR cs.DC cs.RO cs.SY eess.SY 版本更新

Fault-Tolerant Nanosatellite Computing on a Budget

在预算内实现容错的纳卫星计算

Christian M. Fuchs, Nadia Murillo, Aske Plaat, Erik Van der Kouwe, Daniel Harsono, Todor Stefanov

发表机构 * Leiden Institute of Advanced Computer Science(莱顿先进计算机科学研究所) Leiden Observatory(莱顿天文台) Leiden University(莱顿大学) European Space Agency(欧洲航天局) Netherlands Research School for Astronomy(荷兰天文研究学校) Royal Netherlands Academy of Arts and Sciences(荷兰皇家艺术与科学学院)

AI总结 本文提出了一种基于线程级粗粒度锁步的软件容错方法,通过故障注入验证,利用FPGA实现 tiled MPSoC 架构,以满足未来科学和商业航天任务的高性能需求,同时提供强故障覆盖保障。

详情
Journal ref
Conference on Radiation Effects on Components and Systems 2018 (RADECS)
AI中文摘要

微卫星和纳卫星已成为各种商业和科学应用的流行平台,但目前主要适用于短时和低优先级空间任务,由于其可靠性较低。部分原因在于其依赖于便宜、低功能尺寸的COTS组件,这些组件最初为嵌入式和移动市场设计,传统硬件投票概念对此无效。软件容错概念已被证明对这类系统有效,但因成熟度低而被航天行业忽视,因为大多数仅在理论中研究。在实践中,载荷仪器和微型卫星的设计者通常被迫牺牲可靠性以满足尖端科学和创新商业应用所需的性能水平。因此,我们开发了一种基于线程级粗粒度锁步的软件容错方法,通过故障注入验证。为了提供强长期故障覆盖,我们的架构实现为FPGA上的tiled MPSoC,利用部分重新配置以及混合关键性。该架构可以在极低成本下满足当前和未来科学和商业航天任务的高性能需求,同时为长期任务提供必要的强故障覆盖保障。该架构是为一个为期4年的ESA项目开发的。与两家工业合作伙伴一起,我们正在开发原型,然后进行辐射测试。

英文摘要

Micro- and nanosatellites have become popular platforms for a variety of commercial and scientific applications, but today are considered suitable mainly for short and low-priority space missions due to their low reliability. In part, this can be attributed to their reliance upon cheap, low-feature size, COTS components originally designed for embedded and mobile-market applications, for which traditional hardware-voting concepts are ineffective. Software-fault-tolerance concepts have been shown effective for such systems, but have largely been ignored by the space industry due to low maturity, as most have only been researched in theory. In practice, designers of payload instruments and miniaturized satellites are usually forced to sacrifice reliability in favor deliver the level of performance necessary for cutting-edge science and innovative commercial applications. Thus, we developed a software-fault-tolerance-approach based upon thread-level coarse-grain lockstep, which was validated using fault-injection. To offer strong long-term fault coverage, our architecture is implemented as tiled MPSoC on an FPGA, utilizing partial reconfiguration, as well as mixed criticality. This architecture can satisfy the high performance requirements of current and future scientific and commercial space missions at very low cost, while offering the strong fault-coverage guarantees necessary for platform control even for missions with a long duration. This architecture was developed for a 4-year ESA project. Together with two industrial partners, we are developing a prototype to then undergo radiation testing.

1903.07353 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Joint axis estimation for fast and slow movements using weighted gyroscope and acceleration constraints

利用加权陀螺仪和加速度约束的快速和慢速运动联合轴估计

Fredrik Olsson, Thomas Seel, Dustin Lehmann, Kjartan Halvorsen

发表机构 * 1 4 Department of Information Technology, Uppsala University, Uppsala, Sweden 2 3 Department of Electrical Engineering 4 Department of Mecatronics, Tecnol \'o gico de Monterrey, Mexico City, Mexico

AI总结 本文提出了一种结合加速度和陀螺仪约束的新型方法,用于准确估计关节轴,通过调整残差权重和非线性加速度范数差,实现快速和慢速运动中的高精度估计。

Comments 8 pages, 4 figures, 1 table

详情
AI中文摘要

传感器到肢体的校准是惯性运动跟踪中的关键步骤。当两个肢体通过铰链关节连接时,例如在人体膝关节和手指关节以及许多机械臂中,必须在内在传感器坐标系中确定关节轴向量。存在通过求解基于运动关节约束的优化问题来确定这些坐标的办法,这些约束涉及测量的加速度或角速度。在本文中,我们证明仅使用其中一个约束会导致在快速或慢速运动中估计不准确。我们提出了一种基于结合两种约束的成本函数的新方法。通过仅使用加速度计和陀螺仪读数,避免了对均匀磁场的严格假设。为了结合两种传感器的优点,残差权重根据估计信号方差和非线性加速度范数差自动调整。该方法使用来自上肢外骨骼九种不同运动的真实数据进行评估。结果表明,与以往方法不同,所提出的方法在仅五秒后即可对所有快速和慢速运动实现准确的关节轴估计。

英文摘要

Sensor-to-segment calibration is a crucial step in inertial motion tracking. When two segments are connected by a hinge joint, for example in human knee and finger joints as well as in many robotic limbs, then the joint axis vector must be identified in the intrinsic sensor coordinate systems. There exist methods that identify these coordinates by solving an optimization problem that is based on kinematic joint constraints, which involve either the measured accelerations or the measured angular rates. In the current paper we demonstrate that using only one of these constraints leads to inaccurate estimates at either fast or slow motions. We propose a novel method based on a cost function that combines both constraints. The restrictive assumption of a homogeneous magnetic field is avoided by using only accelerometer and gyroscope readings. To combine the advantages of both sensor types, the residual weights are adjusted automatically based on the estimated signal variances and a nonlinear weighting of the acceleration norm difference. The method is evaluated using real data from nine different motions of an upper limb exoskeleton. Results show that, unlike previous approaches, the proposed method yields accurate joint axis estimation after only five seconds for all fast and slow motions.

1807.06614 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Rapid Trajectory Optimization Using C-FROST with Illustration on a Cassie-Series Dynamic Walking Biped

利用C-FROST实现快速轨迹优化及在Cassie系列动态步行双足机器人上的示例

Ayonga Hereid, Omar Harib, Ross Hartley, Yukai Gong, Jessy W. Grizzle

发表机构 * College of Engineering and the Robotics Institute, University of Michigan(密歇根大学工程学院和机器人研究所)

AI总结 本文提出了一种无需去除或合并自由度的方法,通过C-FROST和多线程技术快速确定人类oids的解决方案,并在20自由度的Cassie系列双足机器人浮基模型上进行数值计算和物理实验验证

详情
AI中文摘要

低维模型在步态设计中的一个主要吸引力是能够快速计算解决方案,而其一个缺点是将解决方案映射回目标机器人存在困难。本文提出了一套工具,用于快速确定“人类oids”的解决方案而无需去除或合并自由度。主要工具包括(1)C-FROST,一个开源的C++接口,用于FROST,一种直接配分优化工具;和(2)多线程。结果将在20自由度的Cassie系列双足机器人浮基模型上通过数值计算和物理实验进行示例展示。

英文摘要

One of the big attractions of low-dimensional models for gait design has been the ability to compute solutions rapidly, whereas one of their drawbacks has been the difficulty in mapping the solutions back to the target robot. This paper presents a set of tools for rapidly determining solutions for ``humanoids'' without removing or lumping degrees of freedom. The main tools are (1) C-FROST, an open-source C++ interface for FROST, a direct collocation optimization tool; and (2) multi-threading. The results will be illustrated on a 20-DoF floating-base model for a Cassie-series bipedal robot through numerical calculations and physical experiments.

1903.04706 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Control Barrier Functions for Systems with High Relative Degree

用于高相对次数系统的控制屏障函数

Wei Xiao, Calin Belta

发表机构 * Boston University(波士顿大学)

AI总结 本文扩展了控制屏障函数(CBFs)到高阶控制屏障函数(HOCBFs),用于处理高相对次数约束。提出HOCBFs比最近提出的(指数)HOCBFs更通用。我们介绍了高阶屏障函数(HOBF),并展示了其满足Lyapunov-like条件会使得一系列集合的交集向前不变。然后引入HOCBF,并展示任何满足HOCBF约束的控制输入会使一系列集合的交集向前不变。我们提出了具有HOCBF和控制Lyapunov函数(CLF)约束的优化控制问题,并分析了在定义HOCBF时所用的$\mathcal{K}$类函数的选择对可行控制区域大小的影响。我们还提供了一种有前途的方法来解决HOCBF约束与控制限制之间的冲突,通过惩罚$\mathcal{K}$类函数。我们通过自适应巡航控制问题展示了所提出的方法。

Comments 9 pages, 7 figures, submitted to CDC19

详情
AI中文摘要

本文扩展了控制屏障函数(CBFs)到高阶控制屏障函数(HOCBFs),用于处理高相对次数约束。所提出的HOCBFs比最近提出的(指数)HOCBFs更通用。我们介绍了高阶屏障函数(HOBF),并展示了其满足Lyapunov-like条件会使得一系列集合的交集向前不变。然后引入HOCBF,并展示任何满足HOCBF约束的控制输入会使一系列集合的交集向前不变。我们提出了具有HOCBF和控制Lyapunov函数(CLF)约束的优化控制问题,并分析了在定义HOCBF时所用的$\mathcal{K}$类函数的选择对可行控制区域大小的影响。我们还提供了一种有前途的方法来解决HOCBF约束与控制限制之间的冲突,通过惩罚$\mathcal{K}$类函数。我们通过自适应巡航控制问题展示了所提出的方法。

英文摘要

This paper extends control barrier functions (CBFs) to high order control barrier functions (HOCBFs) that can be used for high relative degree constraints. The proposed HOCBFs are more general than recently proposed (exponential) HOCBFs. We introduce high order barrier functions (HOBF), and show that their satisfaction of Lyapunov-like conditions implies the forward invariance of the intersection of a series of sets. We then introduce HOCBF, and show that any control input that satisfies the HOCBF constraints renders the intersection of a series of sets forward invariant. We formulate optimal control problems with constraints given by HOCBF and control Lyapunov functions (CLF) and analyze the influence of the choice of the class $\mathcal{K}$ functions used in the definition of the HOCBF on the size of the feasible control region. We also provide a promising method to address the conflict between HOCBF constraints and control limitations by penalizing the class $\mathcal{K}$ functions. We illustrate the proposed method on an adaptive cruise control problem.

1903.05355 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

A Framework for On-line Learning of Underwater Vehicles Dynamic Models

在线学习水下机器人动态模型的框架

Bilal Wehbe, Marc Hildebrandt, Frank Kirchner

发表机构 * DFKI - Robotic Innovation Center(DFKI机器人创新中心)

AI总结 本文提出了一种在线学习水下机器人动态模型的框架,通过增量支持向量回归方法从数据流中逐步学习模型,并结合增量学习策略来改进模型在整体状态空间上的泛化能力。

Comments 8 pages, 6 figures, ICRA 2019 authors preprint

详情
AI中文摘要

从数据中学习机器人的动力学有助于实现更精确的跟踪控制器,或帮助其导航算法。然而,当由于外部条件变化导致机器人实际动力学变化时,需要在线调整其模型以保持高性能。本文提出了一种在线学习机器人动力学的框架,以适应此类变化。所提出的框架采用增量支持向量回归方法,从数据流中逐步学习模型。结合增量学习,开发了包括和遗忘数据的策略,以在整体状态空间上获得更好的泛化能力。该框架在仿真和真实实验场景中进行了测试,展示了其适应机器人动力学变化的能力。

英文摘要

Learning the dynamics of robots from data can help achieve more accurate tracking controllers, or aid their navigation algorithms. However, when the actual dynamics of the robots change due to external conditions, on-line adaptation of their models is required to maintain high fidelity performance. In this work, a framework for on-line learning of robot dynamics is developed to adapt to such changes. The proposed framework employs an incremental support vector regression method to learn the model sequentially from data streams. In combination with the incremental learning, strategies for including and forgetting data are developed to obtain better generalization over the whole state space. The framework is tested in simulation and real experimental scenarios demonstrating its adaptation capabilities to changes in the robot's dynamics.

1903.03318 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Development of an Autonomous Sanding Robot with Structured-Light Technology

基于结构光技术的自主打磨机器人开发

Yingxin Huo, Diancheng Chen, Xiang Li, Peng Li, Yun-Hui Liu

发表机构 * CUHK T Stone Robotics Institute(CUHK T Stone机器人研究所) Innovation and Technology Commission of Hong Kong(香港创新及科技委员会) Harbin Institute of Technology(哈尔滨工业大学)

AI总结 本文提出了一种能够自主完成未知物体打磨工作的机器人,通过结构光相机扫描建模、优化运动规划和阻抗模型控制,实现了无需人工干预的自主打磨。

Comments 7 pages, 11 figures, IEEE/RSJ International Conference on Intelligent Robots and Systems 2019

详情
AI中文摘要

机器人和自动化的大需求在打磨工作中得到了体现,因为当前的手动操作是劳动密集型的,缺乏一致的质量,且存在安全和健康问题。虽然已经开发出几种自动化打磨工作的机器,但现有解决方案的自主能力相对较低,仍需要大量的人工协助或监督来校准目标物体或规划机器人运动和任务。本文提出了一种自主打磨机器人,能够自动对未知物体进行打磨,无需任何先前的校准或人工干预。该机器人工作流程如下:首先,使用结构光相机扫描并建模目标物体。其次,规划机器人运动以覆盖物体的所有表面,采用优化的过渡序列。第三,控制机器人在期望的阻抗模型下对物体进行打磨。制造了一个打磨机器人的原型,并在打磨一批木箱的任务中验证了其性能。凭借足够的自由度(DOFs)和末端执行器的模块化设计,该机器人能够为许多其他不同物体的自主打磨提供通用解决方案。

英文摘要

Large demand for robotics and automation has been reflected in the sanding works, as current manual operations are labor-intensive, without consistent quality, and also subject to safety and health issues. While several machines have been developed to automate one or two steps in the sanding works, the autonomous capability of existing solutions is relatively low, and the human assistance or supervision is still heavily required in the calibration of target objects or the planning of robot motion and tasks. This paper presents the development of an autonomous sanding robot, which is able to perform the sanding works on an unknown object automatically, without any prior calibration or human intervention. The developed robot works as follows. First, the target object is scanned then modeled with the structured-light camera. Second, the robot motion is planned to cover all the surfaces of the object with an optimized transition sequence. Third, the robot is controlled to perform the sanding on the object under the desired impedance model. A prototype of the sanding robot is fabricated and its performance is validated in the task of sanding a batch of wooden boxes. With sufficient degrees of freedom (DOFs) and the module design for the end effector, the developed robot is able to provide a general solution to the autonomous sanding on many other different objects.

1811.07834 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Safely Probabilistically Complete Real-Time Planning and Exploration in Unknown Environments

在未知环境中安全的概率完备实时规划与探索

David Fridovich-Keil, Jaime F. Fisac, Claire J. Tomlin

发表机构 * UC-Philippine-California Advanced Research Institute(加州-菲律宾-加州高级研究机构) ONR MURI(国防高级研究计划局(ONR)MURI) SRC CONIX Center(SRC CONIX中心)

AI总结 本文提出了一种新的运动规划框架,该框架围绕现有的动力学规划器构建,在事先未知的静态环境中保证递归可行性。通过利用来自可达性分析的鲁棒控制器,该方法对整体安全性和碰撞避免做出了强保证。运动计划始终保持在初始状态的安全后向可达集内,同时安全地探索空间。这保证了初始状态的安全性,并确保在安全探索过程中最终能找到目标。

Comments 7 pages, accepted to ICRA 2019

详情
AI中文摘要

我们提出了一种新的运动规划框架,该框架围绕现有的动力学规划器构建,并在事先未知的静态环境中保证递归可行性。我们的方法通过利用来自可达性分析的鲁棒控制器,对整体安全性和碰撞避免做出了强保证。我们确保运动计划始终不离开初始状态的安全后向可达集,同时安全地探索空间。这保持了初始状态的安全性,并保证在安全探索过程中最终能够找到目标。我们将在机器人操作系统(ROS)软件环境中实现该框架,并在实时模拟中进行演示。

英文摘要

We present a new framework for motion planning that wraps around existing kinodynamic planners and guarantees recursive feasibility when operating in a priori unknown, static environments. Our approach makes strong guarantees about overall safety and collision avoidance by utilizing a robust controller derived from reachability analysis. We ensure that motion plans never exit the safe backward reachable set of the initial state, while safely exploring the space. This preserves the safety of the initial state, and guarantees that that we will eventually find the goal if it is possible to do so while exploring safely. We implement our framework in the Robot Operating System (ROS) software environment and demonstrate it in a real-time simulation.

1903.02219 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Training in Task Space to Speed Up and Guide Reinforcement Learning

在任务空间中训练以加速和引导强化学习

Guillaume Bellegarda, Katie Byl

AI总结 本文提出在任务空间中训练以提高强化学习的效率和稳定性,通过简化高自由度系统模型、利用正逆运动学以及在笛卡尔空间中学习运动策略,从而减少样本复杂度和训练时间。

详情
AI中文摘要

最近强化学习(RL)领域的突破在学习和部署真实世界机器人系统策略方面取得了显著进展。然而,即使使用当前最先进的算法和计算资源,这些算法仍然面临高样本复杂度的问题,导致训练时间长,尤其是对于高自由度(DOF)系统。此外,新兴策略缺乏感知稳定性和鲁棒性保证也引发了担忧。本文旨在通过以下方法缓解这些缺点:(1)用一个代表性简单的模型来建模复杂的高DOF系统;(2)明确使用正逆运动学,而不需要让RL算法自行学习;(3)在笛卡尔空间中学习运动策略,而不是关节空间。本文将这些方法应用于JPL的Robosimian,但可以轻松应用于任何具有基座和末端执行器的系统。这些运动策略可以在几分钟内生成,并在单台笔记本电脑上训练。我们比较了所学策略的鲁棒性与其他控制方法的鲁棒性。本文的配套视频可在https://youtu.be/xDxxSw5ahnc找到。

英文摘要

Recent breakthroughs in the reinforcement learning (RL) community have made significant advances towards learning and deploying policies on real world robotic systems. However, even with the current state-of-the-art algorithms and computational resources, these algorithms are still plagued with high sample complexity, and thus long training times, especially for high degree of freedom (DOF) systems. There are also concerns arising from lack of perceived stability or robustness guarantees from emerging policies. This paper aims at mitigating these drawbacks by: (1) modeling a complex, high DOF system with a representative simple one, (2) making explicit use of forward and inverse kinematics without forcing the RL algorithm to "learn" them on its own, and (3) learning locomotion policies in Cartesian space instead of joint space. In this paper these methods are applied to JPL's Robosimian, but can be readily used on any system with a base and end effector(s). These locomotion policies can be produced in just a few minutes, trained on a single laptop. We compare the robustness of the resulting learned policies to those of other control methods. An accompanying video for this paper can be found at https://youtu.be/xDxxSw5ahnc .

1903.00927 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Topological Information-Theoretic Belief Space Planning with Optimality Guarantees

具有最优性保证的拓扑信息论信念空间规划

Andrej Kitanov, Vadim Indelman

AI总结 本文提出了一种高效确定t-bsp误差界限的方法,从而为该方法提供全局最优性保证或解的不确定性边际,该方法基于信息论BSP的最优解和之前引入的拓扑度量。

详情
AI中文摘要

在高维状态空间中确定信念空间规划(BSP)的全局最优解计算成本很高,因为它需要对每个候选动作进行信念传播和目标函数评估。我们最近引入的拓扑信念空间规划t-bsp则仅考虑因子图的拓扑结构来做出决策。在本文中,我们为这一领域贡献了一种新的方法,用于高效确定t-bsp的误差界限,从而提供全局最优性保证或解的不确定性边际。这些界限是基于信息论BSP的最优解,并考虑了之前引入的拓扑度量,该度量基于生成树的数量。在现实和合成模拟中,我们分析了这些界限的紧致性,并展示了该度量如何与另一种计算上更高效的t-bsp度量紧密相关,即图的von Neumann熵近似值,后者可以实现在线性能。

英文摘要

Determining a globally optimal solution of belief space planning (BSP) in high-dimensional state spaces is computationally expensive, as it involves belief propagation and objective function evaluation for each candidate action. Our recently introduced topological belief space planning t-bsp instead performs decision making considering only topologies of factor graphs that correspond to posterior future beliefs. In this paper we contribute to this body of work a novel method for efficiently determining error bounds of t-bsp, thereby providing global optimality guarantees or uncertainty margin of its solution. The bounds are given with respect to an optimal solution of information theoretic BSP considering the previously introduced topological metric which is based on the number of spanning trees. In realistic and synthetic simulations, we analyze tightness of these bounds and show empirically how this metric is closely related to another computationally more efficient t-bsp metric, an approximation of the von Neumann entropy of a graph, which can achieve online performance.

1902.08705 2026-06-04 cs.RO cs.AI cs.LG cs.SY eess.SY 版本更新

A General Framework for Structured Learning of Mechanical Systems

结构机械系统学习的通用框架

Jayesh K. Gupta, Kunal Menda, Zachary Manchester, Mykel J. Kochenderfer

发表机构 * Stanford University(斯坦福大学)

AI总结 本文提出了一种通用框架,用于结构化学习机械系统,通过结合先验知识和训练表达式近似器来提高模型的准确性和效率。

Comments 10 pages, 7 figures. First two authors contributed equally. Submitted to IROS/RA-L. Code at https://github.com/sisl/mechamodlearn/

详情
AI中文摘要

学习准确的动力学模型对于优化和顺应性控制机器人系统至关重要。当前使用解析参数化进行白盒建模或使用神经网络进行黑盒建模的方法可能会产生高偏差或高方差。我们提出了一个灵活的灰盒模型,可以无缝地结合可用的先验知识,并在没有时训练具有表达能力的函数近似器。我们提出使用神经网络参数化机械系统,以建模其拉格朗日量和作用在其上的广义力。我们在模拟的驱动双摆上测试了我们的方法。我们展示了我们的方法在数据效率以及基于模型的强化学习中的性能优于朴素的黑盒模型。我们还系统地研究了我们的方法在结合可用的系统先验知识以提高数据效率方面的能力。

英文摘要

Learning accurate dynamics models is necessary for optimal, compliant control of robotic systems. Current approaches to white-box modeling using analytic parameterizations, or black-box modeling using neural networks, can suffer from high bias or high variance. We address the need for a flexible, gray-box model of mechanical systems that can seamlessly incorporate prior knowledge where it is available, and train expressive function approximators where it is not. We propose to parameterize a mechanical system using neural networks to model its Lagrangian and the generalized forces that act on it. We test our method on a simulated, actuated double pendulum. We show that our method outperforms a naive, black-box model in terms of data-efficiency, as well as performance in model-based reinforcement learning. We also conduct a systematic study of our method's ability to incorporate available prior knowledge about the system to improve data efficiency.

1903.00220 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Approximate Robust Control of Uncertain Dynamical Systems

不确定动力系统近似鲁棒控制

Edouard Leurent, Yann Blanco, Denis Efimov, Odalric-Ambrym Maillard

发表机构 * INRIA Lille(INRIA里尔) Renault(雷诺) Non-A team, INRIA Lille SequeL team, INRIA Lille(非A团队,INRIA里尔SequeL团队,INRIA里尔)

AI总结 本文研究了在不确定环境中大型非线性系统安全控制策略的设计,提出两种可处理的鲁棒控制方法,应用于自动驾驶问题。

详情
Journal ref
32nd Conference on Neural Information Processing Systems (NeurIPS 2018) Workshop, Dec 2018, Montr{é}al, Canada
AI中文摘要

本文研究了在不确定环境中大型非线性系统安全控制策略的设计。在这样的情况下,鲁棒控制框架是一种安全性的原理性方法,旨在最大化系统的最坏情况性能。然而,由此产生的优化问题通常对于具有连续状态的非线性系统来说是不可行的。为了解决这个问题,我们引入了两种可处理的方法,这些方法基于采样或对鲁棒目标的保守近似。所提出的方法应用于自动驾驶问题。

英文摘要

This work studies the design of safe control policies for large-scale non-linear systems operating in uncertain environments. In such a case, the robust control framework is a principled approach to safety that aims to maximize the worst-case performance of a system. However, the resulting optimization problem is generally intractable for non-linear systems with continuous states. To overcome this issue, we introduce two tractable methods that are based either on sampling or on a conservative approximation of the robust objective. The proposed approaches are applied to the problem of autonomous driving.

1806.07115 2026-06-04 cs.RO cs.SY eess.SY 版本更新

ConFusion: Sensor Fusion for Complex Robotic Systems using Nonlinear Optimization

ConFusion:利用非线性优化的复杂机器人系统传感器融合

Timothy Sandy, Lukas Stadelmann, Simon Kerscher, Jonas Buchli

发表机构 * Agile & Dexterous Robotics Lab, ETH Zurich(敏捷与灵活机器人实验室,苏黎世联邦理工学院)

AI总结 本文提出ConFusion,一种用于机器人应用的开源传感器融合框架,通过非线性优化实现灵活的传感器融合设计,并展示了其在视觉惯性跟踪和移动机械臂上的性能。

详情
Journal ref
IEEE Robotics and Automation Letters, 2019, Volume 4, Number 2, Pages 1093-1100
AI中文摘要

我们介绍了ConFusion,一种用于机器人应用的开源在线传感器融合包。ConFusion是一种模块化的框架,用于在移动时间窗口估计器内融合多种异构传感器的测量。ConFusion比基于滤波的系统在传感器融合问题设计上具有更大的灵活性,并且能够根据可用的计算能力调整在线估计的质量。我们通过与迭代扩展卡尔曼滤波器在视觉惯性跟踪中的性能比较,展示了其在移动机械臂上的整体传感器融合的适应性。

英文摘要

We present ConFusion, an open-source package for online sensor fusion for robotic applications. ConFusion is a modular framework for fusing measurements from many heterogeneous sensors within a moving horizon estimator. ConFusion offers greater flexibility in sensor fusion problem design than filtering-based systems and the ability to scale the online estimate quality with the available computing power. We demonstrate its performance in comparison to an iterated extended Kalman filter in visual-inertial tracking, and show its versatility through whole-body sensor fusion on a mobile manipulator.

1902.11015 2026-06-04 eess.SY cs.MA cs.RO cs.SY 版本更新

Mobile Formation Coordination and Tracking Control for Multiple Non-holonomic Vehicles

多非holonomic车辆的移动编队协调与跟踪控制

Xiuhui Peng, Zhiyong Sun, Kexin Guo, Zhiyong Geng

发表机构 * Department of Automatic Control, Lund University(自动化系,吕勒奥大学)

AI总结 本文针对非holonomic车辆在SE(2)上的轨迹跟踪和移动编队协调问题,提出了一种基于中间姿态变量的分阶段控制方法,并证明了严格刚体运动的编队条件,同时通过仿真和实验验证了所提控制器的性能。

详情
AI中文摘要

本文针对在SE(2)上进行轨迹跟踪和移动编队协调的非holonomic车辆的前进运动控制问题。首先,通过构建一个包含车辆位置信息和期望姿态的中间姿态变量,设计了分阶段的平移和旋转控制输入以解决轨迹跟踪问题。其次,深入探讨了非holonomic车辆之间的相对位置和航向的协调关系,以维持具有刚体运动约束的移动编队。证明了除了平行编队和纯 translation 直线编队的情况外,只要每个车辆的线速度与角速度的比值为常数,就可以实现严格刚体运动的编队。还展示了具有弱刚体运动的移动编队的运动特性。之后,基于所提的轨迹跟踪方法,在有向树图上设计了分布式移动编队控制律。所提控制器的性能通过数值仿真和实验进行了验证。

英文摘要

This paper addresses forward motion control for trajectory tracking and mobile formation coordination for a group of non-holonomic vehicles on SE(2). Firstly, by constructing an intermediate attitude variable which involves vehicles' position information and desired attitude, the translational and rotational control inputs are designed in two stages to solve the trajectory tracking problem. Secondly, the coordination relationships of relative positions and headings are explored thoroughly for a group of non-holonomic vehicles to maintain a mobile formation with rigid body motion constraints. We prove that, except for the cases of parallel formation and translational straight line formation, a mobile formation with strict rigid-body motion can be achieved if and only if the ratios of linear speed to angular speed for each individual vehicle are constants. Motion properties for mobile formation with weak rigid-body motion are also demonstrated. Thereafter, based on the proposed trajectory tracking approach, a distributed mobile formation control law is designed under a directed tree graph. The performance of the proposed controllers is validated by both numerical simulations and experiments.

1810.03749 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Balancing Global Exploration and Local-connectivity Exploitation with Rapidly-exploring Random disjointed-Trees

在快速探索随机断树中平衡全局探索与局部连通性利用

Tin Lai, Fabio Ramos, Gilad Francis

发表机构 * NVIDIA, USA(美国NVIDIA公司)

AI总结 本文提出了一种名为RRdT*的增量最优多查询规划器,通过使用多个断树来利用空间的局部连通性,通过马尔可夫链随机采样,并在局部连通性利用失败时主动探索全局空间,将局部利用与全局探索的平衡转化为多臂老虎机问题,从而提高采样效率。

Comments Submitted to IEEE International Conference on Robotics and Automation (ICRA) 2019

详情
AI中文摘要

在高度受限的环境中,采样效率长期以来一直是采样规划器的主要挑战。在本文中,我们提出了快速探索随机断树(RRdT*),一种增量最优多查询规划器。RRdT*使用多个断树来通过马尔可夫链随机采样来利用空间的局部连通性,该方法利用来自先前成功和失败样本的邻居信息。为了平衡局部利用,当局部连通性利用失败时,RRdT*会主动探索未见过的全局空间。局部利用与全局探索之间的主动权衡被公式化为一个多臂老虎机问题。我们主张,主动平衡全局探索与局部利用是提高采样规划器采样效率的关键。我们为这一新方法提供了严谨的完整性和最优收敛性证明。此外,我们通过实验展示了RRdT*的局部探索树在规划中提供改进可见性的有效性。因此,RRdT*在高度受限的环境中优于现有的最先进增量规划器。

英文摘要

Sampling efficiency in a highly constrained environment has long been a major challenge for sampling-based planners. In this work, we propose Rapidly-exploring Random disjointed-Trees* (RRdT*), an incremental optimal multi-query planner. RRdT* uses multiple disjointed-trees to exploit local-connectivity of spaces via Markov Chain random sampling, which utilises neighbourhood information derived from previous successful and failed samples. To balance local exploitation, RRdT* actively explore unseen global spaces when local-connectivity exploitation is unsuccessful. The active trade-off between local exploitation and global exploration is formulated as a multi-armed bandit problem. We argue that the active balancing of global exploration and local exploitation is the key to improving sample efficient in sampling-based motion planners. We provide rigorous proofs of completeness and optimal convergence for this novel approach. Furthermore, we demonstrate experimentally the effectiveness of RRdT*'s locally exploring trees in granting improved visibility for planning. Consequently, RRdT* outperforms existing state-of-the-art incremental planners, especially in highly constrained environments.

1902.10320 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A New Simulation Metric to Determine Safe Environments and Controllers for Systems with Unknown Dynamics

一种新的仿真度量用于确定具有未知动态系统的安全环境和控制器

Shromona Ghosh, Somil Bansal, Alberto Sangiovanni-Vincentelli, Sanjit A. Seshia, Claire J. Tomlin

发表机构 * University of California, Berkeley(加州大学伯克利分校)

AI总结 本文提出了一种基于规范的仿真度量(SPEC),用于在系统动态未知的情况下,通过更严格的规范修改来合成安全控制器,从而扩大安全环境集。

Comments 22nd ACM International Conference on Hybrid Systems: Computation and Control (2019)

详情
AI中文摘要

我们考虑了在已知状态和控制空间但动态未知的情况下,提取安全环境和控制器以满足可达-避免目标的问题。在给定的环境中,通常通过从抽象或系统模型(可能从数据中学习)中合成控制器。然而,在许多情况下,模型的动力学与实际系统的关系并不明确,因此难以为系统提供安全保证。在这种情况下,标准仿真度量(SSM)定义为模型与系统输出轨迹之间最坏情况的范数距离,可以用来将系统的可达-避免规范修改为更严格的抽象规范。然而,获得的距离以及修改后的规范可能相当保守,这限制了能够获得安全控制器的环境集。我们提出SPEC,一种基于规范的仿真度量,通过仅计算违反系统规范的轨迹来克服这些限制。我们证明,使用SPEC修改可达-避免规范可以比SSM合成更大环境集的安全控制器。我们还提出了一种概率方法来计算一般系统的SPEC。使用四旋翼和自动驾驶汽车的仿真器进行的案例研究展示了所提出度量在确定安全环境集和控制器方面的优势。

英文摘要

We consider the problem of extracting safe environments and controllers for reach-avoid objectives for systems with known state and control spaces, but unknown dynamics. In a given environment, a common approach is to synthesize a controller from an abstraction or a model of the system (potentially learned from data). However, in many situations, the relationship between the dynamics of the model and the \textit{actual system} is not known; and hence it is difficult to provide safety guarantees for the system. In such cases, the Standard Simulation Metric (SSM), defined as the worst-case norm distance between the model and the system output trajectories, can be used to modify a reach-avoid specification for the system into a more stringent specification for the abstraction. Nevertheless, the obtained distance, and hence the modified specification, can be quite conservative. This limits the set of environments for which a safe controller can be obtained. We propose SPEC, a specification-centric simulation metric, which overcomes these limitations by computing the distance using only the trajectories that violate the specification for the system. We show that modifying a reach-avoid specification with SPEC allows us to synthesize a safe controller for a larger set of environments compared to SSM. We also propose a probabilistic method to compute SPEC for a general class of systems. Case studies using simulators for quadrotors and autonomous cars illustrate the advantages of the proposed metric for determining safe environment sets and controllers.

1902.09626 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Learning Extreme Hummingbird Maneuvers on Flapping Wing Robots

在扑翼机器人上学习极端蜂鸟动作

Fan Fei, Zhan Tu, Jian Zhang, Xinyan Deng

AI总结 研究通过模仿蜂鸟的极端机动动作,开发了一种混合控制策略,利用模型驱动的非线性控制和模型无关的强化学习,实现了在12克仿生蜂鸟机器人上实现快速逃避机动。

Comments 6 pages, accepted at ICRA 2019

详情
AI中文摘要

生物学研究表明,蜂鸟在快速逃避时可以执行极端空战动作。在悬停时突然出现的视觉刺激下,蜂鸟会启动快速的后退平移并伴随180度的偏转,随后在不到10次振翅之间完成瞬间姿态稳定。考虑到振翅频率为40Hz,这种激进的动作仅在0.2秒内完成。受蜂鸟在这些极端动作中接近最大性能的启发,我们开发了一种飞行控制系统,并实验表明,这种机动性可通过配备两个执行器的12克仿生蜂鸟机器人实现。所提出的混合控制策略结合了基于模型的非线性控制和无模型强化学习。我们使用基于模型的非线性控制进行正常飞行控制,因为这些条件下的动态模型相对准确。然而,在极端机动中,建模误差变得无法控制。通过在仿真中训练的无模型强化学习策略被优化以'破坏'系统并最大化机动期间的性能。混合策略表现出接近蜂鸟观察到的机动动作。直接仿真到现实的转移得以实现,证明了仿生蜂鸟机器人上蜂鸟式的快速逃避机动。

英文摘要

Biological studies show that hummingbirds can perform extreme aerobatic maneuvers during fast escape. Given a sudden looming visual stimulus at hover, a hummingbird initiates a fast backward translation coupled with a 180-degree yaw turn, which is followed by instant posture stabilization in just under 10 wingbeats. Consider the wingbeat frequency of 40Hz, this aggressive maneuver is carried out in just 0.2 seconds. Inspired by the hummingbirds' near-maximal performance during such extreme maneuvers, we developed a flight control strategy and experimentally demonstrated that such maneuverability can be achieved by an at-scale 12-gram hummingbird robot equipped with just two actuators. The proposed hybrid control policy combines model-based nonlinear control with model-free reinforcement learning. We use model-based nonlinear control for nominal flight control, as the dynamic model is relatively accurate for these conditions. However, during extreme maneuver, the modeling error becomes unmanageable. A model-free reinforcement learning policy trained in simulation was optimized to 'destabilize' the system and maximize the performance during maneuvering. The hybrid policy manifests a maneuver that is close to that observed in hummingbirds. Direct simulation-to-real transfer is achieved, demonstrating the hummingbird-like fast evasive maneuvers on the at-scale hummingbird robot.

1812.06120 2026-06-04 eess.SY cs.AI cs.RO cs.SY 版本更新

Simulation to Scaled City: Zero-Shot Policy Transfer for Traffic Control via Autonomous Vehicles

模拟到缩放城市:通过自动驾驶车辆实现交通控制的零样本策略迁移

Kathy Jang, Eugene Vinitsky, Behdad Chalaki, Ben Remer, Logan Beaver, Andreas Malikopoulos, Alexandre Bayen

发表机构 * University of California, Berkeley(加州大学伯克利分校) University of Delaware(德克萨斯大学)

AI总结 本文通过深度强化学习训练自动驾驶车辆在环形交叉口的控制策略,并将训练好的策略迁移至缩放智能城市进行测试,发现注入噪声的策略在迁移后表现更佳,实现了交通流的优化。

Comments To be published at the International Conference on Cyber Physical Systems (ICCPS) 2019. 10 pages, 9 figures

详情
AI中文摘要

使用深度强化学习,我们训练了自动驾驶车辆在车队中通过环形交叉口的控制策略。使用Flow库,我们在微仿真器中训练了两种策略:一种在状态和动作空间中注入噪声,另一种则没有。在模拟中,自动驾驶车辆为两种策略都学习出一种涌现的引导行为,即减速以实现更流畅的合并。随后,我们将该策略直接迁移至德雷克塞尔大学缩放智能城市(UDSSC)测试平台,该平台是连接和自动化车辆的1:25比例测试场。我们对两种策略在缩放城市中的性能进行了表征。结果显示,无噪声策略经常导致碰撞,仅偶尔实现引导;而注入噪声的策略则始终表现出引导行为且无碰撞,表明噪声有助于零样本策略迁移。此外,迁移后的噪声注入策略在UDSSC中使平均行程时间减少了5%,最大行程时间减少了22%。控制器的视频可在https://sites.google.com/view/iccps-policy-transfer查看。

英文摘要

Using deep reinforcement learning, we train control policies for autonomous vehicles leading a platoon of vehicles onto a roundabout. Using Flow, a library for deep reinforcement learning in micro-simulators, we train two policies, one policy with noise injected into the state and action space and one without any injected noise. In simulation, the autonomous vehicle learns an emergent metering behavior for both policies in which it slows to allow for smoother merging. We then directly transfer this policy without any tuning to the University of Delaware Scaled Smart City (UDSSC), a 1:25 scale testbed for connected and automated vehicles. We characterize the performance of both policies on the scaled city. We show that the noise-free policy winds up crashing and only occasionally metering. However, the noise-injected policy consistently performs the metering behavior and remains collision-free, suggesting that the noise helps with the zero-shot policy transfer. Additionally, the transferred, noise-injected policy leads to a 5% reduction of average travel time and a reduction of 22% in maximum travel time in the UDSSC. Videos of the controllers can be found at https://sites.google.com/view/iccps-policy-transfer.

1711.09048 2026-06-04 cs.AI cs.RO cs.SY eess.SY 版本更新

A Compression-Inspired Framework for Macro Discovery

一种受压缩启发的宏发现框架

Francisco M. Garcia, Bruno C. da Silva, Philip S. Thomas

发表机构 * College of Information and Computer Sciences(信息与计算机科学学院) Department of Computer Science(计算机科学系) University of Massachusetts Amherst(马萨诸塞大学阿默斯特分校) Federal University Rio Grande do Sul(里约格朗德杜斯阿鲁斯联邦大学)

AI总结 本文提出了一种受压缩启发的宏发现框架,通过识别高性能策略获得的轨迹中的重复模式,帮助强化学习代理利用早期经验快速解决相关新任务。

Comments Accepted as Extended Abstract, AAMAS, 2019

详情
AI中文摘要

在本文中,我们考虑了强化学习代理在解决一组相关马尔可夫决策过程时,如何利用早期获得的知识来提高其快速解决新但相关任务的能力。一种利用这种经验的方法是通过识别从高性能策略中获得的轨迹中的重复模式。我们提出一个三步框架:代理1) 通过压缩来自近最优策略的轨迹生成一组候选开环宏;2) 评估每个宏的价值;3) 选择一个最大化多样性的宏子集,覆盖通常用于解决相关任务集的策略空间。我们的实验表明,将识别出的宏扩展到代理的原始原始动作集,使其能够更快速地在未见过但相似的MDPs中学习到最优策略。

英文摘要

In this paper we consider the problem of how a reinforcement learning agent tasked with solving a set of related Markov decision processes can use knowledge acquired early in its lifetime to improve its ability to more rapidly solve novel, but related, tasks. One way of exploiting this experience is by identifying recurrent patterns in trajectories obtained from well-performing policies. We propose a three-step framework in which an agent 1) generates a set of candidate open-loop macros by compressing trajectories drawn from near-optimal policies; 2) evaluates the value of each macro; and 3) selects a maximally diverse subset of macros that spans the space of policies typically required for solving the set of related tasks. Our experiments show that extending the original primitive action-set of the agent with the identified macros allows it to more rapidly learn an optimal policy in unseen, but similar MDPs.

1902.07708 2026-06-04 eess.SY cs.RO cs.SY 版本更新

A Stability Analysis for the Acceleration-based Robust Position Control of Robot Manipulators via Disturbance Observer

基于扰动观测器的机器人操作臂加速度鲁棒位置控制的稳定性分析

Emre Sariyildiz, Hiromu Sekiguchi, Takahiro Nozaki, Barkan Ugurlu, Kouhei Ohnishi

发表机构 * University of Wollongong(沃林戈大学) Keio University(庆应大学)

AI总结 本文提出了一种新的非线性稳定性分析方法,用于通过扰动观测器(DOb)实现机器人操作臂的加速度鲁棒位置控制,证明了在适当调节名义惯性矩阵时,位置误差在调节控制中渐近趋于零,在轨迹跟踪控制中是均匀最终有界的。随着DOb的带宽和名义惯性矩阵的增加,误差的界减小,即位置控制系统的鲁棒稳定性和性能得到改善。然而,由于实际设计限制,DOb的带宽和名义惯性矩阵不能随意增加,例如,当它们增加时,鲁棒位置控制器会变得更加噪声敏感。所提出的稳定性分析为DOb基于鲁棒运动控制系统动态行为提供了见解。理论和实验证明了名义惯性矩阵的非对角元素对稳定性和调整鲁棒性与噪声敏感性之间的权衡是有帮助的。该提议的有效性通过仿真和实验结果得到验证。

Comments 9 pages, 9 figures, Journal

详情
Journal ref
IEEE/ASME Transactions On Mechatronics, Vol. 23, No. 5, October 2018
AI中文摘要

本文提出了一种新的非线性稳定性分析方法,用于通过扰动观测器(DOb)实现机器人操作臂的加速度鲁棒位置控制。证明了如果在DOb的设计中适当调节名义惯性矩阵,则在调节控制中位置误差渐近趋于零,在轨迹跟踪控制中是均匀最终有界的。随着DOb的带宽和名义惯性矩阵的增加,误差的界减小,即位置控制系统的鲁棒稳定性和性能得到改善。然而,由于实际设计限制,DOb的带宽和名义惯性矩阵不能随意增加,例如,当它们增加时,鲁棒位置控制器会变得更加噪声敏感。所提出的稳定性分析为DOb基于鲁棒运动控制系统动态行为提供了见解。理论和实验证明了名义惯性矩阵的非对角元素对稳定性和调整鲁棒性与噪声敏感性之间的权衡是有帮助的。该提议的有效性通过仿真和实验结果得到验证。

英文摘要

This paper proposes a new nonlinear stability analysis for the acceleration-based robust position control of robot manipulators by using Disturbance Observer (DOb). It is shown that if the nominal inertia matrix is properly tuned in the design of DOb, then the position error asymptotically goes to zero in regulation control and is uniformly ultimately bounded in trajectory tracking control. As the bandwidth of DOb and the nominal inertia matrix are increased, the bound of error shrinks, i.e., the robust stability and performance of the position control system are improved. However, neither the bandwidth of DOb nor the nominal inertia matrix can be freely increased due to practical design constraints, e.g., the robust position controller becomes more noise sensitive when they are increased. The proposed stability analysis provides insights regarding the dynamic behavior of DOb-based robust motion control systems. It is theoretically and experimentally proved that non-diagonal elements of the nominal inertia matrix are useful to improve the stability and adjust the trade-off between the robustness and noise sensitivity. The validity of the proposal is verified by simulation and experimental results.

1812.07084 2026-06-04 cs.RO cs.AI cs.LG cs.SY eess.SY 版本更新

Learning Constraints from Demonstrations

从示范中学习约束

Glen Chou, Dmitry Berenson, Necmiye Ozay

发表机构 * Dept. of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI, 48109, USA(电气工程与计算机科学系,密歇根大学,安娜堡,MI,48109,美国)

AI总结 该研究提出了一种从示范中学习未知约束的方法,通过任务示范、成本函数和系统动力学与控制约束,利用hit-and-run采样获取低成本但不安全的轨迹,并通过整数规划获得一致的不安全集表示,同时理论分析了可从安全示范中学习的约束子集。

Comments Presented at the Workshop on the Algorithmic Foundations of Robotics (WAFR), 2018, Mérida, Mexico

详情
AI中文摘要

我们通过提供一种方法扩展了从示范中学习的范式,该方法利用任务的示范、成本函数以及系统动力学和控制约束来学习跨任务的未知约束。给定安全的示范,我们的方法使用hit-and-run采样来获得低成本但不安全的轨迹。安全和不安全的轨迹都被用来通过求解整数规划问题获得不安全集的一致表示。我们的方法能够跨系统动力学泛化,并学习保证的约束子集。我们还提供了理论分析,说明从安全示范中可以学习的约束子集。我们在线性和非线性系统动力学上展示了我们的方法,并证明它可以修改以适应次优示范,并且也可以用于特征空间中学习约束。

英文摘要

We extend the learning from demonstration paradigm by providing a method for learning unknown constraints shared across tasks, using demonstrations of the tasks, their cost functions, and knowledge of the system dynamics and control constraints. Given safe demonstrations, our method uses hit-and-run sampling to obtain lower cost, and thus unsafe, trajectories. Both safe and unsafe trajectories are used to obtain a consistent representation of the unsafe set via solving an integer program. Our method generalizes across system dynamics and learns a guaranteed subset of the constraint. We also provide theoretical analysis on what subset of the constraint can be learnable from safe demonstrations. We demonstrate our method on linear and nonlinear system dynamics, show that it can be modified to work with suboptimal demonstrations, and that it can also be used to learn constraints in a feature space.

1902.06133 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A Fleet of Miniature Cars for Experiments in Cooperative Driving

用于合作驾驶实验的微型汽车车队

Nicholas Hyldmar, Yijun He, Amanda Prorok

发表机构 * University of Cambridge(剑桥大学)

AI总结 本文介绍了一种由16辆微型Ackermann转向车辆组成的独特实验平台,旨在解决多车导航和轨迹规划研究与教育中低成本平台不足的问题,通过实验展示合作驾驶在多车道道路地形中的优势。

Comments Accepted to ICRA 2019

详情
AI中文摘要

我们介绍了一种独特的实验测试平台,由16辆微型Ackermann转向车辆组成。我们受多车导航和轨迹规划研究与教育中缺乏低成本平台的启发。本文详细介绍了我们的微型机器人汽车Cambridge Minicar的设计以及车队的控制架构。我们的实验平台允许我们实现最先进的驾驶员模型以及自主控制策略,并在真实的多车道物理环境中测试其有效性。通过在我们的微型高速公路上的实验,我们能够直观地展示合作驾驶在多车道道路地形中的优势。我们的设置为室内的大车队实验研究铺平了道路。

英文摘要

We introduce a unique experimental testbed that consists of a fleet of 16 miniature Ackermann-steering vehicles. We are motivated by a lack of available low-cost platforms to support research and education in multi-car navigation and trajectory planning. This article elaborates the design of our miniature robotic car, the Cambridge Minicar, as well as the fleet's control architecture. Our experimental testbed allows us to implement state-of-the-art driver models as well as autonomous control strategies, and test their validity in a real, physical multi-lane setup. Through experiments on our miniature highway, we are able to tangibly demonstrate the benefits of cooperative driving on multi-lane road topographies. Our setup paves the way for indoor large-fleet experimental research.

1902.05343 2026-06-04 cs.CV cs.RO cs.SY eess.SY 版本更新

Study of dynamical system based obstacle avoidance via manipulating orthogonal coordinates

基于操纵正交坐标的动态系统障碍避障研究

Weiya Ren

发表机构 * Artificial Intelligence Research Center of National Innovation Institute of Defense Technology(国家创新技术研究院人工智能研究中心) Tianjin Artificial Intelligence Innovation Center(天津人工智能创新中心)

AI总结 本文研究了基于动态系统的障碍避障问题,通过引入正交坐标开发了调制矩阵,使调制矩阵更加合理。新轨迹的方向可通过正交坐标的线性组合表示。提出了一种通过引入旋转矩阵来解决局部最小问题,并在三维或更高维空间中提供更合理运动的正交坐标操纵方法。该方法还为围绕凸形体巡逻提供了解决方案。实验结果表明所提出方法的有效性。

详情
AI中文摘要

在本文中,我们考虑了基于动态系统的障碍避障问题。通过引入正交坐标,开发了调制矩阵,使调制矩阵更加合理。新轨迹的方向可通过正交坐标的线性组合表示。通过引入旋转矩阵,提出了一种正交坐标操纵方法,以解决局部最小问题,并在三维或更高维度空间中提供更合理的运动。所提出的方法还为围绕凸形体巡逻提供了解决方案。在几个设计的动态系统上的实验结果展示了所提出方法的有效性。

英文摘要

In this paper, we consider the general problem of obstacle avoidance based on dynamical system. The modulation matrix is developed by introducing orthogonal coordinates, which makes the modulation matrix more reasonable. The new trajectory's direction can be represented by the linear combination of orthogonal coordinates. A orthogonal coordinates manipulating approach is proposed by introducing rotating matrix to solve the local minimal problem and provide more reasonable motions in 3-D or higher dimension space. The proposed method also provide a solution for patrolling around a convex shape. Experimental results on several designed dynamical systems demonstrate the effectiveness of the proposed approach.

1809.04539 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Frequency-Aware Model Predictive Control

频率感知模型预测控制

Ruben Grandia, Farbod Farshidian, Alexey Dosovitskiy, René Ranftl, Marco Hutter

发表机构 * Robotic Systems Lab, ETH Zurich(机器人系统实验室,苏黎世联邦理工学院) Intel Labs, Munich, Germany(英特尔实验室,德国慕尼黑)

AI总结 本文提出频率形状成本函数,用于在腿足机器人最优控制中实现鲁棒解决方案,通过仿真和硬件实验展示了运动计划与执行器带宽限制的兼容性,并在未建模合规性地形上实现了稳健行走。

详情
Journal ref
IEEE Robotics and Automation Letters 2019
AI中文摘要

将轨迹优化得到的解决方案转移到机器人硬件仍是一个具有挑战性的问题。当优化充分利用提供的模型执行动态任务时,未建模的动力学会使运动在现实系统中不可行。模型误差可能是由于模型简化,也自然出现在在无结构和非确定性环境中部署机器人时。主要的是,顺应性接触和执行器动力学导致带宽限制。虽然经典控制方法提供了合成对一类模型误差鲁棒的控制器的工具,但现代轨迹优化中缺少这种概念,该问题是在时域中解决的。我们提出频率形状成本函数,以在腿足机器人的最优控制中实现鲁棒解决方案。通过仿真和硬件实验,我们展示了运动计划可以与由执行器和接触动力学设定的带宽限制相兼容。模型预测解决方案的平滑度可以连续调节而不影响问题的可行性。与由高度顺应性串联弹性执行器驱动的四足机器人ANYmal的实验显示,计划的运动、扭矩和力轨迹的跟踪性能显著提高,并使机器在具有未建模顺应性的地形上稳健行走。

英文摘要

Transferring solutions found by trajectory optimization to robotic hardware remains a challenging task. When the optimization fully exploits the provided model to perform dynamic tasks, the presence of unmodeled dynamics renders the motion infeasible on the real system. Model errors can be a result of model simplifications, but also naturally arise when deploying the robot in unstructured and nondeterministic environments. Predominantly, compliant contacts and actuator dynamics lead to bandwidth limitations. While classical control methods provide tools to synthesize controllers that are robust to a class of model errors, such a notion is missing in modern trajectory optimization, which is solved in the time domain. We propose frequency-shaped cost functions to achieve robust solutions in the context of optimal control for legged robots. Through simulation and hardware experiments we show that motion plans can be made compatible with bandwidth limits set by actuators and contact dynamics. The smoothness of the model predictive solutions can be continuously tuned without compromising the feasibility of the problem. Experiments with the quadrupedal robot ANYmal, which is driven by highly-compliant series elastic actuators, showed significantly improved tracking performance of the planned motion, torque, and force trajectories and enabled the machine to walk robustly on terrain with unmodeled compliance.

1812.06325 2026-06-04 eess.SY cs.LG cs.RO cs.SY 版本更新

Data-efficient Auto-tuning with Bayesian Optimization: An Industrial Control Study

数据高效自动调优与贝叶斯优化:一项工业控制研究

Matthias Neumann-Brosig, Alonso Marco, Dieter Schwarzmann, Sebastian Trimpe

发表机构 * IAV GmbH(IAV集团) Max Planck Society(马克斯·普朗克学会) Cyber Valley initiative(Cyber Valley倡议) Max Planck Institute for Intelligent Systems(智能系统研究所)

AI总结 本文提出利用贝叶斯优化自动学习最优控制器参数,通过概率模型(高斯过程)建模控制器参数到用户定义成本的未知函数,并通过实验数据迭代优化,以高效找到全局最优参数,实验表明其在 throttle valve 控制中优于手动校准。

Comments 11 pages, 7 figures and 4 tables. To appear in IEEE Transactions on Control Systems Technology

详情
AI中文摘要

贝叶斯优化被提出用于从实验数据自动学习最优控制器参数。通过概率描述(高斯过程)建模控制器参数到用户定义成本的未知函数。概率模型通过在物理系统上测试一组参数并评估成本来更新。为加快学习速度,贝叶斯优化算法系统地选择下一步评估的参数,例如通过最大化关于最优解的信息增益。因此,该算法通过少量实验迭代找到全局最优参数。以节流阀控制为例,所提出的自动调优方法在低实验次数下 consistently 实现更好的性能,优于手动校准。所提出的自动调优框架具有灵活性,可处理不同的控制结构和目标。

英文摘要

Bayesian optimization is proposed for automatic learning of optimal controller parameters from experimental data. A probabilistic description (a Gaussian process) is used to model the unknown function from controller parameters to a user-defined cost. The probabilistic model is updated with data, which is obtained by testing a set of parameters on the physical system and evaluating the cost. In order to learn fast, the Bayesian optimization algorithm selects the next parameters to evaluate in a systematic way, for example, by maximizing information gain about the optimum. The algorithm thus iteratively finds the globally optimal parameters with only few experiments. Taking throttle valve control as a representative industrial control example, the proposed auto-tuning method is shown to outperform manual calibration: it consistently achieves better performance with a low number of experiments. The proposed auto-tuning framework is flexible and can handle different control structures and objectives.

1309.7666 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Dynamic Sliding Mode Control based on Fractional calculus subject to uncertain delay based chaotic pneumatic robot

基于不确定延迟混沌气动机器人的分数 calculus 动态滑模控制

Sara Gholipour P., Heydar Toosian Shandiz, Mobin Alizadeh, Sara Minagar, Seyed Javad Kazemitabar

发表机构 * Farabina Noshirvani’s smart Co.(法拉比亚·诺希拉维智能公司) Robotic Research Lab.(机器人研究实验室) Faculty of Electrical and Robotic Engineering, Shahrood University of Technology(沙霍尔德大学电气与机器人工程学院)

AI总结 本文针对机器人操作臂中由于延迟引起混沌的滑模控制颤振问题,提出了一种基于分数 calculus 的动态滑模控制方法,通过动态本质消除颤振,并在主从配置的混沌系统中实现控制,利用李雅普诺夫稳定性理论保证闭环系统稳定性,同时对延迟机器人运动进行定性与定量研究。

Comments 8 pages, 9 figures, will be submitted in journal

详情
AI中文摘要

本文考虑了机器人操作臂中由于延迟引起混沌的滑模控制颤振问题。分数 calculus 作为一种强大的定理,用于产生一种新的滑模控制,其动态本质用于消除颤振。为实现主从配置中一类混沌系统的控制,本文提出并检验了这种新的分数动态滑模控制方案,在关节和工作空间中的延迟混沌机器人上进行测试。此外,通过李雅普诺夫稳定性理论保证闭环系统的稳定性。此外,延迟的机器人运动被用于定性和定量研究。最后,数值模拟示例展示了所提出控制方法的可行性。

英文摘要

This paper considers the chattering problem of sliding mode control while delay in robot manipulator caused chaos in such electromechanical systems. Fractional calculus as a powerful theorem to produce a novel sliding mode; which has a dynamic essence is used for chattering elimination. To realize the control of a class of chaotic systems in master-slave configuration this novel fractional dynamic sliding mode control scheme is presented and examined on delay based chaotic robot in joint and work space. Also the stability of the closed-loop system is guaranteed by Lyapunov stability theory. Beside these, delayed robot motions are sorted out for qualitative and quantification study. Finally, numerical simulation example illustrates the feasibility of proposed control method.

1708.05004 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

RodFIter: Attitude Reconstruction from Inertial Measurement by Functional Iteration

RodFIter:通过函数迭代从惯性测量中重建姿态

Yuanxin Wu

发表机构 * Shanghai Jiao Tong University(上海交通大学)

AI总结 本文提出了一种基于罗德里格斯向量的函数迭代方法(RodFIter),用于从陀螺仪测量中精确重建姿态,该方法在理论上能够准确重构增量姿态,并在姿态摆动运动中表现出优于主流姿态算法的精度。

Comments IEEE TAES, 2018

详情
AI中文摘要

刚体运动的计算或估计是许多领域中的基石。通过整合陀螺仪测量的角速度可以实现姿态计算,其准确性对惯性导航中的死 reckoning至关重要。目前的姿态算法通常依赖于旋转向量的简化微分方程来获得姿态。本文提出了一种名为RodFIter的方法,该方法通过罗德里格斯向量进行函数迭代,以解析地从陀螺仪测量中重建姿态。RodFIter方法只要角速度是准确的,就能在理论上准确地重构增量姿态。值得注意的是,罗德里格斯向量可以解析地获得,并可用于在考虑的时间区间内更新姿态。所提出的方法产生了一种终极姿态算法方案,可以自然地扩展到一般的刚体运动计算。该方法在姿态摆动运动中进行了广泛评估,并在精度上优于主流姿态算法。这项工作被认为已经消除了从惯性测量中进行精确运动积分的长期理论障碍。

英文摘要

Rigid motion computation or estimation is a cornerstone in numerous fields. Attitude computation can be achieved by integrating the angular velocity measured by gyroscopes, the accuracy of which is crucially important for the dead-reckoning inertial navigation. The state-of-the-art attitude algorithms have unexceptionally relied on the simplified differential equation of the rotation vector to obtain the attitude. This paper proposes a Functional Iteration technique with the Rodrigues vector (named the RodFIter method) to analytically reconstruct the attitude from gyroscope measurements. The RodFIter method is provably exact in reconstructing the incremental attitude as long as the angular velocity is exact. Notably, the Rodrigues vector is analytically obtained and can be used to update the attitude over the considered time interval. The proposed method gives birth to an ultimate attitude algorithm scheme that can be naturally extended to the general rigid motion computation. It is extensively evaluated under the attitude coning motion and compares favorably in accuracy with the mainstream attitude algorithms. This work is believed having eliminated the long-standing theoretical barrier in exact motion integration from inertial measurements.

1712.03913 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

A Non-Cooperative Game Approach to Autonomous Racing

一种非合作博弈方法用于自动驾驶赛车

Alexander Liniger, John Lygeros

AI总结 本文提出了一种非合作非零和博弈方法来解决自动驾驶赛车决策问题,通过设计三种不同的博弈模型,考虑静态赛道约束和车辆间碰撞避免,探讨了博弈策略对赛车行为建模的影响。

Comments 14 pages, 5 figuers

详情
AI中文摘要

我们考虑两辆车的自动驾驶赛车问题,并提出了一种将赛车决策建模为非合作非零和博弈的方法。我们设计了三种不同的博弈,玩家旨在满足静态赛道约束并避免相互碰撞;后者约束依赖于两个玩家的联合行动。三种博弈的区别在于碰撞约束和支付函数。在第一个博弈中,仅由跟随者考虑碰撞避免,每个玩家最大化自身向终点线的进度。我们证明,由于该博弈的顺序结构,可以通过高效的顺序最大化方法计算均衡。进一步,我们证明这些动作,如果可行,也是第二个博弈中纯策略的Stackelberg和Nash均衡,其中两个玩家都考虑碰撞约束。第三个博弈的支付函数被设计为促进阻塞,通过在时间 horizon 结束时奖励车辆保持领先。我们证明这会改变Stackelberg均衡,但对Nash均衡影响较小。对于在线实现,我们提出以移动时间窗的方式进行博弈,并讨论了保证由此产生的耦合重复博弈可行性的两种方法。最后,我们研究了所提出方法在模拟中的性能,以复制在苏黎世联邦理工学院自动控制实验室测试的微型赛车设置。模拟研究显示,所提出的博弈能够成功建模不同的赛车行为并生成有趣的赛车情境。

英文摘要

We consider autonomous racing of two cars and present an approach to formulate racing decisions as a non-cooperative non-zero-sum game. We design three different games where the players aim to fulfill static track constraints as well as avoid collision with each other; the latter constraint depends on the combined actions of the two players. The difference between the games are the collision constraints and the payoff. In the first game collision avoidance is only considered by the follower, and each player maximizes their own progress towards the finish line. We show that, thanks to the sequential structure of this game, equilibria can be computed through an efficient sequential maximization approach. Further, we show these actions, if feasible, are also a Stackelberg and Nash equilibrium in pure strategies of our second game where both players consider the collision constraints. The payoff of our third game is designed to promote blocking, by additionally rewarding the cars for staying ahead at the end of the horizon. We show that this changes the Stackelberg equilibrium, but has a minor influence on the Nash equilibria. For online implementation, we propose to play the games in a moving horizon fashion, and discuss two methods for guaranteeing feasibility of the resulting coupled repeated games. Finally, we study the performance of the proposed approaches in simulation for a set-up that replicates the miniature race car tested at the Automatic Control Laboratory of ETH Zurich. The simulation study shows that the presented games can successfully model different racing behaviors and generate interesting racing situations.

1710.11088 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Robust Cooperative Manipulation without Force/Torque Measurements: Control Design and Experiments

无需力/扭矩测量的鲁棒协同操作:控制设计与实验

Christos K. Verginis, Matteo Mastellaro, Dimos V. Dimarogonas

AI总结 本文提出两种新型控制方法,用于由N个机器人代理协同操作物体,通过四元数反馈避免表示奇异,同时保证物体轨迹的暂态和稳态性能,且具有去中心化和抗扰动特性,无需力/扭矩测量,并通过仿真和实验验证理论结果。

详情
AI中文摘要

本文提出了两种新颖的控制方法,用于由N个机器人代理协同操作物体。首先,我们设计了一种自适应控制协议,利用四元数反馈来避免潜在的表示奇点。其次,我们提出了一种保证物体轨迹预定义暂态和稳态性能的控制协议。两种方法都是去中心化的,因为代理可以自行计算信号而不进行相互通信,同时对外部扰动和模型不确定性具有鲁棒性。此外,我们考虑抓取点是刚性的,从而避免了对力/扭矩测量的需求。通过抓握矩阵伪逆来考虑代理之间可能的功率能力差异,以实现负载分布。最后,通过两个机器人臂的仿真和实验结果验证了理论发现。

英文摘要

This paper presents two novel control methodologies for the cooperative manipulation of an object by N robotic agents. Firstly, we design an adaptive control protocol which employs quaternion feedback for the object orientation to avoid potential representation singularities. Secondly, we propose a control protocol that guarantees predefined transient and steady-state performance for the object trajectory. Both methodologies are decentralized, since the agents calculate their own signals without communicating with each other, as well as robust to external disturbances and model uncertainties. Moreover, we consider that the grasping points are rigid, and avoid the need for force/torque measurements. Load distribution is also included via a grasp matrix pseudo-inverse to account for potential differences in the agents' power capabilities. Finally, simulation and experimental results with two robotic arms verify the theoretical findings.

1707.09718 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Adaptive Second-order Sliding Mode Control of UAVs for Civil Applications

无人机民用应用中的自适应二阶滑模控制

Van Truong Hoang, Ansu Man Singh, Manh Duong Phung, Quang Ha

发表机构 * Faculty of Engineering and Information Technology(工程与信息技术学院) University of Technology Sydney, Australia(悉尼技术大学,澳大利亚)

AI总结 本文提出了一种自适应二阶准连续滑模控制方案,用于无人机在基础设施监测中的鲁棒姿态控制,通过数学建模和稳定性分析,实现了对噪声和扰动的鲁棒性,并在仿真中验证了其优于实际无人机任务的跟踪性能。

Comments in Proceeding of The 34th International Symposium on Automation and Robotics in Construction (ISARC), pp. 823-829, Taipei, Taiwan, 2017

详情
AI中文摘要

四旋翼作为一种无人驾驶航空器(UAV),在测绘、建筑监控和基础设施状况评估等民用应用中具有巨大潜力。然而,四旋翼对噪声和扰动较为敏感,因此在控制不足、系统不确定性和/或外部扰动的情况下,其性能可能迅速下降。在本研究中,我们通过提出一种名为自适应二阶准连续滑模控制(自适应2-QCSM)的鲁棒方案,来处理四旋翼的低层控制。最终目标是实现无人机在基础设施监测和检查中的鲁棒姿态控制。首先,考虑非线性、强耦合、不确定动态和外部扰动,推导了四旋翼的数学模型。控制设计包括滑动面的选择和开发具有自适应增益的准连续二阶滑模控制器。通过使用全局李雅普诺夫函数分析整个控制系统的稳定性,以保证滑动动态和自适应方案的收敛性。进行了大量的仿真以进行评估。结果表明,所提出的控制器能够对扰动或参数变化具有鲁棒性,并在与实际无人机实时监控任务的实验响应相比,具有更好的跟踪性能。

英文摘要

Quadcopters, as unmanned aerial vehicles (UAVs), have great potential in civil applications such as surveying, building monitoring, and infrastructure condition assessment. Quadcopters, however, are relatively sensitive to noises and disturbances so that their performance may be quickly downgraded in the case of inadequate control, system uncertainties and/or external disturbances. In this study, we deal with the quadrotor low-level control by proposing a robust scheme named the adaptive second-order quasi-continuous sliding mode control (adaptive 2-QCSM). The ultimate objective is for robust attitude control of the UAV in monitoring and inspection of built infrastructure. First, the mathematical model of the quadcopter is derived considering nonlinearity, strong coupling, uncertain dynamics and external disturbances. The control design includes the selection of the sliding manifold and the development of quasi-continuous second-order sliding mode controller with an adaptive gain. Stability of the overall control system is analysed by using a global Lyapunov function for convergence of both the sliding dynamics and adaptation scheme. Extensive simulations have been carried out for evaluation. Results show that the proposed controller can achieve robustness against disturbances or parameter variations and has better tracking performance in comparison with experimental responses of a UAV in a real-time monitoring task.

1707.09715 2026-06-04 eess.SY cs.CV cs.RO cs.SY 版本更新

Automatic Crack Detection in Built Infrastructure Using Unmanned Aerial Vehicles

使用无人机自动检测建筑基础设施裂缝

Manh Duong Phung, Van Truong Hoang, Tran Hiep Dinh, Quang Ha

发表机构 * School of Electrical Mechanical and Mechatronic Systems, University of Technology Sydney, Australia(电气机械与机电系统学院,悉尼技术大学,澳大利亚)

AI总结 本文提出了一种利用无人机采集数据并结合直方图分析进行建筑基础设施裂缝检测的方法,通过自动化流程提高检测效率并降低安全隐患。

Comments In proceeding of The 34th International Symposium on Automation and Robotics in Construction (ISARC), pp. 823-829, Taipei, Taiwan, 2017

详情
AI中文摘要

本文针对建筑基础设施健康监测中至关重要的裂缝检测问题,提出了一种包含两个阶段的方法:使用无人机(UAV)进行数据采集和利用直方图分析进行裂缝检测。首先,利用激光扫描仪创建结构的3D模型,然后提取几何属性以生成用于导航无人机拍摄结构图像的路径点。接着,将从重叠视野中获取的图像拼接在一起,通过直方图分析和峰值检测进行聚类,最后利用局部自适应阈值识别潜在裂缝。整个过程自动化进行,从而显著提高了检查时间并最小化了安全风险。已开发出原型系统进行评估,并包含实验结果。

英文摘要

This paper addresses the problem of crack detection which is essential for health monitoring of built infrastructure. Our approach includes two stages, data collection using unmanned aerial vehicles (UAVs) and crack detection using histogram analysis. For the data collection, a 3D model of the structure is first created by using laser scanners. Based on the model, geometric properties are extracted to generate way points necessary for navigating the UAV to take images of the structure. Then, our next step is to stick together those obtained images from the overlapped field of view. The resulting image is then clustered by histogram analysis and peak detection. Potential cracks are finally identified by using locally adaptive thresholds. The whole process is automatically carried out so that the inspection time is significantly improved while safety hazards can be minimised. A prototypical system has been developed for evaluation and experimental results are included.

1812.07879 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Modelling and Fast Terminal Sliding Mode Control for Mirror-based Pointing Systems

基于镜面的指向系统建模与快速终端滑模控制

Ansu Man Singh, Manh Duong Phung, Quang Ha

发表机构 * University of Technology Sydney(悉尼技术大学)

AI总结 本文提出了一种新的离散时间快速终端滑模控制器,用于基于镜面的指向系统,通过非线性最小二乘识别方法估计参数,并基于推导的模型设计了连续域的滑模面,利用欧拉离散化合成离散时间控制器,并通过添加线性项改进滑模面的暂态动态,最后基于Sarpturk到达条件证明了控制器的稳定性。

Comments In Proceedings of the 15th International Conference on Control, Automation, Robotics and Vision (ICARCV 2018)

详情
AI中文摘要

在本文中,我们提出了一种新的离散时间快速终端滑模(FTSM)控制器,用于基于镜面的指向系统。我们首先推导了这些系统的解耦模型,然后使用非线性最小二乘识别方法估计参数。基于推导的模型,我们设计了一个连续域的FTSM滑模面。然后,我们利用设计的FTSM滑模面进行欧拉离散化,以合成离散时间控制器。进一步地,我们通过添加一个线性项来改进滑模面的暂态动态。最后,我们基于Sarpturk到达条件证明了所提出控制器的稳定性。进行了大量仿真,并与终端滑模(TSM)和模型预测控制(MPC)进行了比较,以评估所提出方法的有效性。还进行了与实时实验数据的比较研究。结果表明,所提出的方法优于其他技术。

英文摘要

In this paper, we present a new discrete-time Fast Terminal Sliding Mode (FTSM) controller for mirror-based pointing systems. We first derive the decoupled model of those systems and then estimate the parameters using a nonlinear least-square identification method. Based on the derived model, we design a FTSM sliding manifold in the continuous domain. We then exploit the Euler discretization on the designed FTSM sliding surfaces to synthesize a discrete-time controller. Furthermore, we improve the transient dynamics of the sliding surface by adding a linear term. Finally, we prove the stability of the proposed controller based on the Sarpturk reaching condition. Extensive simulations, followed by comparisons with the Terminal Sliding Mode (TSM) and Model Predictive Control (MPC) have been carried out to evaluate the effectiveness of the proposed approach. A comparative study with data obtained from a real-time experiment was also conducted. The results indicate the advantage of the proposed method over the other techniques.

1812.11707 2026-06-04 cs.RO cs.SY eess.SY 版本更新

UAV Control in Close Proximities - Ceiling Effect on Battery Lifetime

无人机近距离控制 - 顶效应对电池寿命的影响

Basaran Bahadir Kocer, Volkan Kumtepeli, Tegoeh Tjahjowidodo, Mahardhika Pratama, Anshuman Tripathi, Gerald Seet Gim Lee, Youyi Wang

发表机构 * 1 School of Mechanical

AI总结 本文研究了无人机在近距离飞行时利用顶效应减少电池消耗对电池寿命的影响,通过实测数据发现顶效应可降低控制器努力,并首次采用全等效循环计数法分析其对电池寿命的降解作用,结果表明可减少15.77%的电池降解。

Comments ICoIAS 2019

详情
AI中文摘要

随着无人机技术的最新发展,预计它们将与周围物体、其他机器人和人互动和协作,以智能地计划和执行特定任务。尽管这些交互操作相比自由飞行任务更具挑战性,但它们可能带来多样的优势。其中之一是在近距离飞行中的基本空气动力学交互,这可以减少控制器的努力。在本研究中,通过收集实时数据,我们观察到在非常接近周围环境飞行时,借助顶效应可以减少电池的电流消耗。首次采用简单的全等效循环计数法,从电池寿命降解的角度分析这一现象。结果表明,如果无人机能够利用顶效应,循环相关的电池降解效应可以减少15.77%。

英文摘要

With the recent developments in the unmanned aerial vehicles (UAV), it is expected them to interact and collaborate with their surrounding objects, other robots and people in order to wisely plan and execute particular tasks. Although these interaction operations are inherently challenging as compared to free-flight missions, they might bring diverse advantages. One of them is their basic aerodynamic interaction during the flight in close proximities which can result in a reduction of the controller effort. In this study, by collecting real-time data, we have observed that the current drawn by the battery can be decreased while flying very close to the surroundings with the help of the ceiling effect. For the first time, this phenomenon is analyzed in terms of battery lifetime degradation by using a simple full equivalent cycle counting method. Results show that cycling related effect on battery degradation can be reduced by a 15.77% if the UAV can utilize ceiling effect.

1812.11315 2026-06-04 cs.RO cs.SY eess.SY 版本更新

On Infusing Reachability-Based Safety Assurance within Probabilistic Planning Frameworks for Human-Robot Vehicle Interactions

在人机车辆交互中将基于可达性的安全性保证融入概率规划框架

Karen Leung, Edward Schmerling, Mo Chen, John Talbot, J. Christian Gerdes, Marco Pavone

发表机构 * Department of Aeronautics and Astronautics(航空与航天系) Department of Mechanical Engineering(机械工程系) School of Computing Science(计算科学学院)

AI总结 本文提出一种最小干预的安全控制器,用于确保自动驾驶车辆与外部控制车辆的碰撞自由交互,通过实时控制器实现轨迹跟踪和安全保证,展示了在交通交织实验中自动驾驶车辆在对手车辆违反规划预期时仍能避免碰撞。

Comments Presented at the International Symposium on Experimental Robotics, Buenos Aires, Argentina, 2018

详情
AI中文摘要

动作预判、意图预测和主动行为都是交互场景中自动驾驶策略的 desirable 特性。然而,确保道路安全是关键挑战,其中一个重要问题是,在不影响规划器性能的情况下,必须考虑人类驾驶员行为的不确定性。本文介绍了一种最小干预的安全控制器,该控制器在自动驾驶车辆控制栈中运行,其作用是确保与外部控制(例如人工驾驶)的对手车辆的碰撞自由交互。我们利用可达性分析来构建一个实时(100Hz)的控制器,该控制器具有双重作用:(1)使用模型预测控制跟踪来自更高层规划算法的输入轨迹;(2)通过维持碰撞自由逃脱 maneuver 的可用性作为持续约束来确保安全,无论对方车辆未来采取何种行动。我们使用全规模的线控平台进行交通交织实验,其中两辆车最初并排,必须在有限的时间和距离内交换车道,模拟车辆汇入/汇出高速公路。我们证明,通过我们的控制栈,自动驾驶车辆能够在对手车辆违反规划预期并采取危险行动(无论是粗心还是有意碰撞)时避免碰撞,并且在必要时仅轻微偏离计划轨迹以维持安全。

英文摘要

Action anticipation, intent prediction, and proactive behavior are all desirable characteristics for autonomous driving policies in interactive scenarios. Paramount, however, is ensuring safety on the road --- a key challenge in doing so is accounting for uncertainty in human driver actions without unduly impacting planner performance. This paper introduces a minimally-interventional safety controller operating within an autonomous vehicle control stack with the role of ensuring collision-free interaction with an externally controlled (e.g., human-driven) counterpart. We leverage reachability analysis to construct a real-time (100Hz) controller that serves the dual role of (1) tracking an input trajectory from a higher-level planning algorithm using model predictive control, and (2) assuring safety through maintaining the availability of a collision-free escape maneuver as a persistent constraint regardless of whatever future actions the other car takes. A full-scale steer-by-wire platform is used to conduct traffic weaving experiments wherein the two cars, initially side-by-side, must swap lanes in a limited amount of time and distance, emulating cars merging onto/off of a highway. We demonstrate that, with our control stack, the autonomous vehicle is able to avoid collision even when the other car defies the planner's expectations and takes dangerous actions, either carelessly or with the intent to collide, and otherwise deviates minimally from the planned trajectory to the extent required to maintain safety.

1812.01532 2026-06-04 quant-ph cond-mat.dis-nn cs.MA cs.RO cs.SY eess.SY 版本更新

Control of automated guided vehicles without collision by quantum annealer and digital devices

通过量子退火器和数字设备控制无碰撞的自动导引车

Masayuki Ohzeki, Akira Miki, Masamichi J. Miyama, Masayoshi Terabe

发表机构 * Graduate School of Information Sciences, Tohoku University(东北大学信息科学研究生院) Institute of Innovative Research, Tokyo Institute of Technology(东京技术大学创新研究所) Electronics R & I Division, DENSO CORPORATION(DENSO公司电子研究与开发部)

AI总结 本文提出了一种优化问题,用于在无碰撞的情况下控制大量自动导引车,通过量子退火器和数字设备解决该问题,验证了其在车辆控制中的有效性。

Comments 12 pages, 4 figures, some typos are fixed

详情
AI中文摘要

我们提出了一个优化问题,用于在无碰撞的情况下控制大量自动导引车。该问题由二进制变量组成。一个二次成本函数使得我们可以利用数字计算机上的某些求解器以及最近开发的专用硬件,如D-Wave 2000Q和富士通数字退火器。在本研究中,我们考虑了日本的一个实际工厂,其中车辆在运行,并通过几种求解器评估了我们的方法在优化车辆方面的效率。我们确认,与传统方法相比,我们的方法可以有效地实现平滑控制,同时避免车辆之间的碰撞。此外,使用几种求解器进行的比较实验表明,D-Wave 2000Q可以作为一种快速求解器,在短时间内生成控制车辆的计划,尽管它只能处理少量车辆,而数字计算机可以快速解决相应的优化问题,即使有大量二进制变量。

英文摘要

We formulate an optimization problem to control a large number of automated guided vehicles in a plant without collision. The formulation consists of binary variables. A quadratic cost function over these variables enables us to utilize certain solvers on digital computers and recently developed purpose-specific hardware such as D-Wave 2000Q and the Fujitsu digital annealer. In the present study, we consider an actual plant in Japan, in which vehicles run, and assess efficiency of our formulation for optimizing the vehicles via several solvers. We confirm that our formulation can be a powerful approach for performing smooth control while avoiding collisions between vehicles, as compared to a conventional method. In addition, comparative experiments performed using several solvers reveal that D-Wave 2000Q can be useful as a rapid solver for generating a plan for controlling the vehicles in a short time although it deals only with a small number of vehicles, while a digital computer can rapidly solve the corresponding optimization problem even with a large number of binary variables.

1805.11706 2026-06-04 cs.LG cs.AI cs.RO cs.SY eess.SY stat.ML 版本更新

Supervised Policy Update for Deep Reinforcement Learning

深度强化学习中的监督策略更新

Quan Vuong, Yiming Zhang, Keith W. Ross

发表机构 * University of California, San Diego(加州大学圣地亚哥分校) New York University(纽约大学)

AI总结 本文提出了一种新的样本效率高的方法,称为监督策略更新(SPU),用于深度强化学习。该方法通过当前策略生成的数据,在非参数化的近端策略空间中构建并求解一个约束优化问题,然后利用监督回归将最优的非参数化策略转换为参数化策略,从而生成新的样本。该方法适用于离散和连续动作空间,并能处理多种接近约束。本文展示了如何通过该方法解决自然策略梯度和信任区域策略优化(NPG/TRPO)以及近端策略优化(PPO)问题。SPU的实现比TRPO更简单,在样本效率方面,实验表明SPU在Mujoco模拟机器人任务中优于TRPO,在Atari视频游戏任务中优于PPO。

Comments Accepted as a conference paper at ICLR 2019

详情
AI中文摘要

我们提出了一种新的样本效率高的方法,称为监督策略更新(SPU),用于深度强化学习。从当前策略生成的数据开始,SPU在非参数化的近端策略空间中构建并求解一个约束优化问题。利用监督回归,它将最优的非参数化策略转换为参数化策略,从而生成新的样本。该方法具有通用性,适用于离散和连续动作空间,并能处理多种接近约束。我们展示了如何通过该方法解决自然策略梯度和信任区域策略优化(NPG/TRPO)以及近端策略优化(PPO)问题。SPU的实现比TRPO更简单。在样本效率方面,我们的广泛实验表明,SPU在Mujoco模拟机器人任务中优于TRPO,在Atari视频游戏任务中优于PPO。

英文摘要

We propose a new sample-efficient methodology, called Supervised Policy Update (SPU), for deep reinforcement learning. Starting with data generated by the current policy, SPU formulates and solves a constrained optimization problem in the non-parameterized proximal policy space. Using supervised regression, it then converts the optimal non-parameterized policy to a parameterized policy, from which it draws new samples. The methodology is general in that it applies to both discrete and continuous action spaces, and can handle a wide variety of proximity constraints for the non-parameterized optimization problem. We show how the Natural Policy Gradient and Trust Region Policy Optimization (NPG/TRPO) problems, and the Proximal Policy Optimization (PPO) problem can be addressed by this methodology. The SPU implementation is much simpler than TRPO. In terms of sample efficiency, our extensive experiments show SPU outperforms TRPO in Mujoco simulated robotic tasks and outperforms PPO in Atari video game tasks.

1812.08280 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Extrinisic Calibration of a Camera-Arm System Through Rotation Identification

通过旋转识别进行相机-机械臂系统的外校准

Steve McGuire, Christoffer Heckman, Daniel Szafir, Simon Julier, Nisar Ahmed

发表机构 * Department of Aerospace Engineering Sciences, University of Colorado at Boulder(科罗拉多大学波尔德分校航空航天工程科学系)

AI总结 本文提出了一种基于观察臂的结构运动来恢复外校准参数的方法,结合已知的机械臂运动学和图像平面中的二次曲线观测,通过最大似然估计计算校准外参数,该方法在仿真中得到验证并在现实模型中测试,结果与尺子估计一致。

详情
AI中文摘要

确定外校准参数对于任何由执行器和相机组成的机器人系统都是必要的。一旦系统离开实验室环境,参数必须在不依赖外部校准目标的情况下确定。我们提出了一种方法,该方法依赖于观察臂的结构运动来恢复外校准参数。我们的方法结合已知的机械臂运动学和图像平面中二次曲线的观测,以计算校准外参数的最大似然估计。该方法在仿真中得到验证,并在现实模型中测试,结果与基于尺子的估计一致。我们的方法在不需繁琐测量或外部目标的情况下,展示了估计相机相对于连杆末端执行器姿态的潜力。

英文摘要

Determining extrinsic calibration parameters is a necessity in any robotic system composed of actuators and cameras. Once a system is outside the lab environment, parameters must be determined without relying on outside artifacts such as calibration targets. We propose a method that relies on structured motion of an observed arm to recover extrinsic calibration parameters. Our method combines known arm kinematics with observations of conics in the image plane to calculate maximum-likelihood estimates for calibration extrinsics. This method is validated in simulation and tested against a real-world model, yielding results consistent with ruler-based estimates. Our method shows promise for estimating the pose of a camera relative to an articulated arm's end effector without requiring tedious measurements or external artifacts. Index Terms: robotics, hand-eye problem, self-calibration, structure from motion

1812.06132 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Bernstein approximation of optimal control problems

伯恩斯坦逼近在最优控制问题中的应用

Venanzio Cichella, Isaac Kaminer, Claire Walton, Naira Hovakimyan, Antonio Pascoal

发表机构 * Department of Mechanical Engineering, University of Iowa(伊利诺伊大学厄巴纳-香槟分校机械科学与工程系) Department of Mechanical and Aerospace Engineering, Naval Postgraduate School(海军研究生院机械与航空航天工程系) Institute for Systems and Robotics (ISR), Instituto Superior Tecnico (IST), Univ. Lisbon, Portugal(葡萄牙里斯本大学系统与机器人研究所)

AI总结 本文提出了一种基于伯恩斯坦多项式逼近的直接方法,用于解决具有混合输入和状态约束的非线性最优控制问题,并展示了该方法在连续时间最优控制问题中的一致性以及在最优控制问题共轭变量估计中的应用,从而推导出伯恩斯坦多项式逼近的共向量映射定理。

详情
AI中文摘要

伯恩斯坦多项式对连续函数的逼近收敛速度比其他逼近方法慢。

英文摘要

Bernstein polynomial approximation to a continuous function has a slower rate of convergence as compared to other approximation methods. "The fact seems to have precluded any numerical application of Bernstein polynomials from having been made. Perhaps they will find application when the properties of the approximant in the large are of more importance than the closeness of the approximation." -- has remarked P.J. Davis in his 1963 book Interpolation and Approximation. This paper presents a direct approximation method for nonlinear optimal control problems with mixed input and state constraints based on Bernstein polynomial approximation. We provide a rigorous analysis showing that the proposed method yields consistent approximations of time continuous optimal control problems. Furthermore, we demonstrate that the proposed method can also be used for costate estimation of the optimal control problems. This latter result leads to the formulation of the Covector Mapping Theorem for Bernstein polynomial approximation. Finally, we explore the numerical and geometric properties of Bernstein polynomials, and illustrate the advantages of the proposed approximation method through several numerical examples.

1802.08678 2026-06-04 eess.SY cs.LG cs.RO cs.SY stat.ML 版本更新

Verifying Controllers Against Adversarial Examples with Bayesian Optimization

通过贝叶斯优化验证控制器对抗示例

Shromona Ghosh, Felix Berkenkamp, Gireeja Ranade, Shaz Qadeer, Ashish Kapoor

发表机构 * Microsoft Research, Redmond(微软研究院(红mond))

AI总结 本文提出基于贝叶斯优化的主动测试框架,用于验证控制器的安全性,通过逻辑定义安全约束并高效搜索行为空间以发现违反安全规范的对抗示例。

Comments Proc. of the IEEE International Conference on Robotics and Automation, 2018

详情
AI中文摘要

最近强化学习的成功促使开发了用于现实世界机器人的复杂控制器。由于这些机器人被部署在安全关键应用中并与人类交互,确保安全性以避免造成伤害变得至关重要。为此方向的一个初步步骤是测试控制器在仿真中的表现。为了做到这一点,我们需要明确安全的定义,然后高效地搜索所有行为空间以确定其安全性。在本文中,我们提出了一种基于贝叶斯优化的主动测试框架。我们使用逻辑指定安全约束,并利用问题中的结构来测试系统,以发现违反安全规范的对抗示例。这些规范被定义为轨迹上的光滑函数的复杂布尔组合,与强化学习中的奖励函数不同,它们是表达性强且对系统施加硬约束。在我们的框架中,我们利用单个函数的正则性假设,形式化为高斯过程(GP)先验。我们结合这些内容到一个连贯的优化框架中,利用问题结构。所得到的算法能够证明验证复杂的安全规范或找到对抗示例。实验结果表明,所提出的方法能够快速发现对抗示例。

英文摘要

Recent successes in reinforcement learning have lead to the development of complex controllers for real-world robots. As these robots are deployed in safety-critical applications and interact with humans, it becomes critical to ensure safety in order to avoid causing harm. A first step in this direction is to test the controllers in simulation. To be able to do this, we need to capture what we mean by safety and then efficiently search the space of all behaviors to see if they are safe. In this paper, we present an active-testing framework based on Bayesian Optimization. We specify safety constraints using logic and exploit structure in the problem in order to test the system for adversarial counter examples that violate the safety specifications. These specifications are defined as complex boolean combinations of smooth functions on the trajectories and, unlike reward functions in reinforcement learning, are expressive and impose hard constraints on the system. In our framework, we exploit regularity assumptions on individual functions in form of a Gaussian Process (GP) prior. We combine these into a coherent optimization framework using problem structure. The resulting algorithm is able to provably verify complex safety specifications or alternatively find counter examples. Experimental results show that the proposed method is able to find adversarial examples quickly.

1812.03216 2026-06-04 cs.LG cs.RO cs.SY eess.SY 版本更新

Zero-shot Deep Reinforcement Learning Driving Policy Transfer for Autonomous Vehicles based on Robust Control

基于鲁棒控制的零样本深度强化学习驾驶策略迁移用于自动驾驶车辆

Zhuo Xu, Chen Tang, Masayoshi Tomizuka

发表机构 * University of California, Berkeley(加州大学伯克利分校)

AI总结 本文提出了一种基于鲁棒控制的零样本深度强化学习驾驶策略迁移方法,通过转移具体的运动学量来解决自动驾驶中源域与目标域之间的建模差距问题,采用可转移的分层强化学习轨迹规划器和基于扰动观测器的鲁棒跟踪控制器,验证了该方法在多个驾驶场景中的零样本迁移能力。

Comments Published at IEEE ITSC 2018

详情
AI中文摘要

尽管深度强化学习(深度RL)方法在应用于自动驾驶时具有诸多优势,但真实应用却受到源域(训练)与目标域(部署)之间建模差距的限制。与当前的策略迁移方法不同,本文提出转移具体的自动驾驶运动学量。所提出的基于鲁棒控制的(RC)通用迁移架构,称为RL-RC,结合了可转移的分层强化学习轨迹规划器和基于扰动观测器(DOB)的鲁棒跟踪控制器。通过训练已知的名义动力学模型的深度RL策略直接转移到目标域,DOB基于的鲁棒跟踪控制用于处理建模差距,包括车辆动力学误差和外部扰动如侧向力。我们提供了模拟验证所提出方法在多个驾驶场景如车道保持、车道变更和障碍物避让中的迁移能力。

英文摘要

Although deep reinforcement learning (deep RL) methods have lots of strengths that are favorable if applied to autonomous driving, real deep RL applications in autonomous driving have been slowed down by the modeling gap between the source (training) domain and the target (deployment) domain. Unlike current policy transfer approaches, which generally limit to the usage of uninterpretable neural network representations as the transferred features, we propose to transfer concrete kinematic quantities in autonomous driving. The proposed robust-control-based (RC) generic transfer architecture, which we call RL-RC, incorporates a transferable hierarchical RL trajectory planner and a robust tracking controller based on disturbance observer (DOB). The deep RL policies trained with known nominal dynamics model are transfered directly to the target domain, DOB-based robust tracking control is applied to tackle the modeling gap including the vehicle dynamics errors and the external disturbances such as side forces. We provide simulations validating the capability of the proposed method to achieve zero-shot transfer across multiple driving scenarios such as lane keeping, lane changing and obstacle avoidance.

1804.01013 2026-06-04 math.OC cs.RO cs.SY eess.SY stat.ML 版本更新

Resilient Non-Submodular Maximization over Matroid Constraints

基于Matroid约束的抗扰非子模最大化

Vasileios Tzoumas, Ali Jadbabaie, George J. Pappas

AI总结 本文研究了在传感器和执行器失效情况下,基于Matroid约束的控制与传感问题,提出了一种具有系统级鲁棒性、可扩展性和可证明近似界的新算法。

Comments arXiv admin note: substantial text overlap with arXiv:1803.07954. Correction on problem statement (Problem 1), and change in authors' info

详情
AI中文摘要

大规模系统的控制和传感导致了不仅在传感器和执行器布置上,而且在调度或可观测性/可控性上都出现组合问题。此类系统设计和实现中的组合约束可以利用一种称为Matroid的结构来捕捉。特别是,Matroid的代数结构可以被用来开发可扩展的算法用于传感器和执行器选择,以及具有可量化近似界。然而,在大规模系统中,传感器和执行器可能失效或可能被(网络-)攻击。本文的目标是关注在存在传感器和执行器失效情况下的Matroid约束问题。一般来说,鲁棒Matroid约束问题在计算上是困难的。与非鲁棒情况(无故障)相反,尽管它们通常涉及单调或子模目标函数,但尚无已知的可扩展近似算法。在本文中,我们提供了第一个算法,其具有以下特性:首先,它实现了系统级鲁棒性,即该算法适用于任何数量的拒绝服务攻击或故障。其次,它是可扩展的,因为我们的算法终止时的运行时间与最先进的非鲁棒Matroid约束优化算法相同。第三,它提供了对系统性能的可证明近似界,因为对于单调目标函数,我们的算法保证了接近最优的解。我们使用单调(不一定子模)集合函数的曲率概念来量化我们的算法的近似性能。最后,我们通过考虑一个控制感知的传感器选择场景,即受传感约束的机器人导航,来支持我们的理论分析。

英文摘要

The control and sensing of large-scale systems results in combinatorial problems not only for sensor and actuator placement but also for scheduling or observability/controllability. Such combinatorial constraints in system design and implementation can be captured using a structure known as matroids. In particular, the algebraic structure of matroids can be exploited to develop scalable algorithms for sensor and actuator selection, along with quantifiable approximation bounds. However, in large-scale systems, sensors and actuators may fail or may be (cyber-)attacked. The objective of this paper is to focus on resilient matroid-constrained problems arising in control and sensing but in the presence of sensor and actuator failures. In general, resilient matroid-constrained problems are computationally hard. Contrary to the non-resilient case (with no failures), even though they often involve objective functions that are monotone or submodular, no scalable approximation algorithms are known for their solution. In this paper, we provide the first algorithm, that also has the following properties: First, it achieves system-wide resiliency, i.e., the algorithm is valid for any number of denial-of-service attacks or failures. Second, it is scalable, as our algorithm terminates with the same running time as state-of-the-art algorithms for (non-resilient) matroid-constrained optimization. Third, it provides provable approximation bounds on the system performance, since for monotone objective functions our algorithm guarantees a solution close to the optimal. We quantify our algorithm's approximation performance using a notion of curvature for monotone (not necessarily submodular) set functions. Finally, we support our theoretical analyses with numerical experiments, by considering a control-aware sensor selection scenario, namely, sensing-constrained robot navigation.

1803.00444 2026-06-04 cs.LG cs.AI cs.RO cs.SY eess.SY stat.ML 版本更新

Inverse Reinforcement Learning via Nonparametric Spatio-Temporal Subgoal Modeling

通过非参数时空子目标建模实现逆强化学习

Adrian Šošić, Elmar Rueckert, Jan Peters, Abdelhak M. Zoubir, Heinz Koeppl

发表机构 * Signal Processing Group(信号处理组) Institute for Robotics and Cognitive Systems(机器人与认知系统研究所) Autonomous Systems Labs(自主系统实验室) Bioinspired Communication Systems(生物启发通信系统)

AI总结 本文提出了一种基于非参数时空子目标建模的逆强化学习方法,通过局部上下文更高效地解释单条轨迹,实现更紧凑的行为表示,并构建隐式意图模型以预测未观察到的情况,从而在处理意图变化和主动学习场景中表现出色。

Comments 45 pages, 14 figures; ### Version 3 ### published in the Journal of Machine Learning Research

详情
AI中文摘要

逆强化学习(IRL)领域的发展导致了更复杂的推理框架,这些框架放宽了原始建模假设,即观察到的代理行为仅反映单一意图。相反于学习全局行为模型,最近的IRL方法将演示数据分成部分,以考虑不同轨迹可能对应不同意图,例如因为它们由不同领域专家生成。在本工作中,我们进一步采用子目标的直观概念,建立一个前提:即使单条轨迹在特定上下文中局部解释也比全局更高效,从而实现更紧凑的行为表示。基于这一假设,我们构建了代理目标的隐式意图模型,以预测未观察到的情况。结果是一种集成的贝叶斯预测框架,显著优于现有IRL解决方案,并提供与专家计划一致的平滑策略估计。最值得注意的是,我们的框架自然处理代理意图随时间变化的情况,而经典IRL算法失败。此外,由于其概率性质,该模型可以轻松应用于主动学习场景,以指导专家的演示过程。

英文摘要

Advances in the field of inverse reinforcement learning (IRL) have led to sophisticated inference frameworks that relax the original modeling assumption of observing an agent behavior that reflects only a single intention. Instead of learning a global behavioral model, recent IRL methods divide the demonstration data into parts, to account for the fact that different trajectories may correspond to different intentions, e.g., because they were generated by different domain experts. In this work, we go one step further: using the intuitive concept of subgoals, we build upon the premise that even a single trajectory can be explained more efficiently locally within a certain context than globally, enabling a more compact representation of the observed behavior. Based on this assumption, we build an implicit intentional model of the agent's goals to forecast its behavior in unobserved situations. The result is an integrated Bayesian prediction framework that significantly outperforms existing IRL solutions and provides smooth policy estimates consistent with the expert's plan. Most notably, our framework naturally handles situations where the intentions of the agent change over time and classical IRL algorithms fail. In addition, due to its probabilistic nature, the model can be straightforwardly applied in active learning scenarios to guide the demonstration process of the expert.

1811.11573 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Distributed Impedance Control of Latency-Prone Robotic Systems with Series Elastic Actuation

具有串联弹性执行器的延迟敏感机器人系统的分布式阻抗控制

Ye Zhao, Luis Sentis

AI总结 本文研究了具有串联弹性执行器的延迟敏感机器人系统的分布式阻抗控制问题,提出了一种关键阻尼增益设计方法,用于优化SEA级联控制架构的阻抗控制器设计,并通过频率域方法分析了时间延迟、滤波和负载惯量对SEA阻抗性能的影响。

Comments 24 pages, 16 figures. arXiv admin note: text overlap with arXiv:1501.02854

详情
AI中文摘要

机器人系统越来越多地依赖分布式反馈控制器来解决复杂且延迟敏感的传感和决策问题。这些需求带来了计算负担的增加,从而导致更大的控制器延迟。为了最大化对机械扰动的鲁棒性和实现高性能控制,我们强调需要在靠近控制对象的位置执行阻尼反馈,并在延迟敏感的集中式控制过程中分配刚度反馈。此外,串联弹性执行器(SEAs)近年来在力控机器人中变得越来越普遍,以实现与环境和人类的顺应性交互。然而,设计最优的阻抗控制器和表征具有时间延迟和滤波的SEAs的阻抗性能仍然是未充分研究的问题。本文通过设计一种关键阻尼增益设计方法,针对一类SEA级联控制架构(由外阻抗和内扭矩反馈环组成)解决最优控制器设计问题。通过所提出的控制器设计准则,我们采用频域方法对时间延迟、滤波和负载惯量对SEA阻抗性能的影响进行了深入分析。这些结果通过在高性能执行器和 omnidirectional 移动基座上的分析、仿真和实验测试进一步验证。

英文摘要

Robotic systems are increasingly relying on distributed feedback controllers to tackle complex and latency-prone sensing and decision problems. These demands come at the cost of a growing computational burden and, as a result, larger controller latencies. To maximize robustness to mechanical disturbances and achieve high control performance, we emphasize the necessity for executing damping feedback in close proximity to the control plant while allocating stiffness feedback in a latency-prone centralized control process. Additionally, series elastic actuators (SEAs) are becoming prevalent in torque-controlled robots during recent years to achieve compliant interactions with environments and humans. However, designing optimal impedance controllers and characterizing impedance performance for SEAs with time delays and filtering are still under-explored problems. The presented study addresses the optimal controller design problem by devising a critically-damped gain design method for a class of SEA cascaded control architectures, which is composed of outer-impedance and inner-torque feedback loops. Via the proposed controller design criterion, we adopt frequency-domain methods to thoroughly analyze the effects of time delays, filtering and load inertia on SEA impedance performance. These results are further validated through the analysis, simulation, and experimental testing on high-performance actuators and on an omnidirectional mobile base.

1809.07916 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Decentralized Optimal Merging Control for Connected and Automated Vehicles

去中心化最优合并控制用于联网自动化车辆

Wei Xiao, Christos G. Cassandras

发表机构 * Boston University(波士顿大学)

AI总结 本文研究了联网自动化车辆在合并点的最优控制问题,旨在共同最小化每辆车的行驶时间和能耗,同时保证速度相关的安全约束在合并点及控制区内的持续满足。通过分析无主动约束的情况,证明在特定条件下安全约束保持非活跃,从而简化了显式去中心化解的确定。当这些条件不适用时,仍能获得包含安全约束活跃区间的显式解。分析结果有助于研究行驶时间和控制区内的能耗之间的权衡。

Comments 16 pages, 2nd version, 20 figures

详情
AI中文摘要

本文针对联网自动化车辆(CAVs)从两条道路汇入合并点的最优控制问题,目标是共同最小化每辆CAV的行驶时间和能耗。解决方案保证在合并点及控制区内的速度相关安全约束始终满足。我们首先分析无主动约束的情况,证明在特定条件下安全约束保持非活跃,从而显著简化了显式去中心化解的确定。当这些条件不适用时,仍能获得包含安全约束活跃区间的显式解。我们的分析使我们能够研究行驶时间和控制区内的能耗之间的权衡。仿真示例用于比较最优控制器与由人工驾驶车辆组成的基线的性能,结果显示在两个指标上均有改进。

英文摘要

This paper addresses the optimal control of Connected and Automated Vehicles (CAVs) arriving from two roads at a merging point where the objective is to jointly minimize the travel time and energy consumption of each CAV. The solution guarantees that a speed-dependent safety constraint is always satisfied, both at the merging point and everywhere within a control zone which precedes it. We first analyze the case of no active constraints and prove that under certain conditions the safety constraint remains inactive, thus significantly simplifying the determination of an explicit decentralized solution. When these conditions do not apply, an explicit solution is still obtained that includes intervals over which the safety constraint is active. Our analysis allows us to study the tradeoff between the two objective function components (travel time and energy within the control zone). Simulation examples are included to compare the performance of the optimal controller to a baseline with human-driven vehicles with results showing improvements in both metrics.

1809.00367 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Momentum Model-based Minimal Parameter Identification of a Space Robot

基于动量模型的太空机器人最小参数识别

B. Naveen, Suril V. Shah, Arun K. Misra

发表机构 * Indian Institute of Technology, Jodhpur, Rajasthan, India(印度理工学院,朱达普尔,拉贾斯坦邦,印度) McGill University, Montreal, Quebec H3A 0C3, Canada(麦吉尔大学,蒙特利尔,魁北克 H3A 0C3,加拿大)

AI总结 本文提出了一种基于动量模型的最小参数识别方法,用于在轨识别太空机器人的最小参数,这些参数能够唯一确定动量和动力学模型,从而支持卫星及其安装的机械臂的运动规划和控制。

Comments Accepted for publication in AIAA Journal of Guidance, Control, and Dynamics

详情
AI中文摘要

准确的惯性参数信息对空间机器人的运动规划和控制至关重要。在发射前,仅能通过实验和计算机辅助设计(CAD)模型获得惯性参数的粗略估计。在发射后,轨道操作会显著改变惯性参数的值。本文提出了一种新的基于动量模型的方法,在轨识别太空机器人的最小参数。最小参数是链接的惯性参数的组合,并唯一定义动量和动力学模型。因此,它们对于卫星及其安装的机械臂的运动规划和控制是足够的。所提出框架的关键在于将动量模型以最小参数的线性形式进行唯一建模。进一步,为了估计最小参数,我们提出了一种基于关节速度方向组合的新型关节轨迹规划和优化技术。该识别框架的有效性在具有12个自由度的空间双臂机器人上得到验证。该方法适用于树型空间机器人,仅需要姿态和扭转数据,并且随着关节数量的增加而可扩展。

英文摘要

Accurate information of inertial parameters is critical to motion planning and control of space robots. Before the launch, only a rudimentary estimate of the inertial parameters is available from experiments and computer-aided design (CAD) models. After the launch, on-orbit operations substantially alter the value of inertial parameters. In this work, we propose a new momentum model-based method for identifying the minimal parameters of a space robot while on orbit. Minimal parameters are combinations of the inertial parameters of the links and uniquely define the momentum and dynamic models. Consequently, they are sufficient for motion planning and control of both the satellite and robotic arms mounted on it. The key to the proposed framework is the unique formulation of momentum model in the linear form of minimal parameters. Further, to estimate the minimal parameters, we propose a novel joint trajectory planning and optimization technique based on direction combinations of joints' velocity. The efficacy of the identification framework is demonstrated on a 12 degrees-of-freedom, spatial, dual-arm space robot. The methodology is developed for tree-type space robots, requires just the pose and twist data, and scalable with increasing number of joints.

1811.09914 2026-06-04 eess.SY cs.AI cs.MA cs.RO cs.SY 版本更新

RADMPC: A Fast Decentralized Approach for Chance-Constrained Multi-Vehicle Path-Planning

RADMPC:一种用于机会约束多车辆路径规划的快速去中心化方法

Aaron Huang, Benjamin J. Ayton, Brian C. Williams

发表机构 * Computer Science and Artificial Intelligence Laboratory(计算机科学与人工智能实验室) Massachusetts Institute of Technology(麻省理工学院)

AI总结 本文提出了一种基于去中心化路径规划方法RADMPC的快速机会约束多车辆路径规划方法,通过评估车辆交互来确定需要耦合规划的车辆集,并利用IRA在较小的车辆集上快速规划安全路径,从而显著提高计算效率。

详情
AI中文摘要

鲁棒的多车辆路径规划对于确保运输、搜索救援和机器人探索等应用中的多车辆系统安全性至关重要。迭代风险分配(IRA)等机会约束方法已被开发用于环境扰动无界的场景。然而,多车辆情况下的机会约束方法通常采用集中策略,其中所有车辆对之间存在耦合关系。随着车队规模的增加,这种策略变得不可行,因为计算时间与规划的车辆数呈指数增长,由于车辆对之间的耦合约束呈多项式增长。我们提出了一种更快的机会约束多车辆路径规划方法,该方法依赖于一种称为风险意识去中心化模型预测控制(RADMPC)的去中心化路径规划方法,以快速近似集中IRA方法。RADMPC近似通过评估车辆交互来确定应耦合规划的车辆集。将IRA应用于由RADMPC近似确定的较小车辆集上,能够快速为整个车队规划安全路径。蒙特卡洛模拟分析证明了我们方法的正确性,并与集中IRA方法相比显示出显著的计算时间改进。

英文摘要

Robust multi-vehicle path-planning is important for ensuring the safety of multi-vehicle systems in applications like transportation, search and rescue, and robotic exploration. Chance-constrained methods like Iterative Risk Allocation (IRA)\cite{IRA} have been developed for situations where environmental disturbances are unbounded. However, chance-constrained methods for the multi-vehicle case generally use centralized strategies where the vehicle set is planned with couplings between all vehicle pairs. This approach is intractable as fleet size increases because computation time is exponential with respect to the number of vehicles being planned over due to a polynomial increase in coupling constraints between vehicle pairs. We present a faster approach for chance-constrained multi-vehicle path-planning that relies upon a decentralized path-planning method called Risk-Aware Decentralized Model Predictive Control (RADMPC) to rapidly approximate a centralized IRA approach. The RADMPC approximation is evaluated for vehicle interactions to determine the vehicle sets that should be planned in a coupled manner. Applying IRA to the smaller vehicle sets determined from the RADMPC approximation rapidly plans safe paths for the entire fleet. A Monte Carlo simulation analysis demonstrates the correctness of our approach and a significant improvement in computation time compared to a centralized IRA approach.

1811.06350 2026-06-04 eess.SY cs.MA cs.RO cs.SC cs.SY math.OC 版本更新

Temporal viability regulation for control affine systems with applications to mobile vehicle coordination under time-varying motion constraints

时间可行性调节用于控制仿射系统及其在时间变化运动约束下移动车辆协调中的应用

Marcus Greiff, Zhiyong Sun, Anders Robertsson, Rolf Johansson

发表机构 * LCCC Linnaeus Center(LCCC 林纳厄中心) ELLIIT Excellence Center(ELLIIT 卓越中心) Lund University(Lund 大学)

AI总结 本文提出了一种时间可行性调节理论,用于一般动态控制系统,特别是控制仿射系统,通过时间变化约束函数参数化时间变化可行集,以确保动态控制系统在时间变化可行集中不变,从而执行时间依赖状态约束。同时,本文还给出了控制仿射系统可行控制输入存在的充分条件,并将该理论应用于移动车辆协调。

Comments 7 pages, 3 figures. Submitted to a conference for publication

详情
AI中文摘要

受控不变集和动态控制系统的可行性调节在许多控制和协调应用中发挥了重要作用。本文开发了一种针对一般动态控制系统的时序可行性调节理论,特别是针对控制仿射系统。时间变化可行集由时间变化约束函数参数化,旨在将动态控制系统调节为在时间变化可行集中不变,以执行时间依赖的状态约束。我们考虑了在定义时间可行集时的时间变化等式和不等式约束。我们还提出了控制仿射系统可行控制输入存在的充分条件。所开发的时序可行性调节理论应用于移动车辆协调。

英文摘要

Controlled invariant set and viability regulation of dynamical control systems have played important roles in many control and coordination applications. In this paper we develop a temporal viability regulation theory for general dynamical control systems, and in particular for control affine systems. The time-varying viable set is parameterized by time-varying constraint functions, with the aim to regulate a dynamical control system to be invariant in the time-varying viable set so that temporal state-dependent constraints are enforced. We consider both time-varying equality and inequality constraints in defining a temporal viable set. We also present sufficient conditions for the existence of feasible control input for the control affine systems. The developed temporal viability regulation theory is applied to mobile vehicle coordination.

1808.00113 2026-06-04 eess.SY cs.LG cs.RO cs.SY math.OC 版本更新

Learning Stabilizable Dynamical Systems via Control Contraction Metrics

通过控制收缩度量学习可稳定化的动态系统

Sumeet Singh, Vikas Sindhwani, Jean-Jacques E. Slotine, Marco Pavone

发表机构 * Dept. of Aeronautics and Astronautics, Stanford University(航空航天系,斯坦福大学) Google Brain Robotics, New York(谷歌大脑机器人,纽约) Dept. of Mechanical Engineering, Massachusetts Institute of Technology(机械工程系,麻省理工学院)

AI总结 本文提出了一种新的框架,用于学习可稳定化的非线性动态系统,以实现机器人连续控制任务。核心方法是开发一种基于稳定性的控制理论正则化器,以确保学习到的系统可以配备一个稳健的控制器,能够稳定任何系统生成的开环轨迹。通过利用收缩理论、统计学习和凸优化工具,我们提供了一个通用且可操作的半监督算法来学习可稳定化的动态系统,可以应用于复杂的欠驱动系统。在模拟平面四旋翼系统上验证了所提算法,并观察到与传统回归技术学习的模型相比,使用控制理论正则化模型在轨迹生成和跟踪性能上有显著改进,尤其是在使用少量示范示例时。结果展示了将标准基于模型的强化学习算法与非线性控制理论概念结合的必要性,以提高可靠性。

Comments To appear at WAFR 2018. v2: re-structured Sections 3 & 4 to improve clarity; expanded discussion on limitations & future work in Section 5; added details on training & validation, significantly expanded experiments

详情
AI中文摘要

我们提出了一种新的框架,用于学习可稳定化的非线性动态系统,以实现机器人连续控制任务。关键思想是开发一种基于稳定性的控制理论正则化器,用于动态拟合,该正则化器保证所学习的系统可以配备一个稳健的控制器,能够稳定任何系统可能生成的开环轨迹。通过利用收缩理论、统计学习和凸优化工具,我们提供了一个通用且可操作的半监督算法来学习可稳定化的动态系统,可以应用于复杂的欠驱动系统。我们在模拟平面四旋翼系统上验证了所提算法,并观察到与传统回归技术学习的模型相比,使用控制理论正则化模型在轨迹生成和跟踪性能上有显著改进,尤其是在使用少量示范示例时。所呈现的结果展示了将标准基于模型的强化学习算法与非线性控制理论概念结合的必要性,以提高可靠性。

英文摘要

We propose a novel framework for learning stabilizable nonlinear dynamical systems for continuous control tasks in robotics. The key idea is to develop a new control-theoretic regularizer for dynamics fitting rooted in the notion of stabilizability, which guarantees that the learned system can be accompanied by a robust controller capable of stabilizing any open-loop trajectory that the system may generate. By leveraging tools from contraction theory, statistical learning, and convex optimization, we provide a general and tractable semi-supervised algorithm to learn stabilizable dynamics, which can be applied to complex underactuated systems. We validated the proposed algorithm on a simulated planar quadrotor system and observed notably improved trajectory generation and tracking performance with the control-theoretic regularized model over models learned using traditional regression techniques, especially when using a small number of demonstration examples. The results presented illustrate the need to infuse standard model-based reinforcement learning algorithms with concepts drawn from nonlinear control theory for improved reliability.

1803.08287 2026-06-04 eess.SY cs.AI cs.LG cs.RO cs.SY 版本更新

Learning-based Model Predictive Control for Safe Exploration

基于学习的模型预测控制用于安全探索

Torsten Koller, Felix Berkenkamp, Matteo Turchetta, Andreas Krause

发表机构 * Vector Institute(向量研究所) Max Planck ETH Center for Learning Systems(马克斯·普朗克-ETH学习系统中心)

AI总结 本文提出了一种基于学习的模型预测控制方法,通过高斯过程先验假设构建可证明准确的轨迹置信区间,从而提供可证明的高概率安全保证,用于动态系统的安全高效探索和学习。

Comments Proc. of the Conference on Decision and Control, 2018

详情
AI中文摘要

基于学习的方法在没有显著系统先验知识的情况下成功解决了复杂控制任务。然而,这些方法通常不提供任何安全保证,这限制了它们在安全关键的现实应用中的使用。在本文中,我们提出了一种基于学习的模型预测控制方案,可以提供可证明的高概率安全保证。为此,我们利用高斯过程先验对动态特性进行正则性假设,以构建可证明准确的预测轨迹置信区间。与以往的方法不同,我们不假设模型不确定性是独立的。基于这些预测,我们保证轨迹满足安全约束。此外,我们使用终端集约束递归地保证在每个迭代中都存在安全的控制动作。在我们的实验中,我们展示了所提出算法可以安全且高效地探索和学习动态系统。

英文摘要

Learning-based methods have been successful in solving complex control tasks without significant prior knowledge about the system. However, these methods typically do not provide any safety guarantees, which prevents their use in safety-critical, real-world applications. In this paper, we present a learning-based model predictive control scheme that can provide provable high-probability safety guarantees. To this end, we exploit regularity assumptions on the dynamics in terms of a Gaussian process prior to construct provably accurate confidence intervals on predicted trajectories. Unlike previous approaches, we do not assume that model uncertainties are independent. Based on these predictions, we guarantee that trajectories satisfy safety constraints. Moreover, we use a terminal set constraint to recursively guarantee the existence of safe control actions at every iteration. In our experiments, we show that the resulting algorithm can be used to safely and efficiently explore and learn about dynamic systems.

1811.01774 2026-06-04 cs.SE cs.RO cs.SY eess.SY 版本更新

SCAV'18: Report of the 2nd International Workshop on Safe Control of Autonomous Vehicles

SCAV'18: 第二届安全控制自主车辆国际研讨会报告

Mario Gleirscher, Sven Linker, Stefan Kugele

发表机构 * University of York, UK(英国约克大学) Technical University of Munich, Germany(德国慕尼黑技术大学) University of Liverpool, UK(英国利兹大学)

AI总结 本文总结了第二届SCAV研讨会的讨论、开放问题、关键信息和结论,探讨了自主车辆安全控制的核心挑战与未来方向。

Comments 3 pages, 1 table

详情
AI中文摘要

本报告总结了第二届SCAV研讨会的讨论、开放问题、关键信息和结论。

英文摘要

This report summarizes the discussions, open issues, take-away messages, and conclusions of the 2nd SCAV workshop.

1806.06498 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Conditional Affordance Learning for Driving in Urban Environments

面向城市环境的条件性 affordance 学习

Axel Sauer, Nikolay Savinov, Andreas Geiger

发表机构 * Chair of Robotics Science and System Intelligence, Technical University of Munich(慕尼黑技术大学机器人科学与系统智能系)

AI总结 本文提出了一种直接感知方法,通过将视频输入映射到适合复杂城市环境自主导航的中间表示,结合高层方向输入,实现了比现有强化学习和条件模仿学习方法更高的目标导向导航性能,并首次通过图像级标签处理交通灯和速度标志,显著减少模拟中的交通事故。

Comments Accepted for Conference on Robot Learning (CoRL) 2018

详情
AI中文摘要

大多数现有的自动驾驶方法分为两类:模块化流水线,通过构建环境的详尽模型,以及模仿学习方法,直接将图像映射到控制输出。最近提出的一种第三范式,直接感知,旨在通过神经网络学习适当的低维中间表示来结合两者的优点。然而,现有的直接感知方法仅限于简单的高速公路场景,缺乏在交叉路口导航、在交通灯前停止或遵守速度限制的能力。在本文中,我们提出了一种直接感知方法,将视频输入映射到适合复杂城市环境自主导航的中间表示,给定高层方向输入。与最先进的强化学习和条件模仿学习方法相比,在具有挑战性的CARLA模拟基准上,我们实现了高达68%的目标导向导航改进。此外,我们的方法是首次通过仅使用图像级标签来处理交通灯和速度标志,从而在模拟中显著减少交通事故。

英文摘要

Most existing approaches to autonomous driving fall into one of two categories: modular pipelines, that build an extensive model of the environment, and imitation learning approaches, that map images directly to control outputs. A recently proposed third paradigm, direct perception, aims to combine the advantages of both by using a neural network to learn appropriate low-dimensional intermediate representations. However, existing direct perception approaches are restricted to simple highway situations, lacking the ability to navigate intersections, stop at traffic lights or respect speed limits. In this work, we propose a direct perception approach which maps video input to intermediate representations suitable for autonomous navigation in complex urban environments given high-level directional inputs. Compared to state-of-the-art reinforcement and conditional imitation learning approaches, we achieve an improvement of up to 68 % in goal-directed navigation on the challenging CARLA simulation benchmark. In addition, our approach is the first to handle traffic lights and speed signs by using image-level labels only, as well as smooth car-following, resulting in a significant reduction of traffic accidents in simulation.

1811.00426 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

Improving the Modularity of AUV Control Systems using Behaviour Trees

使用行为树提高水下机器人控制系统的模块化程度

Christopher Iliffe Sprague, Özer Özkahraman, Andrea Munafo, Rachel Marlow, Alexander Phillips, Petter Ögren

发表机构 * Robotics, Perception and Learning Lab(机器人、感知与学习实验室) Royal Institute of Technology(皇家理工学院) National Oceanography Centre(国家海洋学研究中心)

AI总结 本文展示如何利用行为树设计模块化、多功能且稳健的控制架构,用于关键任务系统,特别针对自主水下机器人。研究强调了系统安全的稳健性、执行多种任务的多功能性以及模块化在结合稳健性和多功能性中的重要性。

Comments Submitted to 2018 IEEE OES Autonomous Underwater Vehicle Symposium

详情
AI中文摘要

在本文中,我们展示了行为树(BTs)如何用于设计模块化、多功能且稳健的控制架构,用于关键任务系统。特别是,我们在此背景下展示了自主水下机器人(AUVs)的应用。在系统安全方面,稳健性很重要,因为手动恢复AUVs往往非常困难。此外,多功能性对于执行多种不同任务至关重要。最后,模块化是实现稳健性和多功能性结合所必需的,因为多功能系统的复杂性需要封装在模块中,以便创建一个简单的整体结构,从而实现稳健性分析。所提出的设计通过典型的AUV任务进行了说明。

英文摘要

In this paper, we show how behaviour trees (BTs) can be used to design modular, versatile, and robust control architectures for mission-critical systems. In particular, we show this in the context of autonomous underwater vehicles (AUVs). Robustness, in terms of system safety, is important since manual recovery of AUVs is often extremely difficult. Further more, versatility is important to be able to execute many different kinds of missions. Finally, modularity is needed to achieve a combination of robustness and versatility, as the complexity of a versatile systems needs to be encapsulated in modules, in order to create a simple overall structure enabling robustness analysis. The proposed design is illustrated using a typical AUV mission.

1810.13087 2026-06-04 cs.RO cs.FL cs.SY eess.SY math.OC 版本更新

Multirobot Coordination with Counting Temporal Logics

多机器人协调与计数时序逻辑

Yunus Emre Sahin, Petter Nilsson, Necmiye Ozay

发表机构 * Department of Electrical Engineering and Computer Science(电气工程与计算机科学系) University of Michigan(密歇根大学) Department of Mechanical and Civil Engineering(机械与土木工程系) California Institute of Technology(加州理工学院)

AI总结 本文提出了一种基于计数时序逻辑的多机器人协调方法,通过优化算法生成轨迹以保证同步执行时满足给定的逻辑公式,并展示了在机器人动力学相同的情况下,使用计数线性时序逻辑(cLTL)能更高效地解决规划问题,同时讨论了异步执行下保持逻辑规范性的方法及鲁棒轨迹生成。

Comments Under submission for a journal

详情
AI中文摘要

在许多多机器人应用中,规划轨迹以确保机器人集体行为满足某种高层规范至关重要。受此问题启发,我们引入了计数时序逻辑——一种能够简洁表达多机器人任务规范的正式语言,适用于可能无限的时域。我们首先介绍了一种通用逻辑,称为计数线性时序逻辑加(cLTL+),并提出了一种基于优化的方法,生成个体轨迹,使得在这些轨迹同步执行时满足给定的cLTL+公式。我们随后介绍cLTL+的一个片段,称为计数线性时序逻辑(cLTL),并展示当所有机器人具有相同动力学时,使用cLTL约束的规划问题的解决方案可以更高效地获得。在论文的第二部分,我们放松同步假设,讨论如何生成可以在异步执行下保持所需cLTL+规范性的轨迹。特别是,我们证明当机器人之间的异步性受限制时,本文提出的方法可以修改以生成鲁棒轨迹。我们通过实验演示这些想法,并提供数值结果,展示该方法的可扩展性。

英文摘要

In many multirobot applications, planning trajectories in a way to guarantee that the collective behavior of the robots satisfies a certain high-level specification is crucial. Motivated by this problem, we introduce counting temporal logics---formal languages that enable concise expression of multirobot task specifications over possibly infinite horizons. We first introduce a general logic called counting linear temporal logic plus (cLTL+), and propose an optimization-based method that generates individual trajectories such that satisfaction of a given cLTL+ formula is guaranteed when these trajectories are synchronously executed. We then introduce a fragment of cLTL+, called counting linear temporal logic (cLTL), and show that a solution to planning problem with cLTL constraints can be obtained more efficiently if all robots have identical dynamics. In the second part of the paper, we relax the synchrony assumption and discuss how to generate trajectories that can be asynchronously executed, while preserving the satisfaction of the desired cLTL+ specification. In particular, we show that when the asynchrony between robots is bounded, the method presented in this paper can be modified to generate robust trajectories. We demonstrate these ideas with an experiment and provide numerical results that showcase the scalability of the method.

1810.13072 2026-06-04 cs.AI cs.RO cs.SY eess.SY 版本更新

Formal Verification of Neural Network Controlled Autonomous Systems

神经网络控制自主系统的形式验证

Xiaowu Sun, Haitham Khedr, Yasser Shoukry

发表机构 * Department of Electrical Computer Engineering University of Maryland, College Park

AI总结 本文研究了如何形式验证配备神经网络控制器的自主机器人在LiDAR图像处理中安全性的核心问题,通过构建有限状态抽象并利用可达性分析计算安全的初始条件,提出了一种多项式时间算法来分区工作空间并计算对应的仿射成像函数,同时利用SMC编码分析神经网络行为,通过数值模拟验证了算法的效率。

详情
AI中文摘要

在本文中,我们考虑了正式验证配备神经网络(NN)控制器的自主机器人在处理LiDAR图像以产生控制动作时的安全性问题。给定一个由一组多边形障碍物特征化的工作空间,我们的目标是计算一组安全的初始条件,使得从这些初始条件出发的机器人轨迹能够保证避开障碍物。我们的方法是构建系统的有限状态抽象,并利用标准的可达性分析在有限状态抽象上计算安全的初始状态集。计算有限状态抽象的第一个技术问题是数学建模将机器人位置映射到LiDAR图像的成像函数。为此,我们引入了成像适应集的概念,作为工作空间的分区,在这些分区中,成像函数被保证为仿射的。我们开发了一种多项式时间算法,用于将工作空间划分为成像适应集并计算相应的仿射成像函数。给定这种工作空间分区,机器人的离散时间线性动力学以及一个预训练的具有修正线性单元(ReLU)非线性的神经网络控制器,第二个技术挑战是分析神经网络的行为。为此,我们利用满足模凸(SMC)编码来枚举所有可能的ReLU段落。SMC求解器随后使用布尔可满足性求解器和凸优化求解器,将问题分解为更小的子问题。为了加速这个过程,我们开发了一种预处理算法,可以快速修剪可行的ReLU段落。最后,我们通过数值模拟验证了所提出算法的效率,模拟中神经网络控制器的复杂性逐渐增加。

英文摘要

In this paper, we consider the problem of formally verifying the safety of an autonomous robot equipped with a Neural Network (NN) controller that processes LiDAR images to produce control actions. Given a workspace that is characterized by a set of polytopic obstacles, our objective is to compute the set of safe initial conditions such that a robot trajectory starting from these initial conditions is guaranteed to avoid the obstacles. Our approach is to construct a finite state abstraction of the system and use standard reachability analysis over the finite state abstraction to compute the set of the safe initial states. The first technical problem in computing the finite state abstraction is to mathematically model the imaging function that maps the robot position to the LiDAR image. To that end, we introduce the notion of imaging-adapted sets as partitions of the workspace in which the imaging function is guaranteed to be affine. We develop a polynomial-time algorithm to partition the workspace into imaging-adapted sets along with computing the corresponding affine imaging functions. Given this workspace partitioning, a discrete-time linear dynamics of the robot, and a pre-trained NN controller with Rectified Linear Unit (ReLU) nonlinearity, the second technical challenge is to analyze the behavior of the neural network. To that end, we utilize a Satisfiability Modulo Convex (SMC) encoding to enumerate all the possible segments of different ReLUs. SMC solvers then use a Boolean satisfiability solver and a convex programming solver and decompose the problem into smaller subproblems. To accelerate this process, we develop a pre-processing algorithm that could rapidly prune the space feasible ReLU segments. Finally, we demonstrate the efficiency of the proposed algorithms using numerical simulations with increasing complexity of the neural network controller.

1706.08932 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Iterative Sequential Action Control for Stable, Model-Based Control of Nonlinear Systems

迭代序列动作控制用于非线性系统的稳定模型控制

Emmanouil Tzorakoleftherakis, Todd Murphey

发表机构 * Neuroscience and Robotics Laboratory (N×R)(神经科学与机器人实验室)

AI总结 本文提出了一种迭代序列动作控制(iSAC)方法,用于非线性系统的控制,该方法通过在时间步之间迭代更新常数控制值来获得闭环渐近稳定性,并探讨了渐近衰减扰动对系统轨迹的影响。

详情
Journal ref
IEEE Transactions on Automatic Control, 2018
AI中文摘要

本文提出了迭代序列动作控制(iSAC),一种用于非线性系统的递推时间窗口控制方法。iSAC方法具有闭式开环解,通过在时间步之间迭代更新,引入常数控制值用于短时间应用。在成本上应用收缩约束被证明在温和假设下可以实现闭环渐近稳定性。还研究了渐近衰减扰动对系统轨迹的影响。为了展示iSAC在各种系统和条件下的适用性,我们采用了五个不同的系统,包括一个基于四元数的13维四旋翼。每个系统在不同的场景中进行测试,从可行和不可行的轨迹跟踪到设定点稳定化,有或没有外部扰动的存在。最后讨论了该工作的局限性。

英文摘要

This paper presents iterative Sequential Action Control (iSAC), a receding horizon approach for control of nonlinear systems. The iSAC method has a closed-form open-loop solution, which is iteratively updated between time steps by introducing constant control values applied for short duration. Application of a contractive constraint on the cost is shown to lead to closed-loop asymptotic stability under mild assumptions. The effect of asymptotically decaying disturbances on system trajectories is also examined. To demonstrate the applicability of iSAC to a variety of systems and conditions, we employ five different systems, including a 13-dimensional quaternion-based quadrotor. Each system is tested in different scenarios, ranging from feasible and infeasible trajectory tracking, to setpoint stabilization, with or without the presence of external disturbances. Finally, limitations of this work are discussed.

1802.00285 2026-06-04 cs.CV cs.RO cs.SY eess.SY 版本更新

Virtual-to-Real: Learning to Control in Visual Semantic Segmentation

虚拟到现实:学习在视觉语义分割中的控制

Zhang-Wei Hong, Chen Yu-Ming, Shih-Yang Su, Tzu-Yun Shann, Yi-Hsiang Chang, Hsuan-Kung Yang, Brian Hsi-Lin Ho, Chih-Chieh Tu, Yueh-Chuan Chang, Tsu-Ching Hsiao, Hsin-Wei Hsiao, Sih-Pin Lai, Chun-Yi Lee

发表机构 * Elsa Lab(Elsa实验室) Department of Computer Science(计算机科学系) National Tsing Hua University(国立清华大学)

AI总结 本文提出了一种模块化架构,通过将感知模块和控制策略模块结合,利用语义图像分割作为元表示,解决虚拟到现实的迁移问题,并在障碍避让和目标跟随任务中展示了优越的性能。

Comments 7 pages, accepted by IJCAI-18

详情
AI中文摘要

从物理世界收集训练数据通常是耗时且甚至对脆弱机器人来说是危险的,因此最近的机器人学习进展倡导使用模拟器作为训练平台。不幸的是,合成与真实视觉数据之间的现实差距阻止了在虚拟世界中训练的模型直接迁移到现实世界。本文提出了一种模块化架构来解决虚拟到现实的问题。所提出的架构将学习模型分为感知模块和控制策略模块,并使用语义图像分割作为这些模块之间关联的元表示。感知模块将感知的RGB图像转换为语义图像分割。控制策略模块实现为一个深度强化学习代理,根据转换后的图像分割执行动作。我们的架构在避障任务和目标跟随任务中进行了评估。实验结果表明,我们的架构在虚拟和现实环境中均显著优于所有基线方法,并且比它们具有更快的学习曲线。我们还对各种变体配置进行了详细分析,并验证了我们模块化架构的可迁移性。

英文摘要

Collecting training data from the physical world is usually time-consuming and even dangerous for fragile robots, and thus, recent advances in robot learning advocate the use of simulators as the training platform. Unfortunately, the reality gap between synthetic and real visual data prohibits direct migration of the models trained in virtual worlds to the real world. This paper proposes a modular architecture for tackling the virtual-to-real problem. The proposed architecture separates the learning model into a perception module and a control policy module, and uses semantic image segmentation as the meta representation for relating these two modules. The perception module translates the perceived RGB image to semantic image segmentation. The control policy module is implemented as a deep reinforcement learning agent, which performs actions based on the translated image segmentation. Our architecture is evaluated in an obstacle avoidance task and a target following task. Experimental results show that our architecture significantly outperforms all of the baseline methods in both virtual and real environments, and demonstrates a faster learning curve than them. We also present a detailed analysis for a variety of variant configurations, and validate the transferability of our modular architecture.

1808.03143 2026-06-04 eess.SY cs.RO cs.SY 版本更新

A Hybrid Dynamic-regenerative Damping Scheme for Energy Regeneration in Variable Impedance Actuators

一种混合动态再生阻尼方案用于变量阻抗执行器的能量再生

Fan Wu, Matthew Howard

AI总结 本文提出了一种新的可变阻尼模块设计,通过利用直流电机的再生制动效应,实现变量阻抗执行器的能量再生,通过仿真和实验验证了该方案在任务性能和能量效率之间的最优平衡。

Comments Accepted to IEEE International Conference on Robotics and Automation (ICRA), 2018

详情
AI中文摘要

越来越多的研究致力于通过减少能耗来提高变量阻抗执行器(VIAs)的能量效率。然而,此类系统中耗散能量的回收仍处于探索阶段。本研究提出了一种新的可变阻尼模块设计,通过利用直流电机的再生制动效应,实现VIAs中的能量再生。所提出的阻尼模块使用四个开关结合再生和动态制动,采用混合方法,在不减少可实现阻尼范围的情况下实现能量再生。数值仿真和物理实验表明,所提出的模块在任务性能和能量效率之间实现了最佳平衡。

英文摘要

Increasing research efforts have been made to improve the energy efficiency of variable impedance actuators (VIAs) through reduction of energy consumption. However, the harvesting of dissipated energy in such systems remains underexplored. This study proposes a novel variable damping module design enabling energy regeneration in VIAs by exploiting the regenerative braking effect of DC motors. The proposed damping module uses four switches to combine regenerative and dynamic braking, in a hybrid approach that enables energy regeneration without reduction in the range of damping achievable. Numerical simulations and a physical experiment are presented in which the proposed module shows an optimal trade-off between task performance and energy efficiency.

1810.09929 2026-06-04 eess.SP cs.RO cs.SY eess.SY 版本更新

Teleoperated Robotic Arm Movement Using EMG Signal With Wearable MYO Armband

使用可穿戴MYO臂带的电信号实现远程操控机械臂运动

Hussein F. Hassan, Sadiq J. Abou-Loukh, Ibraheem Kasim Ibraheem

发表机构 * University of Baghdad, College of Engineering, Department of Electrical Engineering(巴格达大学,工程学院,电气工程系)

AI总结 本研究通过分析表面肌电信号,利用可穿戴MYO臂带区分七种手部运动,采用模式识别系统实现机械臂的实时控制,其中SVM分类器在准确率上达到96.57%。

详情
AI中文摘要

本研究的主要目的是基于通过无线Myo手势臂带获取的表面肌电信号(sEMG)实时控制五自由度机械臂,以区分七种手部运动。sEMG信号是生物电信号,用于估计和记录肌肉收缩和放松过程中产生的电信号,代表神经肌肉活动。因此,通过人体手臂肌肉利用sEMG信号来控制机械臂被视为一种重要的方法。无线Myo手势臂带用于从前臂记录sEMG信号。为了分析这些信号,采用了模式识别系统,该系统由三个主要部分组成:分段、特征提取和分类。重叠技术用于分段信号。从每个分段中提取六个时域特征(MAV、WL、RMS、AR、ZC和SSC)。采用分类器(SVM、LDA和KNN)以比较它们,以获得系统的最佳准确率。结果表明,SVM在准确率上达到96.57%,优于LDA的96.01%和KNN的92.67%。

英文摘要

The main purpose of this research is to move the robotic arm (5DoF) in real-time, based on the surface Electromyography (sEMG) signals, as obtained from the wireless Myo gesture armband to distinguish seven hand movements. The sEMG signals are biomedical signals that estimate and record the electrical signals produced in muscles through their contraction and relaxation, representing neuromuscular activities. Therefore, controlling the robotic arm via the muscles of the human arm using sEMG signals is considered to be one of the most significant methods. The wireless Myo gesture armband is used to record sEMG signals from the forearm. In order to analyze these signals, the pattern recognition system is employed, which consists of three main parts: segmentation, feature extraction, and classification. Overlap technique is chosen for segmenting part of the signal. Six time domain features (MAV, WL, RMS, AR, ZC, and SSC) are extracted from each segment. The classifiers (SVM, LDA, and KNN) are employed to enable comparison between them in order to obtain optimum accuracy of the system. The results show that the SVM achieves higher system accuracy at 96.57 %, compared to LDA reaching 96.01 %, and 92.67 % accuracy achieved by KNN.

1810.09729 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

Design Challenges of Multi-UAV Systems in Cyber-Physical Applications: A Comprehensive Survey, and Future Directions

多无人机系统在网络物理应用中的设计挑战:综述与未来方向

Reza Shakeri, Mohammed Ali Al-Garadi, Ahmed Badawy, Amr Mohamed, Tamer Khattab, Abdulla Al-Ali, Khaled A. Harras, Mohsen Guizani

发表机构 * Carnegie Mellon University Qatar Campus(卡塔尔分校卡内基梅隆大学)

AI总结 本文综述了多无人机系统在网络物理应用中的关键设计挑战,探讨了目标和基础设施对象的覆盖与跟踪、能量高效导航以及基于机器学习的图像分析等核心方法,并提出了面向细粒度网络物理应用的先进算法和未来研究方向。

详情
AI中文摘要

无人驾驶飞行器(UAVs)近年来迅速发展,为一系列创新应用提供了支持,这些应用有可能从根本上改变网络物理系统(CPSs)的设计方式。CPSs 是一种现代系统,具有计算和物理潜力的协同作用,能够通过多种新机制与人类交互。使用 UAVs 在 CPS 应用中的主要优势在于其卓越的特性,包括机动性、动态性、易于部署、适应高度、敏捷性、可调节性和随时在任何地方有效评估现实功能的能力。此外,从技术角度来看,UAVs 被预测将成为高级 CPSs 发展的重要元素。因此,在本次综述中,我们旨在确定多 UAV 系统在 CPS 应用中最基本和重要的设计挑战。我们强调了关键且多方面的内容,涵盖目标和基础设施对象的覆盖与跟踪、能量高效的导航以及使用机器学习进行图像分析以支持细粒度的 CPS 应用。此外,还研究了关键原型和测试平台,以展示这些实用技术如何促进 CPS 应用。我们提出了面向设计挑战的最先进算法,结合定量和定性方法,并将这些挑战与重要的 CPS 应用映射,以得出关于每个应用挑战的深入结论。最后,我们总结了可能的新方向和想法,这些可能会影响这些领域的未来研究。

英文摘要

Unmanned Aerial Vehicles (UAVs) have recently rapidly grown to facilitate a wide range of innovative applications that can fundamentally change the way cyber-physical systems (CPSs) are designed. CPSs are a modern generation of systems with synergic cooperation between computational and physical potentials that can interact with humans through several new mechanisms. The main advantages of using UAVs in CPS application is their exceptional features, including their mobility, dynamism, effortless deployment, adaptive altitude, agility, adjustability, and effective appraisal of real-world functions anytime and anywhere. Furthermore, from the technology perspective, UAVs are predicted to be a vital element of the development of advanced CPSs. Therefore, in this survey, we aim to pinpoint the most fundamental and important design challenges of multi-UAV systems for CPS applications. We highlight key and versatile aspects that span the coverage and tracking of targets and infrastructure objects, energy-efficient navigation, and image analysis using machine learning for fine-grained CPS applications. Key prototypes and testbeds are also investigated to show how these practical technologies can facilitate CPS applications. We present and propose state-of-the-art algorithms to address design challenges with both quantitative and qualitative methods and map these challenges with important CPS applications to draw insightful conclusions on the challenges of each application. Finally, we summarize potential new directions and ideas that could shape future research in these areas.

1807.03475 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

On Controller Design for Systems on Manifolds in Euclidean Space

在欧几里得空间中系统控制器设计方法

Dong Eui Chang

发表机构 * Electrical Engineering(电气工程) Korea Advanced Institute of Science(韩国科学技术院)

AI总结 本文提出了一种在欧几里得空间中为定义在流形上的系统设计控制器的新方法,通过将状态空间流形嵌入到欧几里得空间中,并在该空间中扩展系统以增加最终动态中的横截稳定性,从而设计出适用于原始系统的控制器。

Comments International Journal of Robust and Nonlinear Control (Accepted July 2018

详情
Journal ref
International J of Robust and Nonlinear Control, 28(16), 4981--4998, 2018
AI中文摘要

本文提出了一种在欧几里得空间中为定义在流形上的系统设计控制器的新方法。该方法的思路是将给定控制系统的状态空间流形M嵌入到某个欧几里得空间R^n中,将系统从M扩展到环境空间R^n,并在该空间中修改系统以在R^n中的最终动态中增加M的横截稳定性。在环境空间R^n中为最终系统设计控制器,然后将其限制到M上,从而得到原始系统在M上的控制器。该方法的优点是仅使用一个单一的全局笛卡尔坐标系在环境空间R^n中进行控制器合成,并且任何在R^n中的控制器设计方法,如线性化方法,都可以全局应用于控制器合成。所提出的方法成功应用于以下两个基准系统的跟踪问题:完全驱动的刚体系统和四旋翼无人机系统。

英文摘要

A new method is developed to design controllers in Euclidean space for systems defined on manifolds. The idea is to embed the state-space manifold $M$ of a given control system into some Euclidean space $\mathbb R^n$, extend the system from $M$ to the ambient space $\mathbb R^n$, and modify it outside $M$ to add transversal stability to $M$ in the final dynamics in $\mathbb R^n$. Controllers are designed for the final system in the ambient space $\mathbb R^n$. Then, their restriction to $M$ produces controllers for the original system on $M$. This method has the merit that only one single global Cartesian coordinate system in the ambient space $\mathbb R^n$ is used for controller synthesis, and any controller design method in $\mathbb R^n$, such as the linearization method, can be globally applied for the controller synthesis. The proposed method is successfully applied to the tracking problem for the following two benchmark systems: the fully actuated rigid body system and the quadcopter drone system.

1706.05104 2026-06-04 cs.RO cs.ET cs.SY eess.SY 版本更新

Personal Food Computer: A new device for controlled-environment agriculture

个人食物计算机:一种用于受控环境农业的新设备

Eduardo Castelló Ferrer, Jake Rye, Gordon Brander, Tim Savas, Douglas Chambers, Hildreth England, Caleb Harper

发表机构 * MIT Media Lab(麻省理工学院媒体实验室)

AI总结 本文提出了一种低成本的桌面平台OpenAg Personal Food Computer(PFC),旨在为植物物候研究、爱好者、制作者和K-12教师提供工具,支持集体数据共享和植物生长分析。

Comments 9 pages, 11 figures, Accepted at the 2017 Future Technologies Conference (FTC)

详情
AI中文摘要

由于其跨学科性质,受控环境农业设备有可能成为研究植物物候和在各种学科中创建课程的理想工具。受控环境设备正在增加其功能并改进其可访问性。传统上,从头开始建造这些设备需要机械工程、数字电子、编程和能源管理方面的知识。然而,为个人使用设计的有效受控环境设备带来了新的约束和挑战。本文提出了OpenAg Personal Food Computer(PFC);一种低成本的桌面平台,不仅针对植物物候研究人员,还针对爱好者、制作者和K-12级别的教师。PFC完全开源,并旨在成为可用于集体数据共享和植物生长分析的工具。得益于其模块化设计,PFC可以用于广泛活动。

英文摘要

Due to their interdisciplinary nature, devices for controlled-environment agriculture have the possibility to turn into ideal tools not only to conduct research on plant phenology but also to create curricula in a wide range of disciplines. Controlled-environment devices are increasing their functionalities as well as improving their accessibility. Traditionally, building one of these devices from scratch implies knowledge in fields such as mechanical engineering, digital electronics, programming, and energy management. However, the requirements of an effective controlled environment device for personal use brings new constraints and challenges. This paper presents the OpenAg Personal Food Computer (PFC); a low cost desktop size platform, which not only targets plant phenology researchers but also hobbyists, makers, and teachers from elementary to high-school levels (K-12). The PFC is completely open-source and it is intended to become a tool that can be used for collective data sharing and plant growth analysis. Thanks to its modular design, the PFC can be used in a large spectrum of activities.

1810.09365 2026-06-04 cs.LG cs.RO cs.SY eess.SY stat.ML 版本更新

Coupled Longitudinal and Lateral Control of a Vehicle using Deep Learning

使用深度学习进行车辆纵向和横向控制的耦合控制

Guillaume Devineau, Philip Polack, Florent Altché, Fabien Moutarde

发表机构 * Center for Robotics, MINES ParisTech(机器人中心,巴黎综合理工学院) PSL Research University(巴黎综合理工大学)

AI总结 本文研究了深度神经网络在捕捉车辆动力学关键特性及执行耦合纵向和横向控制方面的潜力,通过高保真车辆动力学模拟数据集训练两种不同的人工神经网络,评估多层感知机和卷积神经网络在复杂测试赛道上的性能,与传统解耦控制器进行比较。

Comments Published in the IEEE 2018 International Conference on Intelligent Transportation Systems (ITSC 2018)

详情
AI中文摘要

本文探讨了深度神经网络在捕捉车辆动力学关键特性及执行耦合纵向和横向控制方面的潜力。为此,两种不同的人工神经网络被训练以计算对应参考轨迹的车辆控制输入,使用基于高保真车辆动力学模拟的数据集。在本研究中,控制输入被选择为前轮转向角和每个车轮施加的扭矩。两种模型,即多层感知机(MLP)和卷积神经网络(CNN),基于其在复杂测试赛道上驾驶车辆的能力进行评估,该赛道在长直线和紧弯之间切换。还提供了与传统解耦控制器在相同赛道上的比较。

英文摘要

This paper explores the capability of deep neural networks to capture key characteristics of vehicle dynamics, and their ability to perform coupled longitudinal and lateral control of a vehicle. To this extent, two different artificial neural networks are trained to compute vehicle controls corresponding to a reference trajectory, using a dataset based on high-fidelity simulations of vehicle dynamics. In this study, control inputs are chosen as the steering angle of the front wheels, and the applied torque on each wheel. The performance of both models, namely a Multi-Layer Perceptron (MLP) and a Convolutional Neural Network (CNN), is evaluated based on their ability to drive the vehicle on a challenging test track, shifting between long straight lines and tight curves. A comparison to conventional decoupled controllers on the same track is also provided.

1810.09000 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Safe Adaptive Cruise Control with Road Grade Preview and V2V Communication

安全自适应巡航控制系统与道路坡度预览及车对车通信

Roya Firoozi, Shima Nazari, Jacopo Guanetti, Ryan O'Gorman, Francesco Borrelli

发表机构 * University of Michigan(密歇根大学)

AI总结 本文提出了一种安全自适应巡航控制系统,利用道路坡度和前车运动预览,通过模型预测控制框架优化舒适性、安全性和能耗,采用新颖的方法计算鲁棒不变终端集以确保车辆间安全距离,仿真结果验证了该控制算法的有效性。

详情
AI中文摘要

我们提出了一种安全的自适应巡航控制系统(ACC),该系统利用道路坡度和前车运动预览。ACC控制器通过模型预测控制(MPC)框架设计,以优化舒适性、安全性和能耗以及速度跟踪精度。安全通过计算鲁棒不变终端集来实现。本文提出了一种新颖的方法来计算此类集合,该方法比现有方法更少保守。所提出的控制器在道路坡度变化和前车运动预测不确定性的情况下,始终确保车辆间安全距离。仿真结果将所提控制器与不考虑先前坡度知识的控制器在车-following和自动驾驶交叉场景中进行比较。结果证明了所提控制算法的有效性。

英文摘要

We present the design of a safe Adaptive Cruise Control (ACC) which uses road grade and lead vehicle motion preview. The ACC controller is designed by using a Model Predictive Control (MPC) framework to optimize comfort, safety, energy-efficiency and speed tracking accuracy. Safety is achieved by computing a robust invariant terminal set. The paper presents a novel approach to compute such set which is less conservative than existing methods. The proposed controller ensures safe inter-vehicle spacing at all times despite changes in the road grade and uncertainty in the predicted motion of the lead vehicle. Simulation results compare the proposed controller with a controller that does not incorporate prior grade knowledge on two scenarios including car-following and autonomous intersection crossing. The results demonstrate the effectiveness of the proposed control algorithm.

1803.11247 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Scalable Integrated Task and Motion Planning from Signal Temporal Logic Specifications

可扩展的信号时序逻辑规范下的集成任务和运动规划

Rafael Rodrigues da Silva, Hai Lin

AI总结 本文提出了一种可扩展且可证明完整的算法,直接合成连续轨迹以满足非凸的信号时序逻辑规范,通过分离离散任务规划和连续运动规划,并利用高效的求解器找到高维系统中满足非凸STL规范的动态可行轨迹。

Comments 13 pages, report

详情
AI中文摘要

许多安全关键系统必须实现高阶任务规范,并保证安全性和正确性。为实现这一目标,最近通过从信号时序逻辑(STL)规范中合成控制器取得了许多进展。然而,现有方法要么考虑了状态空间的先验离散化,要么只处理STL的凸片段,或无法证明是完整的。我们提出了一种可扩展、可证明完整的算法,直接合成连续轨迹以满足非凸STL规范。我们在线分离离散任务规划和连续运动规划,并利用高效的可满足性模理论(SMT)和线性规划(LP)求解器,为高维系统找到满足非凸STL规范的动态可行轨迹。所提出的设计算法已被证明是正确且完整的,仿真结果展示了我们方法的可扩展性。

英文摘要

Many safety-critical systems must achieve high-level task specifications with guaranteed safety and correctness. Much recent progress towards this goal has been made through controller synthesis from signal temporal logic (STL) specifications. Existing approaches, however, either consider some a priori discretization of the state-space, deal only with a convex fragment of STL, or are not provably complete. We propose a scalable, provably complete algorithm that directly synthesizes continuous trajectories to satisfy non-convex STL specifications. We separate discrete task planning and continuous motion planning on the fly and harness highly efficient satisfiability modulo theories (SMT) and linear programming (LP) solvers to find dynamically feasible trajectories for high dimensional systems that satisfies non-convex STL specifications. The proposed design algorithms are proved sound and complete, and simulation results demonstrate the scalability of our approach.

1506.02438 2026-06-04 cs.LG cs.RO cs.SY eess.SY 版本更新

High-Dimensional Continuous Control Using Generalized Advantage Estimation

利用广义优势估计进行高维连续控制

John Schulman, Philipp Moritz, Sergey Levine, Michael Jordan, Pieter Abbeel

发表机构 * Department of Electrical Engineering and Computer Science(电气工程与计算机科学系) University of California, Berkeley(加州大学伯克利分校)

AI总结 本文提出了一种基于广义优势估计的方法,通过减少策略梯度估计的方差来解决高维连续控制中的样本需求问题,并通过信任区域优化提高稳定性和收敛性,从而在复杂的3D运动任务中实现了高效的政策学习。

详情
AI中文摘要

策略梯度方法在强化学习中受到青睐,因为它们直接优化累积奖励,并且可以方便地与非线性函数近似器如神经网络结合使用。主要挑战是通常需要大量的样本,以及在输入数据非平稳性下获得稳定和持续改进的难度。我们通过使用价值函数来显著减少策略梯度估计的方差(尽管引入了偏差),并利用类似于TD(λ)的指数加权优势函数估计来解决第一个挑战。我们通过使用信任区域优化过程来解决第二个挑战,该过程用于策略和价值函数,它们由神经网络表示。我们的方法在高度具有挑战性的3D运动任务中表现出强大的经验结果,包括学习双足和四足仿真实体的行走姿态,以及学习使双足机器人从地面平躺状态站立的策略。与使用手工制定政策表示的先前工作相比,我们的神经网络策略直接从原始运动学映射到关节扭矩。我们的算法是完全模型无关的,并且在3D双足机器人上的学习任务所需的模拟经验时间相当于1-2周的真实时间。

英文摘要

Policy gradient methods are an appealing approach in reinforcement learning because they directly optimize the cumulative reward and can straightforwardly be used with nonlinear function approximators such as neural networks. The two main challenges are the large number of samples typically required, and the difficulty of obtaining stable and steady improvement despite the nonstationarity of the incoming data. We address the first challenge by using value functions to substantially reduce the variance of policy gradient estimates at the cost of some bias, with an exponentially-weighted estimator of the advantage function that is analogous to TD(lambda). We address the second challenge by using trust region optimization procedure for both the policy and the value function, which are represented by neural networks. Our approach yields strong empirical results on highly challenging 3D locomotion tasks, learning running gaits for bipedal and quadrupedal simulated robots, and learning a policy for getting the biped to stand up from starting out lying on the ground. In contrast to a body of prior work that uses hand-crafted policy representations, our neural network policies map directly from raw kinematics to joint torques. Our algorithm is fully model-free, and the amount of simulated experience required for the learning tasks on 3D bipeds corresponds to 1-2 weeks of real time.

1810.05683 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Long-Duration Autonomy for Small Rotorcraft UAS including Recharging

小型旋翼机无人机的长期自主性包括充电

Christian Brommer, Danylo Malyuta, Daniel Hentzen, Roland Brockers

发表机构 * Autonomous Controls Laboratory, University of Washington(华盛顿大学自主控制实验室) Jet Propulsion Laboratory, California Institute of Technology(加州理工学院喷气推进实验室)

AI总结 本研究提出了一种完全自主的小型旋翼机无人机,能够在无人干预的情况下执行长期观测任务,通过全平台自主性和基于视觉的精确着陆技术实现自动能源补充,实验结果展示了其在室内和室外环境中的11小时自主操作能力。

Comments 7 pages

详情
AI中文摘要

许多无人 aerial vehicle 监控和监测应用需要在精确位置上进行长时间的观测,理想情况下持续数天或数周(例如生态系统监测),这在以往由于有限的续航能力和需要有人参与操作而难以实现。为克服这些限制,我们提出了一种完全自主的小型旋翼机无人机,能够执行多次飞行任务以完成长期观测任务而无需任何人为干预。我们解决了两个关键技术,对于此类系统至关重要:全平台自主性,包括紧急响应以使任务能够独立于人类操作员执行,以及基于视觉的精确着陆能力,用于自动补充能量。实验结果展示了在室内和室外环境中长达11小时的完全自主操作能力。

英文摘要

Many unmanned aerial vehicle surveillance and monitoring applications require observations at precise locations over long periods of time, ideally days or weeks at a time (e.g. ecosystem monitoring), which has been impractical due to limited endurance and the requirement of humans in the loop for operation. To overcome these limitations, we propose a fully autonomous small rotorcraft UAS that is capable of performing repeated sorties for long-term observation missions without any human intervention. We address two key technologies that are critical for such a system: full platform autonomy including emergency response to enable mission execution independently from human operators, and the ability of vision-based precision landing on a recharging station for automated energy replenishment. Experimental results of up to 11 hours of fully autonomous operation in indoor and outdoor environments illustrate the capability of our system.

1807.09904 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

A Data-Efficient Approach to Precise and Controlled Pushing

一种数据高效且精确可控的推动作方法

Maria Bauza, Francois R. Hogan, Alberto Rodriguez

发表机构 * Department of Mechanical Engineering — Massachusetts Institute of Technology(机械工程系——麻省理工学院)

AI总结 本文提出了一种数据高效的方法,通过学习动态模型来控制复杂机械系统,仅需10个数据点即可完成复杂的推动作轨迹。

Comments Maria Bauza and Francois R. Hogan contributed equally to this work. 10 pages, 5 figures

详情
Journal ref
CoRL 2018
AI中文摘要

几十年来,控制理论的研究表明,简单的控制器在获得及时反馈的情况下,能够控制复杂的系统。推动作是复杂机械系统的一个例子,由于摩擦系数和压力分布等未知系统参数,难以准确建模。本文探讨了控制而非建模所需的数据复杂性。结果表明,一种基于模型的控制方法,其中动态模型从数据中学习,能够使用极少量的训练数据(10个数据点)完成复杂的推动作轨迹。推动作的动态特性通过高斯过程(GP)建模,并在一种模型预测控制方法中利用,该方法线性化GP并施加执行器和任务约束,以完成平面操作任务。

英文摘要

Decades of research in control theory have shown that simple controllers, when provided with timely feedback, can control complex systems. Pushing is an example of a complex mechanical system that is difficult to model accurately due to unknown system parameters such as coefficients of friction and pressure distributions. In this paper, we explore the data-complexity required for controlling, rather than modeling, such a system. Results show that a model-based control approach, where the dynamical model is learned from data, is capable of performing complex pushing trajectories with a minimal amount of training data (10 data points). The dynamics of pushing interactions are modeled using a Gaussian process (GP) and are leveraged within a model predictive control approach that linearizes the GP and imposes actuator and task constraints for a planar manipulation task.

1803.01940 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Tactile Regrasp: Grasp Adjustments via Simulated Tactile Transformations

触觉重抓:通过模拟触觉变换进行抓取调整

Francois R. Hogan, Maria Bauza, Oleguer Canal, Elliott Donlon, Alberto Rodriguez

发表机构 * Massachusetts Institute of Technology(麻省理工学院)

AI总结 本文提出了一种新的重抓控制策略,利用触觉传感进行局部抓取调整。该方法通过虚拟搜索局部触觉测量变换来提高抓取质量。首先,使用深度卷积神经网络构建基于触觉的抓取质量度量,该网络在超过2800次抓取上进行训练。每个抓取的质量是一个介于0和1之间的连续值,通过实验测量其对外部扰动的抵抗性来确定。其次,通过刚体变换模拟机器人运动相对于初始抓取的触觉印记,新生成的触觉印记与学习的抓取质量网络进行评估,选择最大化抓取质量的重抓动作。结果表明,抓取质量网络在已知物体上的平均准确率为85%,在12个物体的交叉验证集上的准确率为75%。重抓控制策略在8个物体的测试集上将抓取动作的成功率提高了70%。

Comments Francois R. Hogan and Maria Bauza contributed equally to this work. 8 pages, 7 figures

详情
Journal ref
IROS 2018
AI中文摘要

本文提出了一种新颖的重抓控制策略,利用触觉传感进行局部抓取调整。我们的方法通过虚拟搜索局部触觉测量变换来确定重抓动作。首先,我们使用深度卷积神经网络构建基于触觉的抓取质量度量,该网络在超过2800次抓取上进行训练。每个抓取的质量是一个介于0和1之间的连续值,通过实验测量其对外部扰动的抵抗性来确定。其次,我们通过执行刚体变换,模拟机器人运动相对于初始抓取的触觉印记。新生成的触觉印记与学习的抓取质量网络进行评估,重抓动作被选择以最大化抓取质量。结果表明,抓取质量网络在已知物体上的平均准确率为85%,在12个物体的交叉验证集上的准确率为75%。重抓控制策略在8个物体的测试集上将抓取动作的成功率提高了70%。

英文摘要

This paper presents a novel regrasp control policy that makes use of tactile sensing to plan local grasp adjustments. Our approach determines regrasp actions by virtually searching for local transformations of tactile measurements that improve the quality of the grasp. First, we construct a tactile-based grasp quality metric using a deep convolutional neural network trained on over 2800 grasps. The quality of each grasp, a continuous value between 0 and 1, is determined experimentally by measuring its resistance to external perturbations. Second, we simulate the tactile imprints associated with robot motions relative to the initial grasp by performing rigid-body transformations of the given tactile measurements. The newly generated tactile imprints are evaluated with the learned grasp quality network and the regrasp action is chosen to maximize the grasp quality. Results show that the grasp quality network can predict the outcome of grasps with an average accuracy of 85% on known objects and 75% on a cross validation set of 12 objects. The regrasp control policy improves the success rate of grasp actions by an average relative increase of 70% on a test set of 8 objects.

1802.04205 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

Efficient Hierarchical Robot Motion Planning Under Uncertainty and Hybrid Dynamics

在不确定性和混合动力学下的高效机器人运动规划

Ajinkya Jain, Scott Niekum

发表机构 * Department of Mechanical Engineering(机械工程系) Department of Computer Science(计算机科学系) University of Texas at Austin, USA(得克萨斯大学奥斯汀分校)

AI总结 本文提出了一种分层POMDP规划器,用于在存在不确定性的情况下为混合动力学模型生成成本优化的运动计划,通过将非线性动力学分解为离散的局部动力学模型,从而有效减少状态不确定性。

Comments 2nd Conference on Robot Learning (CoRL 2018), Zurich, Switzerland

详情
AI中文摘要

嘈杂的观测与非线性动力学是机器人运动规划中最大的挑战之一。通过将非线性动力学分解为一组离散的局部动力学模型,混合动力学提供了一种自然的方式来建模非线性动力学,尤其是在由于接触等因素导致动力学突然不连续的系统中。我们提出了一种分层POMDP规划器,该规划器为混合动力学模型开发成本优化的运动计划。分层规划器首先开发一个高层运动计划,以确定要访问的局部动力学模型的顺序,然后将其转换为详细的连续状态计划。这种分层规划方法将POMDP规划问题分解为更小的子部分,这些子部分可以以显著降低的计算成本解决。能够按顺序访问局部动力学模型的能力也提供了一种强大的方法,利用混合动力学来减少状态不确定性。我们在模拟领域导航任务和具有机械臂的装配任务上评估了所提出的规划器,证明了我们的方法可以有效解决具有高观测噪声和非线性动力学的任务,且计算成本显著低于直接规划方法。

英文摘要

Noisy observations coupled with nonlinear dynamics pose one of the biggest challenges in robot motion planning. By decomposing nonlinear dynamics into a discrete set of local dynamics models, hybrid dynamics provide a natural way to model nonlinear dynamics, especially in systems with sudden discontinuities in dynamics due to factors such as contacts. We propose a hierarchical POMDP planner that develops cost-optimized motion plans for hybrid dynamics models. The hierarchical planner first develops a high-level motion plan to sequence the local dynamics models to be visited and then converts it into a detailed continuous state plan. This hierarchical planning approach results in a decomposition of the POMDP planning problem into smaller sub-parts that can be solved with significantly lower computational costs. The ability to sequence the visitation of local dynamics models also provides a powerful way to leverage the hybrid dynamics to reduce state uncertainty. We evaluate the proposed planner on a navigation task in the simulated domain and on an assembly task with a robotic manipulator, showing that our approach can solve tasks having high observation noise and nonlinear dynamics effectively with significantly lower computational costs compared to direct planning approaches.

1810.03074 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Hierarchical Optimization for Whole-Body Control of Wheeled Inverted Pendulum Humanoids

为轮式倒立摆人形机器人的全身控制设计分层优化

Munzir Zafar, Seth Hutchinson, Evangelos A. Theodorou

发表机构 * Institute of Robotics and Intelligent Machines(机器人与智能机构研究所)

AI总结 本文提出了一种用于轮式倒立摆人形机器人的全身控制框架,通过分层优化方法实现多任务的同时执行,同时考虑关节角度和扭矩限制,以提高整体性能。

详情
AI中文摘要

本文提出了一种用于轮式倒立摆人形机器人的全身控制框架。轮式倒立摆人形机器人是一种具有多个自由度的冗余操作臂,能够动态地在轮子上保持平衡。这些机器人能够同时执行多种任务,如平衡、维持身体姿态、控制视线、举重物或维持末端执行器在操作空间中的配置。全身控制问题旨在在指定优先级下,通过最优利用所有自由度的同时执行这些任务。控制还必须遵守每个关节的角度和扭矩限制。所提出的方法是分层的,包括一个低级控制器用于身体关节的操作,以及一个高级控制器定义用于低级控制器的质心(CoM)目标,以控制系统的零动力学,从而驱动轮子。低级控制器在考虑系统更完整的动力学时计划较短的规划范围,而高级控制器则基于对机器人近似模型的规划来提高计算效率,以规划更长的规划范围。

英文摘要

In this paper, we present a whole-body control framework for Wheeled Inverted Pendulum (WIP) Humanoids. WIP Humanoids are redundant manipulators dynamically balancing themselves on wheels. Characterized by several degrees of freedom, they have the ability to perform several tasks simultaneously, such as balancing, maintaining a body pose, controlling the gaze, lifting a load or maintaining end-effector configuration in operation space. The problem of whole-body control is to enable simultaneous performance of these tasks with optimal participation of all degrees of freedom at specified priorities for each objective. The control also has to obey constraint of angle and torque limits on each joint. The proposed approach is hierarchical with a low level controller for body joints manipulation and a high-level controller that defines center of mass (CoM) targets for the low-level controller to control zero dynamics of the system driving the wheels. The low-level controller plans for shorter horizons while considering more complete dynamics of the system, while the high-level controller plans for longer horizon based on an approximate model of the robot for computational efficiency.

1810.01577 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Moment-Sum-Of-Squares Approach For Fast Risk Estimation In Uncertain Environments

基于矩-平方和方法的不确定环境中的快速风险估计

Ashkan Jasour, Andreas Hofmann, Brian C. Williams

发表机构 * MIT(麻省理工学院) Computer Science and Artificial Intelligence Laboratory(计算机科学与人工智能实验室)

AI总结 本文提出了一种基于矩-平方和的方法,用于在存在有界不确定性的环境中快速估计机器人安全约束违反的概率。该方法利用多项式水平集描述不安全集,通过求解平方和优化问题获得单变量切比雪夫多项式的系数,从而在实时条件下利用有限的矩估计风险。

Comments 57th IEEE Conference on Decision and Control 2018

详情
AI中文摘要

在本文中,我们解决了风险估计问题,即在存在有界不确定性且概率分布任意的情况下,估计机器人安全约束违反的概率。在此问题中,不安全集由多项式的水平集描述,通常是非凸集。不确定性来源于不安全集的随机参数和机器人的随机状态。为了解决这个问题,我们使用基于矩的概率分布表示。我们用线性加权矩的和来描述风险的上界和下界。权重是通过求解平方和优化问题得到的单变量切比雪夫多项式的系数。因此,给定概率分布的有限矩,可以在实时条件下估计风险。我们通过解决概率碰撞检查问题来展示所提供方法的性能,其中目标是在机器人位置和障碍物大小、位置和几何形状存在概率不确定性的条件下,找到机器人与非凸障碍物碰撞的概率。

英文摘要

In this paper, we address the risk estimation problem where one aims at estimating the probability of violation of safety constraints for a robot in the presence of bounded uncertainties with arbitrary probability distributions. In this problem, an unsafe set is described by level sets of polynomials that is, in general, a non-convex set. Uncertainty arises due to the probabilistic parameters of the unsafe set and probabilistic states of the robot. To solve this problem, we use a moment-based representation of probability distributions. We describe upper and lower bounds of the risk in terms of a linear weighted sum of the moments. Weights are coefficients of a univariate Chebyshev polynomial obtained by solving a sum-of-squares optimization problem in the offline step. Hence, given a finite number of moments of probability distributions, risk can be estimated in real-time. We demonstrate the performance of the provided approach by solving probabilistic collision checking problems where we aim to find the probability of collision of a robot with a non-convex obstacle in the presence of probabilistic uncertainties in the location of the robot and size, location, and geometry of the obstacle.

1810.00527 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Safe Adaptive Switching among Dynamical Movement Primitives: Application to 3D Limit-Cycle Walkers

安全的动态运动基元切换:应用于3D极限环步行机器人

Sushant Veer, Ioannis Poulakakis

发表机构 * Department of Mechanical Engineering, University of Delaware(德克萨斯大学达勒姆分校机械工程系)

AI总结 本文提出了一种安全的动态运动基元切换方法,用于生成机器人运动计划,通过在存在外部扰动的情况下确保执行的安全性,应用于3D极限环步行机器人以适应持续的外部力。

详情
AI中文摘要

机器人复杂运动通常通过切换多个独立运动基元来生成。我们采用这种方法,将机器人运动计划表示为一系列按顺序执行的基元序列。在处理动态运动基元时,除了完成高层目标外,规划器还必须考虑计划执行对平台安全的影响。在存在扰动(如外部力)的情况下,这一任务变得更加困难。为了解决这一问题,我们提出了一种框架,利用严谨的控制理论工具,为受外部激励的机器人系统生成安全可执行的运动计划。该框架在一种3D极限环周期步态双足机器人上得到示例,该机器人能够适应持续的外部力。

英文摘要

Complex motions for robots are frequently generated by switching among a collection of individual movement primitives. We use this approach to formulate robot motion plans as sequences of primitives to be executed one after the other. When dealing with dynamical movement primitives, besides accomplishing the high-level objective, planners must also reason about the effect of the plan's execution on the safety of the platform. This task becomes more daunting in the presence of disturbances, such as external forces. To alleviate this issue, we present a framework that builds on rigorous control-theoretic tools to generate safely-executable motion plans for externally excited robotic systems. Our framework is illustrated on a 3D limit-cycle gait bipedal robot that adapts its walking pattern to persistent external forcing.

1810.00046 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Estimation-Based Model Predictive Control for Automatic Crosswind Stabilization of Hybrid Aerial Vehicles

基于估计的模型预测控制用于混合空中车辆的自动侧风稳定

Mohamed K. Helwa, Adrian Esser, Angela P. Schoellig

发表机构 * Dynamic Systems Lab, Institute for Aerospace Studies, University of Toronto, Canada(动态系统实验室,航空航天研究 institute,多伦多大学,加拿大)

AI总结 本文研究了一种新型浮力辅助空中运输车辆的自动侧风稳定控制系统设计,通过设计风扭矩估计器和模型预测控制器(MPC)来提高响应速度,实验表明该方法比传统PID控制器快80-90%。

Comments 23 pages, 13 figures, preprint submitted to Elsevier Mechatronics

详情
AI中文摘要

本文研究了一种新型浮力辅助空中运输车辆的自动侧风稳定控制系统设计。该车辆相比其他飞机具有在极短距离内起降且无需道路或跑道的优势。然而,其较大的机翼表面积使其更易受风影响,导致不希望的滚动角度运动。自动侧风稳定系统的作用是检测滚动角度偏差,并利用机翼尖端的电机来抵消风的影响。然而,由于机翼相对于小型无人机较大的惯性和额外的输入时间延迟,基于传统控制算法(如比例-积分-微分(PID)控制器)的自动侧风稳定系统响应时间过慢。另一个挑战是缺乏能够安装在车辆机翼上的高精度风传感器。因此,我们首先设计了一个依赖惯性测量的风扭矩估计器,并利用前馈补偿直接校正风扭矩,从而显著提高响应速度。其次,我们将所提出的估计器与模型预测控制器(MPC)结合,并对所考虑的应用进行约束MPC与无约束MPC的比较。实验结果表明,与标准PID控制器相比,所提出的基于估计的MPC策略将系统响应时间减少了约80-90%,无需添加风传感器或更改稳定系统的硬件。

英文摘要

In this paper, we study the control design of an automatic crosswind stabilization system for a novel, buoyantly-assisted aerial transportation vehicle. This vehicle has several advantages over other aircraft including the ability to take-off and land in very short distances and without the need for roads or runways. Despite these advantages, the large surface area of the vehicle's wing makes it more susceptible to wind, which introduces undesirable roll angle motions. The role of the automatic crosswind stabilization system is to detect the roll angle deviation, and then use motors at the wingtips to counteract the wind effect. However, due to the relatively large inertia of the wing compared to small-size unmanned aerial vehicles and additional input time delays, an automatic crosswind stabilization system based on traditional control algorithms such as the proportional-integral-derivative (PID) controller results in a response time that is too slow. Another challenge is the lack of high-accuracy wind sensors that can be mounted on the vehicle's wing. Therefore, we first design a wind torque estimator that relies on inertial measurements, and then use feed-forward compensation to directly correct for the wind torque, resulting in a significantly faster response. We second combine the proposed estimator with a model predictive controller (MPC), and compare constrained MPC with unconstrained MPC for the considered application. Experimental results show that our proposed estimation-based MPC strategy reduces the response time of the system by around 80-90% compared to a standard PID controller, without the need for adding wind sensors or changing the hardware of the stabilization system.

1808.00924 2026-06-04 eess.SY cs.LG cs.RO cs.SY 版本更新

The Lyapunov Neural Network: Adaptive Stability Certification for Safe Learning of Dynamical Systems

Lyapunov神经网络:用于安全学习动力系统的自适应稳定性认证

Spencer M. Richards, Felix Berkenkamp, Andreas Krause

发表机构 * Department of Mechanical and Process Engineering(机械与过程工程系) Department of Computer Science(计算机科学系)

AI总结 本文提出了一种学习准确安全证书的方法,用于非线性闭环动力系统,通过构造Lyapunov函数神经网络和适应最大安全区域形状的训练算法,以确保安全学习。

Comments Proc. of the 2nd Conference on Robot Learning (CoRL 2018)

详情
AI中文摘要

学习算法在模拟中表现出色,使机器人能够适应不确定环境并提高性能。然而,这些算法很少在安全关键系统中实际应用,因为学习的策略通常不提供任何安全保证。也就是说,所需的探索可能会对机器人或其环境造成物理伤害。在本文中,我们提出了一种方法,用于学习非线性闭环动力系统的准确安全证书。具体而言,我们构建了一个Lyapunov函数神经网络和一个训练算法,该算法能够适应状态空间中最大的安全区域的形状。该算法仅依赖于动力学的输入和输出知识,而不是任何特定的模型结构。我们通过学习模拟倒立摆的安全吸引区域来展示我们的方法。此外,我们讨论了我们的方法如何与动态系统的统计模型结合,用于安全学习算法。

英文摘要

Learning algorithms have shown considerable prowess in simulation by allowing robots to adapt to uncertain environments and improve their performance. However, such algorithms are rarely used in practice on safety-critical systems, since the learned policy typically does not yield any safety guarantees. That is, the required exploration may cause physical harm to the robot or its environment. In this paper, we present a method to learn accurate safety certificates for nonlinear, closed-loop dynamical systems. Specifically, we construct a neural network Lyapunov function and a training algorithm that adapts it to the shape of the largest safe region in the state space. The algorithm relies only on knowledge of inputs and outputs of the dynamics, rather than on any specific model structure. We demonstrate our method by learning the safe region of attraction for a simulated inverted pendulum. Furthermore, we discuss how our method can be used in safe learning algorithms together with statistical models of dynamical systems.

1806.02459 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Fault Tolerant Control for Networked Mobile Robots

网络化移动机器人故障容错控制

Pietro Pierpaoli, Dominique Sauter, Magnus Egerstedt

AI总结 本文提出了一种两阶段技术,用于网络化移动机器人中偏置测量代理的识别与补偿,通过单个观测者代理部署的故障识别滤波器估计网络中的单个故障,并在检测到故障后启动最优领导者基于的补偿策略。

Comments 7 pages, 7 figures, conference

详情
AI中文摘要

网络化自主代理已被用于许多应用,如移动传感器网络和智能交通系统。然而,在此类系统中,一个或多个子系统的故障或误差效应可能迅速扩散到整个网络,快速降低系统整体性能。在以共识驱动的动力学中,故障效应尤为重要,因为系统传递函数中存在无约束刚性模式。本文提出了一种两阶段技术,用于网络化移动机器人中偏置测量代理的识别与补偿,在时间不变的交互拓扑下进行。我们假设这些交互仅以相对位置测量的形式发生。一个部署在单个观测者代理上的故障识别滤波器用于估计网络中任何位置发生的单个故障。一旦检测到故障,就会启动最优领导者基于的补偿策略。结果通过数值模拟和机器人实验呈现。

英文摘要

Teams of networked autonomous agents have been used in a number of applications, such as mobile sensor networks and intelligent transportation systems. However, in such systems, the effect of faults and errors in one or more of the sub-systems can easily spread throughout the network, quickly degrading the performance of the entire system. In consensus-driven dynamics, the effects of faults are particularly relevant because of the presence of unconstrained rigid modes in the transfer function of the system. Here, we propose a two-stage technique for the identification and accommodation of a biased-measurements agent, in a network of mobile robots with time invariant interaction topology. We assume these interactions to only take place in the form of relative position measurements. A fault identification filter deployed on a single observer agent is used to estimate a single fault occurring anywhere in the network. Once the fault is detected, an optimal leader-based accommodation strategy is initiated. Results are presented by means of numerical simulations and robot experiments.

1809.10012 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Using Neural Networks to Generate Information Maps for Mobile Sensors

用神经网络为移动传感器生成信息图

Louis Dressel, Mykel J. Kochenderfer

AI总结 本文提出利用卷积神经网络实时生成移动传感器的信息图,以提高轨迹生成的效率和准确性。

Comments Accepted to the 2018 IEEE Conference on Decision and Control (CDC)

详情
AI中文摘要

目标定位是移动传感器的关键任务,具有多种应用。然而,为这些传感器生成信息丰富的轨迹是一个具有挑战性的问题。一种常用方法是使用信息图来估计在传感器状态空间中的任意点进行测量的价值。这些信息图用于生成轨迹;例如,轨迹可能被设计成其测量分布与信息图的分布匹配。无论轨迹生成方法如何,生成信息图作为新观察结果出现是至关重要的。然而,在实时计算这些图可能会有挑战。我们提出使用卷积神经网络从目标估计和传感器模型中实时生成信息图。模拟显示,生成的图准确且计算时间减少了多个数量级。

英文摘要

Target localization is a critical task for mobile sensors and has many applications. However, generating informative trajectories for these sensors is a challenging research problem. A common method uses information maps that estimate the value of taking measurements from any point in the sensor state space. These information maps are used to generate trajectories; for example, a trajectory might be designed so its distribution of measurements matches the distribution of the information map. Regardless of the trajectory generation method, generating information maps as new observations are made is critical. However, it can be challenging to compute these maps in real-time. We propose using convolutional neural networks to generate information maps from a target estimate and sensor model in real-time. Simulations show that maps are accurately rendered while offering orders of magnitude reduction in computation time.

1809.09716 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Sampling-based Polytopic Trees for Approximate Optimal Control of Piecewise Affine Systems

基于采样的多面体树用于分段仿射系统的近似最优控制

Sadra Sadraddini, Russ Tedrake

AI总结 本文提出了一种基于采样的多面体树方法,用于分段仿射系统的近似最优控制,通过混合整数凸规划将轨迹优化与反馈控制结合,以满足硬约束条件。

Comments Under Review

详情
AI中文摘要

分段仿射(PWA)系统广泛用于建模高度非线性行为,如机器人运动和操作中的接触动力学。现有PWA系统的控制技术在离线设计和在线实现中都有计算上的缺点。本文介绍了一种方法,用于获得反馈控制策略和对应的可接受初始条件,使得所有闭环轨迹都能达到目标多面体,同时优化成本函数。该方法的概念类似于LQR-trees [1],包括三个步骤:(1)开环轨迹优化,(2)反馈控制用于计算轨迹周围的“漏斗”状态,以及(3)以树状方式重复(1)和(2),使漏斗从目标向后生长并尽可能填充状态空间。我们展示了PWA动力学可以被利用将步骤(1)和(2)结合为一个步骤,该步骤通过混合整数凸规划解决,使该方法适用于处理硬约束。接触动力学的示例进行了说明。

英文摘要

Piecewise affine (PWA) systems are widely used to model highly nonlinear behaviors such as contact dynamics in robot locomotion and manipulation. Existing control techniques for PWA systems have computational drawbacks, both in offline design and online implementation. In this paper, we introduce a method to obtain feedback control policies and a corresponding set of admissible initial conditions for discrete-time PWA systems such that all the closed-loop trajectories reach a goal polytope, while a cost function is optimized. The idea is conceptually similar to LQR-trees \cite{tedrake2010lqr}, which consists of 3 steps: (1) open-loop trajectory optimization, (2) feedback control for computation of "funnels" of states around trajectories, and (3) repeating (1) and (2) in a way that the funnels are grown backward from the goal in a tree fashion and fill the state-space as much as possible. We show PWA dynamics can be exploited to combine step (1) and (2) into a single step that is tackled using mixed-integer convex programming, which makes the method suitable for dealing with hard constraints. Illustrative examples on contact-based dynamics are presented.

1809.08819 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Oscillation Damping Control of Pendulum-like Manipulation Platform using Moving Masses

使用移动质量抑制摆动式操作平台的振动

Min Jun Kim, Jianjie Lin, Konstantin Kondak, Dongheui Lee, Christian Ott

发表机构 * Institute of Robotics and Mechatronics, German Aerospace Center (DLR), Wessling, Germany(机器人与机电研究所,德国航空航天中心(DLR),德国韦斯林) Chair of Automatic Control Engineering, Technical University of Munich (TUM), Munich, Germany(自动控制工程系,慕尼黑技术大学(TUM),德国慕尼黑) Fortiss Institute, Munich, Germany(Fortiss研究所,德国慕尼黑)

AI总结 本文提出了一种通过在平台上安装移动质量来抑制机器人操作臂悬挂平台振动的方法,通过合理设计移动质量的参考加速度实现平台的渐近稳定性,同时克服了欠驱动带来的挑战。

Comments IFAC Symposium on Robot Control (SYROCO) 2018

详情
AI中文摘要

本文提出了一种方法,用于抑制安装在机器人操作臂上的摆动式悬挂平台的振荡运动。为此,在平台上安装了移动质量。本文通过合理设计移动质量的参考加速度,实现了平台的渐近稳定性(即振动阻尼)。该工作的主要特点是不仅实现了平台的渐近稳定性,还实现了移动质量的渐近稳定性,这可能由于欠驱动特性而具有挑战性。所提出的方法通过仿真研究进行了验证。

英文摘要

This paper presents an approach to damp out the oscillatory motion of the pendulum-like hanging platform on which a robotic manipulator is mounted. To this end, moving masses were installed on top of the platform. In this paper, asymptotic stability of the platform (which implies oscillation damping) is achieved by designing reference acceleration of the moving masses properly. A main feature of this work is that we can achieve asymptotic stability of not only the platform, but also the moving masses, which may be challenging due to the under-actuation nature. The proposed scheme is validated by the simulation studies.

1602.01891 2026-06-04 cs.RO cs.MA cs.SY eess.SY math.OC 版本更新

Distributed Estimation of State and Parameters in Multi-Agent Cooperative Load Manipulation

多智能体协同负载操作中状态与参数的分布式估计

Antonio Franchi, Antonio Petitti, Alessandro Rizzo

发表机构 * CNRS, LAAS(法国国家科学研究中心(CNRS)、拉瓦尔大学(LAAS))

AI总结 本文提出两种分布式方法,用于估计未知平面体的运动学参数、动力学参数和运动学状态,利用刚体运动学和动力学、非线性观测理论和一致性算法,通过智能体对负载施加二维力矩、测量接触点速度以及通信图连通性来实现,理论分析和收敛证明均提供,第一种方法假设参数恒定,第二种方法可处理时变参数并可并行应用于任何任务导向的控制律,对于无控制律的情况,提出了一种分布式且安全的控制策略以满足可观测性条件,通过现实的蒙特卡洛模拟展示了估计策略的有效性和鲁棒性。

Comments Accepted for publication to the IEEE Transactions on Control of Network Systems

详情
AI中文摘要

我们提出了两种分布式方法,用于估计未知平面体的运动学参数、动力学参数和运动学状态,这些方法依赖于刚体运动学和动力学、非线性观测理论和一致性算法。唯一三个要求是每个智能体可以对负载施加二维力矩,可以测量其接触点的速度,并且通信图是连通的。提供了理论非线性可观测性分析和收敛证明。第一种方法假设参数恒定,而第二种方法可以处理时变参数,并且可以并行应用于任何任务导向的控制律。对于没有提供控制律的情况,我们提出了一种分布式且安全的控制策略,以满足可观测性条件。通过现实的蒙特卡洛模拟展示了估计策略的有效性和鲁棒性。

英文摘要

We present two distributed methods for the estimation of the kinematic parameters, the dynamic parameters, and the kinematic state of an unknown planar body manipulated by a decentralized multi-agent system. The proposed approaches rely on the rigid body kinematics and dynamics, on nonlinear observation theory, and on consensus algorithms. The only three requirements are that each agent can exert a 2D wrench on the load, it can measure the velocity of its contact point, and that the communication graph is connected. Both theoretical nonlinear observability analysis and convergence proofs are provided. The first method assumes constant parameters while the second one can deal with time-varying parameters and can be applied in parallel to any task-oriented control law. For the cases in which a control law is not provided, we propose a distributed and safe control strategy satisfying the observability condition. The effectiveness and robustness of the estimation strategy is showcased by means of realistic MonteCarlo simulations.

1809.08022 2026-06-04 cs.RO cs.SY eess.SY 版本更新

The Urban Last Mile Problem: Autonomous Drone Delivery to Your Balcony

城市最后一公里问题:自主无人机送货到您的阳台

Gino Brunner, Bence Szebedy, Simon Tanner, Roger Wattenhofer

发表机构 * Computer Engineering and Networks Laboratory(计算机工程与网络实验室)

AI总结 本文提出了一种基于商用无人机的城市最后一公里自主送货方法,通过GPS定位和视觉导航实现对阳台或门廊等非集中地点的精准配送,并开源代码以促进未来研究。

详情
AI中文摘要

无人机送货在过去几年中已成为行业热点。然而,现有方法要么专注于农村地区,要么依赖于集中式配送点来执行最后一公里配送。在本文中,我们使用商用无人机解决城市环境中自主最后一公里配送的问题。我们构建了一个原型系统,该系统能够利用GPS飞向近似配送位置,然后使用视觉导航找到精确的配送位置。配送位置可能例如在阳台或门廊上,并且只需在墙上或窗户上用视觉标记指示即可。我们测试了我们的系统组件在模拟环境中的表现,包括视觉导航和避障。最后,我们在现实环境中部署了我们的无人机,并展示了它如何在阳台上找到配送点。为了促进该主题的未来研究,我们开源了我们的代码。

英文摘要

Drone delivery has been a hot topic in the industry in the past few years. However, existing approaches either focus on rural areas or rely on centralized drop-off locations from where the last mile delivery is performed. In this paper we tackle the problem of autonomous last mile delivery in urban environments using an off-the-shelf drone. We build a prototype system that is able to fly to the approximate delivery location using GPS and then find the exact drop-off location using visual navigation. The drop-off location could, e.g., be on a balcony or porch, and simply needs to be indicated by a visual marker on the wall or window. We test our system components in simulated environments, including the visual navigation and collision avoidance. Finally, we deploy our drone in a real-world environment and show how it can find the drop-off point on a balcony. To stimulate future research in this topic we open source our code.

1803.02238 2026-06-04 cs.RO cs.CL cs.SY eess.SY 版本更新

Precise but Natural Specification for Robot Tasks

机器人任务的精确但自然的规范

Ivan Gavran, Brendon Boldt, Eva Darulova, Rupak Majumdar

发表机构 * Max Planck Institute for Software Systems, Germany(德国马克斯·普朗克软件研究所)

AI总结 Flipper通过自然语言接口实现机器人高阶任务规范,结合形式化核心语言与语义解析器,提供可视化反馈并支持自然语言扩展,提升任务描述效率。

详情
AI中文摘要

我们提出了Flipper,一种自然语言接口,用于描述机器人高阶任务规范并编译为机器人动作。Flipper始于形式化核心语言,允许表达丰富的时序规范,并通过语义解析器提供自然语言接口。Flipper通过在图形用户界面中执行自动构建的计划提供即时视觉反馈,允许用户解决潜在的歧义解释。Flipper通过自然化扩展自身:用户可以添加 utterances 的定义,Flipper 由此诱导新规则并将其添加到核心语言中,逐渐形成更加自然的任务规范语言。Flipper通过泛化用户提供的定义来改进自然化。与其他任务规范系统不同,Flipper在保持编程语言的表达力和形式精确性的同时,实现了自然语言交互。我们通过初始用户研究证明,自然语言交互和泛化可以显著简化任务描述。此外,随着时间推移,用户会使用更多超出初始核心语言的概念。这些扩展可供Flipper社区使用,用户可以使用其他人定义的概念。

英文摘要

We present Flipper, a natural language interface for describing high-level task specifications for robots that are compiled into robot actions. Flipper starts with a formal core language for task planning that allows expressing rich temporal specifications and uses a semantic parser to provide a natural language interface. Flipper provides immediate visual feedback by executing an automatically constructed plan of the task in a graphical user interface. This allows the user to resolve potentially ambiguous interpretations. Flipper extends itself via naturalization: its users can add definitions for utterances, from which Flipper induces new rules and adds them to the core language, gradually growing a more and more natural task specification language. Flipper improves the naturalization by generalizing the definition provided by users. Unlike other task-specification systems, Flipper enables natural language interactions while maintaining the expressive power and formal precision of a programming language. We show through an initial user study that natural language interactions and generalization can considerably ease the description of tasks. Moreover, over time, users employ more and more concepts outside of the initial core language. Such extensions are available to the Flipper community, and users can use concepts that others have defined.

1710.06537 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Sim-to-Real Transfer of Robotic Control with Dynamics Randomization

机器人控制的仿真到现实转移与动力学随机化

Xue Bin Peng, Marcin Andrychowicz, Wojciech Zaremba, Pieter Abbeel

发表机构 * OpenAI

AI总结 本文提出一种简单方法通过随机化仿真动力学来弥合现实与仿真的差距,使策略能适应不同动态,从而在无需真实系统训练的情况下实现现实世界泛化。

详情
AI中文摘要

仿真环境为训练智能体提供了丰富的数据源,并在训练过程中减少了某些安全方面的担忧。但智能体在仿真中开发的行为往往特定于模拟器的特性。由于建模误差,仿真中表现良好的策略可能无法转移到现实世界。本文提出了一种简单的方法来弥合这一“现实差距”。通过在训练过程中随机化模拟器的动力学,我们能够开发出能够适应非常不同动力学的策略,包括那些与策略训练所基于的动力学有显著差异的动力学。这种适应性使策略能够在没有对物理系统进行训练的情况下泛化到现实世界的动力学。我们的方法在使用机械臂的物体推动任务上进行了演示。尽管策略仅在仿真中进行训练,但部署在真实机器人上时,其性能仍能保持相似水平,能够可靠地将物体从随机初始配置移动到目标位置。我们探讨了各种设计决策的影响,并展示了所得到的策略对显著校准误差具有鲁棒性。

英文摘要

Simulations are attractive environments for training agents as they provide an abundant source of data and alleviate certain safety concerns during the training process. But the behaviours developed by agents in simulation are often specific to the characteristics of the simulator. Due to modeling error, strategies that are successful in simulation may not transfer to their real world counterparts. In this paper, we demonstrate a simple method to bridge this "reality gap". By randomizing the dynamics of the simulator during training, we are able to develop policies that are capable of adapting to very different dynamics, including ones that differ significantly from the dynamics on which the policies were trained. This adaptivity enables the policies to generalize to the dynamics of the real world without any training on the physical system. Our approach is demonstrated on an object pushing task using a robotic arm. Despite being trained exclusively in simulation, our policies are able to maintain a similar level of performance when deployed on a real robot, reliably moving an object to a desired location from random initial configurations. We explore the impact of various design decisions and show that the resulting policies are robust to significant calibration error.

1809.06179 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Learning of Multi-Context Models for Autonomous Underwater Vehicles

多情境模型学习用于自主水下车辆

Bilal Wehbe, Octavio Arriaga, Mario Michael Krell, Frank Kirchner

发表机构 * DFKI - Robotic Innovation Center(DFKI机器人创新中心) Robotics Research Group(机器人研究组)

AI总结 本文提出利用LSTM网络学习自主水下车辆的多情境模型,通过实验数据构建仿真模型,生成不同情境并提高分类准确性,展现对噪声的鲁棒性和大数据集的扩展能力。

Comments 6 pages, 7 figures, AUV 2018 author copy

详情
AI中文摘要

多情境模型学习对于海洋机器人至关重要,因为多个因素可能干扰系统的动力学。本文解决了识别自主水下车辆(AUV)模型多种情境的问题。我们从实验数据构建了机器人的仿真模型,并利用该模型填补缺失数据并生成不同的模型情境。我们实现了一种基于长短期记忆(LSTM)网络的架构,直接从数据中学习不同的情境。我们证明LSTM网络在与基线方法相比时能够实现较高的分类准确性,显示出对噪声的鲁棒性,并能有效扩展到大规模数据集上。

英文摘要

Multi-context model learning is crucial for marine robotics where several factors can cause disturbances to the system's dynamics. This work addresses the problem of identifying multiple contexts of an AUV model. We build a simulation model of the robot from experimental data, and use it to fill in the missing data and generate different model contexts. We implement an architecture based on long-short-term-memory (LSTM) networks to learn the different contexts directly from the data. We show that the LSTM network can achieve high classification accuracy compared to baseline methods, showing robustness against noise and scaling efficiently on large datasets.

1806.06161 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

BaRC: Backward Reachability Curriculum for Robotic Reinforcement Learning

BaRC:机器人强化学习中的逆向可达性课程

Boris Ivanovic, James Harrison, Apoorva Sharma, Mo Chen, Marco Pavone

发表机构 * Department of Mechanical Engineering, Stanford University(斯坦福大学机械工程系) School of Computing Science, Simon Fraser University(西蒙弗雷泽大学计算机科学学院)

AI总结 本文提出BaRC方法,利用物理先验知识设计课程方案,通过逆向可达性策略加速连续控制MDP中模型无关RL算法的训练,提升性能并减少探索需求。

详情
AI中文摘要

模型无关强化学习(RL)为高维系统学习控制策略提供了有吸引力的方法,但其相对差的样本复杂性通常迫使在模拟环境中进行训练。即使在模拟中,具有稀疏自然奖励函数的目标导向任务仍难以被最先进的模型无关算法处理。这些任务的瓶颈在于从系统初始状态获取学习信号所需的大量探索。本文利用物理先验知识(以近似系统动力学模型的形式)设计了一种课程方案,用于模型无关策略优化算法。我们的逆向可达性课程(BaRC)从需要少量动作完成任务的状态开始策略训练,并在策略优化算法表现出足够性能后,以动态一致的方式扩展初始状态分布。BaRC具有通用性,可以加速任何模型无关RL算法在广泛目标导向连续控制MDP上的训练。其课程策略具有物理直观性、易于调节,并允许将物理先验整合到训练中,而不会影响模型无关RL算法的性能、灵活性和适用性。我们在两个代表性的动态机器人学习问题上评估了我们的方法,并发现相对于先前的课程生成技术和朴素探索策略,有显著的性能提升。

英文摘要

Model-free Reinforcement Learning (RL) offers an attractive approach to learn control policies for high-dimensional systems, but its relatively poor sample complexity often forces training in simulated environments. Even in simulation, goal-directed tasks whose natural reward function is sparse remain intractable for state-of-the-art model-free algorithms for continuous control. The bottleneck in these tasks is the prohibitive amount of exploration required to obtain a learning signal from the initial state of the system. In this work, we leverage physical priors in the form of an approximate system dynamics model to design a curriculum scheme for a model-free policy optimization algorithm. Our Backward Reachability Curriculum (BaRC) begins policy training from states that require a small number of actions to accomplish the task, and expands the initial state distribution backwards in a dynamically-consistent manner once the policy optimization algorithm demonstrates sufficient performance. BaRC is general, in that it can accelerate training of any model-free RL algorithm on a broad class of goal-directed continuous control MDPs. Its curriculum strategy is physically intuitive, easy-to-tune, and allows incorporating physical priors to accelerate training without hindering the performance, flexibility, and applicability of the model-free RL algorithm. We evaluate our approach on two representative dynamic robotic learning problems and find substantial performance improvement relative to previous curriculum generation techniques and naive exploration strategies.

1701.03913 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Modeling and control of a cable-driven series elastic actuator

缆驱动系列弹性执行器的建模与控制

Wulin Zou, Ningbo Yu

发表机构 * Institute of Robotics and Automatic Information Systems, Nankai University(机器人与自动化信息系统研究所,南开大学) Tianjin Key Laboratory of Intelligent Robotics, Nankai University(天津智能机器人重点实验室,南开大学)

AI总结 本文研究缆驱动系列弹性执行器的建模与控制,采用双自由度控制方法提升鲁棒性,验证了其在人机交互中的应用潜力。

详情
AI中文摘要

系列弹性执行器(SEA)在物理人机交互领域正发挥越来越重要的作用。本文聚焦于缆驱动SEA的建模与控制。首先提出了缆驱动SEA的方案,并采用速度控制直流电机作为其动力源。基于此建立了缆驱动SEA的模型。进一步,采用双自由度(2-DOF)控制方法来控制输出扭矩。仿真结果表明,2-DOF方法在鲁棒性方面优于PD方法。

英文摘要

Series elastic actuators (SEA) are playing an increasingly important role in the fields of physical human-robot interaction. This paper focuses on the modeling and control of a cable-driven SEA. First, the scheme of the cable-driven SEA has been proposed, and a velocity controlled DC motor has been used as its power source. Based on this, the model of the cable-driven SEA has been built up. Further, a two degrees of freedom (2-DOF) control approach has been employed to control the output torque. Simulation results have shown that the 2-DOF method has achieved better robust performance than the PD method.

1804.01031 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Provably Robust Learning-Based Approach for High-Accuracy Tracking Control of Lagrangian Systems

具有证明鲁棒性的基于学习的方法用于拉格朗日系统高精度跟踪控制

Mohamed K. Helwa, Adam Heins, Angela P. Schoellig

发表机构 * Dynamic Systems Lab(动态系统实验室) Institute for Aerospace Studies(航空航天研究院) University of Toronto(多伦多大学)

AI总结 本文提出基于高斯过程的新型学习控制方法,确保系统稳定性与高精度跟踪,通过不确定性界保证鲁棒性,并在仿真和实验中验证有效性。

Comments 8 pages, 4 figures, 2 tables, submitted to IEEE Robotics and Automation Letters (RA-L) and the 2019 International Conference on Robotics and Automation (ICRA) (created: March 2018; updated: September 2018)

详情
AI中文摘要

拉格朗日系统涵盖了多种机器人系统,包括机械臂、轮式和腿部机器人以及四旋翼。通常使用逆动力学控制和前馈线性化技术将复杂非线性动力学转换为解耦的二阶积分器,然后使用标准外环控制器计算线性化系统的期望加速度。然而,这些方法通常依赖于非常准确的系统模型,这在实践中往往不可用。尽管文献中使用了不同的学习方法来解决这一挑战,但大多数方法在学习控制系统稳定性方面缺乏安全保证。本文提出了一种基于高斯过程(GPs)的新学习控制方法,确保闭环系统的稳定性和高精度跟踪。我们使用GPs近似命令加速度与系统实际加速度之间的误差,并利用GP预测的均值和方差计算线性化模型不确定性的上界。此不确定性界随后用于鲁棒的外环控制器以确保整个系统的稳定性。此外,我们证明跟踪误差收敛到一个半径可任意小的球体。进一步,我们通过在2自由度平面机械臂上的仿真和6自由度工业机械臂上的实验验证了我们方法的有效性。

英文摘要

Lagrangian systems represent a wide range of robotic systems, including manipulators, wheeled and legged robots, and quadrotors. Inverse dynamics control and feedforward linearization techniques are typically used to convert the complex nonlinear dynamics of Lagrangian systems to a set of decoupled double integrators, and then a standard, outer-loop controller can be used to calculate the commanded acceleration for the linearized system. However, these methods typically depend on having a very accurate system model, which is often not available in practice. While this challenge has been addressed in the literature using different learning approaches, most of these approaches do not provide safety guarantees in terms of stability of the learning-based control system. In this paper, we provide a novel, learning-based control approach based on Gaussian processes (GPs) that ensures both stability of the closed-loop system and high-accuracy tracking. We use GPs to approximate the error between the commanded acceleration and the actual acceleration of the system, and then use the predicted mean and variance of the GP to calculate an upper bound on the uncertainty of the linearized model. This uncertainty bound is then used in a robust, outer-loop controller to ensure stability of the overall system. Moreover, we show that the tracking error converges to a ball with a radius that can be made arbitrarily small. Furthermore, we verify the effectiveness of our approach via simulations on a 2 degree-of-freedom (DOF) planar manipulator and experimentally on a 6 DOF industrial manipulator.

1809.03826 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Real-time force control of an SEA-based body weight support unit with the 2-DOF control structure

基于SEA的体位支撑装置的实时力控研究:采用2自由度控制结构

Yubo Sun, Yuqi Lei, Wulin Zou, Jianmin Li, Ningbo Yu

AI总结 本文提出一种基于SEA的体位支撑装置,采用2自由度控制结构实现实时力控,通过仿真和实验验证了其在康复中的有效性。

Comments In proceedings of the IEEE International Conference on Real-time Computing and Robotics 2018

详情
AI中文摘要

体位支撑(BWS)是康复领域的重要技术。随着康复科学与工程的快速发展,BWS正迅速发展并受到广泛关注。本文构建了一种新型重力卸载系统,允许患者在三维笛卡尔空间内自由移动并受到重力支撑。因此,对于患有神经损伤(如中风或脊髓损伤)的患者,可以将残余运动控制能力专注于平衡和行走的治疗训练。实时力控性能对BWS单元提供合适支撑并避免干扰至关重要。在本文中,我们重新设计了BWS单元,采用一系列弹性执行机构结构以提高人机交互性能。进一步,采用了2自由度(2-DOF)控制方法以实现精确且鲁棒的BWS力控。仿真和实验结果验证了BWS设计和实时控制方法的有效性。

英文摘要

Body weight support (BWS) is a fundamental technique in rehabilitation. Along with the dramatic progressing of rehabilitation science and engineering, BWS is quickly evolving with new initiatives and has attracted deep research effort in recent years. We have built up a novel gravity offloading system, in which the patient is allowed to move freely in the three-dimensional Cartesian space and receives support against gravity. Thus, the patients, especially for those that suffer from neurological injury such as stroke or spinal cord injury, can focus their residual motor control capabilities on essential therapeutic trainings of balance and gait. The real-time force control performance is critical for the BWS unit to provide suitable support and avoid disturbance. In this work, we have re-designed our BWS unit with a series elastic actuation structure to improve the human-robot interaction performance. Further, the 2 degrees of freedom (2-DOF) control approach was taken for accurate and robust BWS force control. Both simulation and experimental results have validated the efficacy of the BWS design and real-time control methods.

1809.02867 2026-06-04 eess.SY cs.IT cs.RO cs.SY math.IT 版本更新

A Review on Cooperative Adaptive Cruise Control (CACC) Systems: Architectures, Controls, and Applications

协同自适应巡航控制系统(CACC)综述:架构、控制与应用

Ziran Wang, Guoyuan Wu, Matthew Barth

发表机构 * Department of Mechanical Engineering(机械工程系)

AI总结 本文综述了全球范围内关于CACC系统不同方面的研究进展,涵盖架构、控制方法及应用,分析了当前机遇与挑战,并展望了未来发展方向。

详情
AI中文摘要

连接和自动化车辆(CAVs)有潜力解决当前交通系统在安全、出行和可持续性方面的问题。协同自适应巡航控制(CACC)是一种有前景的技术,使CAVs能够以协作方式驾驶,并带来系统层面的益处。本文综述了全球研究人员在CACC系统不同方面的研究成果。回顾了CACC系统架构的文献,解释了该系统从高层如何运作。回顾了不同的控制方法及其相关问题,从底层介绍了CACC系统。通过详细文献展示了CACC技术的应用,绘制了CACC的整体图景,指出了当前的机会与挑战,并预估了其未来的发展。

英文摘要

Connected and automated vehicles (CAVs) have the potential to address the safety, mobility and sustainability issues of our current transportation systems. Cooperative adaptive cruise control (CACC), for example, is one promising technology to allow CAVs to be driven in a cooperative manner and introduces system-wide benefits. In this paper, we review the progress achieved by researchers worldwide regarding different aspects of CACC systems. Literature of CACC system architectures are reviewed, which explain how this system works from a higher level. Different control methodologies and their related issues are reviewed to introduce CACC systems from a lower level. Applications of CACC technology are demonstrated with detailed literature, which draw an overall landscape of CACC, point out current opportunities and challenges, and anticipate its development in the near future.

1709.06196 2026-06-04 cs.AI cs.RO cs.SY eess.SY 版本更新

Online algorithms for POMDPs with continuous state, action, and observation spaces

在线算法用于具有连续状态、动作和观察空间的POMDPs

Zachary Sunberg, Mykel Kochenderfer

发表机构 * Aeronautics and Astronautics Dept. Stanford University(航空航天系 斯坦福大学)

AI总结 本文提出POMCPOW和PFT-DPW算法,通过加权粒子过滤解决连续状态空间POMDPs的求解问题,验证了改进方法的有效性。

Comments Added Multilane section

详情
Journal ref
Short version published in 2018 proceedings of the International Conference on Automated Planning and Scheduling (ICAPS)
AI中文摘要

在线求解部分可观测马尔可夫决策过程的算法已被应用于具有大离散状态空间的问题,但连续状态、动作和观察空间仍具挑战性。本文首先探讨双级渐进扩展(DPW)作为解决方案,但证明该修改单独不足,因为搜索树中的信念表示坍缩为单个粒子,导致算法收敛到次优策略。本文提出并评估了两种新算法,POMCPOW和PFT-DPW,通过加权粒子过滤克服这一缺陷。仿真结果表明,这些改进使算法在先前方法失败的场景中取得成功。

英文摘要

Online solvers for partially observable Markov decision processes have been applied to problems with large discrete state spaces, but continuous state, action, and observation spaces remain a challenge. This paper begins by investigating double progressive widening (DPW) as a solution to this challenge. However, we prove that this modification alone is not sufficient because the belief representations in the search tree collapse to a single particle causing the algorithm to converge to a policy that is suboptimal regardless of the computation time. This paper proposes and evaluates two new algorithms, POMCPOW and PFT-DPW, that overcome this deficiency by using weighted particle filtering. Simulation results show that these modifications allow the algorithms to be successful where previous approaches fail.

1804.11278 2026-06-04 eess.SY cs.RO cs.SY 版本更新

On the Interaction between Autonomous Mobility-on-Demand and Public Transportation Systems

自动驾驶出行即服务与公共交通系统的交互

Mauro Salazar, Federico Rossi, Maximilian Schiffer, Christopher H. Onder, Marco Pavone

发表机构 * Stanford University(斯坦福大学) Technical University of Munich(慕尼黑技术大学)

AI总结 本文研究了自动驾驶出行即服务与公共交通系统的耦合模型及协调策略,通过网络流模型最大化社会福利,并设计定价与收费方案实现社会最优,以纽约市为例验证了协同效应。

Comments 9 pages, 8 figures, ITSC 2018

详情
AI中文摘要

在本文中,我们研究了自动驾驶出行即服务(AMoD)与公共交通系统的模型和协调策略,其中一辆自动驾驶车辆车队与公共交通系统联合提供按需出行服务。具体而言,我们首先提出一个网络流模型用于多模式AMoD,其中捕捉了AMoD与公共交通的耦合关系,并旨在最大化社会福利。其次,利用该模型,我们设计了一种定价和收费方案,使在完全市场假设下,自私代理能够实现社会最优。最后,我们展示了纽约市的真实案例研究。我们的结果表明,AMoD车队与公共交通的协调可以比孤立运行的AMoD系统产生显著效益。

英文摘要

In this paper we study models and coordination policies for intermodal Autonomous Mobility-on-Demand (AMoD), wherein a fleet of self-driving vehicles provides on-demand mobility jointly with public transit. Specifically, we first present a network flow model for intermodal AMoD, where we capture the coupling between AMoD and public transit and the goal is to maximize social welfare. Second, leveraging such a model, we design a pricing and tolling scheme that allows to achieve the social optimum under the assumption of a perfect market with selfish agents. Finally, we present a real-world case study for New York City. Our results show that the coordination between AMoD fleets and public transit can yield significant benefits compared to an AMoD system operating in isolation.

1809.00037 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Estimation for Quadrotors

四旋翼估计

Stefanie Tellex, Andy Brown, Sergei Lupashin

发表机构 * Brown University(布朗大学) Udacity, Inc.(Udacity公司) Fotokite(Fotokite公司)

AI总结 本文基于四旋翼模型,推导了扩展卡尔曼滤波器的推导过程,提供EKF、贝叶斯滤波和无迹卡尔曼滤波的伪代码,旨在解决四旋翼状态估计中的噪声和计算限制问题。

详情
AI中文摘要

本文描述了四旋翼滤波和估计的标准方法,适用于Udacity飞行汽车课程。本文假设具备概率知识和一些线性代数知识,不假设卡尔曼滤波或贝叶斯滤波的先前知识。本文推导了不同无人机模型在1D、2D和3D中的EKF。本文使用Thrun等人[13]定义的EKF和符号,并提供了贝叶斯滤波、EKF和无迹卡尔曼滤波[14]的伪代码。本文的动机是缺乏提供四旋翼直升机推导的逐步EKF教程。估计的目标是从传感器值和控制输入推断无人机的状态(姿态、速度、加速度和偏差)。这个问题具有挑战性,因为传感器噪声很大。此外,由于重量和成本问题,许多无人机具有有限的机载计算能力,因此希望快速估计这些值。标准方法是扩展卡尔曼滤波,它是卡尔曼滤波的非线性扩展,通过在当前状态附近线性化非线性转换和测量模型。然而,无迹卡尔曼滤波在几乎所有方面都更好:更容易实现,估计更准确,运行时间相当。

英文摘要

This document describes standard approaches for filtering and estimation for quadrotors, created for the Udacity Flying Cars course. We assume previous knowledge of probability and some knowledge of linear algebra. We do not assume previous knowledge of Kalman filters or Bayes filters. This document derives an EKF for various models of drones in 1D, 2D, and 3D. We use the EKF and notation as defined in Thrun et al. [13]. We also give pseudocode for the Bayes filter, the EKF, and the Unscented Kalman filter [14]. The motivation behind this document is the lack of a step-by-step EKF tutorial that provides the derivations for a quadrotor helicopter. The goal of estimation is to infer the drone's state (pose, velocity, acceleration, and biases) from its sensor values and control inputs. This problem is challenging because sensors are noisy. Additionally, because of weight and cost issues, many drones have limited on-board computation so we want to estimate these values as quickly as possible. The standard method for performing this method is the Extended Kalman filter, a nonlinear extension of the Kalman filter which linearizes a nonlinear transition and measurement model around the current state. However the Unscented Kalman filter is better in almost every respect: simpler to implement, more accurate to estimate, and comparable runtimes.

1808.06900 2026-06-04 cs.NI cs.DS cs.RO cs.SY eess.SY 版本更新

Defending against Intrusion of Malicious UAVs with Networked UAV Defense Swarms

用网络化无人机防御群抵御恶意无人机入侵

Matthias R. Brust, Grégoire Danoy, Pascal Bouvry, Dren Gashi, Himadri Pathak, Mike P. Gonçalves

发表机构 * Interdisciplinary Centre for Security, Reliability and Trust (SnT), University of Luxembourg, Luxembourg(安全、可靠与信任 interdisciplinary 中心(SnT),卢森堡大学,卢森堡) Faculty of Science, Technology and Communication (FSTC), University of Luxembourg, Luxembourg(科学、技术与通信学院(FSTC),卢森堡大学,卢森堡)

AI总结 本文提出了一种无人机防御系统,通过自组织防御编队拦截并护送恶意无人机至飞行区外,采用模块化设计和自平衡聚类算法,实现通信损失下的鲁棒性。

Comments IEEE Conference on Local Computer Networks (LCN), 2017

详情
AI中文摘要

如今,亚马逊、阿里巴巴等公司正推动使用无人机(UAVs)提供服务,如包裹和食品配送。随着政府希望利用无人机带来的巨大经济利益,城市规划者正在将所谓的无人机飞行区和无人机高速公路纳入智能城市设计。然而,无人机的高速移动和行为动态需要监控以检测并处理入侵者、 rogue drones 和恶意无人机。本文提出了一种无人机防御系统,旨在拦截并护送恶意无人机至飞行区外。所提出的无人机防御系统由一个能够自行组织防御编队的防御无人机群组成,并在检测到入侵者时形成拦截和捕获编队。我们采用了模块化设计原则,开发了一种创新的自平衡聚类过程来实现拦截和捕获编队。结果表明,所得到的网络化防御无人机群对通信损失具有鲁棒性。最后,实现了一个原型无人机模拟器。通过广泛的模拟,我们展示了该方法的可行性和性能。

英文摘要

Nowadays, companies such as Amazon, Alibaba, and even pizza chains are pushing forward to use drones, also called UAVs (Unmanned Aerial Vehicles), for service provision, such as package and food delivery. As governments intend to use these immense economic benefits that UAVs have to offer, urban planners are moving forward to incorporate so-called UAV flight zones and UAV highways in their smart city designs. However, the high-speed mobility and behavior dynamics of UAVs need to be monitored to detect and, subsequently, to deal with intruders, rogue drones, and UAVs with a malicious intent. This paper proposes a UAV defense system for the purpose of intercepting and escorting a malicious UAV outside the flight zone. The proposed UAV defense system consists of a defense UAV swarm, which is capable to self-organize its defense formation in the event of intruder detection, and chase the malicious UAV as a networked swarm. Modular design principles have been used for our fully localized approach. We developed an innovative auto-balanced clustering process to realize the intercept- and capture-formation. As it turned out, the resulting networked defense UAV swarm is resilient against communication losses. Finally, a prototype UAV simulator has been implemented. Through extensive simulations, we show the feasibility and performance of our approach.

1806.05220 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Decentralized Ergodic Control: Distribution-Driven Sensing and Exploration for Multi-Agent Systems

去中心化恒定控制:面向多智能体系统的分布驱动感知与探索

Ian Abraham, Todd D. Murphey

发表机构 * Neuroscience and Robotics Laboratory(神经科学与机器人实验室)

AI总结 本文提出一种去中心化恒定控制策略,用于解决多智能体非线性动态系统的时间变化区域覆盖问题,通过共识实现完全去中心化的多智能体控制政策,并展示了其在多智能体地形映射和目标定位中的应用。

Comments 8 pages, Accepted for publication in IEEE Robotics and Automation Letters

详情
Journal ref
IEEE Robotics and Automation Letters, vol. 3, no. 4, pp. 2377-3766, 2018
AI中文摘要

我们提出了一种去中心化恒定控制策略,用于解决多智能体在时间变化区域覆盖问题中的非线性动态。恒定控制允许我们将分布作为非线性机器人系统的区域覆盖问题的目标,作为一种闭式控制器。我们推导出一种恒定控制策略的变体,可用于共识,以实现完全去中心化的多智能体控制策略。通过示例展示了我们的方法在多智能体地形映射以及目标定位中的适用性。还提供了对恒定策略作为纳什均衡的分析,用于博弈论应用。

英文摘要

We present a decentralized ergodic control policy for time-varying area coverage problems for multiple agents with nonlinear dynamics. Ergodic control allows us to specify distributions as objectives for area coverage problems for nonlinear robotic systems as a closed-form controller. We derive a variation to the ergodic control policy that can be used with consensus to enable a fully decentralized multi-agent control policy. Examples are presented to illustrate the applicability of our method for multi-agent terrain mapping as well as target localization. An analysis on ergodic policies as a Nash equilibrium is provided for game theoretic applications.

1808.08309 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Trajectory Tracking Control of a Flexible Spine Robot, With and Without a Reference Input

具有和不具有参考输入的柔性脊柱机器人类轨迹跟踪控制

Andrew P. Sabelhaus, Shirley Huajing Zhao, Mallory C. Daly, Ellande Tang, Edward Zhu, Abishek K. Akella, Zeerek A. Ahmad, Vytas SunSpiral, Alice M. Agogino

发表机构 * NASA Ames Intelligent Robotics Group(美国航空航天局阿姆斯研究中心智能机器人组) Dynamic Tensegrity Robotics Lab(动态张力机器人实验室) Levant Power Corp.(Levant Power公司) Velo3D Inc.(Velo3D公司) SGT Inc.(SGT公司)

AI总结 本文提出两种控制器,一种不使用参考输入但包含平滑约束,另一种使用参考输入但无平滑,用于柔性脊柱机器人轨迹跟踪控制。

详情
Journal ref
2017 NASA/ESA Conference on Adaptive Hardware and Systems - Workshop on Structurally Adaptive Tensegrity Robots
AI中文摘要

ULTRA Spine项目旨在为四足机器人开发柔性驱动脊柱。本文使用模型预测控制在机器人状态空间中跟踪轨迹。所用状态轨迹对应脊柱弯曲运动,包括移动椎骨的平移和旋转。本文提出了两种控制器:一种不使用参考输入但包含平滑约束,另一种使用参考输入但无平滑。对于无参考输入的平滑控制器,误差收敛到零,而使用输入参考的简单调节器显示小误差但未完全收敛。预计随着进一步改进,该控制器将收敛。

英文摘要

The Underactuated Lightweight Tensegrity Robotic Assistive Spine (ULTRA Spine) project is an ongoing effort to develop a flexible, actuated backbone for quadruped robots. In this work, model-predictive control is used to track a trajectory in the robot's state space, in simulation. The state trajectory used here corresponds to a bending motion of the spine, with translations and rotations of the moving vertebrae. Two different controllers are presented in this work: one that does not use a reference input but includes smoothing constrants, and a second one that uses a reference input without smoothing. For the smoothing controller, without reference inputs, the error converges to zero, while the simpler-to-tune controller with an input reference shows small errors but not complete convergence. It is expected that this controller will converge as it is improved further.

1808.06652 2026-06-04 eess.SY cs.RO cs.SY 版本更新

On the Optimality of Ergodic Trajectories for Information Gathering Tasks

关于信息采集任务中ergodic轨迹的最优性

Louis Dressel, Mykel J. Kochenderfer

AI总结 本文研究了在重复测量同一状态下信息衰减线性假设条件下,最优信息采集轨迹为ergodic轨迹的问题,通过实验验证了ergodic性与最优信息采集及submodularity之间的联系。

Comments Presented at 2018 American Control Conference (ACC)

详情
AI中文摘要

近年来,ergodic控制被提出用于引导移动传感器进行信息采集任务。在ergodic控制中,移动传感器遵循一个与某些信息密度分布相关的ergodic轨迹。轨迹是ergodic的,如果在状态空间区域中花费的时间与该区域的信息密度成比例。尽管ergodic控制在实验中表现出色,但对其为何有效以及何时最优的理解仍有限。本文研究了一类问题,其中最优信息采集轨迹是ergodic的。该类问题基于从同一状态重复测量的submodularity假设。假设区域中可用的信息随停留时间线性衰减。这一假设指导了ergodic轨迹生成中时间范围的选择。我们通过一系列实验支持我们的结论,展示了ergodic性、最优信息采集和submodularity之间的联系。

英文摘要

Recently, ergodic control has been suggested as a means to guide mobile sensors for information gathering tasks. In ergodic control, a mobile sensor follows a trajectory that is ergodic with respect to some information density distribution. A trajectory is ergodic if time spent in a state space region is proportional to the information density of the region. Although ergodic control has shown promising experimental results, there is little understanding of why it works or when it is optimal. In this paper, we study a problem class under which optimal information gathering trajectories are ergodic. This class relies on a submodularity assumption for repeated measurements from the same state. It is assumed that information available in a region decays linearly with time spent there. This assumption informs selection of the horizon used in ergodic trajectory generation. We support our claims with a set of experiments that demonstrate the link between ergodicity, optimal information gathering, and submodularity.

1802.08215 2026-06-04 cs.RO cs.SY eess.SY 版本更新

ArduSoar: an Open-Source Thermalling Controller for Resource-Constrained Autopilots

ArduSoar:一种为资源受限自动驾驶仪设计的开源热气球控制器

Samuel Tabor, Iain Guilliard, Andrey Kolobov

发表机构 * Glasgow, Scotland(格拉斯哥,苏格兰) Australian National University(澳大利亚国立大学) Microsoft Research(微软研究院)

AI总结 本文提出ArduSoar,首个集成于主流小型无人机自动驾驶软件中的热气球控制器,通过算法设计、与ArduPlane的集成及实飞测试验证其在非理想大气条件下的鲁棒性。

详情
AI中文摘要

自主热气球能力有潜力显著增加固定翼无人机的飞行时间。本文介绍ArduSoar,首个集成于主流小型无人机自动驾驶软件套件中的热气球控制器。我们从算法角度描述ArduSoar,概述其与ArduPlane自动驾驶仪的集成,讨论其参数调节,并在真实小型无人机上进行一系列飞行测试,证明ArduSoar在高度非理想大气条件下仍具鲁棒性。

英文摘要

Autonomous soaring capability has the potential to significantly increase time aloft for fixed-wing UAVs. In this paper, we introduce ArduSoar, the first soaring controller integrated into a major autopilot software suite for small UAVs. We describe ArduSoar from the algorithmic standpoint, outline its integration with the ArduPlane autopilot, discuss parameter tuning for it, and conduct a series of flight tests on real sUAVs that show ArduSoar's robustness even in highly non-ideal atmospheric conditions.

1808.06018 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Optimized Path Planning for Inspection by Unmanned Aerial Vehicles Swarm with Energy Constraints

基于能量约束的无人机群巡检路径优化

Momena Monwar, Omid Semiari, Walid Saad

发表机构 * Georgia Southern University(佐治亚南方大学) Virginia Tech(弗吉尼亚理工大学)

AI总结 本文提出一种考虑无人机能量限制的高效路径规划算法,通过优化飞行、悬停和数据传输能耗,降低巡检总时间和能量消耗,为自主巡检系统设计提供指导。

Comments IEEE Global Communications Conference (GLOBECOM), Ad Hoc and Sensor Networks Symposium

详情
AI中文摘要

自主巡检大规模地理区域是未来智能城市等物理信息系统中高效危险检测和灾害管理的核心要求。本文提出一种新的路径规划算法,用于在严格能量可用性约束下实现高效的巡检。所开发的框架考虑了无人机群在巡检操作中的所有能耗方面,包括飞行、悬停和数据传输所需的能量。证明所提出的算法可以在多项式时间内高效解决路径规划问题。仿真结果表明,该算法在减少总体巡检时间和能量消耗方面有显著优势。此外,结果还为设计自主巡检系统提供了确定所需无人机数量和能量水平的指导。

英文摘要

Autonomous inspection of large geographical areas is a central requirement for efficient hazard detection and disaster management in future cyber-physical systems such as smart cities. In this regard, exploiting unmanned aerial vehicle (UAV) swarms is a promising solution to inspect vast areas efficiently and with low cost. In fact, UAVs can easily fly and reach inspection points, record surveillance data, and send this information to a wireless base station (BS). Nonetheless, in many cases, such as operations at remote areas, the UAVs cannot be guided directly by the BS in real-time to find their path. Moreover, another key challenge of inspection by UAVs is the limited battery capacity. Thus, realizing the vision of autonomous inspection via UAVs requires energy-efficient path planning that takes into account the energy constraint of each individual UAV. In this paper, a novel path planning algorithm is proposed for performing energy-efficient inspection, under stringent energy availability constraints for each UAV. The developed framework takes into account all aspects of energy consumption for a UAV swarm during the inspection operations, including energy required for flying, hovering, and data transmission. It is shown that the proposed algorithm can address the path planning problem efficiently in polynomial time. Simulation results show that the proposed algorithm can yield substantial performance gains in terms of minimizing the overall inspection time and energy. Moreover, the results provide guidelines to determine parameters such as the number of required UAVs and amount of energy, while designing an autonomous inspection system.

1808.03983 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Robot Safe Interaction System for Intelligent Industrial Co-Robots

智能工业协作机器人安全交互系统

Changliu Liu, Masayoshi Tomizuka

发表机构 * University of California, Berkeley(加州大学伯克利分校)

AI总结 本文提出一种安全交互系统,通过并行规划与控制架构提升协作机器人在动态不确定环境中的效率与安全性,实验验证了方法的有效性。

Comments 12 pages

详情
AI中文摘要

人类-机器人交互被认为是未来工业协作机器人(协作机器人)的关键要素。不同于传统机器人在结构化和确定性环境中的工作方式,协作机器人需要在高度非结构化和随机环境中操作。为确保协作机器人在动态不确定环境中高效安全地运行,本文介绍了机器人安全交互系统。为解决人类-机器人交互中的不确定性,提出了一种独特的并行规划与控制架构,该架构包含一个长期全局规划器以确保机器人行为的效率,以及一个短期局部规划器以在不确定性下确保实时安全性。为使机器人能够立即响应环境变化,使用快速算法进行实时计算,即用于长期优化的凸可行性集算法和用于短期优化的安全集算法。介绍了几个测试平台,用于在部署初期对开发系统的安全性进行评估。通过与工业机器人机械臂的实验验证了所提方法的有效性和效率。

英文摘要

Human-robot interactions have been recognized to be a key element of future industrial collaborative robots (co-robots). Unlike traditional robots that work in structured and deterministic environments, co-robots need to operate in highly unstructured and stochastic environments. To ensure that co-robots operate efficiently and safely in dynamic uncertain environments, this paper introduces the robot safe interaction system. In order to address the uncertainties during human-robot interactions, a unique parallel planning and control architecture is proposed, which has a long term global planner to ensure efficiency of robot behavior, and a short term local planner to ensure real time safety under uncertainties. In order for the robot to respond immediately to environmental changes, fast algorithms are used for real-time computation, i.e., the convex feasible set algorithm for the long term optimization, and the safe set algorithm for the short term optimization. Several test platforms are introduced for safe evaluation of the developed system in the early phase of deployment. The effectiveness and the efficiency of the proposed method have been verified in experiment with an industrial robot manipulator.

1808.03037 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Passive Compliance Control of Aerial Manipulators

空载合规控制的空中机械臂

Min Jun Kim, Ribin Balachandran, Marco De Stefano, Konstantin Kondak, Christian Ott

发表机构 * German Aerospace Center (DLR)(德国航空航天中心)

AI总结 本文提出了一种空载合规控制方法,通过合理选择末端执行器坐标和时间域被动技术,确保空中机械臂在无动力驱动情况下实现稳定环境交互。

Comments IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2018

详情
AI中文摘要

本文提出了一种用于空中机械臂的被动合规控制方法,以实现稳定的环境交互。主要挑战是空中车辆的机身平面方向上缺乏驱动能力,这可能在交互过程中需要保持被动性。本文提出的控制器通过合理选择末端执行器坐标来保证机械臂的被动性,而通过利用时间域被动技术保证车辆机身的被动性。仿真研究验证了所提出方法的有效性。

英文摘要

This paper presents a passive compliance control for aerial manipulators to achieve stable environmental interactions. The main challenge is the absence of actuation along body-planar directions of the aerial vehicle which might be required during the interaction to preserve passivity. The controller proposed in this paper guarantees passivity of the manipulator through a proper choice of end-effector coordinates, and that of vehicle fuselage is guaranteed by exploiting time domain passivity technique. Simulation studies validate the proposed approach.

1808.02393 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Control of Multi-Agent Systems with Finite Time Control Barrier Certificates and Temporal Logic

多智能体系统的有限时间控制屏障证书与时序逻辑控制

Mohit Srinivasan, Samuel Coogan, Magnus Egerstedt

发表机构 * Georgia Institute of Technology(佐治亚理工学院)

AI总结 本文提出利用有限时间收敛控制屏障函数和线性时序逻辑规范合成多智能体连续时间动态系统的控制器,确保系统在有限时间内收敛到目标集并保持前向不变性,解决连续时间可达性问题。

Comments To appear in the 57th IEEE Conference on Decision and Control, Miami Beach, FL, USA, 2018

详情
AI中文摘要

本文提出了一种方法,用于利用有限时间收敛控制屏障函数和线性时序逻辑规范合成连续时间多智能体动态系统的控制器。在存在合适有限时间收敛控制屏障函数的情况下,保证系统在状态空间中有限时间收敛到目标集。此外,这些屏障函数在系统收敛到目标集后也保证前向不变性。这使得我们能够建立一个理论框架,用于合成多智能体系统的控制器。这些性质还使我们能够通过多个有限时间收敛控制屏障函数的组合定理来解决连续时间的可达性问题。该方法比现有方法更灵活,并允许更广泛的可行控制律。线性时序逻辑用于指定多智能体系统需要满足的复杂任务规范。通过这种方法,可以合成满足给定时序逻辑任务规范的控制律。提供的机器人实验是在佐治亚理工学院的Robotarium多机器人测试平台上进行的。

英文摘要

In this paper, a method to synthesize controllers using finite time convergence control barrier functions guided by linear temporal logic specifications for continuous time multi-agent dynamical systems is proposed. Finite time convergence to a desired set in the state space is guaranteed under the existence of a suitable finite time convergence control barrier function. In addition, these barrier functions also guarantee forward invariance once the system converges to the desired set. This allows us to formulate a theoretical framework which synthesizes controllers for the multi-agent system. These properties also enable us to solve the reachability problem in continuous time by formulating a theorem on the composition of multiple finite time convergence control barrier functions. This approach is more flexible than existing methods and also allows for a greater set of feasible control laws. Linear temporal logic is used to specify complex task specifications that need to be satisfied by the multi-agent system. With this solution methodology, a control law is synthesized that satisfies the given temporal logic task specification. Robotic experiments are provided which were performed on the Robotarium multi-robot testbed at Georgia Tech.

1808.02000 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Bionic Reflex Control Strategy for Robotic Finger with Kinematic Constraints

仿生反射控制策略用于具有运动学约束的机械手指

Narkhede Kunal Sanjay, Shyamanta M. Hazarika

发表机构 * Department of Mechanical Engineering(机械工程系) Indian Institute of Technology Kharagpur(印度理工学院卡里格普分校) Indian Institute of Technology Guwahati(印度理工学院古瓦哈蒂)

AI总结 本文提出了一种用于具有运动学约束的机械手指的仿生反射控制策略,通过力跟踪阻抗控制策略实现仿生反射,减少手指的动力学模型并讨论了允许精确力跟踪的阻抗控制策略,展示了单指在平面上支撑矩形物体的仿真结果,反射响应时间在毫秒级。

Comments 5 pages, 7 figures

详情
AI中文摘要

本文提出了一种用于具有运动学约束的机械手指的仿生反射控制策略。此处,仿生反射是通过力跟踪阻抗控制策略实现的。手指的动力学模型在受到运动学约束时被简化。随后,讨论了允许精确跟踪力的阻抗控制策略。展示了单指在平面上支撑矩形物体的仿真结果。仿生反射响应时间在毫秒级。

英文摘要

This paper presents a bionic reflex control strategy for a kinematically constrained robotic finger. Here, the bionic reflex is achieved through a force tracking impedance control strategy. The dynamic model of the finger is reduced subject to kinematic constraints. Thereafter, an impedance control strategy that allows exact tracking of forces is discussed. Simulation results for a single finger holding a rectangular object against a flat surface are presented. Bionic reflex response time is of the order of milliseconds.

1712.07249 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Probabilistic Learning of Torque Controllers from Kinematic and Force Constraints

基于概率学习的扭矩控制器从运动学和力约束中学习

João Silvério, Yanlong Huang, Leonel Rozo, Sylvain Calinon, Darwin G. Caldwell

发表机构 * Department of Advanced Robotics, Istituto Italiano di Tecnologia(意大利先进机器人研究所机器人部) Idiap Research Institute(Idiap研究 institute)

AI总结 本文提出一种概率方法,同时学习和合成扭矩控制命令,考虑任务空间、关节空间和力约束,通过概率学习不同扭矩控制器的相关性,结合高斯分布特性生成满足任务特征的新扭矩命令。

Comments Accepted for publication at 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

详情
AI中文摘要

在从示范中学习技能时,通常需要提前考虑适当的任务表示(通常在操作空间或配置空间中)。本文提出了一种概率方法,同时学习和合成扭矩控制命令,考虑任务空间、关节空间和力约束。我们通过考虑作用于机器人上的不同扭矩控制器,其相关性从示范中概率性地学习。利用高斯分布的性质,将这些控制器结合起来,生成满足任务重要特征的新扭矩命令。我们在两个实验场景中使用7自由度扭矩控制机械臂进行验证,任务需要考虑不同控制器以正确执行。

英文摘要

When learning skills from demonstrations, one is often required to think in advance about the appropriate task representation (usually in either operational or configuration space). We here propose a probabilistic approach for simultaneously learning and synthesizing torque control commands which take into account task space, joint space and force constraints. We treat the problem by considering different torque controllers acting on the robot, whose relevance is learned probabilistically from demonstrations. This information is used to combine the controllers by exploiting the properties of Gaussian distributions, generating new torque commands that satisfy the important features of the task. We validate the approach in two experimental scenarios using 7-DoF torquecontrolled manipulators, with tasks that require the consideration of different controllers to be properly executed.

1808.00869 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Developing Robot Driver Etiquette Based on Naturalistic Human Driving Behavior

基于自然人类驾驶行为发展机器人驾驶礼仪

Xianan Huang, Songan Zhang, Huei Peng

AI总结 本文研究机器人驾驶礼仪问题,通过分析自然驾驶数据库提取人类驾驶行为关键参数,为未来高自动化车辆算法设计和仿真中的人类驾驶行为建模提供指导。

详情
AI中文摘要

自动化车辆可通过提升安全、出行和燃油效率改变社会。然而,由于成本较高和商业模式变化,未来几十年内,高度自动化车辆很可能继续与人类驾驶车辆交互。过去,高度自动化(机器人)车辆的控制/设计主要考虑安全和效率,但未能解决周围人类驾驶车辆的“驾驶文化”问题。因此,机器人车辆可能表现出与其他车辆截然不同的行为。本文研究这一“驾驶礼仪”问题。作为第一步,我们报告了从大规模自然驾驶数据库中提取的人类驾驶车辆的关键行为参数。这些结果可用于指导未来高自动化车辆的算法设计,或在仿真中开发现实的人类驾驶车辆行为模型。

英文摘要

Automated vehicles can change the society by improved safety, mobility and fuel efficiency. However, due to the higher cost and change in business model, over the coming decades, the highly automated vehicles likely will continue to interact with many human-driven vehicles. In the past, the control/design of the highly automated (robotic) vehicles mainly considers safety and efficiency but failed to address the "driving culture" of surrounding human-driven vehicles. Thus, the robotic vehicles may demonstrate behaviors very different from other vehicles. We study this "driving etiquette" problem in this paper. As the first step, we report the key behavior parameters of human driven vehicles derived from a large naturalistic driving database. The results can be used to guide future algorithm design of highly automated vehicles or to develop realistic human-driven vehicle behavior model in simulations.

1807.05290 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Adaptive Model Predictive Control for High-Accuracy Trajectory Tracking in Changing Conditions

自适应模型预测控制在变化条件下高精度轨迹跟踪中的应用

Karime Pereida, Angela Schoellig

发表机构 * Dynamic Systems Lab(动态系统实验室) University of Toronto Institute for Aerospace Studies(多伦多大学航空航天研究 institute)

AI总结 本文提出一种结合模型预测控制与L1自适应控制器的自适应模型预测控制器,用于在未知和变化的扰动环境下提高系统轨迹跟踪性能。通过实验验证,该方法在四旋翼无人机上表现出更低的轨迹跟踪误差。

详情
AI中文摘要

机器人和自动化系统越来越多地被引入未知和动态环境,这些环境要求它们能够处理扰动、未建模动力学和参数不确定性。为在这些动态环境中实现高性能,需要鲁棒和自适应的控制策略。本文提出了一种新颖的自适应模型预测控制器,结合模型预测控制(MPC)与底层的L1自适应控制器,以提高受未知和变化扰动影响的系统轨迹跟踪性能。L1自适应控制器迫使系统以参考模型指定的方式运行。更高层的模型预测控制器则基于成本函数计算最优参考输入,同时考虑输入和状态约束。我们专注于所提出方法的实验验证,并在四旋翼无人机上展示了其有效性。我们表明,所提出的方法在外部风扰动下,其轨迹跟踪误差比非预测性自适应方法和预测性非自适应方法更低。

英文摘要

Robots and automated systems are increasingly being introduced to unknown and dynamic environments where they are required to handle disturbances, unmodeled dynamics, and parametric uncertainties. Robust and adaptive control strategies are required to achieve high performance in these dynamic environments. In this paper, we propose a novel adaptive model predictive controller that combines model predictive control (MPC) with an underlying $\mathcal{L}_1$ adaptive controller to improve trajectory tracking of a system subject to unknown and changing disturbances. The $\mathcal{L}_1$ adaptive controller forces the system to behave in a predefined way, as specified by a reference model. A higher-level model predictive controller then uses this reference model to calculate the optimal reference input based on a cost function, while taking into account input and state constraints. We focus on the experimental validation of the proposed approach and demonstrate its effectiveness in experiments on a quadrotor. We show that the proposed approach has a lower trajectory tracking error compared to non-predictive, adaptive approaches and a predictive, non-adaptive approach, even when external wind disturbances are applied.

1807.11578 2026-06-04 eess.SP cs.RO cs.SY eess.SY 版本更新

Trajectory Optimization for Cooperative Dual-band UAV Swarms

协同双频无人机编队轨迹优化

Hakim Ghazzai, Mahdi Ben Ghorbel, Andreas Kassler, Md. Jahangir Hossain

发表机构 * Stevens Institute of Technology(史蒂文斯理工学院) The University of British Columbia(不列颠哥伦比亚大学) Karlstad University(卡尔斯塔德大学)

AI总结 本文针对带宽需求大且容忍延迟的应用,提出双频无人机轨迹优化框架,通过混合非线性规划问题优化无人机路径和停靠点,以最小化总服务时间。

Comments 8 pages, 5 figures, conference Globecom 2018

详情
AI中文摘要

无人驾驶航空器(UAVs)在多样无线通信领域中获得了广泛 popularity。由于其能够提供视距链路的能力,它们可以作为高空飞行中继器,支持地面节点之间的通信。随着物联网的蓬勃发展,多种新型应用不断涌现。本文聚焦于带宽需求大且容忍延迟的应用,其中多个收发器对需要无人机的支持来完成其传输。为此,无人机有可能使用两种不同的频段,即典型的微波频段和高速毫米波频段。本文开发了一个通用框架,将无人机分配给支持的收发器并优化其轨迹,以最小化总服务时间的加权函数。考虑到中继消息所需的时间和无人机飞行时间,一个混合非线性规划问题被提出,旨在找到无人机停靠点以转发数据给接收器。然后开发了一种迭代方法来解决该问题。首先,通过混合线性规划问题最优解决来确定每架可用无人机的路径。然后执行分层迭代搜索以提高无人机停靠点位置并减少服务时间。所提框架的行为和优势在选定场景中进行了展示。

英文摘要

Unmanned aerial vehicles (UAVs) have gained a lot of popularity in diverse wireless communication fields. They can act as high-altitude flying relays to support communications between ground nodes due to their ability to provide line-of-sight links. With the flourishing Internet of Things, several types of new applications are emerging. In this paper, we focus on bandwidth hungry and delay-tolerant applications where multiple pairs of transceivers require the support of UAVs to complete their transmissions. To do so, the UAVs have the possibility to employ two different bands namely the typical microwave and the high-rate millimeter wave bands. In this paper, we develop a generic framework to assign UAVs to supported transceivers and optimize their trajectories such that a weighted function of the total service time is minimized. Taking into account both the communication time needed to relay the message and the flying time of the UAVs, a mixed non-linear programming problem aiming at finding the stops at which the UAVs hover to forward the data to the receivers is formulated. An iterative approach is then developed to solve the problem. First, a mixed linear programming problem is optimally solved to determine the path of each available UAV. Then, a hierarchical iterative search is executed to enhance the UAV stops' locations and reduce the service time. The behavior of the UAVs and the benefits of the proposed framework are showcased for selected scenarios.

1807.11553 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Reach-Avoid Problems via Sum-of-Squares Optimization and Dynamic Programming

通过平方和优化与动态规划解决可达-回避问题

Benoit Landry, Mo Chen, Scott Hemley, Marco Pavone

AI总结 本文结合平方和优化与动态规划,提出一种保守解法,保证系统在可达与回避之间的切换,适用于多项式动力学和一般问题设置,并在计算效率上优于传统方法。

Comments International Conference on Intelligent Robots & Systems (IROS), 2018

详情
AI中文摘要

可达-回避问题涉及将系统驱动到期望配置集的同时避免不利配置。提供数学保证具有挑战性,但有广泛应用。由于挑战,分析可达-回避问题需要在系统动力学的通用性、问题设置的通用性、解的最优性和计算复杂性之间做出权衡。本文结合平方和优化与动态规划,提出一种保守解法,保持可达性和回避保证。我们的方法适用于多项式系统动力学和一般问题设置,并比以前的相关方法更具计算可扩展性。通过涉及两个单积分器的数值示例,我们验证了所提理论,并将其与哈密顿-雅可比可达性进行比较。在验证理论后,我们通过计算涉及两个运动学汽车系统的可达-回避集,展示了我们方法的计算可扩展性。

英文摘要

Reach-avoid problems involve driving a system to a set of desirable configurations while keeping it away from undesirable ones. Providing mathematical guarantees for such scenarios is challenging but have numerous potential practical applications. Due to the challenges, analysis of reach-avoid problems involves making trade-offs between generality of system dynamics, generality of problem setups, optimality of solutions, and computational complexity. In this paper, we combine sum-of-squares optimization and dynamic programming to address the reach-avoid problem, and provide a conservative solution that maintains reaching and avoidance guarantees. Our method is applicable to polynomial system dynamics and to general problem setups, and is more computationally scalable than previous related methods. Through a numerical example involving two single integrators, we validate our proposed theory and compare our method to Hamilton-Jacobi reachability. Having validated our theory, we demonstrate the computational scalability of our method by computing the reach-avoid set of a system involving two kinematic cars.

1807.03515 2026-06-04 eess.SY cs.NI cs.RO cs.SY 版本更新

A Reinforcement Learning Approach to Jointly Adapt Vehicular Communications and Planning for Optimized Driving

一种联合适应车载通信与规划的强化学习方法

Mayank K. Pal, Rupali Bhati, Anil Sharma, Sanjit K. Kaul, Saket Anand, P. B. Sujit

发表机构 * IIIT-Delhi(印度德里印度理工学院)

AI总结 本文提出一种强化学习方法,用于联合优化自动驾驶车辆的通信与运动规划,通过模拟验证了该方法在提升驾驶效益方面的有效性。

Comments 7 pages, 7 figures; Accepted as a conference paper at IEEE ITSC 2018

详情
AI中文摘要

我们的前提是自动驾驶车辆必须联合优化通信和运动规划。具体而言,车辆必须在考虑通信速率相关约束的情况下调整其运动计划,并在考虑道路环境可能施加的运动规划限制的情况下调整通信使用。为此,我们提出了一个强化学习问题,其中自动驾驶车辆同时选择(a)在道路上执行的运动规划动作和(b)查询基础设施感知信息的通信动作。目标是优化自动驾驶车辆的驾驶效益。我们应用Q学习算法使车辆学习最优策略,该策略在任何给定时间都能做出最优的规划和通信动作选择。我们通过模拟验证了最优策略在智能适应通信和规划动作方面的能力,同时实现了较高的驾驶效益。

英文摘要

Our premise is that autonomous vehicles must optimize communications and motion planning jointly. Specifically, a vehicle must adapt its motion plan staying cognizant of communications rate related constraints and adapt the use of communications while being cognizant of motion planning related restrictions that may be imposed by the on-road environment. To this end, we formulate a reinforcement learning problem wherein an autonomous vehicle jointly chooses (a) a motion planning action that executes on-road and (b) a communications action of querying sensed information from the infrastructure. The goal is to optimize the driving utility of the autonomous vehicle. We apply the Q-learning algorithm to make the vehicle learn the optimal policy, which makes the optimal choice of planning and communications actions at any given time. We demonstrate the ability of the optimal policy to smartly adapt communications and planning actions, while achieving large driving utilities, using simulations.

1807.09905 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Toward Efficient and Robust Biped Walking Optimization

迈向高效且稳健的双足行走优化

Nihar Talele, Katie Byl

AI总结 本文研究双足机器人行走的高效与稳健优化,探讨步态的能耗与鲁棒性量化,以及运动轨迹与机器人参数的联合优化,通过五连杆平面行走模型验证效率与稳健性的平衡。

详情
AI中文摘要

实用的双足机器人运动需要同时具备能量效率和对变化与不确定性的鲁棒性。本文基于最近的轨迹优化研究,有两个主要目标:首先,展示考虑并量化步态的能耗与鲁棒性的重要性,以及优化名义运动轨迹和机器人设计参数及反馈控制策略的重要性;其次,通过五连杆平面行走模型的优化研究,提供关于提高效率与鲁棒性之间权衡和一般趋势的初步数据。在解决重要的开放性挑战时,特别讨论了在选择始终是近似优化的指标以及在结构和调节反馈控制时的选择影响。

英文摘要

Practical bipedal robot locomotion needs to be both energy efficient and robust to variability and uncertainty. In this paper, we build upon recent works in trajectory optimization for robot locomotion with two primary goals. First, we wish to demonstrate the importance of (a) considering and quantifying not only energy efficiency but also robustness of gaits, and (b) optimization not only of nominal motion trajectories but also of robot design parameters and feedback control policies. As a second, complementary focus, we present results from optimization studies on a 5-link planar walking model, to provide preliminary data on particular trade-offs and general trends in improving efficiency versus robustness. In addressing important, open challenges, we focus in particular on discussions of the effects of choices made (a) in formulating what is always, necessarily only an approximate optimization, in choosing metrics for performance, and (b) in structuring and tuning feedback control.

1803.09022 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Controller Synthesis for Discrete-Time Polynomial Systems via Occupation Measures

通过占用测度方法为离散时间多项式系统设计控制器

Weiqiao Han, Russ Tedrake

发表机构 * Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology(计算机科学与人工智能实验室,麻省理工学院)

AI总结 本文通过占用测度方法为离散时间多项式动力系统设计非线性状态反馈控制器,将控制器合成问题转化为无限维线性规划问题,并通过松弛为有限维半正定规划问题,从而提取非线性控制器,该方法具有计算复杂度低且可扩展性强的优势。

详情
AI中文摘要

本文中,我们通过占用测度方法为离散时间多项式动力系统设计非线性状态反馈控制器。我们提出了离散时间受控李雅普诺夫方程,并利用该方程将控制器合成问题转化为关于测度的无限维线性规划问题,随后将其松弛为关于测度矩及其对偶的有限维半正定规划问题。非线性控制器可以从松弛问题的解中提取。占用测度方法的优势在于我们解决的是凸问题而非通常非凸问题,且计算复杂度与状态和输入维度呈多项式关系,因此该方法更具可扩展性。此外,我们还展示了该方法可以应用于近似离散时间自主多项式系统的后向可达集以及在已知状态反馈控制律下的离散时间多项式系统的可控集。我们还在多个动力系统上展示了我们的方法。

英文摘要

In this paper, we design nonlinear state feedback controllers for discrete-time polynomial dynamical systems via the occupation measure approach. We propose the discrete-time controlled Liouville equation, and use it to formulate the controller synthesis problem as an infinite-dimensional linear programming problem on measures, which is then relaxed as finite-dimensional semidefinite programming problems on moments of measures and their duals on sums-of-squares polynomials. Nonlinear controllers can be extracted from the solutions to the relaxed problems. The advantage of the occupation measure approach is that we solve convex problems instead of generally non-convex problems, and the computational complexity is polynomial in the state and input dimensions, and hence the approach is more scalable. In addition, we show that the approach can be applied to over-approximating the backward reachable set of discrete-time autonomous polynomial systems and the controllable set of discrete-time polynomial systems under known state feedback control laws. We illustrate our approach on several dynamical systems.

1803.07725 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Semidefinite Outer Approximation of the Backward Reachable Set of Discrete-time Autonomous Polynomial Systems

半正定外近似法用于离散时间自治多项式系统的逆向可达集

Weiqiao Han, Russ Tedrake

AI总结 本文提出利用半正定规划对离散时间自治多项式系统的逆向可达集进行半正定外近似,通过构造测度优化问题并求解得到逼近结果。

Comments merged with the paper arXiv:1803.09022

详情
AI中文摘要

我们使用最近发展的occupation measure方法来近似离散时间自治多项式系统的逆向可达集。我们将问题形式化为一个关于测度的无限维线性规划问题及其对偶的连续函数问题。然后通过构造测度的矩序列的半正定规划层次来近似该线性规划问题。最后通过求解这些半正定规划问题,得到逆向可达集的外近似序列。我们通过三个动态系统展示了我们的方法。作为特殊情况,我们还展示了如何在多项式映射下近似紧致半代数集的预像。

英文摘要

We approximate the backward reachable set of discrete-time autonomous polynomial systems using the recently developed occupation measure approach. We formulate the problem as an infinite-dimensional linear programming (LP) problem on measures and its dual on continuous functions. Then we approximate the LP by a hierarchy of finite-dimensional semidefinite programming (SDP) programs on moments of measures and their duals on sums-of-squares polynomials. Finally we solve the SDP's and obtain a sequence of outer approximations of the backward reachable set. We demonstrate our approach on three dynamical systems. As a special case, we also show how to approximate the preimage of a compact semi-algebraic set under a polynomial map.

1807.08855 2026-06-04 stat.ML cs.LG cs.RO cs.SY eess.SP eess.SY 版本更新

Weak in the NEES?: Auto-tuning Kalman Filters with Bayesian Optimization

在NEES中薄弱:基于贝叶斯优化的自动调节卡尔曼滤波器

Zhaozhong Chen, Christoffer Heckman, Simon Julier, Nisar Ahmed

发表机构 * Department of Computer Science(计算机科学系) University of Colorado Boulder(科罗拉多大学博尔德分校) University College London(伦敦大学学院) Smead Aerospace Engineering Sciences(Smead航空航天工程科学系)

AI总结 本文提出一种基于贝叶斯优化的自动调节卡尔曼滤波器方法,通过智能采样参数空间,利用非参数高斯过程代理函数,高效识别多个局部极小值并提供结果不确定性量化。

Comments Final version presented at FUSION 2018 Conference, Cambridge, UK, July 2018 (submitted June 1, 2018)

详情
AI中文摘要

卡尔曼滤波器被广泛用于数据融合应用,包括导航、跟踪和同时定位与建图问题。然而,调整各种卡尔曼滤波器模型参数需要大量时间和努力,例如过程噪声协方差、非白噪声预白化滤波器模型等。传统优化技术在调整时容易陷入较差的局部极小值,并且使用真实传感器数据实施成本较高。为了解决这些问题,本文开发了一种新的“黑箱”贝叶斯优化策略,用于自动调节卡尔曼滤波器。在该方法中,性能由两种随机目标函数之一来表征:当可用真实状态模型时为归一化估计误差平方(NEES),当只有传感器数据可用时为归一化创新误差平方(NIS)。通过智能采样参数空间,学习和利用非参数高斯过程代理函数,贝叶斯优化可以高效地识别多个局部极小值,并对其结果提供不确定性量化。

英文摘要

Kalman filters are routinely used for many data fusion applications including navigation, tracking, and simultaneous localization and mapping problems. However, significant time and effort is frequently required to tune various Kalman filter model parameters, e.g. process noise covariance, pre-whitening filter models for non-white noise, etc. Conventional optimization techniques for tuning can get stuck in poor local minima and can be expensive to implement with real sensor data. To address these issues, a new "black box" Bayesian optimization strategy is developed for automatically tuning Kalman filters. In this approach, performance is characterized by one of two stochastic objective functions: normalized estimation error squared (NEES) when ground truth state models are available, or the normalized innovation error squared (NIS) when only sensor data is available. By intelligently sampling the parameter space to both learn and exploit a nonparametric Gaussian process surrogate function for the NEES/NIS costs, Bayesian optimization can efficiently identify multiple local minima and provide uncertainty quantification on its results.

1807.08604 2026-06-04 eess.SY cs.RO cs.SY eess.SP math.OC 版本更新

A Frequency-Domain Characterization of Optimal Error Covariance for the Kalman-Bucy Filter

卡尔曼-布西滤波器最优误差协方差的频域特性

Song Fang, Hideaki Ishii, Jie Chen, Karl Henrik Johansson

AI总结 本文通过频域积分特性,揭示卡尔曼-布西滤波器最优输出估计误差协方差与噪声协方差的比值迹可由系统动态和噪声统计表达。采用解析函数理论分析代数里卡蒂方程,并与布德积分相关联。

详情
AI中文摘要

本文发现,卡尔曼-布西滤波器所获得的最优输出估计误差协方差与噪声协方差的比值迹可通过系统动态和噪声统计在频域中明确表达。为此,我们利用解析函数理论分析与卡尔曼-布西滤波相关的代数里卡蒂方程,并将其与布德积分相关联。我们的方法提供了一种分析代数里卡蒂方程的替代频域框架,并减少到各种现有相关结果。

英文摘要

In this paper, we discover that the trace of the division of the optimal output estimation error covariance over the noise covariance attained by the Kalman-Bucy filter can be explicitly expressed in terms of the plant dynamics and noise statistics in a frequency-domain integral characterization. Towards this end, we examine the algebraic Riccati equation associated with Kalman-Bucy filtering using analytic function theory and relate it to the Bode integral. Our approach features an alternative, frequency-domain framework for analyzing algebraic Riccati equations and reduces to various existing related results.

1807.08048 2026-06-04 cs.RO cs.AI cs.LG cs.SY eess.SY 版本更新

Baidu Apollo EM Motion Planner

百度 Apollo EM 运动规划器

Haoyang Fan, Fan Zhu, Changchun Liu, Liangliang Zhang, Li Zhuang, Dong Li, Weicheng Zhu, Jiangtao Hu, Hongye Li, Qi Kong

发表机构 * Baidu USA LLC(百度美国有限公司)

AI总结 本文提出基于百度 Apollo 开源自动驾驶平台的实时运动规划系统,解决工业级4级运动规划问题,兼顾安全性、舒适性和可扩展性,通过分层结构实现多车道和单车道自动驾驶。

详情
AI中文摘要

本文介绍了一种基于百度 Apollo(开源)自动驾驶平台的实时运动规划系统。该系统旨在解决工业级4级运动规划问题,同时考虑安全性、舒适性和可扩展性。系统采用分层结构处理多车道和单车道自动驾驶:(1)系统顶层为多车道策略,通过并行计算的车道级轨迹进行比较以处理变道场景。(2)在车道级轨迹生成器中,基于弗伦兹框架迭代求解路径和速度优化。(3)对于路径和速度优化,提出结合动态规划和基于样条的二次规划的方法,构建可扩展且易于调节的框架,同时处理交通规则、障碍物决策和平滑性。该规划器可扩展至高速公路和低速城市驾驶场景。我们通过场景示例和道路测试结果展示了该算法。本文描述的系统自2017年9月Apollo v1.5发布以来已部署到数十辆百度Apollo自动驾驶车辆。截至2018年5月16日,该系统已在各种城市场景下进行了3,380小时和约68,000公里(42,253英里)的闭环自动驾驶测试。本文描述的算法可在https://github.com/ApolloAuto/apollo/tree/master/modules/planning上获得。

英文摘要

In this manuscript, we introduce a real-time motion planning system based on the Baidu Apollo (open source) autonomous driving platform. The developed system aims to address the industrial level-4 motion planning problem while considering safety, comfort and scalability. The system covers multilane and single-lane autonomous driving in a hierarchical manner: (1) The top layer of the system is a multilane strategy that handles lane-change scenarios by comparing lane-level trajectories computed in parallel. (2) Inside the lane-level trajectory generator, it iteratively solves path and speed optimization based on a Frenet frame. (3) For path and speed optimization, a combination of dynamic programming and spline-based quadratic programming is proposed to construct a scalable and easy-to-tune framework to handle traffic rules, obstacle decisions and smoothness simultaneously. The planner is scalable to both highway and lower-speed city driving scenarios. We also demonstrate the algorithm through scenario illustrations and on-road test results. The system described in this manuscript has been deployed to dozens of Baidu Apollo autonomous driving vehicles since Apollo v1.5 was announced in September 2017. As of May 16th, 2018, the system has been tested under 3,380 hours and approximately 68,000 kilometers (42,253 miles) of closed-loop autonomous driving under various urban scenarios. The algorithm described in this manuscript is available at https://github.com/ApolloAuto/apollo/tree/master/modules/planning.

1807.05289 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Transfer Learning for High-Precision Trajectory Tracking Through $\mathcal{L}_1$ Adaptive Feedback and Iterative Learning

通过L1自适应反馈和迭代学习实现高精度轨迹跟踪的迁移学习

Karime Pereida, Dave Kooijman, Rikky R. P. R. Duivenvoorden, Angela P. Schoellig

发表机构 * Institute for Aerospace Studies, University of Toronto, North York, ON M3H 5T6, Canada(多伦多大学航空航天研究 institute, 北York, ON M3H 5T6, 加拿大)

AI总结 本文提出结合L1自适应控制与迭代学习控制的框架,用于在未知动态环境中实现高精度轨迹跟踪,通过迁移学习实现不同系统间的经验传递。

详情
AI中文摘要

当机器人或自动化系统被引入未知和动态环境时,需要鲁棒且适应性的控制策略以应对干扰、未建模动力学和参数不确定性。本文展示了一种结合L1自适应控制与迭代学习控制(ILC)的框架,用于在存在未知和变化的干扰时实现高精度轨迹跟踪。L1自适应控制器使系统接近参考模型,但无法保证完美轨迹跟踪,而ILC则通过以前的迭代改进轨迹跟踪性能。本文的综合框架使用L1自适应控制作为底层控制器,实现鲁棒且可重复的行为,而ILC则作为高层适应方案,主要补偿系统跟踪误差。我们证明了该框架能够在动态不同的系统间实现迁移学习,其中一个系统的学习经验可对另一个系统有益。两种不同四旋翼的实验结果表明,与使用PID控制器的ILC方法相比,该综合L1-ILC框架具有优越性能。结果表明,当初始输入基于自适应控制器的参考模型生成时,我们的L1-ILC框架能够实现精确的轨迹跟踪,即使在存在未知和变化的干扰时,也能实现系统间的学习经验迁移。

英文摘要

Robust and adaptive control strategies are needed when robots or automated systems are introduced to unknown and dynamic environments where they are required to cope with disturbances, unmodeled dynamics, and parametric uncertainties. In this paper, we demonstrate the capabilities of a combined $\mathcal{L}_1$ adaptive control and iterative learning control (ILC) framework to achieve high-precision trajectory tracking in the presence of unknown and changing disturbances. The $\mathcal{L}_1$ adaptive controller makes the system behave close to a reference model; however, it does not guarantee that perfect trajectory tracking is achieved, while ILC improves trajectory tracking performance based on previous iterations. The combined framework in this paper uses $\mathcal{L}_1$ adaptive control as an underlying controller that achieves a robust and repeatable behavior, while the ILC acts as a high-level adaptation scheme that mainly compensates for systematic tracking errors. We illustrate that this framework enables transfer learning between dynamically different systems, where learned experience of one system can be shown to be beneficial for another different system. Experimental results with two different quadrotors show the superior performance of the combined $\mathcal{L}_1$-ILC framework compared with approaches using ILC with an underlying proportional-derivative controller or proportional-integral-derivative controller. Results highlight that our $\mathcal{L}_1$-ILC framework can achieve high-precision trajectory tracking when unknown and changing disturbances are present and can achieve transfer of learned experience between dynamically different systems. Moreover, our approach is able to achieve precise trajectory tracking in the first attempt when the initial input is generated based on the reference model of the adaptive controller.

1806.00627 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Fast Rigid 3D Registration Solution: A Simple Method Free of SVD and Eigen-Decomposition

快速刚性三维配准解决方案:一种无需SVD和特征分解的简单方法

Jin Wu, Ming Liu, Zebo Zhou, Rui Li

发表机构 * School of Aeronautics and Astronautics, University of Electronic Science and Technology of China, Chengdu, 611731, China(电子科技大学航空宇航学院)

AI总结 本文提出一种无需SVD和特征分解的快速刚性三维配准方法,通过点交叉协方差矩阵的最优特征向量计算实现高效配准,验证了其在噪声点云中的鲁棒性和速度优势。

详情
AI中文摘要

本文提出了一种新的解决方案来解决刚性三维配准问题,受之前基于特征分解的方法启发。与现有求解器不同,所提算法不需要复杂的矩阵运算,如奇异值分解或特征值分解。相反,点交叉协方差矩阵的最优特征向量可以在几次迭代中计算出来。此外,证明了在不需要四元数的情况下可以直接计算最优旋转矩阵。该简单框架提供了在嵌入式平台上整数实现的非常简便方法。对噪声污染点云的仿真验证了所提方法的鲁棒性和计算速度。最终结果表明,所提算法准确、鲁棒,并且比代表方法计算时间减少了60%至80%。它还已应用于实际世界,以实现更快的相对机器人导航。

英文摘要

A novel solution is obtained to solve the rigid 3D registration problem, motivated by previous eigen-decomposition approaches. Different from existing solvers, the proposed algorithm does not require sophisticated matrix operations e.g. singular value decomposition or eigenvalue decomposition. Instead, the optimal eigenvector of the point cross-covariance matrix can be computed within several iterations. It is also proven that the optimal rotation matrix can be directly computed for cases without need of quaternion. The simple framework provides very easy approach of integer-implementation on embedded platforms. Simulations on noise-corrupted point clouds have verified the robustness and computation speed of the proposed method. The final results indicate that the proposed algorithm is accurate, robust and owns over $60\% \sim 80\%$ less computation time than representatives. It has also been applied to real-world applications for faster relative robotic navigation.

1806.09849 2026-06-04 eess.SY cs.RO cs.SY 版本更新

SENSE: Abstraction-Based Synthesis of Networked Control Systems

SENSE:基于抽象的网络控制系统合成

Mahmoud Khaled, Matthias Rungger, Majid Zamani

发表机构 * Technical University of Munich(慕尼黑技术大学)

AI总结 SENSE通过符号模型构建和自动控制器合成,解决网络控制系统复杂规格的满足问题,支持VHDL/Verilog或C/C++代码生成。

Comments In Proceedings MeTRiD 2018, arXiv:1806.09330

详情
Journal ref
EPTCS 272, 2018, pp. 65-78
AI中文摘要

SENSE通过符号模型构建和自动控制器合成,解决网络控制系统复杂规格的满足问题,支持VHDL/Verilog或C/C++代码生成。

英文摘要

While many studies and tools target the basic stabilizability problem of networked control systems (NCS), nowadays modern systems require more sophisticated objectives such as those expressed as formulae in linear temporal logic or as automata on infinite strings. One general technique to achieve this is based on so-called symbolic models, where complex systems are approximated by finite abstractions, and then, correct-by-construction controllers are automatically synthesized for them. We present tool SENSE for the construction of finite abstractions for NCS and the automated synthesis of controllers. Constructed controllers enforce complex specifications over plants in NCS by taking into account several non-idealities of the communication channels. Given a symbolic model of the plant and network parameters, SENSE can efficiently construct a symbolic model of the NCS, by employing operations on binary decision diagrams (BDDs). Then, it synthesizes symbolic controllers satisfying a class of specifications. It has interfaces for the simulation and the visualization of the resulting closed-loop systems using OMNETPP and MATLAB. Additionally, SENSE can generate ready-to-implement VHDL/Verilog or C/C++ codes from the synthesized controllers.

1806.08810 2026-06-04 cs.LO cs.RO cs.SY eess.SY 版本更新

Self-Driving Vehicle Verification Towards a Benchmark

自动驾驶车辆验证:一个基准

Nima Roohi, Ramneet Kaur, James Weimer, Oleg Sokolsky, Insup Lee

发表机构 * University of Pennsylvania(宾夕法尼亚大学)

AI总结 本文提出一个简单的形式化模型用于自动驾驶车辆,尽管经过简化后其安全性已手动证明,但目前尚无自动形式化验证工具支持其动态特性,旨在为形式化验证工具提供挑战。

Comments 7 pages

详情
AI中文摘要

工业蜂窝物理系统是具有严格安全要求的混合系统。尽管没有形式语义,大多数这些系统主要由于两个原因被建模为Stateflow/Simulink:(1) 使用这些工具建模、测试和模拟更方便,(2)这些系统的动态特性不被大多数其他工具支持。此外,随着蜂窝物理系统复杂性的不断增加,自动形式化验证工具所能建模的范围与工业蜂窝物理系统的模型之间的差距也在扩大。在本文中,我们提出了一个简单的形式化模型用于自动驾驶车辆。尽管经过一些简化,该系统的安全性已经手动证明,据我们所知,目前没有自动形式化验证工具支持其动态特性。我们希望这能为针对工业应用的形式化验证工具提供一个挑战问题。

英文摘要

Industrial cyber-physical systems are hybrid systems with strict safety requirements. Despite not having a formal semantics, most of these systems are modeled using Stateflow/Simulink for mainly two reasons: (1) it is easier to model, test, and simulate using these tools, and (2) dynamics of these systems are not supported by most other tools. Furthermore, with the ever growing complexity of cyber-physical systems, grows the gap between what can be modeled using an automatic formal verification tool and models of industrial cyber-physical systems. In this paper, we present a simple formal model for self-deriving cars. While after some simplification, safety of this system has already been proven manually, to the best of our knowledge, no automatic formal verification tool supports its dynamics. We hope this serves as a challenge problem for formal verification tools targeting industrial applications.

1709.04940 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A Robust Model Predictive Control Approach for Autonomous Underwater Vehicles Operating in a Constrained workspace

一种用于在受限工作空间中运行的自主水下机器人鲁棒模型预测控制方法

Shahab Heshmati-alamdari, George C. Karras, Panos Marantos, Kostas J. Kyriakopoulos

AI总结 本文提出了一种新型非线性模型预测控制方案,用于在存在静态障碍物的受限工作空间中引导水下机器人到达特定路径点,通过考虑障碍物、工作空间边界、推进器饱和度和预定义的车辆速度上限等约束条件,提高控制性能。

Comments IEEE International Conference on Robotics and Automation (ICRA-2018), Accepted

详情
AI中文摘要

本文提出了一种新型非线性模型预测控制(NMPC)方案,用于在存在静态障碍物的受限工作空间中运行的水下机器人。控制器的目标是引导车辆朝着特定路径点前进。各种限制,如障碍物、工作空间边界、推进器饱和度和预定义的车辆速度上限,被作为状态和输入约束,并在控制设计中得到保证。所提出的方案结合了车辆的全部动力学特性,包括海洋 currents。因此,所提出方案计算出的控制输入以一种方式制定,使车辆能够利用有利的海洋 currents,从而减少推进器的能耗。所提出控制策略的性能通过在带有障碍物的受限测试水槽中使用4自由度水下机器人进行实验验证。

英文摘要

This paper presents a novel Nonlinear Model Predictive Control (NMPC) scheme for underwater robotic vehicles operating in a constrained workspace including static obstacles. The purpose of the controller is to guide the vehicle towards specific way points. Various limitations such as: obstacles, workspace boundary, thruster saturation and predefined desired upper bound of the vehicle velocity are captured as state and input constraints and are guaranteed during the control design. The proposed scheme incorporates the full dynamics of the vehicle in which the ocean currents are also involved. Hence, the control inputs calculated by the proposed scheme are formulated in a way that the vehicle will exploit the ocean currents, when these are in favor of the way-point tracking mission which results in reduced energy consumption by the thrusters. The performance of the proposed control strategy is experimentally verified using a $4$ Degrees of Freedom (DoF) underwater robotic vehicle inside a constrained test tank with obstacles.

1804.01926 2026-06-04 cs.RO cs.SY eess.SY stat.ML 版本更新

Scalable Magnetic Field SLAM in 3D Using Gaussian Process Maps

基于高斯过程地图的可扩展三维磁场SLAM

Manon Kok, Arno Solin

发表机构 * Delft University of Technology(代尔夫特理工大学) Aalto University(阿尔托大学) University of Cambridge(剑桥大学)

AI总结 本文提出一种利用磁场局部异常进行三维磁场SLAM的方法,采用高斯过程模型和六边形分块映射,结合降维高斯过程回归与 Rao-Blackwellised 粒子滤波,实现高效计算和存储的SLAM算法。

Comments 11 pages, 5 figures

详情
AI中文摘要

我们提出了一种利用磁场局部异常作为位置信息源的可扩展且完全三维的磁场同时定位与建图(SLAM)方法。这些异常是由于建筑物结构和家具等物体中存在铁磁材料引起的。我们使用高斯过程模型表示磁场地图,并考虑磁场的已知物理性质。我们使用三维六边形分块进行局部地图构建。为了使我们的方法计算可行,我们结合降维高斯过程回归与 Rao-Blackwellised 粒子滤波。我们展示了使用智能手机测量可以得到准确的位置和姿态估计,并证明我们的方法在计算复杂度和地图存储方面都实现了可扩展的磁场SLAM算法。

英文摘要

We present a method for scalable and fully 3D magnetic field simultaneous localisation and mapping (SLAM) using local anomalies in the magnetic field as a source of position information. These anomalies are due to the presence of ferromagnetic material in the structure of buildings and in objects such as furniture. We represent the magnetic field map using a Gaussian process model and take well-known physical properties of the magnetic field into account. We build local maps using three-dimensional hexagonal block tiling. To make our approach computationally tractable we use reduced-rank Gaussian process regression in combination with a Rao-Blackwellised particle filter. We show that it is possible to obtain accurate position and orientation estimates using measurements from a smartphone, and that our approach provides a scalable magnetic field SLAM algorithm in terms of both computational complexity and map storage.

1711.03449 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Optimization-Based Collision Avoidance

基于优化的避障方法

Xiaojing Zhang, Alexander Liniger, Francesco Borrelli

发表机构 * Model Predictive Control Laboratory, Department of Mechanical Engineering, University of California, Berkeley, USA(加州大学伯克利分校机械工程系模型预测控制实验室) Automatic Control Laboratory, Department of Information Technology and Electrical Engineering, ETH Zurich, Switzerland(苏黎世联邦理工学院信息科技与电气工程系自动控制实验室)

AI总结 本文提出一种将非光滑避障约束转化为光滑非线性约束的方法,利用凸优化的强对偶性。该方法适用于在n维空间中移动的受控物体,能处理一般障碍物和可表示为有限凸集并集的受控物体,并结合了传统轨迹生成算法中常用的有号距离概念。

Comments 27 pages, 9 figures, 2 tables

详情
AI中文摘要

本文提出了一种将非光滑避障约束转化为光滑非线性约束的方法,利用凸优化的强对偶性。我们关注一个受控物体在n维空间中移动时避免障碍物的目标。所提出的改写方法不引入近似,并适用于一般障碍物和可在n维空间中表示为有限凸集并集的受控物体。此外,我们结果与传统轨迹生成算法中常用的有号距离概念相连接。我们的方法可以用于通用的导航和轨迹规划任务,光滑性属性允许使用通用的梯度和Hessian基于优化算法。最后,在无法避免碰撞的情况下,我们的框架允许找到

英文摘要

This paper presents a novel method for reformulating non-differentiable collision avoidance constraints into smooth nonlinear constraints using strong duality of convex optimization. We focus on a controlled object whose goal is to avoid obstacles while moving in an n-dimensional space. The proposed reformulation does not introduce approximations, and applies to general obstacles and controlled objects that can be represented in an n-dimensional space as the finite union of convex sets. Furthermore, we connect our results with the notion of signed distance, which is widely used in traditional trajectory generation algorithms. Our method can be used in generic navigation and trajectory planning tasks, and the smoothness property allows the use of general-purpose gradient- and Hessian-based optimization algorithms. Finally, in case a collision cannot be avoided, our framework allows us to find "least-intrusive" trajectories, measured in terms of penetration. We demonstrate the efficacy of our framework on a quadcopter navigation and automated parking problem, and our numerical experiments suggest that the proposed methods enable real-time optimization-based trajectory planning problems in tight environments. Source code of our implementation is provided at https://github.com/XiaojingGeorgeZhang/OBCA.

1704.06053 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Using Inertial Sensors for Position and Orientation Estimation

利用惯性传感器进行位置和姿态估计

Manon Kok, Jeroen D. Hol, Thomas B. Schön

发表机构 * Delft Center for Systems and Control, Delft University of Technology, the Netherlands(荷兰代尔夫特理工大学系统与控制中心) Xsens Technologies B.V., Enschede, the Netherlands(荷兰恩schede市Xsens技术公司) Department of Information Technology, Uppsala University, Sweden(瑞典乌普萨拉大学信息科技系)

AI总结 本文探讨了惯性传感器在位置和姿态估计中的信号处理方法,分析了不同建模选择和关键算法,如优化平滑滤波、扩展卡尔曼滤波和互补滤波,并通过实验和模拟数据验证其性能。

Comments 90 pages, 38 figures

详情
Journal ref
Foundations and Trends in Signal Processing, Vol. 11: No. 1-2, Pages 1-153, 2017
AI中文摘要

近年来,由于体积小、成本低,MEMS惯性传感器(3D加速度计和3D陀螺仪)已广泛可用。惯性传感器以高采样率获取数据,可通过积分获得位置和姿态信息。这些估计在短时间尺度上准确,但长时间尺度上会因积分漂移而产生误差。为克服此问题,惯性传感器通常与其它传感器和模型结合。本文教程聚焦于惯性传感器用于位置和姿态估计的信号处理方面,讨论了不同的建模选择和若干重要的算法。这些算法包括基于优化的平滑和滤波方法,以及计算成本更低的扩展卡尔曼滤波和互补滤波实现。通过实验和模拟数据展示了这些算法的估计质量。

英文摘要

In recent years, MEMS inertial sensors (3D accelerometers and 3D gyroscopes) have become widely available due to their small size and low cost. Inertial sensor measurements are obtained at high sampling rates and can be integrated to obtain position and orientation information. These estimates are accurate on a short time scale, but suffer from integration drift over longer time scales. To overcome this issue, inertial sensors are typically combined with additional sensors and models. In this tutorial we focus on the signal processing aspects of position and orientation estimation using inertial sensors. We discuss different modeling choices and a selected number of important algorithms. The algorithms include optimization-based smoothing and filtering as well as computationally cheaper extended Kalman filter and complementary filter implementations. The quality of their estimates is illustrated using both experimental and simulated data.

1806.01777 2026-06-04 eess.SP cs.RO cs.SY eess.SY 版本更新

Safe Driving Capacity of Autonomous Vehicles

自动驾驶车辆的安全驾驶能力

Yuan-Ying Wang, Hung-Yu Wei

发表机构 * Department of Electrical Engineering(电气工程系) National Taiwan University(国立台湾大学)

AI总结 本文通过线性时序逻辑定义道路和车辆的安全状态,提出安全驾驶吞吐量和容量概念,分析不同因素对安全驾驶吞吐量的影响,并比较基于感知和协作车辆的道路安全驾驶容量差异。

Comments 5 pages, VTC 2018

详情
AI中文摘要

本文通过线性时序逻辑定义道路和车辆的安全状态,提出安全驾驶吞吐量和容量概念,分析不同因素对安全驾驶吞吐量的影响,并比较基于感知和协作车辆的道路安全驾驶容量差异。

英文摘要

An excellent self-driving car is expected to take its passengers safely and efficiently from one place to another. However, different ways of defining safety and efficiency may significantly affect the conclusion we make. In this paper, we give formal definitions to the safe state of a road and safe state of a vehicle using the syntax of linear temporal logic (LTL). We then propose the concept of safe driving throughput (SDT) and safe driving capacity (SDC) which measure the amount of vehicles in the safe state on a road. We analyze how SDT is affected by different factors. We show the analytic difference of SDC between the road with perception-based vehicles (PBV) and the road with cooperative-based vehicles (CBV). We claim that through proper design, the SDC of the road filled with PBVs will be upper-bounded by the SDC of the road filled with CBVs.

1806.01623 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Adaptive twisting sliding mode control for quadrotor unmanned aerial vehicles

自适应扭曲滑模控制用于四旋翼无人机

V. T. Hoang, M. D. Phung, Q. P. Ha

AI总结 本文提出自适应扭曲滑模控制算法,用于四旋翼无人机在非线性、扰动和参数变化下的姿态控制,通过改进滑模律和增益自适应方案提升控制性能。

Comments 2017 11th Asian Control Conference (ASCC)

详情
AI中文摘要

本文解决四旋翼无人机的鲁棒姿态控制问题。首先推导考虑非线性、外部扰动、不确定动力学和强耦合因素的数学模型。然后开发了自适应扭曲滑模控制算法,旨在在各种条件下控制四旋翼跟踪期望姿态。为此,扭曲滑模控制律被修改,采用所提出的增益自适应方案以改善控制瞬态和跟踪性能。对Solo四旋翼进行了广泛的仿真研究和与实验数据的比较。结果表明,所提出的控制方案在抗扰动方面具有强鲁棒性,并且能适应参数变化。

英文摘要

This work addresses the problem of robust attitude control of quadcopters. First, the mathematical model of the quadcopter is derived considering factors such as nonlinearity, external disturbances, uncertain dynamics and strong coupling. An adaptive twisting sliding mode control algorithm is then developed with the objective of controlling the quadcopter to track desired attitudes under various conditions. For this, the twisting sliding mode control law is modified with a proposed gain adaptation scheme to improve the control transient and tracking performance. Extensive simulation studies and comparisons with experimental data have been carried out for a Solo quadcopter. The results show that the proposed control scheme can achieve strong robustness against disturbances while is adaptable to parametric variations.

1806.00727 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

Closed-loop Bayesian Semantic Data Fusion for Collaborative Human-Autonomy Target Search

闭环贝叶斯语义数据融合用于协同人机目标搜索

Luke Burks, Ian Loefgren, Luke Barbier, Jeremy Muesing, Jamison McGinley, Sousheel Vunnam, Nisar Ahmed

AI总结 本文提出一种闭环贝叶斯语义数据融合方法,通过CPOMDP规划生成最优轨迹,结合不完美传感器数据和人类提供的语义观察,提升动态目标搜索效率。

Comments Final version accepted and submitted to 2018 FUSION Conference (Cambridge, UK, July 2018)

详情
AI中文摘要

在搜索应用中,自主无人车辆必须能够高效重新获取和定位长时间可能处于视线外的大空间中移动目标。为此,本文开发并验证了一种新的协同人机感知解决方案。我们的方法利用连续部分可观测马尔可夫决策过程(CPOMDP)规划,生成最优利用不完美传感器数据和可请求的语义自然语言观察的车辆轨迹。关键创新是可扩展的层次高斯混合模型形式,用于在连续动态状态空间中高效求解包含语义观察的CPOMDPs。该方法在定制测试平台上通过真实的人机团队在动态室内目标搜索和捕捉场景中进行了演示和验证。

英文摘要

In search applications, autonomous unmanned vehicles must be able to efficiently reacquire and localize mobile targets that can remain out of view for long periods of time in large spaces. As such, all available information sources must be actively leveraged -- including imprecise but readily available semantic observations provided by humans. To achieve this, this work develops and validates a novel collaborative human-machine sensing solution for dynamic target search. Our approach uses continuous partially observable Markov decision process (CPOMDP) planning to generate vehicle trajectories that optimally exploit imperfect detection data from onboard sensors, as well as semantic natural language observations that can be specifically requested from human sensors. The key innovation is a scalable hierarchical Gaussian mixture model formulation for efficiently solving CPOMDPs with semantic observations in continuous dynamic state spaces. The approach is demonstrated and validated with a real human-robot team engaged in dynamic indoor target search and capture scenarios on a custom testbed.

1806.00678 2026-06-04 cs.RO cs.SY eess.SY 版本更新

AutoRally An open platform for aggressive autonomous driving

AutoRally:一个用于激进自动驾驶的开放平台

Brian Goldfain, Paul Drews, Changxi You, Matthew Barulic, Orlin Velev, Panagiotis Tsiotras, James M. Rehg

发表机构 * Georgia Tech Autonomous Racing Facility(佐治亚理工学院自动驾驶赛车中心)

AI总结 本文介绍了一个1:5比例的机器人测试平台AutoRally,旨在提供稳健、易用和可重复的自动驾驶研究环境,使非专业人员也能收集真实世界的数据。

详情
AI中文摘要

本文介绍了一个1:5比例的机器人测试平台AutoRally,旨在提供稳健、易用和可重复的自动驾驶研究环境,使非专业人员也能收集真实世界的数据。

英文摘要

This article presents AutoRally, a 1$:$5 scale robotics testbed for autonomous vehicle research. AutoRally is designed for robustness, ease of use, and reproducibility, so that a team of two people with limited knowledge of mechanical engineering, electrical engineering, and computer science can construct and then operate the testbed to collect real world autonomous driving data in whatever domain they wish to study. Complete documentation to construct and operate the platform is available online along with tutorials, example controllers, and a driving dataset collected at the Georgia Tech Autonomous Racing Facility. Offline estimation algorithms are used to determine parameters for physics-based dynamics models using an adaptive limited memory joint state unscented Kalman filter. Online vehicle state estimation using a factor graph optimization scheme and a convolutional neural network for semantic segmentation of drivable surface are presented. All algorithms are tested with real world data from the fleet of six AutoRally robots at the Georgia Tech Autonomous Racing Facility tracks, and serve as a demonstration of the robot$'$s capabilities.

1805.12170 2026-06-04 eess.SY cs.RO cs.SY 版本更新

An Improved Active Disturbance Rejection Control for a Differential Drive Mobile Robot with Mismatched Disturbances and Uncertainties

一种改进的主动扰动拒绝控制用于有差驱动移动机器人以应对不匹配扰动和不确定性

Ibraheem Kasim Ibraheem, Wameedh Riyadh Abdul-Adeem

发表机构 * Electrical Engineering Department(电气工程系) College of Engineering, Baghdad University(巴格达大学工程学院)

AI总结 本文提出了一种基于扰动和不确定性估计与抑制技术的改进主动扰动拒绝控制方法,用于非线性运动学模型的有差驱动移动机器人,通过消除扰动和不确定性提高动态性能。

详情
AI中文摘要

本文提出了一种基于扰动和不确定性估计与抑制技术的改进主动扰动拒绝控制方法,用于非线性运动学模型的有差驱动移动机器人。所提出的方法是J.Han建议的主动扰动拒绝控制(ADRC)的改进版本。ADRC用于主动抵消由未知外源信号和系统模型的匹配不确定性引起的扰动,这些扰动被合并为总扰动。在本工作中,假设系统为仿射系统,总扰动和输入被认为是不同通道。为处理不匹配扰动和不确定性,总扰动已被转换为匹配扰动。然后基于改进的ADRC(IADRC),通过估计总扰动并将其从系统中抵消,提高了DDMR的动态性能。通过数字仿真,应用了不同的性能指标,所有结果都表明所提出的IADRC的有效性,几乎消除了抖振现象,并在闭环系统中提供了对扭矩扰动的高鲁棒性。

英文摘要

In this paper a new strategy based on disturbance and uncertainty (DU) estimation and attenuation technique is proposed and tested on the nonlinear kinematic model of the differential drive mobile robot (DDMR). The proposed technique is an improved version of the Active Disturbance Rejection Control (ADRC) strategy suggested by J. Han. The ADRC is used to actively reject disturbances caused by the unknown exogenous signals and the matched uncertainties of the system model, which are lumped all together and attributed as a total disturbance. In this work, the considered system is assumed to be affine and the total disturbance and the input are considered to be on different channels. To deal with the mismatched disturbances and uncertainties, the total disturbance has been converted into a matched one. Then, based on the improved ADRC (IADRC), the dynamic performance of the DDMR has been enhanced by estimating the total disturbance and canceling it from the system. Through digital simulations, different performance measures are applied, and they all indicate the effectiveness of the proposed IADRC by almost removing the chattering phenomenon and providing a high immunity in the closed-loop system against torque disturbance.

1709.05746 2026-06-04 cs.RO cs.AI cs.CV cs.LG cs.SY eess.SY 版本更新

Adversarial Discriminative Sim-to-real Transfer of Visuo-motor Policies

对抗性判别仿真到现实的视觉-运动策略转移

Fangyi Zhang, Jürgen Leitner, Zongyuan Ge, Michael Milford, Peter Corke

发表机构 * Australian Centre for Robotic Vision (ACRV)(澳大利亚机器人视觉中心) Queensland University of Technology (QUT)(昆士兰技术大学) Monash University(墨尔本大学)

AI总结 本文提出对抗性判别仿真到现实转移方法,减少现实数据标注成本,在桌面上物体抓取任务中,通过视觉观测控制7自由度机械臂在障碍物中抓取蓝色立方体,仅需93个标注和186个未标注图像即可实现97.8%的成功率和1.8厘米的控制精度。

Comments Under review for the International Journal of Robotics Research

详情
AI中文摘要

各种方法已被提出以学习用于现实世界机器人应用的视觉-运动策略。一种解决方案是首先在仿真中学习然后转移到现实世界。在转移过程中,大多数现有方法需要带有标签的真实图像。然而,在许多机器人应用中,标注过程往往昂贵甚至不实际。在本文中,我们提出了一种对抗性判别仿真到现实转移方法,以减少标注真实数据的成本。通过模块化网络在桌面物体抓取任务中验证了该方法的有效性,其中7自由度的机械臂以速度模式控制在障碍物中抓取蓝色立方体。对抗性转移方法将标注真实数据的需求减少了50%。策略可以仅使用93个标注和186个未标注的真实图像转移到现实环境。转移的视觉-运动策略对训练中未见过的物体和移动目标具有鲁棒性,实现了97.8%的成功率和1.8厘米的控制精度。

英文摘要

Various approaches have been proposed to learn visuo-motor policies for real-world robotic applications. One solution is first learning in simulation then transferring to the real world. In the transfer, most existing approaches need real-world images with labels. However, the labelling process is often expensive or even impractical in many robotic applications. In this paper, we propose an adversarial discriminative sim-to-real transfer approach to reduce the cost of labelling real data. The effectiveness of the approach is demonstrated with modular networks in a table-top object reaching task where a 7 DoF arm is controlled in velocity mode to reach a blue cuboid in clutter through visual observations. The adversarial transfer approach reduced the labelled real data requirement by 50%. Policies can be transferred to real environments with only 93 labelled and 186 unlabelled real images. The transferred visuo-motor policies are robust to novel (not seen in training) objects in clutter and even a moving target, achieving a 97.8% success rate and 1.8 cm control accuracy.

1805.09875 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Autonomous Thermalling as a Partially Observable Markov Decision Process (Extended Version)

自主热力上升作为部分可观测马尔可夫决策过程(扩展版本)

Iain Guilliard, Richard Rogahn, Jim Piavis, Andrey Kolobov

发表机构 * Australian National University(澳大利亚国立大学) Microsoft Research(微软研究院)

AI总结 本文提出将自主热力上升建模为POMDP,并设计基于此的递推地平线控制器,通过在ArduPlane中实现并对比现有方法,验证了其在多架sUAV同时热力上升时的显著优势。

详情
AI中文摘要

小型无人空中车辆(sUAVs)通常依赖主动推进保持飞行,这限制了飞行时间和范围。为解决此问题,自主热力上升试图利用大气中的上升气流(热力)。然而,低空热力的不规则性使得现有方法难以有效利用。本文将自主热力上升建模为POMDP,并基于此提出递推地平线控制器。该控制器被实现于流行的开源自动驾驶系统ArduPlane中,并通过一系列涉及两架同时热力上升的sUAV的实飞测试,与现有方法进行比较,结果表明基于POMDP的控制器具有显著优势。

英文摘要

Small uninhabited aerial vehicles (sUAVs) commonly rely on active propulsion to stay airborne, which limits flight time and range. To address this, autonomous soaring seeks to utilize free atmospheric energy in the form of updrafts (thermals). However, their irregular nature at low altitudes makes them hard to exploit for existing methods. We model autonomous thermalling as a POMDP and present a receding-horizon controller based on it. We implement it as part of ArduPlane, a popular open-source autopilot, and compare it to an existing alternative in a series of live flight tests involving two sUAVs thermalling simultaneously, with our POMDP-based controller showing a significant advantage.

1805.09613 2026-06-04 stat.ML cs.AI cs.LG cs.RO cs.SY eess.SY 版本更新

A0C: Alpha Zero in Continuous Action Space

A0C:在连续动作空间中的Alpha Zero

Thomas M. Moerland, Joost Broekens, Aske Plaat, Catholijn M. Jonker

发表机构 * Dep. of Computer Science, Delft University of Technology, The Netherlands(代尔夫特理工大学计算机科学系,荷兰) Dep. of Computer Science, Leiden University, The Netherlands(莱顿大学计算机科学系,荷兰)

AI总结 本文提出将Alpha Zero扩展到连续动作空间的理论方法,并在倒摆任务中验证了其可行性,为连续动作空间中的迭代搜索与学习应用奠定了基础。

详情
AI中文摘要

Alpha Zero的核心创新在于树搜索与深度学习的结合,这在国际象棋、国际跳棋和围棋等离散动作空间的游戏中证明非常成功。然而,许多现实世界的强化学习领域具有连续动作空间,例如机器人控制、导航和自动驾驶汽车。本文提出了将Alpha Zero扩展到连续动作空间所需的理论扩展。我们还提供了一些在倒摆摆起任务中的初步实验,实证地展示了我们方法的可行性。因此,这项工作为在连续动作空间领域中应用迭代搜索与学习奠定了基础。

英文摘要

A core novelty of Alpha Zero is the interleaving of tree search and deep learning, which has proven very successful in board games like Chess, Shogi and Go. These games have a discrete action space. However, many real-world reinforcement learning domains have continuous action spaces, for example in robotic control, navigation and self-driving cars. This paper presents the necessary theoretical extensions of Alpha Zero to deal with continuous action space. We also provide some preliminary experiments on the Pendulum swing-up task, empirically showing the feasibility of our approach. Thereby, this work provides a first step towards the application of iterated search and learning in domains with a continuous action space.

1805.08551 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Robust Model Predictive Control for Autonomous Vehicles/Self Driving Cars

自主车辆/自动驾驶汽车的鲁棒模型预测控制

Che Kun Law, Darshit Dalal, Stephen Shearrow

AI总结 本文提出一种用于自主车辆前轮控制的鲁棒模型预测控制方法,通过权重调整和在线线性化技术提升控制鲁棒性,并讨论各方法在精度和计算负载上的有效性。

Comments 12 pages,9 figures

详情
AI中文摘要

本文提出了一种用于自主车辆前轮控制的鲁棒模型预测控制(MPC)方法。我们通过权重调整、非线性车辆模型的连续在线线性化以跟踪位置误差以及连续在线线性化以跟踪速度误差,提出了多种增强模型预测控制鲁棒性的方法。讨论了每种方法在精度和计算负载方面的有效性。

英文摘要

A robust Model Predictive Control (MPC) approach for controlling front steering of an autonomous vehicle is presented in this paper. We present various approaches to increase the robustness of model predictive control by using weight tuning, a successive on-line linearization of a nonlinear vehicle model to track position error and successive on-line linearization to track velocity error. Results of the effectiveness of each method in terms of accuracy and computational load are discussed.

1510.07380 2026-06-04 cs.RO cs.SY eess.SY 版本更新

SLAP: Simultaneous Localization and Planning Under Uncertainty for Physical Mobile Robots via Dynamic Replanning in Belief Space: Extended version

SLAP:通过信念空间中的动态重新规划实现物理移动机器人的同时定位与规划(在不确定性下):扩展版

Ali-akbar Agha-mohammadi, Saurav Agarwal, Sung-Kyun Kim, Suman Chakravorty, Nancy M. Amato

发表机构 * NASA-JPL, Caltech(NASA-喷气推进中心,加州理工学院) Dept. of Aerospace Eng. and Amato is with the Dept. of Computer Science(航空航天工程系和计算机科学系) Dept. of Computer Science(计算机科学系)

AI总结 本文提出一种在不确定性环境下通过信念空间动态重新规划实现物理移动机器人同时定位与规划的方法,通过在线重新规划循环改进离线策略,有效应对环境变化和大定位误差,优于FIRM方法。

Comments 20 pages, updated figures, extended theory and simulation results

详情
AI中文摘要

同时定位与规划(SLAP)是自主机器人在不确定性环境下至关重要的能力。在最一般的形式下,SLAP诱导出一个连续的POMDP(部分可观测马尔可夫决策过程),需要在线不断求解。本文针对此问题提出一种在信念空间中的动态重新规划方案。该连续的POMDP在状态、动作和观测空间中通过采样方法进行离线近似,但通过在线重新规划循环实现局部改进。这种构造使所提方法能够应对环境变化和大定位误差,即使环境变化改变了最优轨迹的同调类。此外,本文方法优于当前最先进的FIRM(反馈信息路标)方法,通过消除不必要的稳定步骤。将信念空间规划应用于物理系统带来了诸多挑战。本文的重点是将所提规划器应用于物理机器人,并展示在不确定性、变化环境和存在大干扰(如被绑架机器人情况)下的SLAP解决方案性能。

英文摘要

Simultaneous localization and Planning (SLAP) is a crucial ability for an autonomous robot operating under uncertainty. In its most general form, SLAP induces a continuous POMDP (partially-observable Markov decision process), which needs to be repeatedly solved online. This paper addresses this problem and proposes a dynamic replanning scheme in belief space. The underlying POMDP, which is continuous in state, action, and observation space, is approximated offline via sampling-based methods, but operates in a replanning loop online to admit local improvements to the coarse offline policy. This construct enables the proposed method to combat changing environments and large localization errors, even when the change alters the homotopy class of the optimal trajectory. It further outperforms the state-of-the-art FIRM (Feedback-based Information RoadMap) method by eliminating unnecessary stabilization steps. Applying belief space planning to physical systems brings with it a plethora of challenges. A key focus of this paper is to implement the proposed planner on a physical robot and show the SLAP solution performance under uncertainty, in changing environments and in the presence of large disturbances, such as a kidnapped robot situation.

1805.04201 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

Learning to Grasp Without Seeing

无需视觉的抓取学习

Adithyavairavan Murali, Yin Li, Dhiraj Gandhi, Abhinav Gupta

发表机构 * The Robotics Institute, Carnegie Mellon University(卡内基梅隆大学机器人研究所)

AI总结 本文提出基于触觉感知的抓取方法,通过触觉信号表征和迭代重抓取提升抓取稳定性,实验表明在无视觉信息下可有效抓取新型物体。

详情
AI中文摘要

能否在不看到物体的情况下让机器人抓取未知物体?本文提出了一种基于触觉感知的解决方案,结合触觉信号定位与触觉反馈重抓取。我们创建了一个大规模抓取数据集,包含超过30帧RGB图像和280万条触觉样本。提出了一种无监督自编码方案,显著提升了触觉感知任务的性能。系统分为两个步骤:首先,触觉定位模型通过粒子滤波聚合目标信息,输出物体位置估计以建立初始抓取;其次,重抓取模型基于学习特征逐步改进抓取,估计抓取稳定性并预测下一步调整。最终通过大量实验验证了在无视觉信息下抓取新型物体的有效性,并在视觉策略基础上提升了整体准确率10.6%。

英文摘要

Can a robot grasp an unknown object without seeing it? In this paper, we present a tactile-sensing based approach to this challenging problem of grasping novel objects without prior knowledge of their location or physical properties. Our key idea is to combine touch based object localization with tactile based re-grasping. To train our learning models, we created a large-scale grasping dataset, including more than 30 RGB frames and over 2.8 million tactile samples from 7800 grasp interactions of 52 objects. To learn a representation of tactile signals, we propose an unsupervised auto-encoding scheme, which shows a significant improvement of 4-9% over prior methods on a variety of tactile perception tasks. Our system consists of two steps. First, our touch localization model sequentially 'touch-scans' the workspace and uses a particle filter to aggregate beliefs from multiple hits of the target. It outputs an estimate of the object's location, from which an initial grasp is established. Next, our re-grasping model learns to progressively improve grasps with tactile feedback based on the learned features. This network learns to estimate grasp stability and predict adjustment for the next grasp. Re-grasping thus is performed iteratively until our model identifies a stable grasp. Finally, we demonstrate extensive experimental results on grasping a large set of novel objects using tactile sensing alone. Furthermore, when applied on top of a vision-based policy, our re-grasping model significantly boosts the overall accuracy by 10.6%. We believe this is the first attempt at learning to grasp with only tactile sensing and without any prior object knowledge.

1805.03358 2026-06-04 eess.SY cs.RO cs.SY 版本更新

A Collision-Free Path Planning Algorithm for Unmanned Aerial Vehicle Delivery

一种用于无人机送货的无碰撞路径规划算法

Ziji Shi, Wee Keong Ng

AI总结 本文提出一种基于A*算法的无碰撞路径规划方法,通过引入考虑等待时间的启发函数,解决动态环境中无人机送货的碰撞问题,并通过仿真验证了算法的有效性。

Comments Accepted by 2018 International Conference on Unmanned Aircraft Systems (ICUAS)

详情
AI中文摘要

路径规划对于无人机的自主性至关重要,特别是在调度无人机送货时。然而,无人机的操作环境通常是不确定和动态的。没有适当的规划,多个无人机拥堵时可能发生碰撞。此外,当局可能设置临时禁飞区,使空域无法使用。因此,需要适当的预出发规划以避免这些地方。本文将该问题转化为约束满足问题,以在动态图中找到无碰撞的最短路径。我们提出了一种基于A*算法的无碰撞路径规划算法。主要创新点是发明了一个同时考虑等待时间的启发函数。我们后来证明,添加等待惩罚后,所提出算法是最优的,因为启发函数是可接受的。该算法的实现使用了新加坡的空域结构进行无人机送货仿真。我们的仿真展示了良好的运行时间性能。使用所提出的算法,碰撞自由路线的百分比随着单位面积请求数量的增加而减少,并在边界值处显著下降。我们的实证分析可以辅助禁飞区政策和无人机送货基础设施的决策。

英文摘要

Path planning is important for the autonomy of Unmanned Aerial Vehicle (UAV), especially for scheduling UAV delivery. However, the operating environment of UAVs is usually uncertain and dynamic. Without proper planning, collisions may happen where multiple UAVs are congested. Besides, there may also be temporary no-fly zone setup by authorities that makes airspace unusable. Thus, proper pre-departure planning that avoids such places is needed. In this paper, we formulate this problem into a Constraint Satisfaction Problem to find a collision-free shortest path on a dynamic graph. We propose a collision-free path planning algorithm that is based on A* algorithm. The main novelty is that we invent a heuristic function that also considers waiting time. We later show that, with added waiting penalty, the proposed algorithm is optimal because the heuristic is admissible. Implementation of this algorithm simulates UAV delivery using Singapore's airspace structure. Our simulation exhibits desirable runtime performance. Using the proposed algorithm, the percentage of collision-free routes decreases as number of requests per unit area increases, and this percentage drops significantly at boundary value. Our empirical analysis could aid the decision-making of no-fly zone policy and infrastructure of UAV delivery.

1805.02508 2026-06-04 eess.SY cs.RO cs.SY 版本更新

A Generic Self-Evolving Neuro-Fuzzy Controller based High-performance Hexacopter Altitude Control System

一种基于通用自进化神经模糊控制器的高性能六旋翼无人机高度控制系统

Md Meftahul Ferdaus, Mahardhika Pratama, Sreenatha G. Anavatti, Matthew A. Garratt

AI总结 本文提出一种无需领域知识的通用自进化神经模糊控制器,用于六旋翼无人机的高度控制,通过自适应规则调整和滑模控制理论保证系统稳定性。

Comments submitted in the 2018 IEEE International Conference on Systems, Man, and Cybernetics (SMC2018)

详情
AI中文摘要

如今,全自主系统如旋翼无人机(UAV)的应用迅速增加。由于非线性动态复杂,开发基于学习机器的智能自组织进化控制器成为研究热点。本文提出名为通用控制器(G-controller)的进化控制器,用于控制六旋翼无人机的高度。该控制器可与极少的领域知识一起工作。其进化架构基于先进的增量学习算法,即通用进化神经模糊推理系统(GENEFIS)。控制器不需要离线训练,因为它从空规则集开始运行,然后根据需求添加或删除规则。后件参数的适应律来源于滑模控制(SMC)理论。Lyapunov理论用于保证所提控制器的稳定性。此外,还实施了辅助鲁棒化控制项,以获得跟踪误差趋于零的均匀渐近收敛。最后,通过六旋翼无人机在各种轨迹上的高度跟踪来评估G控制器的性能。

英文摘要

Nowadays, the application of fully autonomous system like rotary wing unmanned air vehicles (UAVs) is increasing sharply. Due to the complex nonlinear dynamics, a huge research interest is witnessed in developing learning machine based intelligent, self-organizing evolving controller for these vehicles notably to address the system's dynamic characteristics. In this work, such an evolving controller namely Generic-controller (G-controller) is proposed to control the altitude of a rotary wing UAV namely hexacopter. This controller can work with very minor expert domain knowledge. The evolving architecture of this controller is based on an advanced incremental learning algorithm namely Generic Evolving Neuro-Fuzzy Inference System (GENEFIS). The controller does not require any offline training, since it starts operating from scratch with an empty set of fuzzy rules, and then add or delete rules on demand. The adaptation laws for the consequent parameters are derived from the sliding mode control (SMC) theory. The Lyapunov theory is used to guarantee the stability of the proposed controller. In addition, an auxiliary robustifying control term is implemented to obtain a uniform asymptotic convergence of tracking error to zero. Finally, the G-controller's performance evaluation is observed through the altitude tracking of a UAV namely hexacopter for various trajectories.

1801.08995 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Trajectory Generation using Sharpness Continuous Dubins-like Paths with Applications in Control of Heavy Duty Vehicles

利用连续尖锐度的Dubins-like路径生成轨迹,应用于重载车辆控制

Rui Oliveira, Pedro F. Lima, Marcello Cirillo, Jonas Mårtensson, Bo Wahlberg

发表机构 * Scania, Autonomous Transport Solutions(斯堪尼亚,自主运输解决方案)

AI总结 本文提出了一种考虑转向执行器速率和扭矩限制的轨迹生成框架,通过连续尖锐度曲线提升重载车辆的自主驾驶性能。

Comments 18 pages, 7 figures, accepted for publication at ECC 2018 - European Control Conference

详情
AI中文摘要

我们提出了一种用于控制轮式车辆的轨迹生成框架,考虑转向执行器的速率和扭矩限制。关键思想是直接考虑转向执行器的速率和扭矩限制,而传统方法仅考虑曲率速率限制。我们提出新的尖锐度连续曲线概念,结合三次和Sigmoid曲率轨迹以及圆弧来引导车辆。所获得的轨迹具有平滑且连续可微的转向角剖面。这些轨迹为低层控制器提供更易跟踪的参考信号,从而提高性能。所获得的转向剖面的平滑性也提高了乘客舒适度。该方法具有快速的计算时间,可通过简单预计算进一步加速。我们详细讨论了该方法的路径规划应用,并通过仿真展示了其优势和实时能力。

英文摘要

We present a trajectory generation framework for control of wheeled vehicles under steering actuator constraints. The motivation is smooth autonomous driving of heavy vehicles. The key idea is to take into account rate, and additionally, torque limitations of the steering actuator directly. Previous methods only take into account curvature rate limitations, which deal indirectly with steering rate limitations. We propose the new concept of Sharpness Continuous curves, which uses cubic and sigmoid curvature trajectories together with circular arcs to steer the vehicle. The obtained trajectories are characterized by a smooth and continuously differentiable steering angle profile. These trajectories provide low-level controllers with reference signals which are easier to track, resulting in improved performance. The smoothness of the obtained steering profiles also results in increased passenger comfort. The method is characterized by a fast computation time, which can be further speeded up through the use of simple pre-computations. We detail possible path planning applications of the method, and conduct simulations that show its advantages and real time capabilities.

1804.08676 2026-06-04 cs.RO cs.MA cs.SY eess.SY 版本更新

Gesture based Human-Swarm Interactions for Formation Control using interpreters

基于手势的人群-蜂群交互的编队控制使用解释器

Aamodh Suresh, Sonia Martinez

发表机构 * Department of Mechanical and Aerospace Engineering, University of California at San Diego, La Jolla, CA 92093, USA(机械与航空航天工程系,加州大学圣地亚哥分校,拉古拉,CA 92093,美国)

AI总结 本文提出了一种新颖的人群-蜂群交互框架,通过手势控制蜂群形状和编队。该框架利用可穿戴臂带记录手势,通过解释器将手势转化为蜂群控制指令,结合机器学习和最优控制技术实现编队控制。

详情
AI中文摘要

我们提出了一种新颖的人群-蜂群交互(HSI)框架,使用户能够通过简单的手臂手势和动作控制蜂群的形状和编队。用户通过可穿戴的臂带记录手势,该框架引入了一种新颖的解释器系统,作为用户和蜂群之间的中介,简化用户的交互角色。解释器接收用户通过手势绘制的高层次输入,并将其转化为低层次的蜂群控制指令。该解释器利用机器学习、卡尔曼滤波和最优控制技术将用户输入转化为蜂群控制参数。引入了人类可解释的动力学概念,用于解释器的规划以及向用户提供反馈。蜂群的动力学通过基于分布式线性迭代和动态平均一致的新型去中心化编队控制器进行控制。该框架在二维环境中理论和实验上均得到了验证,展示了人类实时控制模拟机器人蜂群的能力。

英文摘要

We propose a novel Human-Swarm Interaction (HSI) framework which enables the user to control a swarm shape and formation. The user commands the swarm utilizing just arm gestures and motions which are recorded by an off-the-shelf wearable armband. We propose a novel interpreter system, which acts as an intermediary between the user and the swarm to simplify the user's role in the interaction. The interpreter takes in a high level input drawn using gestures by the user, and translates it into low level swarm control commands. This interpreter employs machine learning, Kalman filtering and optimal control techniques to translate the user input into swarm control parameters. A notion of Human Interpretable dynamics is introduced, which is used by the interpreter for planning as well as to provide feedback to the user. The dynamics of the swarm are controlled using a novel decentralized formation controller based on distributed linear iterations and dynamic average consensus. The framework is demonstrated theoretically as well as experimentally in a 2D environment, with a human controlling a swarm of simulated robots in real time.

1804.06586 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Composite Adaptive Control for Bilateral Teleoperation Systems without Persistency of Excitation

双向往复操作系统无持续激励的复合自适应控制

Yuling Li, Yixin Yin, Sen Zhang, Jie Dong, Rolf Johansson

发表机构 * School of Automation and Electrical Engineering, University of Science and Technology Beijing(自动化与电气工程学院,北京科技大学) Department of Automatic Control, Lund University, P.O. Box 118, 22100 Lund, Sweden(自动控制系,卢德大学)

AI总结 本文提出一种无需持续激励条件的复合自适应控制方法,用于解决非线性双向往复操作系统中参数收敛问题,通过线性矩阵不等式给出闭环系统稳定性准则,并通过仿真验证了方法的有效性。

Comments 21 pages, 9 figures, submitted to Journal of The Franklin Institute

详情
AI中文摘要

复合自适应控制方案利用系统跟踪误差和预测误差驱动更新律,已成为提高系统性能的常用方法。然而,为保证参数收敛需满足强持续激励(PE)条件。本文提出一种新颖的复合自适应控制,用于非线性双向往复操作系统,无需PE条件即可实现参数收敛。闭环双向往复系统稳定性准则以线性矩阵不等式形式给出。新的跟踪性能指标用于评估主从之间位置跟踪性能。仿真研究展示了所提方法的有效性。

英文摘要

Composite adaptive control schemes, which use both the system tracking errors and the prediction error to drive the update laws, have become widespread in achieving an improvement of system performance. However, a strong persistent-excitation (PE) condition should be satisfied to guarantee the parameter convergence. This paper proposes a novel composite adaptive control to guarantee parameter convergence without PE condition for nonlinear teleoperation systems with dynamic uncertainties and time-varying communication delays. The stability criteria of the closed-loop teleoperation system are given in terms of linear matrix inequalities. New tracking performance measures are proposed to evaluate the position tracking between the master and the slave. Simulation studies are given to show the effectiveness of the proposed method.

1804.04696 2026-06-04 cs.RO cs.AI cs.LG cs.SY eess.SY 版本更新

Efficient Model Identification for Tensegrity Locomotion

高效 tensegrity 机器人运动的模型识别

Shaojun Zhu, David Surovik, Kostas E. Bekris, Abdeslam Boularias

发表机构 * Department of Computer Science, Rutgers University(计算机科学系,罗格斯大学)

AI总结 本文提出一种高效方法,利用物理引擎和贝叶斯优化框架,用于识别高维顺应性tensegrity机器人中的未知机械参数,提升运动控制精度。

详情
AI中文摘要

本文旨在以实用方式识别未知物理参数,如驱动机器人连杆的机械模型,这些参数在动态机器人任务中至关重要。关键特征包括使用现成的物理引擎和贝叶斯优化框架。所考虑的任务是高维、顺应性tensegrity机器人的运动。关键见解在于将模型识别挑战投影到适当的低维空间以提高效率。与替代方法的比较表明,所提出的方法可以在给定的时间预算内更准确地识别参数,从而实现更精确的运动控制。

英文摘要

This paper aims to identify in a practical manner unknown physical parameters, such as mechanical models of actuated robot links, which are critical in dynamical robotic tasks. Key features include the use of an off-the-shelf physics engine and the Bayesian optimization framework. The task being considered is locomotion with a high-dimensional, compliant Tensegrity robot. A key insight, in this case, is the need to project the model identification challenge into an appropriate lower dimensional space for efficiency. Comparisons with alternatives indicate that the proposed method can identify the parameters more accurately within the given time budget, which also results in more precise locomotion control.

1804.04349 2026-06-04 eess.SY cs.RO cs.SE cs.SY 版本更新

On the Application of ISO 26262 in Control Design for Automated Vehicles

在自动驾驶车辆控制设计中应用ISO 26262

Georg Schildbach

发表机构 * Institute for Electrical Engineering in Medicine, University of Luebeck(医学电气工程研究所,吕贝克大学)

AI总结 本文探讨了ISO 26262标准在自动驾驶车辆控制设计中的应用,分析了其在高自动化车辆中的适用性与争议,并总结了该标准的安全设计步骤。

Comments In Proceedings SCAV 2018, arXiv:1804.03406

详情
Journal ref
EPTCS 269, 2018, pp. 74-82
AI中文摘要

过去十年间,自动驾驶车辆的研究经历了爆炸性增长。然而,其实际应用的主要障碍是一个令人信服的安全概念。随着算法日益复杂和车辆自动化水平提高,这一问题愈发重要。功能安全领域提供了一种系统方法,用于识别潜在风险源并提高车辆安全性。该领域基于航空航天、过程和其他行业数十年的实践经验。这些经验汇总成汽车领域的功能安全标准ISO 26262,已被广泛采用。然而,其在高自动化车辆中的适用性和相关性却存在争议。本文对这一讨论进行了批判性分析,并总结了ISO 26262的主要步骤,以实现自动驾驶车辆的安全控制设计。

英文摘要

Research on automated vehicles has experienced an explosive growth over the past decade. A main obstacle to their practical realization, however, is a convincing safety concept. This question becomes ever more important as more sophisticated algorithms are used and the vehicle automation level increases. The field of functional safety offers a systematic approach to identify possible sources of risk and to improve the safety of a vehicle. It is based on practical experience across the aerospace, process and other industries over multiple decades. This experience is compiled in the functional safety standard for the automotive domain, ISO 26262, which is widely adopted throughout the automotive industry. However, its applicability and relevance for highly automated vehicles is subject to a controversial debate. This paper takes a critical look at the discussion and summarizes the main steps of ISO 26262 for a safe control design for automated vehicles.

1804.04347 2026-06-04 cs.RO cs.SE cs.SY eess.SY 版本更新

The CAT Vehicle Testbed: A Simulator with Hardware in the Loop for Autonomous Vehicle Applications

CAT车辆测试平台:用于自动驾驶应用的具有闭环硬件的模拟器

Rahul Kumar Bhadani, Jonathan Sprinkle, Matthew Bunting

发表机构 * Department of Electrical and Computer Engineering(电气与计算机工程系) University of Arizona(亚利桑那大学) Tucson, USA(美国图森市)

AI总结 本文提出CAT车辆测试平台,通过闭环硬件模拟验证仿真结果,支持自动驾驶技术研究。平台基于ROS和物理车辆模型,支持多车交互和实时数据回放,可快速验证算法性能。

Comments In Proceedings SCAV 2018, arXiv:1804.03406

详情
Journal ref
EPTCS 269, 2018, pp. 32-47
AI中文摘要

本文介绍了CAT车辆(认知与自主测试车辆)测试平台:一个由分布式仿真为基础的自动驾驶车辆组成的研发测试平台,能够轻松过渡到闭环硬件测试和执行,以支持自动驾驶技术的研究。自动驾驶技术从主动安全功能和高级驾驶辅助系统发展到完全传感器引导的自动驾驶,需要测试所有可能的场景。然而,研究人员若没有自己的机器人平台,想要在物理平台上展示新成果将面临困难。因此,需要一个研究测试平台,使基于仿真的结果能够通过闭环仿真快速验证,以便在物理平台上测试软件。CAT车辆测试平台提供了这样的测试平台,可以在仿真中模拟真实车辆的动力学,然后无缝过渡到使用案例的硬件再现。该模拟器使用机器人操作系统(ROS)和基于物理的车辆模型,包括具有可配置参数的模拟传感器和执行器。该测试平台允许多车仿真以支持车辆间交互。我们的测试平台还支持实时数据记录和捕获,可以回放以检查特定场景或使用案例,并用于回归测试。作为可行性演示的一部分,我们介绍了CAT车辆挑战,全球各地的学生研究人员能够在少于2天的物理平台接口时间内重现他们的仿真结果。

英文摘要

This paper presents the CAT Vehicle (Cognitive and Autonomous Test Vehicle) Testbed: a research testbed comprised of a distributed simulation-based autonomous vehicle, with straightforward transition to hardware in the loop testing and execution, to support research in autonomous driving technology. The evolution of autonomous driving technology from active safety features and advanced driving assistance systems to full sensor-guided autonomous driving requires testing of every possible scenario. However, researchers who want to demonstrate new results on a physical platform face difficult challenges, if they do not have access to a robotic platform in their own labs. Thus, there is a need for a research testbed where simulation-based results can be rapidly validated through hardware in the loop simulation, in order to test the software on board the physical platform. The CAT Vehicle Testbed offers such a testbed that can mimic dynamics of a real vehicle in simulation and then seamlessly transition to reproduction of use cases with hardware. The simulator utilizes the Robot Operating System (ROS) with a physics-based vehicle model, including simulated sensors and actuators with configurable parameters. The testbed allows multi-vehicle simulation to support vehicle to vehicle interaction. Our testbed also facilitates logging and capturing of the data in the real time that can be played back to examine particular scenarios or use cases, and for regression testing. As part of the demonstration of feasibility, we present a brief description of the CAT Vehicle Challenge, in which student researchers from all over the globe were able to reproduce their simulation results with fewer than 2 days of interfacing with the physical platform.

1804.03036 2026-06-04 eess.IV cs.RO cs.SY eess.SY 版本更新

Image Moment Models for Extended Object Tracking

图像矩模型用于扩展目标跟踪

Gang Yao, Ashwin Dani

发表机构 * Department of Electrical and Computer Engineering, University of Connecticut Storrs(电气与计算机工程系,康涅狄格大学斯托尔斯分校)

AI总结 本文提出基于图像矩的新型模型,用于估计和跟踪复杂轨迹下的目标形状。通过无迹卡尔曼滤波-交互多模型算法估计目标形状及位置速度,结合IOU和RMSE指标进行验证。

详情
Journal ref
IEEE Transactions on Aerospace and Electronic Systems, 2018
AI中文摘要

本文提出了一种基于图像矩的新型模型,用于估计和跟踪具有复杂轨迹的物体形状。假设相机静止注视移动物体,内部点特征作为测量值。假设椭圆近似为基本形状,利用图像矩组合估计椭圆形状。推导了在恒速或协调转弯运动模型下图像矩的动态模型,作为物体形状估计的函数。应用无迹卡尔曼滤波-交互多模型(UKF-IMM)滤波算法估计目标形状(近似为椭圆)并跟踪其位置和速度。基于平均对数似然推导IMM滤波器的似然函数。展示了所提UKF-IMM算法与图像矩模型的仿真结果,显示了在复杂轨迹中移动物体的估计结果。通过交并比(IOU)和位置和速度均方根误差(RMSE)等指标与文献中的基准算法进行比较。还展示了从四旋翼无人机捕获的真实图像数据结果。

英文摘要

In this paper, a novel image moments based model for shape estimation and tracking of an object moving with a complex trajectory is presented. The camera is assumed to be stationary looking at a moving object. Point features inside the object are sampled as measurements. An ellipsoidal approximation of the shape is assumed as a primitive shape. The shape of an ellipse is estimated using a combination of image moments. Dynamic model of image moments when the object moves under the constant velocity or coordinated turn motion model is derived as a function for the shape estimation of the object. An Unscented Kalman Filter-Interacting Multiple Model (UKF-IMM) filter algorithm is applied to estimate the shape of the object (approximated as an ellipse) and track its position and velocity. A likelihood function based on average log-likelihood is derived for the IMM filter. Simulation results of the proposed UKF-IMM algorithm with the image moments based models are presented that show the estimations of the shape of the object moving in complex trajectories. Comparison results, using intersection over union (IOU), and position and velocity root mean square errors (RMSE) as metrics, with a benchmark algorithm from literature are presented. Results on real image data captured from the quadcopter are also presented.

1804.02814 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Nonlinear Unknown Input and State Estimation Algorithm in Mobile Robots

移动机器人中的非线性未知输入和状态估计算法

Pinyao Guo, Hunmin Kim, Nurali Virani, Jun Xu, Minghui Zhu, Peng Liu

发表机构 * College of Information Sciences(信息科学学院) School of Electrical Engineering(电气工程学院) GE Global Research(GE全球研究)

AI总结 本文提出一种适用于非线性动态模型和随机噪声的移动机器人未知输入和状态估计算法,通过传感器数据和控制指令检测并量化传感器和执行器的异常。

Comments arXiv admin note: text overlap with arXiv:1708.01834

详情
AI中文摘要

本技术报告描述并推导了一种新型非线性未知输入和状态估计算法(NUISE),用于移动机器人。该算法针对具有非线性动态模型且受传感器和执行器随机噪声影响的真实机器人设计。利用传感器读数和计划的控制指令,该算法能够检测并量化传感器和执行器的异常。随后,我们阐述了两种不同移动机器人的动态模型,以展示NUISE的应用。本报告作为[1]的补充文档。

英文摘要

This technical report provides the description and the derivation of a novel nonlinear unknown input and state estimation algorithm (NUISE) for mobile robots. The algorithm is designed for real-world robots with nonlinear dynamic models and subject to stochastic noises on sensing and actuation. Leveraging sensor readings and planned control commands, the algorithm detects and quantifies anomalies on both sensors and actuators. Later, we elaborate the dynamic models of two distinctive mobile robots for the purpose of demonstrating the application of NUISE. This report serves as a supplementary document for [1].

1612.07139 2026-06-04 cs.RO cs.AI cs.LG cs.SY eess.SY 版本更新

A Survey of Deep Network Solutions for Learning Control in Robotics: From Reinforcement to Imitation

深度网络在机器人学习控制中的应用综述:从强化到模仿

Lei Tai, Jingwei Zhang, Ming Liu, Joschka Boedecker, Wolfram Burgard

发表机构 * University of Freiburg(弗赖堡大学)

AI总结 本文综述了深度学习在机器人学习控制中的应用,探讨了深度强化学习和模仿学习两大主流方法,分析了其在导航、 manipulation 任务中的应用及现实差距挑战。

Comments 19 pages, 1 figures

详情
AI中文摘要

深度学习技术已广泛应用于各种研究领域,取得了最先进的成果。本文综述了针对机器人应用的学习控制策略的深度学习解决方案。我们讨论了深度学习在学习控制中的两大主要范式:深度强化学习和模仿学习。对于深度强化学习(DRL),我们从传统强化学习算法开始,展示了如何将其扩展到深度领域,并介绍了在机器人导航和 manipulation 任务中使用 DRL 的代表性工作。我们继续讨论了解决现实差距挑战的方法,即如何将仿真中训练的 DRL 策略转移到现实世界场景,并总结了用于 DRL 研究的机器人仿真平台。对于模仿学习,我们探讨了其三个主要类别:行为克隆、逆强化学习和生成对抗模仿学习,介绍了它们的公式及其在机器人应用中的对应情况。最后,我们讨论了开放挑战和研究前沿。

英文摘要

Deep learning techniques have been widely applied, achieving state-of-the-art results in various fields of study. This survey focuses on deep learning solutions that target learning control policies for robotics applications. We carry out our discussions on the two main paradigms for learning control with deep networks: deep reinforcement learning and imitation learning. For deep reinforcement learning (DRL), we begin from traditional reinforcement learning algorithms, showing how they are extended to the deep context and effective mechanisms that could be added on top of the DRL algorithms. We then introduce representative works that utilize DRL to solve navigation and manipulation tasks in robotics. We continue our discussion on methods addressing the challenge of the reality gap for transferring DRL policies trained in simulation to real-world scenarios, and summarize robotics simulation platforms for conducting DRL research. For imitation leaning, we go through its three main categories, behavior cloning, inverse reinforcement learning and generative adversarial imitation learning, by introducing their formulations and their corresponding robotics applications. Finally, we discuss the open challenges and research frontiers.

1804.00311 2026-06-04 eess.SY cs.RO cs.SY eess.SP 版本更新

Trajectory Optimization of Robots with Regenerative Drive Systems: Numerical and Experimental Results

具有再生驱动系统的机器人轨迹优化:数值和实验结果

Poya Khalaf, Hanz Richter

AI总结 本文研究了具有超级电容器再生驱动系统的机器人能量最优控制,通过数值和实验验证了轨迹优化方法,展示了13%的能耗降低效果。

详情
AI中文摘要

我们研究了具有基于超级电容器的再生驱动系统的机器人的能量最优控制。基于之前引入的框架,考虑了一个较为通用的机器人和驱动系统模型。为寻找最大化再生和存储在电容器中的能量的点对点轨迹,提出了一个优化控制问题。该优化问题、其数值解和实验评估使用PUMA 560机械臂进行演示。为评估功率流动和能量再生,准备了全面的实验设置。通过标准的鲁棒基于被动性控制方法强制机器人跟踪最优轨迹。实验结果表明,在遵循最优轨迹的情况下,可在研究条件下的能耗降低约13%。

英文摘要

We investigate energy-optimal control of robots with ultracapacitor based regenerative drive systems. Based on a previously introduced framework, a fairly generic model is considered for the robot and the drive system. An optimal control problem is formulated to find point-to point trajectories maximizing the amount of energy regenerated and stored in the capacitor. The optimization problem, its numerical solution and an experimental evaluation are demonstrated using a PUMA 560 manipulator. A comprehensive experimental setup was prepared to evaluate power flows and energy regeneration. Tracking of optimal trajectories was enforced on the robot using a standard robust passivity based control approach. Experimental results show that when following optimal trajectories, a reduction of about 13\% in energy consumption can be achieved for the conditions of the study.

1803.10371 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Reinforcement learning for non-prehensile manipulation: Transfer from simulation to physical system

基于非操控操作的强化学习:从仿真到物理系统的迁移

Kendall Lowrey, Svetoslav Kolev, Jeremy Dao, Aravind Rajeswaran, Emanuel Todorov

发表机构 * University of Washington(华盛顿大学) Roboti LLC(Roboti公司)

AI总结 本文提出了一种基于仿真的强化学习方法,用于非操控操作任务,通过在仿真环境中训练策略,成功迁移到物理系统中,且在模型集合训练下提升了策略的鲁棒性。

Comments Accepted at IEEE SIMPAR 2018. Project page: https://sites.google.com/view/phantomsim2real

详情
AI中文摘要

强化学习已作为一种有前途的方法用于训练机器人控制器。然而,大多数结果受限于仿真,因为需要大量样本且缺乏自动且安全的数据收集方法。基于模型的强化学习方法提供了一种途径来克服这些挑战,但传统关注的是仿真与现实世界之间的不匹配。这里,我们展示在仿真中学习的控制策略可以成功迁移到由三个Phantom机器人推动物体到各种目标位置的物理系统中。我们使用修改的自然策略梯度算法进行学习,应用于精心识别的仿真模型。所得到的策略在仿真中完全训练后,在物理系统中无需额外训练即可有效工作。此外,我们还表明,使用模型集合训练使学习的策略对建模误差更鲁棒,从而补偿系统识别的困难。

英文摘要

Reinforcement learning has emerged as a promising methodology for training robot controllers. However, most results have been limited to simulation due to the need for a large number of samples and the lack of automated-yet-safe data collection methods. Model-based reinforcement learning methods provide an avenue to circumvent these challenges, but the traditional concern has been the mismatch between the simulator and the real world. Here, we show that control policies learned in simulation can successfully transfer to a physical system, composed of three Phantom robots pushing an object to various desired target positions. We use a modified form of the natural policy gradient algorithm for learning, applied to a carefully identified simulation model. The resulting policies, trained entirely in simulation, work well on the physical system without additional training. In addition, we show that training with an ensemble of models makes the learned policies more robust to modeling errors, thus compensating for difficulties in system identification.

1803.09792 2026-06-04 cs.MA cs.RO cs.SY eess.SY 版本更新

Min-Max Tours for Task Allocation to Heterogeneous Agents

为异构智能体分配任务的最优化路线

Amritha Prasad, Han-Lim Choi, Shreyas Sundaram

发表机构 * School of Electrical and Computer Engineering at Purdue University(普渡大学电气与计算机工程学院)

AI总结 研究如何为异构移动智能体分配任务,以最小化任何智能体完成任务并返回仓库的最大成本,提出了一种三阶段算法,提供5倍近似比。

详情
AI中文摘要

我们考虑一个场景,其中一组异构移动智能体位于仓库,一组任务分散在地理区域中。智能体分为不同类型,任务分为专用任务(只能由特定类型的智能体完成)和通用任务(可由任何智能体完成)。每对任务之间的距离已指定,并满足三角不等式。给定此场景,我们研究如何将这些任务分配给可用智能体(受类型兼容性约束),以最小化任何智能体完成任务并返回仓库的最大成本。该问题属于NP难问题,我们提出了一种三阶段算法来解决该问题,无论总智能体数量和每种类型智能体数量如何,该算法提供5倍的近似比。我们还证明,在仅有一种类型智能体的情况下,该算法的近似因子为4。

英文摘要

We consider a scenario consisting of a set of heterogeneous mobile agents located at a depot, and a set of tasks dispersed over a geographic area. The agents are partitioned into different types. The tasks are partitioned into specialized tasks that can only be done by agents of a certain type, and generic tasks that can be done by any agent. The distances between each pair of tasks are specified, and satisfy the triangle inequality. Given this scenario, we address the problem of allocating these tasks among the available agents (subject to type compatibility constraints) while minimizing the maximum cost to tour the allocation by any agent and return to the depot. This problem is NP-hard, and we give a three phase algorithm to solve this problem that provides 5-factor approximation, regardless of the total number of agents and the number of agents of each type. We also show that in the special case where there is only one agent of each type, the algorithm has an approximation factor of 4.

1802.04929 2026-06-04 eess.SY cs.LG cs.RO cs.SY 版本更新

Context-Specific Validation of Data-Driven Models

基于情境的驱动模型验证

Somil Bansal, Shromona Ghosh, Alberto Sangiovanni-Vincentelli, Sanjit A. Seshia, Claire J. Tomlin

AI总结 本文提出了一种基于情境的驱动模型验证框架,通过计算闭环实际系统与学习模型之间的距离来评估模型质量,并采用主动采样方案高效计算距离上界,用于验证实际系统控制器设计。

详情
AI中文摘要

随着数据驱动模型在机器人系统控制中的广泛应用,开发一种在部署前验证此类模型的方法变得至关重要。具体而言,必须确保为学习模型设计的控制器在实际物理系统上能按预期运行。本文提出了一种基于情境的验证框架,通过闭环实际系统与学习模型之间的距离度量来量化学习模型的质量。随后,我们提出了一种主动采样方案,以在样本高效的方式下计算该距离的概率上界。所提出的框架仅验证与模型预期用途相关的行为,且不需要任何先验系统动力学知识。几个模拟示例展示了该框架在验证真实系统模型及控制器合成中的实用性。

英文摘要

With an increasing use of data-driven models to control robotic systems, it has become important to develop a methodology for validating such models before they can be deployed to design a controller for the actual system. Specifically, it must be ensured that the controller designed for a learned model would perform as expected on the actual physical system. We propose a context-specific validation framework to quantify the quality of a learned model based on a distance measure between the closed-loop actual system and the learned model. We then propose an active sampling scheme to compute a probabilistic upper bound on this distance in a sample-efficient manner. The proposed framework validates the learned model against only those behaviors of the system that are relevant for the purpose for which we intend to use this model, and does not require any a priori knowledge of the system dynamics. Several simulations illustrate the practicality of the proposed framework for validating the models of real-world systems, and consequently, for controller synthesis.

1709.05363 2026-06-04 cs.RO cs.GT cs.SY eess.SY 版本更新

Synthesis of surveillance strategies via belief abstraction

通过信念抽象合成监视策略

Suda Bharadwaj, Rayna Dimitrova, Ufuk Topcu

发表机构 * University of Leicester(莱斯特大学)

AI总结 本文研究了合成具有监视目标的机器人控制器问题,通过将问题建模为单侧部分信息游戏,并利用抽象技术减少状态空间爆炸,从而实现监视策略的合成。

详情
AI中文摘要

我们研究了合成具有监视目标的机器人控制器问题,即机器人需维持对移动目标位置的了解。我们将该问题建模为单侧部分信息游戏,其中代理的胜利条件由时序逻辑公式指定。该规范形式化了用户提供的监视需求,包括额外的非监视任务。为了合成满足规范的监视策略,我们将部分信息游戏转换为完美信息游戏,利用抽象技术缓解此类转换通常导致的指数级状态空间爆炸。这使得可以使用现成的工具进行反应合成。我们使用反例引导的细化技术自动实现足够的抽象精度以合成监视策略。我们在两个案例研究中评估了所提出的方法,展示了其在大规模状态空间和多样化需求中的适用性。

英文摘要

We study the problem of synthesizing a controller for a robot with a surveillance objective, that is, the robot is required to maintain knowledge of the location of a moving, possibly adversarial target. We formulate this problem as a one-sided partial-information game in which the winning condition for the agent is specified as a temporal logic formula. The specification formalizes the surveillance requirement given by the user, including additional non-surveillance tasks. In order to synthesize a surveillance strategy that meets the specification, we transform the partial-information game into a perfect-information one, using abstraction to mitigate the exponential blow-up typically incurred by such transformations. This enables the use of off-the-shelf tools for reactive synthesis. We use counterexample-guided refinement to automatically achieve abstraction precision that is sufficient to synthesize a surveillance strategy. We evaluate the proposed method on two case-studies, demonstrating its applicability to large state-spaces and diverse requirements.

1703.02660 2026-06-04 cs.LG cs.AI cs.RO cs.SY eess.SY 版本更新

Towards Generalization and Simplicity in Continuous Control

连续控制中的泛化与简洁性

Aravind Rajeswaran, Kendall Lowrey, Emanuel Todorov, Sham Kakade

发表机构 * University of Washington(华盛顿大学)

AI总结 本文展示简单线性与RBF参数化策略可解决多种连续控制任务,性能可与更复杂网络相媲美,且多样初始化提升泛化能力。

Comments NIPS 2017, Project page: https://sites.google.com/view/simple-pol

详情
AI中文摘要

本文表明,简单线性及RBF参数化策略可训练解决多种连续控制任务,包括OpenAI Gym基准。这些策略性能可与更复杂参数化方法相媲美。现有训练测试场景受限且易过拟合,导致仅轨迹中心策略。多样初始化产生更具全局性的策略,允许系统在大扰动下恢复,如补充视频所示。

英文摘要

This work shows that policies with simple linear and RBF parameterizations can be trained to solve a variety of continuous control tasks, including the OpenAI gym benchmarks. The performance of these trained policies are competitive with state of the art results, obtained with more elaborate parameterizations such as fully connected neural networks. Furthermore, existing training and testing scenarios are shown to be very limited and prone to over-fitting, thus giving rise to only trajectory-centric policies. Training with a diverse initial state distribution is shown to produce more global policies with better generalization. This allows for interactive control scenarios where the system recovers from large on-line perturbations; as shown in the supplementary video.

1709.04407 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

An Inversion-Based Learning Approach for Improving Impromptu Trajectory Tracking of Robots with Non-Minimum Phase Dynamics

基于逆向学习的方法用于改进具有非最小相位动态的机器人即兴轨迹跟踪

Siqi Zhou, Mohamed K. Helwa, Angela P. Schoellig

发表机构 * Dynamic Systems Lab(动态系统实验室) Institute for Aerospace Studies(航空航天研究 institute) University of Toronto(多伦多大学) Cairo University(开罗大学)

AI总结 本文提出一种基于学习的方法,用于改进非最小相位系统的即兴轨迹跟踪,通过直接从输入输出数据学习稳定近似逆向,验证了方法的稳定性与高精度跟踪效果。

Comments Accepted for publication in the IEEE Robotics and Automation Letters (RA-L), July 2018

详情
AI中文摘要

本文提出了一种基于学习的方法,用于非最小相位系统的即兴轨迹跟踪。逆向前馈方法常用于提高跟踪性能,但无法直接应用于非最小相位系统,因为其固有不稳定。为解决此问题,现有方法假设系统模型已知,并使用预动作或逆向近似技术。本文提出了一种从输入输出数据直接学习稳定近似逆向的方法。通过理论讨论、模拟和两种不同平台的实验,展示了所提方法的稳定性及其在高精度即兴跟踪中的有效性。此外,本文还表明,在训练中包含更多信息,尽管通常被认为有用,但未必能提高性能,反而可能引发不稳定性并影响整体方法的效果。

英文摘要

This paper presents a learning-based approach for impromptu trajectory tracking for non-minimum phase systems, i.e., systems with unstable inverse dynamics. Inversion-based feedforward approaches are commonly used for improving tracking performance; however, these approaches are not directly applicable to non-minimum phase systems due to their inherent instability. In order to resolve the instability issue, existing methods have assumed that the system model is known and used pre-actuation or inverse approximation techniques. In this work, we propose an approach for learning a stable, approximate inverse of a non-minimum phase baseline system directly from its input-output data. Through theoretical discussions, simulations, and experiments on two different platforms, we show the stability of our proposed approach and its effectiveness for high-accuracy, impromptu tracking. Our approach also shows that including more information in the training, as is commonly assumed to be useful, does not lead to better performance but may trigger instability and impact the effectiveness of the overall approach.

1803.01579 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Motion and Cooperative Transportation Planning for Multi-Agent Systems under Temporal Logic Formulas

多智能体系统在时序逻辑公式下的运动与协作运输规划

Christos K. Verginis, Dimos V. Dimarogonas

发表机构 * KTH Center for Autonomous Systems(KTH 自主系统中心)

AI总结 本文提出一种混合控制框架,用于在高阶目标表达为线性时序逻辑(LTL)公式的情况下,多智能体系统的运动规划。设计控制协议以实现智能体在预定义兴趣区域间的过渡及协作运输物体。通过抽象智能体与物体的耦合行为为有限状态转移系统,设计满足智能体和物体规格的高层多智能体计划。

Comments Submitted to IEEE Transactions on Automation Science and Engineering. arXiv admin note: text overlap with arXiv:1611.05186

详情
AI中文摘要

本文提出了一种混合控制框架,用于多智能体系统(包括N个机器人和M个物体)在高阶目标表达为线性时序逻辑(LTL)公式的情况下进行运动规划。特别地,我们设计了控制协议,使智能体能够在预定义的兴趣区域之间过渡,并通过智能体协作运输物体。这允许将智能体和物体的耦合行为抽象为有限状态转移系统,并设计一个满足智能体和物体规格的高层多智能体计划,这些规格以时序逻辑公式给出。仿真结果验证了所提框架。

英文摘要

This paper presents a hybrid control framework for the motion planning of a multi-agent system including N robotic agents and M objects, under high level goals expressed as Linear Temporal Logic (LTL) formulas. In particular, we design control protocols that allow the transition of the agents as well as the cooperative transportation of the objects by the agents, among predefined regions of interest in the workspace. This allows to abstract the coupled behavior of the agents and the objects as a finite transition system and to design a high-level multi-agent plan that satisfies the agents' and the objects' specifications, given as temporal logic formulas. Simulation results verify the proposed framework.

1802.08327 2026-06-04 cs.SE cs.RO cs.SY eess.SY 版本更新

From Hazard Analysis to Hazard Mitigation Planning: The Automated Driving Case

从危险分析到危险缓解规划:自动驾驶案例

Mario Gleirscher, Stefan Kugele

AI总结 本文提出一个框架,用于分析和设计能够实时识别和缓解危险的规划器,同时介绍构建规划模型的增量算法及基于给定控制系统架构设计故障容错控制器的示例应用。

详情
Journal ref
Barrett C., Davies M., Kahsai T. (eds) NASA Formal Methods. LNCS, vol 10227. Springer
AI中文摘要

车辆安全依赖于识别的危险范围和缓解这些危险的运营情况,以降低风险。随着自动化程度的提高,风险责任可能增加,因此高自动化车辆必须配备经过验证的控制器,能够可靠地在所有可能的运营情况下识别和缓解危险。为此,可用的方法必须通过危险分析和缓解的模型来支持自动化车辆控制器的设计和验证。本文描述了(1)一个用于分析和设计规划器(即高层控制器)的框架,该规划器能够实时识别和缓解危险;(2)一个增量算法,用于从危险分析中构建规划模型;以及(3)一个基于给定控制系统架构设计故障容错控制器的示例应用。我们的方法使安全工程师能够(2a)详细阐述危险场景并(2b)设计缓解此类场景的操作策略。

英文摘要

Vehicle safety depends on (a) the range of identified hazards and (b) the operational situations for which mitigations of these hazards are acceptably decreasing risk. Moreover, with an increasing degree of autonomy, risk ownership is likely to increase for vendors towards regulatory certification. Hence, highly automated vehicles have to be equipped with verified controllers capable of reliably identifying and mitigating hazards in all possible operational situations. To this end, available methods for the design and verification of automated vehicle controllers have to be supported by models for hazard analysis and mitigation. In this paper, we describe (1) a framework for the analysis and design of planners (i.e., high-level controllers) capable of run-time hazard identification and mitigation, (2) an incremental algorithm for constructing planning models from hazard analysis, and (3) an exemplary application to the design of a fail-operational controller based on a given control system architecture. Our approach equips the safety engineer with concepts and steps to (2a) elaborate scenarios of endangerment and (2b) design operational strategies for mitigating such scenarios.

1802.07346 2026-06-04 cs.RO cs.SY eess.SP eess.SY stat.AP 版本更新

Cooperative Robot Localization Using Event-triggered Estimation

基于事件触发估计的协作机器人定位

Michael Ouimet, David Iglesias, Nisar Ahmed, Sonia Martinez

发表机构 * SPAWAR Systems Center Pacific(SPAWAR太平洋系统中心) University of Colorado Boulder(科罗拉多大学博尔德分校) University of California San Diego(加州大学圣地亚哥分校)

AI总结 本文提出一种低通信开销的协作定位算法,通过事件触发机制仅在状态估计创新度高时发送测量,结合协方差交叠机制实现网络状态同步,实验验证了其在多种动态模型下的高效定位性能。

Comments Revised submission in review with AIAA Journal of Aerospace Information Systems (JAIS), submitted February 17, 2018

详情
AI中文摘要

本文描述了一种新颖的低通信开销协作定位算法,适用于移动无人机器人团队。利用基于事件的估计范式,机器人仅在状态估计创新度较高时向邻居发送测量。由于代理已知触发测量的条件,缺少的测量信息也被融合到状态估计中。机器人使用协方差交叠(CI)机制偶尔同步其对完整网络状态的局部估计。此外,启发式平衡动态确保在大直径网络中,局部误差协方差始终保持在预期范围内。线性和非线性动态/测量模型的仿真表明,事件触发方法在广泛的操作条件下实现了接近最优的状态估计性能,即使仅使用传统全数据共享所需通信开销的小部分。还检验了所提方法对丢包通信的鲁棒性以及网络拓扑与基于CI的同步需求之间的关系。

英文摘要

This paper describes a novel communication-spare cooperative localization algorithm for a team of mobile unmanned robotic vehicles. Exploiting an event-based estimation paradigm, robots only send measurements to neighbors when the expected innovation for state estimation is high. Since agents know the event-triggering condition for measurements to be sent, the lack of a measurement is thus also informative and fused into state estimates. The robots use a Covariance Intersection (CI) mechanism to occasionally synchronize their local estimates of the full network state. In addition, heuristic balancing dynamics on the robots' CI-triggering thresholds ensure that, in large diameter networks, the local error covariances remains below desired bounds across the network. Simulations on both linear and nonlinear dynamics/measurement models show that the event-triggering approach achieves nearly optimal state estimation performance in a wide range of operating conditions, even when using only a fraction of the communication cost required by conventional full data sharing. The robustness of the proposed approach to lossy communications, as well as the relationship between network topology and CI-based synchronization requirements, are also examined.

1802.07199 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A Parametric MPC Approach to Balancing the Cost of Abstraction for Differential-Drive Mobile Robots

一种参数MPC方法用于平衡抽象成本的差分驱动移动机器人

Paul Glotfelter, Magnus Egerstedt

发表机构 * Georgia Institute of Technology(佐治亚理工学院)

AI总结 本文提出了一种参数MPC方法,用于优化差分驱动移动机器人的抽象成本,通过分析精度和机动性成本,改进了单积分器模型的控制策略。

Comments ICRA 2018

详情
AI中文摘要

在为差分驱动移动机器人设计控制策略时,一种标准工具是考虑一条垂直于车轮轴线的固定距离的点,而不是车辆的完整姿态。这种抽象支持将非holonomic、三状态unicycle模型替换为更简单的两状态单积分器模型(即速度控制点)。然而,这种转换会带来性能代价,通过机器人的精度和机动性。本文推导了这些精度和机动性成本的表达式,以转换参数为变量。此外,这些成本表明,在应用过程中仅一次选择参数可能导致精度损失。模型预测控制(MPC)代表一种缓解此状况的方法。然而,MPC通常产生控制信号而不是参数,因此本文还提出了一种参数模型预测控制(PMPC)方法,用于参数和采样时间窗优化。实验结果展示了参数化对实际差分驱动移动机器人上开发的单积分器模型算法部署的影响。

英文摘要

When designing control strategies for differential-drive mobile robots, one standard tool is the consideration of a point at a fixed distance along a line orthogonal to the wheel axis instead of the full pose of the vehicle. This abstraction supports replacing the non-holonomic, three-state unicycle model with a much simpler two-state single-integrator model (i.e., a velocity-controlled point). Yet this transformation comes at a performance cost, through the robot's precision and maneuverability. This work contains derivations for expressions of these precision and maneuverability costs in terms of the transformation's parameters. Furthermore, these costs show that only selecting the parameter once over the course of an application may cause an undue loss of precision. Model Predictive Control (MPC) represents one such method to ameliorate this condition. However, MPC typically realizes a control signal, rather than a parameter, so this work also proposes a Parametric Model Predictive Control (PMPC) method for parameter and sampling horizon optimization. Experimental results are presented that demonstrate the effects of the parameterization on the deployment of algorithms developed for the single-integrator model on actual differential-drive mobile robots.

1702.03258 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Adaptive and Resilient Soft Tensegrity Robots

自适应且具有韧性的软 tensegrity 机器人

John Rieffel, Jean-Baptiste Mouret

发表机构 * Union College(联合学院) Inria Nancy Grand - Est CNRS, Loria, UMR 7503(法国国家科学研究中心(CNRS)、洛里亚实验室(Loria)、UMR 7503)

AI总结 本文提出一种易于组装的基于 tensegrity 的软机器人,能产生高动态运动步态,并在物理损伤下表现出结构和行为韧性,通过机器学习算法实现有效步态发现。

Comments video: https://youtu.be/SuLQDhrk9tQ

详情
AI中文摘要

生物体结合软(如肌肉)和硬(如骨骼)材料,赋予其内在灵活性和韧性,这在传统刚性机器人中往往缺失。新兴的软机器人领域试图利用这些特性创造韧性机器。然而,软材料的性质给设计、制造和控制带来了重大挑战,迄今为止,大多数软机器人的步态都是通过经验试错法手动设计的。本文描述了一种易于组装的基于 tensegrity 的软机器人,能够产生高度动态的运动步态,并在面对物理损伤时表现出结构和行为的韧性。使这一成果成为可能的是使用一种机器学习算法,能够以最少的物理试验发现有效的步态。这些结果进一步支持了软机器人方法,旨在利用复杂材料动力学的相互作用,以生成丰富的动态行为。

英文摘要

Living organisms intertwine soft (e.g., muscle) and hard (e.g., bones) materials, giving them an intrinsic flexibility and resiliency often lacking in conventional rigid robots. The emerging field of soft robotics seeks to harness these same properties in order to create resilient machines. The nature of soft materials, however, presents considerable challenges to aspects of design, construction, and control -- and up until now, the vast majority of gaits for soft robots have been hand-designed through empirical trial-and-error. This manuscript describes an easy-to-assemble tensegrity-based soft robot capable of highly dynamic locomotive gaits and demonstrating structural and behavioral resilience in the face of physical damage. Enabling this is the use of a machine learning algorithm able to discover effective gaits with a minimal number of physical trials. These results lend further credence to soft-robotic approaches that seek to harness the interaction of complex material dynamics in order to generate a wealth of dynamical behaviors.

1802.06314 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

Autonomous Vehicle Speed Control for Safe Navigation of Occluded Pedestrian Crosswalk

自动驾驶车辆速度控制:安全通过遮挡人行横道

Sarah Thornton

发表机构 * Dynamic Design Lab(动态设计实验室)

AI总结 本文提出基于部分可观测马尔可夫决策过程的速度控制方法,用于安全通过遮挡人行横道,通过动态规划计算控制策略以应对感知限制。

Comments 6 pages, 9 figures

详情
AI中文摘要

人类和自动驾驶车辆传感器的感知能力有限。当这些限制与涉及易受伤害道路使用者的场景重合时,必须在运动规划器中考虑这些限制。在遮挡人行横道的场景中,接近车辆的速度应是道路上不确定性量的函数。在本工作中,纵向控制器被建模为部分可观测马尔可夫决策过程,并使用动态规划计算控制策略。该控制策略将速度剖面传递给模型预测转向控制器。

英文摘要

Both humans and the sensors on an autonomous vehicle have limited sensing capabilities. When these limitations coincide with scenarios involving vulnerable road users, it becomes important to account for these limitations in the motion planner. For the scenario of an occluded pedestrian crosswalk, the speed of the approaching vehicle should be a function of the amount of uncertainty on the roadway. In this work, the longitudinal controller is formulated as a partially observable Markov decision process and dynamic programming is used to compute the control policy. The control policy scales the speed profile to be used by a model predictive steering controller.

1802.05730 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Pedestrian-Robot Interaction Experiments in an Exit Corridor

行人-机器人在出口走廊中的交互实验

Zhuo Chen, Chao Jiang, Yi Guo

AI总结 本文通过实验研究机器人与行人交互对群体流动的影响,发现机器人运动减缓行人速度,为未来设计机器人辅助疏散算法提供指导。

Comments Submitted to the 15th International Conference on Ubiquitous Robots, Honolulu, 2018

详情
AI中文摘要

人类-机器人交互(HRI)的研究因机器人在人群中的导航问题而受到越来越多的关注。本文提出了在单向出口走廊中进行行人-机器人交互的实证研究。我们部署了一个沿行人流动方向垂直移动的移动机器人,并安装了行人运动跟踪系统以记录集体运动。我们分析了行人个体和集体运动,并测量了机器人运动对整体行人流动的影响。实验结果表明,被动HRI的效果是,当有机器人存在时,行人整体速度减慢,机器人运动越快,平均行人速度越低。实验结果显示出集体HRI效果与之前报告的模拟结果在定性上一致。本研究可用于指导未来机器人辅助行人疏散算法的设计。

英文摘要

The study of human-robot interaction (HRI) has received increasing research attention for robot navigation in pedestrian crowds. In this paper, we present empirical study of pedestrian-robot interaction in an uni-directional exit corridor. We deploy a mobile robot moving in a direction perpendicular to that of the pedestrian flow, and install a pedestrian motion tracking system to record the collective motion. We analyze both individual and collective motion of pedestrians, and measure the effect of the robot motion on the overall pedestrian flow. The experimental results show the effect of passive HRI, where the pedestrians' overall speed is slowed down in the presence of the robot, and the faster the robot moves, the lower the average pedestrian velocity becomes. Experiment results show qualitative consistency of the collective HRI effect with simulation results that was previously reported. The study can be used to guide future design of robot-assisted pedestrian evacuation algorithms.

1705.01292 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A General Safety Framework for Learning-Based Control in Uncertain Robotic Systems

一种用于不确定机器人系统中基于学习的控制的通用安全框架

Jaime F. Fisac, Anayo K. Akametalu, Melanie N. Zeilinger, Shahab Kaynama, Jeremy Gillula, Claire J. Tomlin

发表机构 * Department of Mechanical and Process Engineering, ETH Zurich(瑞士苏黎世联邦理工学院机械与过程工程系) Clearpath Robotics(Clearpath机器人公司) Electronic Frontier Foundation(电子前沿基金会)

AI总结 本文提出一种通用安全框架,结合Hamilton-Jacobi可达性方法和任意学习算法,通过近似系统动力学知识确保约束满足,同时减少对学习过程的干扰,并引入贝叶斯机制提升安全分析。

Comments Accepted for publication in IEEE Transactions on Automatic Control. Video with experiments: https://youtu.be/WAAxyeSk2bw

详情
AI中文摘要

基于学习的控制方案在物理世界中应用的有效性强烈推动了其在机器人系统中的使用。然而,保证学习过程中的正确操作目前仍是一个未解决的问题,这对安全关键系统至关重要。我们提出了一种基于Hamilton-Jacobi可达性方法的通用安全框架,可以与任意学习算法协同工作。该方法利用对系统动力学的近似知识来保证约束满足,同时尽可能减少对学习过程的干扰。我们进一步引入了贝叶斯机制,通过系统获取的新证据来细化安全分析,从而在适当的时候减少初始保守性,同时通过实时验证加强保证。结果是一种最不具限制性的、安全保持的控制律,仅在(a)计算的安全保证要求时或(b)由于新观察到的置信度下降时介入。我们通过概率和最坏情况分析相结合的理论方法证明了安全保证,并在四旋翼飞行器上进行了实验验证。尽管安全分析基于一个简单的质点模型,四旋翼通过策略梯度强化学习成功到达了合适的控制器,从未发生碰撞,并在飞行过程中安全地远离强外部干扰。

英文摘要

The proven efficacy of learning-based control schemes strongly motivates their application to robotic systems operating in the physical world. However, guaranteeing correct operation during the learning process is currently an unresolved issue, which is of vital importance in safety-critical systems. We propose a general safety framework based on Hamilton-Jacobi reachability methods that can work in conjunction with an arbitrary learning algorithm. The method exploits approximate knowledge of the system dynamics to guarantee constraint satisfaction while minimally interfering with the learning process. We further introduce a Bayesian mechanism that refines the safety analysis as the system acquires new evidence, reducing initial conservativeness when appropriate while strengthening guarantees through real-time validation. The result is a least-restrictive, safety-preserving control law that intervenes only when (a) the computed safety guarantees require it, or (b) confidence in the computed guarantees decays in light of new observations. We prove theoretical safety guarantees combining probabilistic and worst-case analysis and demonstrate the proposed framework experimentally on a quadrotor vehicle. Even though safety analysis is based on a simple point-mass model, the quadrotor successfully arrives at a suitable controller by policy-gradient reinforcement learning without ever crashing, and safely retracts away from a strong external disturbance introduced during flight.

1802.00922 2026-06-04 cs.RO cs.NI cs.OS cs.SY eess.SY 版本更新

Realizing Uncertainty-Aware Timing Stack in Embedded Operating System

在嵌入式操作系统中实现不确定性感知的定时栈

Amr Alanwar, Fatima M. Anwar, Joao P Hespanha, Mani Srivastava

发表机构 * University of California(加州大学) University of California, Santa Barbara(加州大学圣巴巴拉分校) Los Angeles(洛杉矶)

AI总结 本文提出了一种基于卡尔曼滤波的时间同步协议,用于在嵌入式系统中处理时间不确定性,通过标准嵌入式Linux平台实现不确定性感知的时钟模型。

Comments In Proc. of the Embedded Operating Systems Workshop, 2016

详情
AI中文摘要

时间感知对于运行在商用平台和操作系统上的广泛新兴应用至关重要,包括网络物理系统和物联网。传统上,通过最佳努力的后台服务在设备间同步时间,其性能既不可观察也不可控制,从而消耗系统资源,而不管应用需求如何,同时不允许应用程序和操作系统服务适应系统时间中的不确定性变化。本文提倡重新思考系统堆栈中时间的管理方式。在本文中,我们提出了一种新的时钟模型,该模型真实地描述了各种时间不确定性的来源。然后,我们提出了一种基于卡尔曼滤波的时间同步协议,该协议能够适应由时钟模型暴露的不确定性。我们对不确定性感知时钟模型和同步协议的实现是基于标准嵌入式Linux平台的。

英文摘要

Time awareness is critical to a broad range of emerging applications -- in Cyber-Physical Systems and Internet of Things -- running on commodity platforms and operating systems. Traditionally, time is synchronized across devices through a best-effort background service whose performance is neither observable nor controllable, thus consuming system resources independently of application needs while not allowing the applications and OS services to adapt to changes in uncertainty in system time. We advocate for rethinking how time is managed in a system stack. In this paper, we propose a new clock model that characterizes various sources of timing uncertainties in true time. We then present a Kalman filter based time synchronization protocol that adapts to the uncertainties exposed by the clock model. Our realization of a uncertainty-aware clock model and synchronization protocol is based on a standard embedded Linux platform.

1801.09877 2026-06-04 cs.RO cs.SY eess.SY 版本更新

On the Use of the Observability Gramian for Partially Observed Robotic Path Planning Problems

关于可观测性格拉姆矩阵在部分观测机器人路径规划问题中的应用

Mohammadhussein Rafieisakhaei, Suman Chakravorty, P. R. Kumar

发表机构 * Department of Electrical and Computer Engineering(电气与计算机工程系)

AI总结 本文探讨了利用可观测性格拉姆矩阵作为估计性能代理进行优化的局限性,指出其可能产生误导性的路径规划结果。

Comments 6 pages, 9 figures. CDC 2017

详情
Journal ref
2017 IEEE 56th Annual Conference on Decision and Control (CDC)
AI中文摘要

优化可观测性格拉姆矩阵作为估计性能的代理可能在存在观测不确定性的情况下提供无关或误导性的轨迹,这对路径规划提出了挑战。

英文摘要

Optimizing measures of the observability Gramian as a surrogate for the estimation performance may provide irrelevant or misleading trajectories for planning under observation uncertainty.

1609.09861 2026-06-04 eess.SY cs.RO cs.SY 版本更新

An Efficient Optimal Planning and Control Framework For Quadrupedal Locomotion

一种高效的四足机器人运动规划与控制框架

Farbod Farshidian, Michael Neunert, Alexander W. Winkler, Gonzalo Rey, Jonas Buchli

AI总结 本文提出一种高效的动态规划框架,用于四足机器人的最优规划与控制,通过多级优化方法高效求解切换时间和连续控制输入,并采用连续时间约束LQR算法优化前馈与反馈控制器。

Comments 8 Pages

详情
AI中文摘要

本文提出了一种高效的动态规划框架,用于四足机器人最优规划与控制。首先将问题表述为切换系统的最优控制问题。然后提出多级优化方法,以求得最优切换时间和最优连续控制输入。通过此方案,分解优化可能比综合方法更高效。最后,我们提出一种连续时间约束LQR算法,同时优化前馈和反馈控制器,具有O(n)的时间复杂度。为了验证我们的方法,我们在四足机器人上展示了框架的性能。我们选择质心动力学和完整运动学公式作为切换系统模型,其中切换时间以及接触力和关节速度针对不同的运动任务(如跨越间隙、行走和小跑)进行优化。

英文摘要

In this paper, we present an efficient Dynamic Programing framework for optimal planning and control of legged robots. First we formulate this problem as an optimal control problem for switched systems. Then we propose a multi--level optimization approach to find the optimal switching times and the optimal continuous control inputs. Through this scheme, the decomposed optimization can potentially be done more efficiently than the combined approach. Finally, we present a continuous-time constrained LQR algorithm which simultaneously optimizes the feedforward and feedback controller with $O(n)$ time-complexity. In order to validate our approach, we show the performance of our framework on a quadrupedal robot. We choose the Center of Mass dynamics and the full kinematic formulation as the switched system model where the switching times as well as the contact forces and the joint velocities are optimized for different locomotion tasks such as gap crossing, walking and trotting.

1801.09361 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Safe and Efficient Intersection Control of Connected and Autonomous Intersection Traffic

连接与自动驾驶交叉口交通的安全高效交叉控制

Qiang Lu

发表机构 * Daniel Felix Ritchie School of Engineering and Computer Science(丹尼尔·费利克斯·里奇工程与计算机科学学院)

AI总结 本文提出DICA算法,用于协调自动驾驶车辆在交叉口的安全高效通行,同时通过Reactive DICA算法优化紧急车辆通行优先级,确保应急车辆快速通过而最小化其他车辆的行程影响。

Comments 104 pages, 23 figures, PhD comprehensive thesis

详情
AI中文摘要

本论文针对自动驾驶和连接地面交通在交叉口的安全高效通行问题,提出了一种名为离散时间占用轨迹基于交叉口交通协调算法(DICA)的算法。所有系统中的车辆均为连接和自动驾驶车辆(CAVs),并具备无线车与交叉口通信能力。在所提出的框架中,交叉口基于车辆提出的DTOTs协调其运动,以高效通过交叉口并避免碰撞。在车辆DTOTs发生碰撞时,交叉口修改冲突的DTOTs以避免碰撞,并请求CAVs根据修改后的DTOTs接近和通过交叉口。我们证明了基本DICA是无死锁和无饥饿的,并且在计算复杂度上具有保守性,通过几种计算方法进行了改进。接着,本文还解决了通过自动驾驶和连接交叉口交通快速疏散紧急车辆的问题。所提出的交叉口控制算法Reactive DICA旨在确定一个高效的车辆通行序列,使紧急车辆能够尽快通过交叉口,同时最小化其他车辆的行程时间。当没有紧急车辆在交叉口区域内时,车辆由DICA控制。当有紧急车辆进入通信范围时,我们通过优化车辆的排序优先处理紧急车辆。提出了一种遗传算法来解决优化问题,找到最优的车辆序列,使紧急车辆获得最高优先级。

英文摘要

In this dissertation, we address a problem of safe and efficient intersection crossing traffic management of autonomous and connected ground traffic. Toward this objective, an algorithm that is called the Discrete-time occupancies trajectory based Intersection traffic Coordination Algorithm (DICA) is proposed. All vehicles in the system are Connected and Autonomous Vehicles (CAVs) and capable of wireless Vehicle-to-Intersection communication. In the proposed framework, an intersection coordinates the motions of CAVs based on their proposed DTOTs to let them cross the intersection efficiently while avoiding collisions. In case when there is a collision between vehicles' DTOTs, the intersection modifies conflicting DTOTs to avoid the collision and requests CAVs to approach and cross the intersection according to the modified DTOTs. We then prove that the basic DICA is deadlock free and also starvation free. We also show that the basic DICA is conservative in computational complexity and improve it by several computational approaches. Next, we addressed the problem of evacuating emergency vehicles as quickly as possible through autonomous and connected intersection traffic in this dissertation. The proposed intersection control algorithm Reactive DICA aims to determine an efficient vehicle-passing sequence which allows the emergency vehicle to cross an intersection as soon as possible while the travel times of other vehicles are minimally affected. When there are no emergency vehicles within the intersection area, the vehicles are controlled by DICA. When there are emergency vehicles entering communication range, we prioritize emergency vehicles through optimal ordering of vehicles. A genetic algorithm is proposed to solve the optimization problem which finds the optimal vehicle sequence that gives the emergency vehicles the highest priority.

1801.07132 2026-06-04 cs.CR cs.RO cs.SY eess.SY 版本更新

SecSens: Secure State Estimation with Application to Localization and Time Synchronization

SecSens: 带有定位和时间同步应用的安全状态估计

Amr Alanwar, Bernhard Etzlinger, Henrique Ferraz, Joao Hespanha, Mani Srivastava

发表机构 * University of California, Los Angeles(加州大学洛杉矶分校) University of California, Santa Barbara(加州大学圣芭芭拉分校) Johannes Kepler University(约翰·凯撒大学)

AI总结 本文提出SecSens,一种用于对抗模型和测量噪声的新型安全非线性状态估计方法,通过SecEKF和SecOPT算法实现安全状态估计,无需专用硬件或加密技术,在强攻击下性能优于现有方案。

详情
AI中文摘要

针对网络物理系统中信息技术和物理传感耦合带来的安全漏洞,本文提出SecSens,一种基于扩展卡尔曼滤波和最大似然估计的新型安全非线性状态估计方法。SecSens包含SecEKF和SecOPT两个独立算法,通过整体方法引入安全意识,无需专用硬件或加密技术。本文将SecSens应用于网络化移动设备的安全定位和时间同步,实验表明其在强攻击下性能优于现有方案。

英文摘要

Research evidence in Cyber-Physical Systems (CPS) shows that the introduced tight coupling of information technology with physical sensing and actuation leads to more vulnerability and security weaknesses. But, the traditional security protection mechanisms of CPS focus on data encryption while neglecting the sensors which are vulnerable to attacks in the physical domain. Accordingly, researchers attach utmost importance to the problem of state estimation in the presence of sensor attacks. In this work, we present SecSens, a novel approach for secure nonlinear state estimation in the presence of modeling and measurement noise. SecSens consists of two independent algorithms, namely, SecEKF and SecOPT, which are based on Extended Kalman Filter and Maximum Likelihood Estimation, respectively. We adopt a holistic approach to introduce security awareness among state estimation algorithms without requiring specialized hardware, or cryptographic techniques. We apply SecSens to securely localize and time synchronize networked mobile devices. SecSens provides good performance at run-time several order of magnitude faster than the state of art solutions under the presence of powerful attacks. Our algorithms are evaluated on a testbed with static nodes and a mobile quadrotor all equipped with commercial ultra-wide band wireless devices.

1801.04816 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Localizability-Constrained Deployment of Mobile Robotic Networks with Noisy Range Measurements

具有噪声测距的移动机器人网络的局部化约束部署

Jerome Le Ny, Simon Chauvière

发表机构 * Polytechnique Montreal(蒙特利尔理工学院) GERAD

AI总结 本文研究了在噪声测距下如何部署移动机器人网络以实现高精度定位,通过局部化函数优化网络几何结构,结合梯度下降法进行分布式控制。

Comments 7 pages, 3 figures

详情
AI中文摘要

当移动网络中的节点使用相对于邻居的噪声测距来估计位置时,整体连通性和测量网络的几何结构对可实现的定位精度有关键影响。本文考虑了部署一个仅基于测距的协作定位移动机器人网络的问题,旨在维持有利于高精度估计机器人位置的网络几何结构。网络几何的质量通过一个称为“局部化性”的函数来衡量,该函数作为机器人运动规划的潜在场。该函数基于Cramér-Rao界,为给定的几何结构提供任何无偏位置估计器的协方差矩阵的下界。我们描述了基于梯度下降的机器人运动规划器,试图优化或约束网络局部化函数的不同变体,并讨论了以分布式方式实现这些控制器的方法。最后,本文还建立了统计观点与维护捕捉相对测距的图的加权刚性形式之间的正式联系。

英文摘要

When nodes in a mobile network use relative noisy measurements with respect to their neighbors to estimate their positions, the overall connectivity and geometry of the measurement network has a critical influence on the achievable localization accuracy. This paper considers the problem of deploying a mobile robotic network implementing a cooperative localization scheme based on range measurements only, while attempting to maintain a network geometry that is favorable to estimating the robots' positions with high accuracy. The quality of the network geometry is measured by a "localizability" function serving as potential field for robot motion planning. This function is built from the Cramér-Rao bound, which provides for a given geometry a lower bound on the covariance matrix achievable by any unbiased position estimator that the robots might implement using their relative measurements. We describe gradient descent-based motion planners for the robots that attempt to optimize or constrain different variations of the network's localizability function, and discuss ways of implementing these controllers in a distributed manner. Finally, the paper also establishes formal connections between our statistical point of view and maintaining a form of weighted rigidity for the graph capturing the relative range measurements.

1611.04706 2026-06-04 cs.RO cs.SY eess.SY 版本更新

High-Dimensional Stochastic Optimal Control using Continuous Tensor Decompositions

高维随机最优控制的连续张量分解应用

Alex A. Gorodetsky, Sertac Karaman, Youssef M. Marzouk

AI总结 本文提出基于连续张量分解的动态规划算法,解决高维状态空间中的随机最优控制问题,通过压缩表示实现多项式时间复杂度,显著提升计算效率。

Comments 32 pages, 20 figures

详情
AI中文摘要

运动规划和控制问题在几乎所有机器人应用中都是嵌入和关键的。这些问题通常被公式化为随机最优控制问题,并通过动态规划算法解决。不幸的是,大多数保证收敛到最优解的现有算法受到维度诅咒的影响:算法的运行时间随着系统状态空间维度的增长呈指数级增长。我们提出新的动态规划算法,以减轻在具有某些低秩结构的问题中的维度诅咒。所提出的算法基于最近由作者开发的连续张量分解。本质上,这些算法以压缩格式表示高维函数(例如价值函数),并在这种格式中直接执行动态规划计算(例如价值迭代、策略迭代)。在某些技术假设下,新算法保证能够以任意精度收敛到最优解。此外,新算法的运行时间与状态维度和价值函数的秩呈多项式增长。这种方法在

英文摘要

Motion planning and control problems are embedded and essential in almost all robotics applications. These problems are often formulated as stochastic optimal control problems and solved using dynamic programming algorithms. Unfortunately, most existing algorithms that guarantee convergence to optimal solutions suffer from the curse of dimensionality: the run time of the algorithm grows exponentially with the dimension of the state space of the system. We propose novel dynamic programming algorithms that alleviate the curse of dimensionality in problems that exhibit certain low-rank structure. The proposed algorithms are based on continuous tensor decompositions recently developed by the authors. Essentially, the algorithms represent high-dimensional functions (e.g., the value function) in a compressed format, and directly perform dynamic programming computations (e.g., value iteration, policy iteration) in this format. Under certain technical assumptions, the new algorithms guarantee convergence towards optimal solutions with arbitrary precision. Furthermore, the run times of the new algorithms scale polynomially with the state dimension and polynomially with the ranks of the value function. This approach realizes substantial computational savings in "compressible" problem instances, where value functions admit low-rank approximations. We demonstrate the new algorithms in a wide range of problems, including a simulated six-dimensional agile quadcopter maneuvering example and a seven-dimensional aircraft perching example. In some of these examples, we estimate computational savings of up to ten orders of magnitude over standard value iteration algorithms. We further demonstrate the algorithms running in real time on board a quadcopter during a flight experiment under motion capture.

1501.05151 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Recursive Bayesian Filtering in Circular State Spaces

循环状态空间中的递归贝叶斯滤波

Gerhard Kurz, Igor Gilitschenski, Uwe D. Hanebeck

发表机构 * Intelligent Sensor-Actuator-Systems Laboratory (ISAS)(智能传感器-执行器系统实验室) Institute for Anthropomatics and Robotics(人机学与机器人研究所) Karlsruhe Institute of Technology (KIT), Germany(卡尔斯鲁厄理工学院(KIT),德国)

AI总结 本文提出了一种基于循环统计的递归滤波框架,利用循环分布(如环绕正态分布和von Mises分布)估计循环状态,并通过高效确定性采样技术处理非线性系统和测量函数,引入了分布无关预测算法和改进的环绕正态密度乘法公式。

详情
AI中文摘要

针对基于循环统计的递归循环滤波,我们介绍了一个通用框架,用于基于不同循环分布(特别是环绕正态分布和von Mises分布)估计循环状态。我们提出了一种用于具有非线性系统和测量函数的循环系统的估计方法,通过依赖高效的确定性采样技术实现。此外,我们展示了在多种重要特殊情况下计算如何简化,例如具有加性噪声的系统以及身份系统或测量函数。我们引入了几个新的关键组件,特别是分布无关的预测算法、新的和更优的环绕正态密度乘法公式,以及处理非加性系统噪声的能力。所有提出的方法都经过彻底评估,并与几种最先进的解决方案进行了比较。

英文摘要

For recursive circular filtering based on circular statistics, we introduce a general framework for estimation of a circular state based on different circular distributions, specifically the wrapped normal distribution and the von Mises distribution. We propose an estimation method for circular systems with nonlinear system and measurement functions. This is achieved by relying on efficient deterministic sampling techniques. Furthermore, we show how the calculations can be simplified in a variety of important special cases, such as systems with additive noise as well as identity system or measurement functions. We introduce several novel key components, particularly a distribution-free prediction algorithm, a new and superior formula for the multiplication of wrapped normal densities, and the ability to deal with non-additive system noise. All proposed methods are thoroughly evaluated and compared to several state-of-the-art solutions.

1605.06645 2026-06-04 math.OC cs.RO cs.SY eess.SY math.DS 版本更新

Full-Pose Tracking Control for Aerial Robotic Systems with Laterally-Bounded Input Force

具有横向受限输入力的空中机器人系统的全姿态跟踪控制

Antonio Franchi, Ruggero Carli, Davide Bicego, Markus Ryll

发表机构 * Department of Information Engineering, University of Padova(意大利帕多瓦大学信息工程系)

AI总结 本文提出了一种新的几何控制策略,用于实现具有横向受限输入力的空中机器人系统在SE(3)中的全姿态轨迹跟踪,通过Lyapunov方法证明了可行全姿态参考轨迹的指数跟踪,并展示了该方法在欠驱动和全驱动平台中的应用效果。

详情
Journal ref
IEEE Trascation on Robotics, 2018
AI中文摘要

本文定义了一类通用的抽象空中机器人系统,称为横向受限力(LBF)车辆,其中大部分控制权沿主推力方向表达,而在横向方向上可能利用较小甚至为零的力来实现全姿态跟踪。此类系统能够很好地近似具有非共面/非共线旋翼的平台,这些平台可以使用倾斜螺旋桨略微改变总推力相对于机体框架的方向。对于此类广泛系统,我们引入了一种新的几何控制策略,以在力约束允许的情况下实现位置加姿态轨迹的独立跟踪。使用SE(3)中的Lyapunov技术证明了可行的全姿态参考轨迹的指数跟踪。该方法可以无缝处理欠驱动和全驱动的LBF平台。控制器在提供不可行的全姿态参考轨迹时,至少保证位置部分的跟踪。本文提供了几个实验测试,清晰展示了该方法的实用性以及与现有方法相比的显著改进。

英文摘要

In this paper, we define a general class of abstract aerial robotic systems named Laterally Bounded Force (LBF) vehicles, in which most of the control authority is expressed along a principal thrust direction, while in the lateral directions a (smaller and possibly null) force may be exploited to achieve full-pose tracking. This class approximates well platforms endowed with non-coplanar/non-collinear rotors that can use the tilted propellers to slightly change the orientation of the total thrust w.r.t. the body frame. For this broad class of systems, we introduce a new geometric control strategy in SE(3) to achieve, whenever made possible by the force constraints, the independent tracking of position-plus-orientation trajectories. The exponential tracking of a feasible full-pose reference trajectory is proven using a Lyapunov technique in SE(3). The method can deal seamlessly with both under- and fully-actuated LBF platforms. The controller guarantees the tracking of at least the positional part in the case that an unfeasible full-pose reference trajectory is provided. The paper provides several experimental tests clearly showing the practicability of the approach and the sharp improvement with respect to state of-the-art approaches.

1610.06781 2026-06-04 cs.RO cs.AI cs.CV cs.LG cs.SY eess.SY 版本更新

Modular Deep Q Networks for Sim-to-real Transfer of Visuo-motor Policies

模块化深度Q网络用于视觉-运动策略的仿真到现实迁移

Fangyi Zhang, Jürgen Leitner, Michael Milford, Peter Corke

发表机构 * Australian Centre for Robotic Vision (ACRV)(澳大利亚机器人视觉中心) Queensland University of Technology (QUT)(昆士兰理工大学)

AI总结 本文提出模块化深度强化学习方法,通过在感知与控制之间引入瓶颈,实现仿真到现实的迁移,提升机器人视觉-运动协调能力。

Comments Australasian Conference on Robotics and Automation (ACRA) 2017, Student Paper Award Finalist

详情
Journal ref
The proceedings of the Australasian Conference on Robotics and Automation (ACRA) 2017
AI中文摘要

尽管深度学习在计算机视觉中因大量视觉数据而取得显著成功,但为机器人学习收集足够大的现实世界数据集成本较高。为提高这些技术在真实机器人上的实用性,我们提出了一种模块化深度强化学习方法,能够将仿真训练的模型迁移到现实世界机器人任务中。我们引入了感知与控制之间的瓶颈,使网络能够独立训练,然后在端到端方式下合并和微调,以进一步提高视觉-运动协调性。在经典的平面视觉引导机器人抓取任务中,微调后的准确度达到1.6像素,显著优于直接迁移(17.5像素),显示出在更复杂和广泛的应用中的潜力。我们的方法提供了一种更高效学习和迁移视觉-运动策略的技术,无需完全依赖大规模现实世界机器人数据集。

英文摘要

While deep learning has had significant successes in computer vision thanks to the abundance of visual data, collecting sufficiently large real-world datasets for robot learning can be costly. To increase the practicality of these techniques on real robots, we propose a modular deep reinforcement learning method capable of transferring models trained in simulation to a real-world robotic task. We introduce a bottleneck between perception and control, enabling the networks to be trained independently, but then merged and fine-tuned in an end-to-end manner to further improve hand-eye coordination. On a canonical, planar visually-guided robot reaching task a fine-tuned accuracy of 1.6 pixels is achieved, a significant improvement over naive transfer (17.5 pixels), showing the potential for more complicated and broader applications. Our method provides a technique for more efficient learning and transfer of visuo-motor policies for real robotic systems without relying entirely on large real-world robot datasets.

1603.07567 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Dynamics, Control, and Estimation for Aerial Robots Tethered by Cables or Bars

缆绳或刚性杆连接的空中机器人动力学、控制与估计

Marco Tognon, Antonio Franchi

AI总结 研究缆绳或刚性杆连接的空中机器人动力学特性,提出基于加速度计和陀螺仪的非线性观测器和控制器,实现对系统状态的高精度估计与轨迹跟踪。

详情
Journal ref
IEEE Transaction on Robotics Volume: 33, Issue: 4, Aug. 2017
AI中文摘要

我们考虑了通过被动缆绳或被动刚性连接器连接到地面的空中机器人控制问题。我们详细阐述了该非线性机械系统的特性,包括微分平坦性、可控性和可观测性。我们证明该机器人系统相对于两个输出对微分平坦:连接器的高度和车辆的姿态;连接器的高度和纵向连接力(如缆绳张力或杆压缩)。我们展示了仅使用机载加速度计和陀螺仪设计的近全球收敛非线性观测器,用于估计完整状态。我们还设计了两个近全球收敛的非线性控制器,以跟踪任何足够平滑的时间变化轨迹。最后,我们通过数值测试在远离名义条件下的鲁棒性:非线性交叉耦合效应、参数偏差、测量噪声和非理想执行器。

英文摘要

We consider the problem of controlling an aerial robot connected to the ground by a passive cable or a passive rigid link. We provide a thorough characterization of this nonlinear dynamical robotic system in terms of fundamental properties such as differential flatness, controllability, and observability. We prove that the robotic system is differentially flat with respect to two output pairs: elevation of the link and attitude of the vehicle; elevation of the link and longitudinal link force (e.g., cable tension, or bar compression). We show the design of an almost globally convergent nonlinear observer of the full state that resorts only to an onboard accelerometer and a gyroscope. We also design two almost globally convergent nonlinear controllers to track any sufficiently smooth time-varying trajectory of the two output pairs. Finally we numerically test the robustness of the proposed method in several far-from-nominal conditions: nonlinear cross-coupling effects, parameter deviations, measurements noise and non ideal actuators.

1712.06021 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Bendable Cuboid Robot Path Planning with Collision Avoidance using Generalized $L_p$ Norms

可弯曲立方体机器人路径规划与碰撞避免使用广义L_p范数

Nak-seung P. Hyun, Patricio A. Vela, Erik I. Verriest

发表机构 * Georgia Institute of Technology(佐治亚理工学院)

AI总结 本文提出基于广义L_p范数的立方体机器人路径规划方法,通过隐式表面近似和双重优化问题解决碰撞避免问题,验证了方法的有效性。

Comments 12 pages, 6 figures

详情
AI中文摘要

本文考虑了刚性及可变形(可弯曲)立方体机器人的最优路径规划问题,通过提供基于广义L_p范数的解析安全约束来实现。对于常规立方体机器人,加权L_p范数的等高线生成其表面的隐式近似。对于可弯曲立方体机器人,极坐标中的加权L_p范数通过指定等高线隐式近似表面边界。环境中的障碍物体积被假设为加权L_p范数的子等高线近似。利用这些近似表面模型,最优安全路径规划问题被重新表述为一个两阶段优化问题,其中安全约束依赖于机器人上最接近障碍物的点。通过推导一组等式和不等式约束来替代最近点问题,从而为原始路径规划问题定义了额外的解析约束。将所有解析约束通过逻辑与操作结合,得到一个通用的最优安全路径规划问题。数值求解该问题涉及将其转换为非线性规划问题。对刚性和可弯曲立方体机器人进行了仿真验证。

英文摘要

Optimal path planning problems for rigid and deformable (bendable) cuboid robots are considered by providing an analytic safety constraint using generalized $L_p$ norms. For regular cuboid robots, level sets of weighted $L_p$ norms generate implicit approximations of their surfaces. For bendable cuboid robots a weighted $L_p$ norm in polar coordinates implicitly approximates the surface boundary through a specified level set. Obstacle volumes, in the environment to navigate within, are presumed to be approximately described as sub-level sets of weighted $L_p$ norms. Using these approximate surface models, the optimal safe path planning problem is reformulated as a two stage optimization problem, where the safety constraint depends on a point on the robot which is closest to the obstacle in the obstacle's distance metric. A set of equality and inequality constraints are derived to replace the closest point problem, which is then defines additional analytic constraints on the original path planning problem. Combining all the analytic constraints with logical AND operations leads to a general optimal safe path planning problem. Numerically solving the problem involve conversion to a nonlinear programing problem. Simulations for rigid and bendable cuboid robot verify the proposed method.

1711.11006 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

A Family of Iterative Gauss-Newton Shooting Methods for Nonlinear Optimal Control

非线性最优控制中的一种迭代高斯-牛顿射击方法族

Markus Giftthaler, Michael Neunert, Markus Stäuble, Jonas Buchli, Moritz Diehl

AI总结 本文提出了一种非线性最优控制的迭代算法族,扩展了经典的iLQR算法,结合了简单初始化和闭环前向积分的优势,具有线性复杂度,适用于高维欠驱动机器人等模拟场景。

Comments 8 pages

详情
AI中文摘要

本文介绍了一种用于无约束非线性最优控制的迭代算法族。我们扩展了经典的iLQR算法,将其推广到不同的多步射击变体,结合了诸如简单初始化和闭环前向积分等优点。所有算法具有相似的计算复杂度,即时间跨度的线性复杂度,并可以在同一计算框架下推导。我们比较了我们算法的全步变体,并展示了几个模拟示例,包括受接触开关影响的高维欠驱动机器人。仿真结果表明,我们的多步射击算法在收敛速度、局部收缩率和运行时间方面优于经典iLQR,使其成为非线性模型预测控制应用的更优选择。

英文摘要

This paper introduces a family of iterative algorithms for unconstrained nonlinear optimal control. We generalize the well-known iLQR algorithm to different multiple-shooting variants, combining advantages like straight-forward initialization and a closed-loop forward integration. All algorithms have similar computational complexity, i.e. linear complexity in the time horizon, and can be derived in the same computational framework. We compare the full-step variants of our algorithms and present several simulation examples, including a high-dimensional underactuated robot subject to contact switches. Simulation results show that our multiple-shooting algorithms can achieve faster convergence, better local contraction rates and much shorter runtimes than classical iLQR, which makes them a superior choice for nonlinear model predictive control applications.

1701.08735 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Real-Time Control for Autonomous Racing Based on Viability Theory

基于可行性理论的实时自主赛车控制

Alexander Liniger, John Lygeros

发表机构 * Automatic Control Laboratory, ETH Zurich(自动控制实验室,苏黎世联邦理工学院)

AI总结 本文基于可行性理论生成有限步前瞻轨迹,结合低层模型预测控制器实现实时自主赛车,通过游戏理论方法改进可行性核计算,提升安全性并减少 lap 时间。

Comments 26 pages, 11 figures

详情
AI中文摘要

本文研究了微型赛车的自主驾驶问题,利用可行性核生成递归可行的有限步前瞻轨迹,以最大化进度并保持在静态障碍物内。该方法结合低层模型预测控制器实现实时自主赛车。可行性核计算基于空间离散化,提出基于游戏理论的新型数值方案,特别是辨别核,以提高计算鲁棒性。结果显示,使用辨别核的保守近似方法在安全性上有所提升,但略微增加 lap 时间。

英文摘要

In this paper we consider autonomous driving of miniature race cars. The viability kernel is used to efficiently generate finite look-ahead trajectories that maximize progress while remaining recursively feasible with respect to static obstacles (e.g., stay inside the track). Together with a low-level model predictive controller, this method makes real-time autonomous racing possible. The viability kernel computation is based on space discretization. To make the calculation robust against discretization errors, we propose a novel numerical scheme based on game theoretical methods, in particular the discriminating kernel. We show that the resulting algorithm provides an inner approximation of the viability kernel and guarantees that, for all states in the cell surrounding a viable grid point, there exists a control that keeps the system within the kernel. The performance of the proposed control method is studied in simulation where we determine the effects of various design choices and parameters and in experiments on an autonomous racing set-up maintained at the Automatic Control Laboratory of ETH Zurich. Both simulation and experimental results suggest that the more conservative approximation using the discriminating kernel results in safer driving style at the cost of a small increase in lap time.

1712.02923 2026-06-04 cs.RO cs.SY eess.SY 版本更新

SPRK: A Low-Cost Stewart Platform For Motion Study In Surgical Robotics

SPRK:一种低成本的斯图尔特平台用于外科机器人中的运动研究

Vatsal Patel, Sanjay Krishnan, Aimee Goncalves, Ken Goldberg

发表机构 * UC Berkeley(加州大学伯克利分校)

AI总结 本文提出了一种低成本的斯图尔特平台,用于模拟呼吸、心跳和蠕动运动,其成本仅为250美元,比商用平台低两个数量级,具备精确的位移和旋转能力。

详情
AI中文摘要

为了模拟由于呼吸、心跳或蠕动运动导致的身体器官运动,我们设计了一种低成本、微型化的SPRK(斯图尔特平台研究套件),用于翻译和旋转假体组织。该平台尺寸为20厘米x20厘米x10厘米,可适配达芬奇研究套件(DVRK)外科机器人的工作空间,并成本仅为250美元,比商用斯图尔特平台低两个数量级。该平台在x、y和z方向上的位移范围为±1.27厘米,并具有正弦运动和呼吸启发式运动模式。还设计了模块化平台支架用于图案切割和去痂实验。该平台的定位控制器具有0.2秒的时间常数,其均方误差在x、y和z方向分别为1.22毫米、1.07毫米和0.20毫米。所有细节、CAD模型和控制软件均可在github.com/BerkeleyAutomation/sprk上找到。

英文摘要

To simulate body organ motion due to breathing, heart beats, or peristaltic movements, we designed a low-cost, miniaturized SPRK (Stewart Platform Research Kit) to translate and rotate phantom tissue. This platform is 20cm x 20cm x 10cm to fit in the workspace of a da Vinci Research Kit (DVRK) surgical robot and costs $250, two orders of magnitude less than a commercial Stewart platform. The platform has a range of motion of +/- 1.27 cm in translation along x, y, and z directions and has motion modes for sinusoidal motion and breathing-inspired motion. Modular platform mounts were also designed for pattern cutting and debridement experiments. The platform's positional controller has a time-constant of 0.2 seconds and the root-mean-square error is 1.22 mm, 1.07 mm, and 0.20 mm in x, y, and z directions respectively. All the details, CAD models, and control software for the platform is available at github.com/BerkeleyAutomation/sprk.

1712.02917 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Using Intermittent Synchronization to Compensate for Rhythmic Body Motion During Autonomous Surgical Cutting and Debridement

利用间歇同步来补偿自主手术切割和清创中的节律性身体运动

Vatsal Patel, Sanjay Krishnan, Aimee Goncalves, Carolyn Chen, Walter Doug Boyd, Ken Goldberg

AI总结 本文提出利用间歇同步技术来补偿手术中因呼吸、心跳和节律性运动导致的不稳定性,通过实验验证该方法在切割和清创任务中具有更高的鲁棒性和精度。

详情
AI中文摘要

手术过程中解剖结构由于呼吸、心跳和节律性运动而很少保持静止。受专家外科医生的启发,我们提出了一种与节律性运动极值(即最低速度窗口)进行间歇同步的方法。我们进行了两项实验:(1)模式切割,(2)清创。在(1)中,间歇同步方法虽然比跟踪运动慢1.8倍,但对噪声和控制延迟的鲁棒性显著提高,并将最大切割误差减少了2.6倍。在(2)中,无同步的基线方法每次移除的成功率为62%,而间歇同步方法达到80%。

英文摘要

Anatomical structures are rarely static during a surgical procedure due to breathing, heartbeats, and peristaltic movements. Inspired by observing an expert surgeon, we propose an intermittent synchronization with the extrema of the rhythmic motion (i.e., the lowest velocity windows). We performed 2 experiments: (1) pattern cutting, and (2) debridement. In (1), we found that the intermittent synchronization approach, while 1.8x slower than tracking motion, was significantly more robust to noise and control latency, and it reduced the max cutting error by 2.6x In (2), a baseline approach with no synchronization achieves 62% success rate for each removal, while intermittent synchronization achieves 80%.

1712.00634 2026-06-04 cs.LG cs.AI cs.RO cs.SY eess.SY math.OC 版本更新

PFAx: Predictable Feature Analysis to Perform Control

PFAx:可预测特征分析用于控制

Stefan Richthofer, Laurenz Wiskott

AI总结 PFAx通过整合补充信息提升预测性能,并透明展示补充信息对特征选择的影响,应用于强化学习环境中的智能体控制优化。

详情
AI中文摘要

可预测特征分析(PFA)(Richthofer, Wiskott, ICMLA 2015)是一种对高维输入信号进行降维的算法,提取最可预测的子信号。本文扩展了PFA,考虑补充信息以提高预测。补充信息不参与特征提取,特征仅从主输入中提取。PFAx透明地展示补充信息如何提升预测质量,并可生成补充信息以实现主信号的特定目标。该方法应用于强化学习环境,使智能体局部优化状态,接近目标。后续论文将扩展此方法以实现全局优化。

英文摘要

Predictable Feature Analysis (PFA) (Richthofer, Wiskott, ICMLA 2015) is an algorithm that performs dimensionality reduction on high dimensional input signal. It extracts those subsignals that are most predictable according to a certain prediction model. We refer to these extracted signals as predictable features. In this work we extend the notion of PFA to take supplementary information into account for improving its predictions. Such information can be a multidimensional signal like the main input to PFA, but is regarded external. That means it won't participate in the feature extraction - no features get extracted or composed of it. Features will be exclusively extracted from the main input such that they are most predictable based on themselves and the supplementary information. We refer to this enhanced PFA as PFAx (PFA extended). Even more important than improving prediction quality is to observe the effect of supplementary information on feature selection. PFAx transparently provides insight how the supplementary information adds to prediction quality and whether it is valuable at all. Finally we show how to invert that relation and can generate the supplementary information such that it would yield a certain desired outcome of the main signal. We apply this to a setting inspired by reinforcement learning and let the algorithm learn how to control an agent in an environment. With this method it is feasible to locally optimize the agent's state, i.e. reach a certain goal that is near enough. We are preparing a follow-up paper that extends this method such that also global optimization is feasible.

1712.00390 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Gain Scheduling LPV Control Scheme for the Autonomous Guidance Problem using a Dynamic Modelling Approach

基于动态建模方法的自主引导问题的增益调度LPV控制方案

Eugenio Alcalá, Vicenç Puig, Joseba Quevedo, Teresa Escobet

发表机构 * 1 Advanced Control Systems Group, Automatic Control Department, Universitat Polit\` e cnica de Catalunya (UPC), Rambla Sant Nebridi, 10, 08222, Terrassa, Spain 11pt 13.2pt This paper is a preprint of a paper submitted to IET Control Theory \& Applications. If accepted, copy of record will be available at the IET Digital Library.

AI总结 本文提出了一种用于城市自动驾驶车辆纵向和横向控制的增益调度LPV控制方法,通过动力学和运动学模型构建LPV表示,并采用级联控制方法同时控制车辆行为,通过解决两个LPV LMI-LQR问题实现性能优化。

详情
AI中文摘要

本文提出了一种用于城市自动驾驶车辆纵向和横向控制的增益调度LPV控制方法。利用运动学和动力学车辆模型,采用线性参数变化(LPV)表示,并提出了一种用于控制两种车辆行为的级联控制方法。特别是,在控制设计中,分别使用两种模型解决两个LPV LMI-LQR问题。此外,为了实现所需的性能水平,提出了一种基于运动学和动力学控制器级联设计的方法。该级联控制方案基于动态闭环行为比运动学闭环行为更快的设计理念。所获得的增益调度LPV控制方法,与轨迹生成模块结合,在模拟城市驾驶场景中展示了良好的效果。

英文摘要

This work proposes a solution for the longitudinal and lateral control problem of urban autonomous vehicles using a gain scheduling LPV control approach. Using the kinematic and dynamic vehicle models, a linear parameter varying (LPV) representation is adopted and a cascade control methodology is proposed for controlling both vehicle behaviours. In particular, for the control design, the use of both models separately lead to solve two LPV LMI-LQR problems. Furthermore, to achieve the desired levels of performance, an approach based on cascade design of the the kinematic and dynamic controllers has been proposed. This cascade control scheme is based on the idea that the dynamic closed loop behaviour is designed to be faster than the kinematic closed loop one. The obtained gain scheduling LPV control approach, jointly with a trajectory generation module, has presented suitable results in a simulated city driving scenario.

1711.11018 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

PDE-Based Optimization for Stochastic Mapping and Coverage Strategies using Robotic Ensembles

基于偏微分方程的机器人群体映射与覆盖策略优化

Karthik Elamvazhuthi, Hendrik Kuiper, Spring Berman

发表机构 * School for Engineering of Matter, Transport and Energy, Arizona State University(物质、传输与能量工程学院,亚利桑那州立大学) School of Mathematical and Statistical Sciences, Arizona State University(数学与统计科学学院,亚利桑那州立大学)

AI总结 本文提出基于偏微分方程的机器人群体控制框架,用于解决具有随机行为的机器人在有限感知与驱动能力下的映射与覆盖任务,通过凸优化和最优控制方法实现区域空间分布的重建与活动率调控。

详情
AI中文摘要

本文提出了一种基于偏微分方程(PDE)的新型框架,用于控制具有有限感知和驱动能力且表现出随机行为的机器人群体,以执行映射和覆盖任务。我们将机器人群体的动态建模为一个输运-扩散-反应PDE模型,并将映射和覆盖任务建模为该模型的识别与控制问题。在映射任务中,机器人被部署在封闭域内以收集无定位信息的数据,用于重建目标区域的未知空间分布。我们将该任务建模为一个凸优化问题,其解表示为PDE模型中的空间依赖系数。然后考虑覆盖问题,其中机器人必须以可编程的概率率执行预期活动以实现目标区域的活动分布。我们将该任务建模为一个最优控制问题,其中PDE模型被表示为双线性控制系统,机器人覆盖活动率和速度场定义为控制输入。我们通过在两个环境中进行联合映射与覆盖场景的模拟来验证我们的方法。

英文摘要

This paper presents a novel partial differential equation (PDE)-based framework for controlling an ensemble of robots, which have limited sensing and actuation capabilities and exhibit stochastic behaviors, to perform mapping and coverage tasks. We model the ensemble population dynamics as an advection-diffusion-reaction PDE model and formulate the mapping and coverage tasks as identification and control problems for this model. In the mapping task, robots are deployed over a closed domain to gather data, which is unlocalized and independent of robot identities, for reconstructing the unknown spatial distribution of a region of interest. We frame this task as a convex optimization problem whose solution represents the region as a spatially-dependent coefficient in the PDE model. We then consider a coverage problem in which the robots must perform a desired activity at a programmable probability rate to achieve a target spatial distribution of activity over the reconstructed region of interest. We formulate this task as an optimal control problem in which the PDE model is expressed as a bilinear control system, with the robots' coverage activity rate and velocity field defined as the control inputs. We validate our approach with simulations of a combined mapping and coverage scenario in two environments with three target coverage distributions.

1711.07300 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Optimization-Based Autonomous Racing of 1:43 Scale RC Cars

基于优化的1:43比例遥控赛车自主赛车

Alexander Liniger, Alexander Domahidi, Manfred Morari

AI总结 本文基于数学优化实现遥控赛车自主赛车,通过动态模型和递推控制策略,结合路径规划和非线性模型预测控制,解决赛道跟踪与对手避让问题,实验验证了优化方法在实时性和性能上的有效性。

详情
Journal ref
Optimal Control Applications and Methods, 36(5), 2015, pp.628-647
AI中文摘要

本文描述了基于数学优化的遥控赛车自主赛车方法。利用车辆动态模型,通过递推控制计算控制输入,目标是最大化赛道进展,同时确保车辆在赛道上并避开对手。提出了两种不同的控制方法:第一种采用两级结构,包括路径规划器和非线性模型预测控制器(NMPC)用于跟踪;第二种将两者结合为一个非线性优化问题(NLP),借鉴轮廓控制思想。通过线性化获得线性时变模型,构建局部控制NLP的凸二次规划(QP)形式。所得到的QP具有典型的MPC结构,可通过最新结构利用求解器在毫秒级内解决,是整体控制方案实时可行性的关键。通过动态规划的高阶走廊规划器实现障碍物避让,根据对手当前位置和赛道布局生成凸约束。通过实验使用1:43比例遥控赛车,在超过3 m/s的速度和饱和后轮力矩(漂移)操作区域进行测试。算法在嵌入式计算平台上以50 Hz采样率运行,验证了基于优化的自主赛车方法在实时性和性能上的有效性。

英文摘要

This paper describes autonomous racing of RC race cars based on mathematical optimization. Using a dynamical model of the vehicle, control inputs are computed by receding horizon based controllers, where the objective is to maximize progress on the track subject to the requirement of staying on the track and avoiding opponents. Two different control formulations are presented. The first controller employs a two-level structure, consisting of a path planner and a nonlinear model predictive controller (NMPC) for tracking. The second controller combines both tasks in one nonlinear optimization problem (NLP) following the ideas of contouring control. Linear time varying models obtained by linearization are used to build local approximations of the control NLPs in the form of convex quadratic programs (QPs) at each sampling time. The resulting QPs have a typical MPC structure and can be solved in the range of milliseconds by recent structure exploiting solvers, which is key to the real-time feasibility of the overall control scheme. Obstacle avoidance is incorporated by means of a high-level corridor planner based on dynamic programming, which generates convex constraints for the controllers according to the current position of opponents and the track layout. The control performance is investigated experimentally using 1:43 scale RC race cars, driven at speeds of more than 3 m/s and in operating regions with saturated rear tire forces (drifting). The algorithms run at 50 Hz sampling rate on embedded computing platforms, demonstrating the real-time feasibility and high performance of optimization-based approaches for autonomous racing.

1702.05116 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Collective Motion under Beacon-referenced Cyclic Pursuit

基于信标参考的循环追捕运动

Kevin S. Galloway, Biswadip Dey

AI总结 本文提出基于信标参考的循环追捕法则,通过引入静止信标作为额外参考,研究集体机器人在循环框架下的环形运动均衡及稳定性,推导了参数选择的必要条件,并探讨了保持集体纯形状的不变流形。

详情
AI中文摘要

循环追捕框架基于环形图中邻近代理之间的追捕交互,为自主机器人集体产生有用全局行为提供了有效方法。先前工作考虑了具有恒定方位(CB)追捕法则的循环追捕,并展示了相应动力学的环形均衡存在性。本文提出一种基于信标参考的CB追捕法则,其中静止信标为集体中的个体代理提供额外参考。在循环框架中实施时,我们显示所得到的动力学允许对应于围绕信标环形轨道的相对均衡,环形半径和代理沿轨道的分布由所提追捕法则的参数决定。我们还推导了环形均衡稳定性的必要条件,为参数选择提供了指导。最后,通过引入变量变换,我们证明了与围绕信标螺旋运动相关的不变流形家族的存在,这些流形保持集体的“纯形状”,并研究了代表性流形上的简化动力学。

英文摘要

Cyclic pursuit frameworks, which are built upon pursuit interactions between neighboring agents in a cycle graph, provide an efficient way to create useful global behaviors in a collective of autonomous robots. Previous work had considered cyclic pursuit with a constant bearing (CB) pursuit law, and demonstrated the existence of circling equilibria for the corresponding dynamics. In this work, we propose a beacon-referenced version of the CB pursuit law, wherein a stationary beacon provides an additional reference for the individual agents in a collective. When implemented in a cyclic framework, we show that the resulting dynamics admit relative equilibria corresponding to a circling orbit around the beacon, with the circling radius and the distribution of agents along the orbit determined by parameters of the proposed pursuit law. We also derive necessary conditions for stability of the circling equilibria, which provides a guide for parameter selection. Finally, by introducing a change of variables, we demonstrate the existence of a family of invariant manifolds related to spiraling motions around the beacon which preserve the "pure shape" of the collective, and study the reduced dynamics on a representative manifold.

1711.04683 2026-06-04 cs.LG cs.RO cs.SY eess.SY stat.ML 版本更新

Tensor Decompositions for Modeling Inverse Dynamics

张量分解用于逆动力学建模

Stephan Baier, Volker Tresp

AI总结 本文提出利用张量分解方法建模逆动力学,通过处理位置、速度和加速度的三重交互,实现对高非线性函数的近似,并在SARCOS机械臂数据集上验证了其优越性。

详情
AI中文摘要

建模逆动力学对于精确的前馈机器人控制至关重要。该模型计算所需的关节扭矩,以执行预期的运动。高度非线性的动态系统逆函数可以通过回归技术近似。我们提出了一种回归方法,即利用位置x速度x加速度的三重交互的张量分解模型。大多数张量分解工作都解决了密集张量的分解问题。本文在稀疏张量的分解基础上进行扩展,仅包含少量非零条目。稀疏张量的分解已成功应用于关系学习,例如大规模知识图谱的建模。最近,该方法已扩展到多类分类问题,涉及离散输入变量。在高维稀疏张量中表示数据可以近似复杂的高非线性函数。本文展示了稀疏张量分解如何应用于回归问题。此外,我们通过学习从连续输入到张量分解的潜在表示的映射,利用基函数将方法扩展到连续输入。我们在具有七自由度SARCOS机械臂轨迹的数据集上评估了所提出的模型。实验结果表明,所提出的功能张量模型相比挑战性的最新方法具有优越的性能。

英文摘要

Modeling inverse dynamics is crucial for accurate feedforward robot control. The model computes the necessary joint torques, to perform a desired movement. The highly non-linear inverse function of the dynamical system can be approximated using regression techniques. We propose as regression method a tensor decomposition model that exploits the inherent three-way interaction of positions x velocities x accelerations. Most work in tensor factorization has addressed the decomposition of dense tensors. In this paper, we build upon the decomposition of sparse tensors, with only small amounts of nonzero entries. The decomposition of sparse tensors has successfully been used in relational learning, e.g., the modeling of large knowledge graphs. Recently, the approach has been extended to multi-class classification with discrete input variables. Representing the data in high dimensional sparse tensors enables the approximation of complex highly non-linear functions. In this paper we show how the decomposition of sparse tensors can be applied to regression problems. Furthermore, we extend the method to continuous inputs, by learning a mapping from the continuous inputs to the latent representations of the tensor decomposition, using basis functions. We evaluate our proposed model on a dataset with trajectories from a seven degrees of freedom SARCOS robot arm. Our experimental results show superior performance of the proposed functional tensor model, compared to challenging state-of-the art methods.

1705.01426 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A Nonlinear Model Predictive Control Scheme for Cooperative Manipulation with Singularity and Collision Avoidance

一种用于协同操作的非线性模型预测控制方案,包含奇异性和避障

Alexandros Nikou, Christos Verginis, Shahab Heshmati-alamdari, Dimos V. Dimarogonas

AI总结 本文提出了一种非线性模型预测控制方案,用于在存在障碍物的有限工作空间中安全导航被N个机器人代理抓取的物体,同时避免碰撞和奇异配置。

Comments Simulation results with 3 agents added

详情
AI中文摘要

本文解决了由N个机器人代理共同运输一个刚性抓取物体的问题。特别地,我们提出了一种非线性模型预测控制(NMPC)方案,该方案保证在有障碍物的有限工作空间中将物体导航到期望姿态,同时满足代理的输入饱和条件。此外,所提出的方法确保代理之间及与工作空间障碍物之间不发生碰撞,且不进入奇异配置。NMPC的可行性和收敛性分析被明确提供。最后,仿真结果展示了所提方法的有效性和效率。

英文摘要

This paper addresses the problem of cooperative transportation of an object rigidly grasped by $N$ robotic agents. In particular, we propose a Nonlinear Model Predictive Control (NMPC) scheme that guarantees the navigation of the object to a desired pose in a bounded workspace with obstacles, while complying with certain input saturations of the agents. Moreover, the proposed methodology ensures that the agents do not collide with each other or with the workspace obstacles as well as that they do not pass through singular configurations. The feasibility and convergence analysis of the NMPC are explicitly provided. Finally, simulation results illustrate the validity and efficiency of the proposed method.

1602.06667 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

A Motion Planning Strategy for the Active Vision-Based Mapping of Ground-Level Structures

一种用于主动视觉建图的地面结构运动规划策略

Manikandasriram Srinivasan Ramanagopal, André Phu-Van Nguyen, Jerome Le Ny

AI总结 本文提出了一种指导配备摄像头或深度传感器的地面机器人自主建图有限三维结构可见部分的策略,通过运动规划算法确定合适视角并自动填补点云中的空洞,适用于建筑、施工和检测领域。

Comments Accepted for publication in IEEE Transactions on Automation Science and Engineering. Available in IEEE Xplore at http://ieeexplore.ieee.org/document/8093664

详情
AI中文摘要

本文提出了一种策略,用于指导配备摄像头或深度传感器的地面机器人,以自主建图有限三维结构的可见部分。我们描述了确定合适连续视角的运动规划算法,并尝试自动填补由感知和感知层产生的点云中的空洞。重点是准确重建中等大小结构的3D模型,而非映射大型开放环境。所提出的算法不需要以网格模型或包围盒形式的初始化,生成的路径适用于视觉传感器同时用于建图和机器人局部化的情况,特别是在没有额外绝对定位系统时。我们分析了我们的策略的覆盖性质,并将其性能与经典前沿探索算法进行比较。我们展示了其在不同结构大小、局部化精度水平和深度传感器范围下的有效性,并在真实世界实验中验证了我们的设计。

英文摘要

This paper presents a strategy to guide a mobile ground robot equipped with a camera or depth sensor, in order to autonomously map the visible part of a bounded three-dimensional structure. We describe motion planning algorithms that determine appropriate successive viewpoints and attempt to fill holes automatically in a point cloud produced by the sensing and perception layer. The emphasis is on accurately reconstructing a 3D model of a structure of moderate size rather than mapping large open environments, with applications for example in architecture, construction and inspection. The proposed algorithms do not require any initialization in the form of a mesh model or a bounding box, and the paths generated are well adapted to situations where the vision sensor is used simultaneously for mapping and for localizing the robot, in the absence of additional absolute positioning system. We analyze the coverage properties of our policy, and compare its performance to the classic frontier based exploration algorithm. We illustrate its efficacy for different structure sizes, levels of localization accuracy and range of the depth sensor, and validate our design on a real-world experiment.

1711.03906 2026-06-04 cs.LG cs.DC cs.NI cs.RO cs.SY eess.SY 版本更新

D-SLATS: Distributed Simultaneous Localization and Time Synchronization

D-SLATS:分布式的同时定位与时间同步

Amr Alanwar, Henrique Ferraz, Kevin Hsieh, Rohit Thazhath, Paul Martin, Joao Hespanha, Mani Srivastava

AI总结 本文提出D-SLATS框架,通过分布式扩展卡尔曼滤波和优化技术联合解决时间同步与定位问题,实现3微秒精度和30厘米误差。

详情
AI中文摘要

通过过去十年,我们见证了物联网(IoT)设备数量的激增,随之而来的是一次对时间和空间上协同行动的更大需求。尽管时间同步和定位这两个问题在许多方面有共同点,但传统上它们被分别处理或在集中式方法中结合,导致资源利用效率低下,或在设备数量方面不可扩展的解决方案。因此,我们提出D-SLATS,一个由三种不同且独立算法组成的框架,以分布式方式联合解决时间和定位问题。前两个算法主要基于分布式扩展卡尔曼滤波(EKF),而第三个算法使用优化技术。不需要融合中心,设备仅与邻居通信。所提出的方法在定制的超宽带通信测试平台和四旋翼无人机上进行了评估,代表了静态和移动节点的网络。我们的算法实现了高达三微秒的时间同步精度和30厘米的定位误差。

英文摘要

Through the last decade, we have witnessed a surge of Internet of Things (IoT) devices, and with that a greater need to choreograph their actions across both time and space. Although these two problems, namely time synchronization and localization, share many aspects in common, they are traditionally treated separately or combined on centralized approaches that results in an ineffcient use of resources, or in solutions that are not scalable in terms of the number of IoT devices. Therefore, we propose D-SLATS, a framework comprised of three different and independent algorithms to jointly solve time synchronization and localization problems in a distributed fashion. The First two algorithms are based mainly on the distributed Extended Kalman Filter (EKF) whereas the third one uses optimization techniques. No fusion center is required, and the devices only communicate with their neighbors. The proposed methods are evaluated on custom Ultra-Wideband communication Testbed and a quadrotor, representing a network of both static and mobile nodes. Our algorithms achieve up to three microseconds time synchronization accuracy and 30 cm localization error.

1711.03819 2026-06-04 eess.SY cs.MA cs.RO cs.SY 版本更新

Cooperative control of multi-agent systems to locate source of an odor

多智能体系统协作控制以定位气味源

Abhinav Sinha, Rishemjit Kaur, Ritesh Kumar, Amol P. Bhondekar

AI总结 本文提出分层协作控制方法,通过多智能体系统定位气味源,结合粒子群算法和滑模控制器实现决策与控制层的协同,验证了分层分布式控制的有效性。

Comments 8 pages, initial results on our work

详情
AI中文摘要

本文提出了一种分层协作控制方法,通过多智能体系统定位气味源。通过粒子群算法和滑模控制器实现决策与控制层的协同,验证了分层分布式控制的有效性。

英文摘要

This work targets the problem of odor source localization by multi-agent systems. A hierarchical cooperative control has been put forward to solve the problem of locating source of an odor by driving the agents in consensus when at least one agent obtains information about location of the source. Synthesis of the proposed controller has been carried out in a hierarchical manner of group decision making, path planning and control. Decision making utilizes information of the agents using conventional Particle Swarm Algorithm and information of the movement of filaments to predict the location of the odor source. The predicted source location in the decision level is then utilized to map a trajectory and pass that information to the control level. The distributed control layer uses sliding mode controllers known for their inherent robustness and the ability to reject matched disturbances completely. Two cases of movement of agents towards the source, i.e., under consensus and formation have been discussed herein. Finally, numerical simulations demonstrate the efficacy of the proposed hierarchical distributed control.

1711.03605 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Stability and Transparency Analysis of a Bilateral Teleoperation in Presence of Data Loss

双边远程操作中数据丢失情况下稳定性与透明度分析

A. Bakhshi, H. A. Talebi, A. A. Suratgar, M. Abdeetedal

AI总结 本文提出一种新的方法,用于分析通信介质中数据丢失情况下双边远程操作的稳定性与透明度。通过周期连续脉冲的集合及其有限级数表示提出新的数据丢失模型,并利用波变量方法证明了系统整体的被动性。

详情
AI中文摘要

本文提出了一种新的方法,用于双边远程操作在通信介质中存在数据丢失情况下的稳定性与透明度分析。本文提出了一种基于一组周期连续脉冲的集合及其有限级数表示的新数据丢失模型。利用波变量方法,包括新定义的数据丢失模型,证明了整体系统的被动性。给出了仿真结果以展示所提方法的有效性。

英文摘要

This paper presents a novel approach for stability and transparency analysis for bilateral teleoperation in the presence of data loss in communication media. A new model for data loss is proposed based on a set of periodic continuous pulses and its finite series representation. The passivity of the overall system is shown using wave variable approach including the newly defined model for data loss. Simulation results are presented to show the effectiveness of the proposed approach.

1708.01930 2026-06-04 cs.AI cs.MA cs.RO cs.SY eess.SY 版本更新

Enhanced Emotion Enabled Cognitive Agent Based Rear End Collision Avoidance Controller for Autonomous Vehicles

增强型情感驱动认知代理基于后方碰撞避免控制器用于自动驾驶车辆

Faisal Riaz, Muaz A. Niazi

AI总结 本文提出一种基于增强型情感驱动认知代理的后方碰撞避免控制器,通过引入恐惧情绪生成机制,提高自动驾驶车辆的碰撞避免效率和规则数量。

Comments 39 pages, 17 figures

详情
AI中文摘要

后方碰撞是自然中最致命的事故,导致大多数交通伤亡和伤害。现有研究提出了许多后方碰撞避免解决方案,但这些方案高度依赖精确的数学模型。然而,实际道路驾驶受非线性因素如路面状况、驾驶员反应时间、行人流量和车辆动力学影响,因此获得车辆控制系统精确数学模型具有挑战性。这个问题通过模糊逻辑解决了,但过多的模糊规则直接影响其效率。此外,这些基于模糊逻辑的控制器未使用适当的代理建模来模拟人工驾驶员执行这些模糊规则的功能。鉴于这些限制,我们提出了一种增强型情感驱动认知代理(EEEC_Agent)控制器,帮助自动驾驶车辆(AVs)以较少的规则进行后方碰撞避免,设计基于恐惧情绪,并具有高效率。为了在EEEC_Agent中引入恐惧情绪生成机制,采用了Orton, Clore & Collins(OCC)模型。EEEC_Agent的恐惧生成机制通过NetLogo模拟验证。此外,通过特别设计的原型AV平台对EEEC_Agent的功能进行了实际验证。最终,与现有最先进研究的定性比较研究表明,所提出的模型优于近期研究。

英文摘要

Rear end collisions are deadliest in nature and cause most of traffic casualties and injuries. In the existing research, many rear end collision avoidance solutions have been proposed. However, the problem with these proposed solutions is that they are highly dependent on precise mathematical models. Whereas, the real road driving is influenced by non-linear factors such as road surface situations, driver reaction time, pedestrian flow and vehicle dynamics, hence obtaining the accurate mathematical model of the vehicle control system is challenging. This problem with precise control based rear end collision avoidance schemes has been addressed using fuzzy logic, but the excessive number of fuzzy rules straightforwardly prejudice their efficiency. Furthermore, these fuzzy logic based controllers have been proposed without using proper agent based modeling that helps in mimicking the functions of an artificial human driver executing these fuzzy rules. Keeping in view these limitations, we have proposed an Enhanced Emotion Enabled Cognitive Agent (EEEC_Agent) based controller that helps the Autonomous Vehicles (AVs) to perform rear end collision avoidance with less number of rules, designed after fear emotion, and high efficiency. To introduce a fear emotion generation mechanism in EEEC_Agent, Orton, Clore & Collins (OCC) model has been employed. The fear generation mechanism of EEEC_Agent has been verified using NetLogo simulation. Furthermore, practical validation of EEEC_Agent functions has been performed using specially built prototype AV platform. Eventually, the qualitative comparative study with existing state of the art research works reflect that proposed model outperforms recent research.

1710.02555 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Model Predictive Path-Following for Constrained Differentially Flat Systems

基于约束微分平坦系统的模型预测路径跟踪

Melissa Greeff, Angela P. Schoellig

AI总结 本文提出一种结合前馈线性化与基于路径的模型预测控制的新型预测路径跟踪方法,通过微分平坦性将非线性问题转化为凸优化问题,并通过动态路径参考提高系统鲁棒性,实验验证了在四旋翼上优于传统轨迹跟踪控制器的性能。

Comments 8 pages, submitted to ICRA 2018

详情
AI中文摘要

对于许多任务,预测路径跟踪控制可以通过优先考虑接近路径而非沿路径的时间进展,并提前考虑路径变化来显著提高自主机器人性能和鲁棒性。本文提出了一种新颖的预测路径跟踪方法,将前馈线性化与基于路径的模型预测控制相结合。我们的方法有几个关键优势。通过利用微分平坦性,我们将基于路径的模型预测控制问题从非线性问题转化为凸优化问题。通过动态路径参考,可以实现对干扰的鲁棒性,该参考根据机器人进展调整其速度。我们还考虑了关键系统约束。我们在四旋翼上进行了实验,展示了在正常条件、初始偏移和风扰情况下,相比传统轨迹跟踪控制器,保持四旋翼更接近期望路径的改进性能。

英文摘要

For many tasks, predictive path-following control can significantly improve the performance and robustness of autonomous robots over traditional trajectory tracking control. It does this by prioritizing closeness to the path over timed progress along the path and by looking ahead to account for changes in the path. We propose a novel predictive path-following approach that couples feedforward linearization with path-based model predictive control. Our approach has a few key advantages. By utilizing the differential flatness property, we reduce the path-based model predictive control problem from a nonlinear to a convex optimization problem. Robustness to disturbances is achieved by a dynamic path reference, which adjusts its speed based on the robot's progress. We also account for key system constraints. We demonstrate these advantages in experiment on a quadrotor. We show improved performance over a baseline trajectory tracking controller by keeping the quadrotor closer to the desired path under nominal conditions, with an initial offset and under a wind disturbance.

1710.11040 2026-06-04 cs.RO cs.AI cs.SY eess.SY math.OC 版本更新

How Should a Robot Assess Risk? Towards an Axiomatic Theory of Risk in Robotics

机器人应如何评估风险?迈向机器人学中的风险轴理论

Anirudha Majumdar, Marco Pavone

AI总结 本文探讨了机器人风险评估的理论基础,提出风险度量应满足的公理,讨论了风险度量的表示定理及其在机器人应用中的实例,并分析了常用风险度量的局限性。

Comments Extended version of paper published in International Symposium on Robotics Research (ISRR) 2017

详情
AI中文摘要

赋予机器人评估风险和做出风险感知决策的能力被视为确保在不确定环境下运作的机器人安全的关键步骤。但,机器人应如何量化风险?一种自然且常见的方法是考虑一种框架,即随机结果被赋予成本——这种分配由一个成本随机变量捕捉。量化风险则对应于评估风险度量,即从成本随机变量到实数的映射。然而,什么是构成

英文摘要

Endowing robots with the capability of assessing risk and making risk-aware decisions is widely considered a key step toward ensuring safety for robots operating under uncertainty. But, how should a robot quantify risk? A natural and common approach is to consider the framework whereby costs are assigned to stochastic outcomes - an assignment captured by a cost random variable. Quantifying risk then corresponds to evaluating a risk metric, i.e., a mapping from the cost random variable to a real number. Yet, the question of what constitutes a "good" risk metric has received little attention within the robotics community. The goal of this paper is to explore and partially address this question by advocating axioms that risk metrics in robotics applications should satisfy in order to be employed as rational assessments of risk. We discuss general representation theorems that precisely characterize the class of metrics that satisfy these axioms (referred to as distortion risk metrics), and provide instantiations that can be used in applications. We further discuss pitfalls of commonly used risk metrics in robotics, and discuss additional properties that one must consider in sequential decision making tasks. Our hope is that the ideas presented here will lead to a foundational framework for quantifying risk (and hence safety) in robotics applications.

1709.03153 2026-06-04 cs.LG cs.AI cs.RO cs.SY eess.SY 版本更新

MBMF: Model-Based Priors for Model-Free Reinforcement Learning

MBMF:基于模型的先验用于无模型强化学习

Somil Bansal, Roberto Calandra, Kurtland Chua, Sergey Levine, Claire Tomlin

AI总结 本文提出一种结合模型与无模型强化学习的方法,通过学习概率动力学模型作为先验,提升数据效率和成本效益。

Comments After we submitted the paper for consideration in CoRL 2017 we found a paper published in the recent past with a similar method (see related work for a discussion). Considering the similarities between the two papers, we have decided to retract our paper from CoRL 2017

详情
AI中文摘要

强化学习主要分为无模型和有模型两种范式。每种范式都有其优势和局限性,并已成功应用于适合其相应优势的真实世界领域。本文提出一种新方法,旨在弥合这两种范式的差距。我们通过学习概率动力学模型,并将其作为交织的无模型优化的先验,结合两种范式的优点,从而实现数据高效和成本节约。结果表明,我们的方法在性能上优于纯有模型和纯无模型方法,以及简单切换范式的方法。

英文摘要

Reinforcement Learning is divided in two main paradigms: model-free and model-based. Each of these two paradigms has strengths and limitations, and has been successfully applied to real world domains that are appropriate to its corresponding strengths. In this paper, we present a new approach aimed at bridging the gap between these two paradigms. We aim to take the best of the two paradigms and combine them in an approach that is at the same time data-efficient and cost-savvy. We do so by learning a probabilistic dynamics model and leveraging it as a prior for the intertwined model-free optimization. As a result, our approach can exploit the generality and structure of the dynamics model, but is also capable of ignoring its inevitable inaccuracies, by directly incorporating the evidence provided by the direct observation of the cost. Preliminary results demonstrate that our approach outperforms purely model-based and model-free approaches, as well as the approach of simply switching from a model-based to a model-free setting.

1709.04889 2026-06-04 math.OC cs.LG cs.RO cs.SY eess.SY 版本更新

Control-Oriented Learning on the Fly

实时控制导向学习

Melkior Ornik, Arie Israel, Ufuk Topcu

AI总结 本文提出一种实时控制导向学习方法,用于在系统动力学几乎未知的情况下实现控制目标,通过小扰动学习局部动态并保证近似最优方向,验证了其在受损飞机避撞和Van der Pol振荡器中的有效性。

Comments Extended version of M. Ornik, A. Israel, U. Topcu, "Myopic Control of Systems with Unknown Dynamics". Detailed list of differences from that paper given within the manuscript. Changes in v2 include a discussion of myopic control in an LTL context and a correction of the bound for suboptimality of the algorithm

详情
AI中文摘要

本文聚焦于开发一种策略,用于控制其动力学几乎完全未知的系统。这种情况自然出现在系统经历关键故障的场景中。在这种情况下,保留满足基本控制目标的能力以避免即将来临的灾难至关重要。一个典型的此类目标是可达避障问题,其中系统需要在受限的状态空间中移动到某个状态。为了应对对系统动力学知识的限制,我们开发了一种贪心控制理论。贪心控制的主要目标是在任何给定时间,仅根据到目前为止获得的系统信息,优化系统的轨迹方向。我们提出了一种算法,利用小扰动的控制努力来学习局部动态,同时确保系统朝着看似近似最优的方向移动,并为其次优性能提供硬性界限。我们还验证了该算法在受损飞机避撞模拟以及Van der Pol振荡器示例中的有效性。

英文摘要

This paper focuses on developing a strategy for control of systems whose dynamics are almost entirely unknown. This situation arises naturally in a scenario where a system undergoes a critical failure. In that case, it is imperative to retain the ability to satisfy basic control objectives in order to avert an imminent catastrophe. A prime example of such an objective is the reach-avoid problem, where a system needs to move to a certain state in a constrained state space. To deal with limitations on our knowledge of system dynamics, we develop a theory of myopic control. The primary goal of myopic control is to, at any given time, optimize the current direction of the system trajectory, given solely the information obtained about the system until that time. We propose an algorithm that uses small perturbations in the control effort to learn local dynamics while simultaneously ensuring that the system moves in a direction that appears to be nearly optimal, and provide hard bounds for its suboptimality. We additionally verify the usefulness of the algorithm on a simulation of a damaged aircraft seeking to avoid a crash, as well as on an example of a Van der Pol oscillator.

1710.04052 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Robotic Control for Cognitive UWB Radar

具备认知能力的超宽频雷达机器人控制

Stefan Brüggenwirth, Fernando Rial

AI总结 本文提出了一种用于6自由度机械臂的轨迹规划方法,该机械臂携带合成孔径超宽频雷达传感器,通过优化问题实现碰撞-free的末端执行器轨迹,以最小化观测时间,适用于三维重建和穿墙雷达扫描及IED检查。

Comments 4 pages, 9 figures, submitted to IEEE IRC 2018

详情
AI中文摘要

本文描述了一种用于携带合成孔径超宽频雷达传感器的6自由度机械臂的轨迹规划问题。分辨率取决于传感器头部的轨迹和速度剖面。约束可以建模为优化问题,以获得在笛卡尔坐标中可行且无碰撞的末端执行器目标轨迹,该轨迹最小化观测时间。对于三维重建,目标在多个高度切片中被观测。对于穿墙雷达,传感器可以以滑模方式扫描更大区域。对于IED检查,首选点光源模式,持续将天线指向目标以获得最大方位分辨率。

英文摘要

In the article, we describe a trajectory planning problem for a 6-DOF robotic manipulator arm that carries an ultra-wideband (UWB) radar sensor with synthetic aperture (SAR). The resolution depends on the trajectory and velocity profile of the sensor head. The constraints can be modelled as an optimization problem to obtain a feasible, collision-free target trajectory of the end-effector of the manipulator arm in Cartesian coordinates that minimizes observation time. For 3D-reconstruction, the target is observed in multiple height slices. For Through-the-Wall radar the sensor can be operated in sliding mode for scanning larger areas. For IED inspection the spot-light mode is preferred, constantly pointing the antennas towards the target to obtain maximum azimuth resolution.

1502.02860 2026-06-04 stat.ML cs.LG cs.RO cs.SY eess.SY 版本更新

Gaussian Processes for Data-Efficient Learning in Robotics and Control

高斯过程在机器人和控制中的数据高效学习

Marc Peter Deisenroth, Dieter Fox, Carl Edward Rasmussen

AI总结 本文提出基于高斯过程的非参数转移模型,通过提取更多数据信息加速学习,减少模型误差影响,实现高效自主学习。

Comments 20 pages, 29 figures; fixed a typo in equation on page 8

详情
Journal ref
IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 37, issue no 2, pages 408-423, February 2015
AI中文摘要

自主学习在控制和机器人领域已持续十多年,数据驱动学习可减少工程知识需求。然而,自主强化学习通常需要大量系统交互,这在实际系统中(如机器人)不现实。本文提出通过高斯过程转移模型提取更多数据信息,显式纳入模型不确定性以减少误差影响,相比现有RL方法,模型基于策略搜索方法实现了前所未有的学习速度,并在真实机器人和控制任务中展示了应用价值。

英文摘要

Autonomous learning has been a promising direction in control and robotics for more than a decade since data-driven learning allows to reduce the amount of engineering knowledge, which is otherwise required. However, autonomous reinforcement learning (RL) approaches typically require many interactions with the system to learn controllers, which is a practical limitation in real systems, such as robots, where many interactions can be impractical and time consuming. To address this problem, current learning approaches typically require task-specific knowledge in form of expert demonstrations, realistic simulators, pre-shaped policies, or specific knowledge about the underlying dynamics. In this article, we follow a different approach and speed up learning by extracting more information from data. In particular, we learn a probabilistic, non-parametric Gaussian process transition model of the system. By explicitly incorporating model uncertainty into long-term planning and controller learning our approach reduces the effects of model errors, a key problem in model-based learning. Compared to state-of-the art RL our model-based policy search method achieves an unprecedented speed of learning. We demonstrate its applicability to autonomous learning in real robot and control tasks.

1508.00952 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Graphical Newton

图牛顿法

Akshay Srinivasan, Emanuel Todorov

AI总结 本文研究如何通过利用函数计算结构减少牛顿步计算复杂度,提出基于计算图线性规模和树宽立方的高效方法。

详情
AI中文摘要

计算通用函数$f: \mathbb{R}^N \rightarrow \mathbb{R}$的牛顿步需要$O(N^{3})$次浮点运算。本文探讨在已知函数计算结构的情况下减少此界限的途径。证明在计算图的线性规模和树宽立方时间内可计算牛顿步。

英文摘要

Computing the Newton step for a generic function $f: \mathbb{R}^N \rightarrow \mathbb{R}$ takes $O(N^{3})$ flops. In this paper, we explore avenues for reducing this bound, when the computational structure of $f$ is known beforehand. It is shown that the Newton step can be computed in time, linear in the size of the computational-graph, and cubic in its tree-width.

1710.00489 2026-06-04 cs.RO cs.AI cs.CV cs.NE cs.SY eess.SY 版本更新

SE3-Pose-Nets: Structured Deep Dynamics Models for Visuomotor Planning and Control

SE3-姿态网络:用于视觉-运动规划和控制的结构深度动力学模型

Arunkumar Byravan, Felix Leeb, Franziska Meier, Dieter Fox

AI总结 本文提出了一种基于结构深度动力学模型的深度视觉-运动控制方法,通过编码器-解码器结构学习低维姿态嵌入,实现场景分割和姿态预测,并在现实世界中实现了闭环控制。

Comments 8 pages, Initial submission to IEEE International Conference on Robotics and Automation (ICRA) 2018

详情
AI中文摘要

本文提出了一种基于结构深度动力学模型的深度视觉-运动控制方法。我们的深度动力学模型是一种SE3-Nets的变体,通过编码器-解码器结构学习低维姿态嵌入用于视觉-运动控制。与以往工作不同,我们的动力学模型是结构化的:给定一个输入场景,我们的网络明确学习分割显著部分并预测其姿态嵌入以及其运动作为姿态空间中的变化。我们通过一对相隔动作的点云训练我们的模型,并展示在仅提供帧间点对数据关联的监督下,我们的网络能够学习有意义的场景分割以及一致的姿态。我们进一步展示我们的模型可以直接在学习的低维姿态空间中用于闭环控制,其中动作通过最小化姿态空间中的误差使用基于梯度的方法计算,类似于传统模型驱动控制。我们展示了在模拟和现实世界中控制Baxter机器人从原始深度数据的结果,并与两种基线深度网络进行了比较。我们的方法在实时运行,实现了良好的场景动态预测,并在多个控制运行中优于基线方法。视频结果可在:https://rse-lab.cs.washington.edu/se3-structured-deep-ctrl/

英文摘要

In this work, we present an approach to deep visuomotor control using structured deep dynamics models. Our deep dynamics model, a variant of SE3-Nets, learns a low-dimensional pose embedding for visuomotor control via an encoder-decoder structure. Unlike prior work, our dynamics model is structured: given an input scene, our network explicitly learns to segment salient parts and predict their pose-embedding along with their motion modeled as a change in the pose space due to the applied actions. We train our model using a pair of point clouds separated by an action and show that given supervision only in the form of point-wise data associations between the frames our network is able to learn a meaningful segmentation of the scene along with consistent poses. We further show that our model can be used for closed-loop control directly in the learned low-dimensional pose space, where the actions are computed by minimizing error in the pose space using gradient-based methods, similar to traditional model-based control. We present results on controlling a Baxter robot from raw depth data in simulation and in the real world and compare against two baseline deep networks. Our method runs in real-time, achieves good prediction of scene dynamics and outperforms the baseline methods on multiple control runs. Video results can be found at: https://rse-lab.cs.washington.edu/se3-structured-deep-ctrl/

1709.10237 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Beacon-referenced Mutual Pursuit in Three Dimensions

三维环境中基于信标的目标互追策略

Kevin S. Galloway, Biswadip Dey

AI总结 本文提出一种三维环境中基于信标的互追控制律,通过引入额外项关注信标,分析了双体互追系统的闭环动态行为,证明在特定参数条件下,系统可形成具有共同半径的圆轨道稳定状态。

详情
AI中文摘要

受各种无人系统维持位置应用的启发,本文介绍了一种用于近固定信标工作的双体agent的转向控制律。该反馈律是对先前研究的三维恒向(CB)追捕律的修改,加入了额外项以关注信标。我们研究了双体互追系统闭环动态行为,其中每个agent使用针对另一个agent和静止信标的目标参考CB追捕律。在某些控制参数假设下,证明该问题存在环形平衡状态,其中agent在垂直于共同轴线并通过信标的平面内沿具有共同半径的圆轨道运动。由于共同半径和距离信标距离由反馈律中的参数选择决定,这种方法提供了一种在三维环境中设计所需编队的方法。

英文摘要

Motivated by station-keeping applications in various unmanned settings, this paper introduces a steering control law for a pair of agents operating in the vicinity of a fixed beacon in a three-dimensional environment. This feedback law is a modification of the previously studied three-dimensional constant bearing (CB) pursuit law, in the sense that it incorporates an additional term to allocate attention to the beacon. We investigate the behavior of the closed-loop dynamics for a two agent mutual pursuit system in which each agent employs the beacon-referenced CB pursuit law with regards to the other agent and a stationary beacon. Under certain assumptions on the associated control parameters, we demonstrate that this problem admits circling equilibria wherein the agents move on circular orbits with a common radius, in planes perpendicular to a common axis passing through the beacon. As the common radius and distances from the beacon are determined by choice of parameters in the feedback law, this approach provides a means to engineer desired formations in a three-dimensional setting.

1709.07610 2026-06-04 cs.CG cs.CC cs.RO cs.SY eess.SY 版本更新

Efficient Nearest-Neighbor Search for Dynamical Systems with Nonholonomic Constraints

动态系统非holonomic约束下的高效最近邻搜索

Valerio Varricchio, Brian Paden, Dmitry Yershov, Emilio Frazzoli

AI总结 本文研究了非holonomic约束动态系统中最近邻搜索的复杂性,发现传统k-d树在子里曼度量下查询复杂度为Θ(N^p log N),提出针对性改进策略提升效率。

Comments 16 pages, 3 figures, the 12th Workshop on the Algorithmic Foundations of Robotics (WAFR) 2016

详情
AI中文摘要

最近邻搜索主导采样基运动规划算法的渐近复杂度,通常使用k-d树数据结构处理。本文发现当使用经典k-d树方法与子里曼度量时,查询复杂度实际上为Θ(N^p log N),其中p由系统非holonomy程度决定。这些度量自然出现在非holonomic机械系统中,包括经典轮式机器人模型。为解决此负结果,本文提出针对子里曼度量的k-d树构建和查询策略,显著提升最近邻搜索查询时间。

英文摘要

Nearest-neighbor search dominates the asymptotic complexity of sampling-based motion planning algorithms and is often addressed with k-d tree data structures. While it is generally believed that the expected complexity of nearest-neighbor queries is $O(log(N))$ in the size of the tree, this paper reveals that when a classic k-d tree approach is used with sub-Riemannian metrics, the expected query complexity is in fact $Θ(N^p \log(N))$ for a number $p \in [0, 1)$ determined by the degree of nonholonomy of the system. These metrics arise naturally in nonholonomic mechanical systems, including classic wheeled robot models. To address this negative result, we propose novel k-d tree build and query strategies tailored to sub-Riemannian metrics and demonstrate significant improvements in the running time of nearest-neighbor search queries.

1709.07310 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Constant Bearing Pursuit on Branching Graphs

在分支图上的恒定方位追捕

Kevin S. Galloway, Biswadip Dey

AI总结 本文通过在单一循环上附加多个分支,扩展了恒定方位追捕框架,允许考虑任意弱连通的追捕图,其中每个节点出度为1,并研究了相对平衡、纯形状平衡和周期轨道,推导了三体集体的稳定性条件。

详情
AI中文摘要

循环追捕框架提供了一种通过群体中自主机器人对间相互作用生成有用全局行为的高效方法。先前工作研究了恒定方位(CB)追捕规则的循环追捕,并展示了相应动力学下各种有趣行为的存在。在本文中,通过在单一循环上附加多个分支,我们引入了该框架的修改版本,允许考虑任意弱连通的追捕图,其中每个节点的出度为1。这进一步扩展了循环追捕设置。然后,在展示相对平衡(直线或环形运动)、纯形状平衡(螺旋运动)和周期轨道存在后,我们还推导了三体集体稳定性的必要条件。通过为个体代理提供加入或离开集体而不扰动他人运动的途径,我们的方法提高了整体系统的可靠性。

英文摘要

Cyclic pursuit frameworks provide an efficient way to create useful global behaviors out of pairwise interactions in a collective of autonomous robots. Earlier work studied cyclic pursuit with a constant bearing (CB) pursuit law, and has demonstrated the existence of a variety of interesting behaviors for the corresponding dynamics. In this work, by attaching multiple branches to a single cycle, we introduce a modified version of this framework which allows us to consider any weakly connected pursuit graph where each node has an outdegree of 1. This provides a further generalization of the cyclic pursuit setting. Then, after showing existence of relative equilibria (rectilinear or circling motion), pure shape equilibria (spiraling motion) and periodic orbits, we also derive necessary conditions for stability of a 3-agent collective. By paving a way for individual agents to join or leave a collective without perturbing the motion of others, our approach leads to improved reliability of the overall system.

1709.07032 2026-06-04 cs.RO cs.MA cs.SY eess.SY stat.AP 版本更新

Data-Driven Model Predictive Control of Autonomous Mobility-on-Demand Systems

数据驱动的自动驾驶按需出行系统模型预测控制

Ramon Iglesias, Federico Rossi, Kevin Wang, David Hallac, Jure Leskovec, Marco Pavone

AI总结 本文提出一种端到端的数据驱动框架,用于控制自动驾驶按需出行系统,通过时间扩展网络建模并设计MPC算法,利用历史数据预测短期需求,减少乘客等待时间达89.6%。

Comments Submitted to the International Conference on Robotics and Automation 2018

详情
AI中文摘要

本文旨在提出一种端到端的数据驱动框架,用于控制自动驾驶按需出行系统(AMoD,即自动驾驶车队)。我们首先使用时间扩展网络建模AMoD系统,并提出一种计算最优再平衡策略(即预置重新定位)和给定旅行需求的最小可行车队规模的公式。然后,我们适应此公式,设计出一种模型预测控制(MPC)算法,利用基于历史数据的短期需求预测来计算再平衡策略。我们使用最先进的LSTM神经网络和滴滴出行的真实客户数据测试该控制器的端到端性能,证明该方法在大规模系统中表现优异(MPC算法的计算复杂度不依赖于系统中的客户和车辆数量),并且在减少平均乘客等待时间方面优于现有再平衡策略,最高可减少89.6%。

英文摘要

The goal of this paper is to present an end-to-end, data-driven framework to control Autonomous Mobility-on-Demand systems (AMoD, i.e. fleets of self-driving vehicles). We first model the AMoD system using a time-expanded network, and present a formulation that computes the optimal rebalancing strategy (i.e., preemptive repositioning) and the minimum feasible fleet size for a given travel demand. Then, we adapt this formulation to devise a Model Predictive Control (MPC) algorithm that leverages short-term demand forecasts based on historical data to compute rebalancing strategies. We test the end-to-end performance of this controller with a state-of-the-art LSTM neural network to predict customer demand and real customer data from DiDi Chuxing: we show that this approach scales very well for large systems (indeed, the computational complexity of the MPC algorithm does not depend on the number of customers and of vehicles in the system) and outperforms state-of-the-art rebalancing strategies by reducing the mean customer wait time by up to to 89.6%.

1703.01250 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Virtual vs. Real: Trading Off Simulations and Physical Experiments in Reinforcement Learning with Bayesian Optimization

虚拟与现实:在强化学习中权衡模拟与物理实验

Alonso Marco, Felix Berkenkamp, Philipp Hennig, Angela P. Schoellig, Andreas Krause, Stefan Schaal, Sebastian Trimpe

AI总结 本文提出利用模拟数据优化强化学习,通过结合低成本但不准确的模拟信息与高成本但准确的物理实验,提高效率。

Comments 7 pages, 6 figures, to appear in IEEE 2017 International Conference on Robotics and Automation (ICRA)

详情
AI中文摘要

在实践中,控制策略的参数通常手动调整,这耗时且令人沮丧。强化学习是一种有前途的替代方法,旨在自动化此过程,但通常需要太多实验才实用。本文提出了一种解决方案,通过利用可用于大多数机器人平台的模拟先验知识。具体而言,我们扩展了熵搜索,一种最大化每次实验信息增益的贝叶斯优化算法,以处理多个信息源的情况。结果是一种原则性的方法,可以有效地将低成本但不准确的模拟信息与高成本且准确的物理实验结合起来。我们将其应用于摆杆系统,证明该算法可以在比仅使用物理系统标准贝叶斯优化更少的实验中找到良好的控制策略。

英文摘要

In practice, the parameters of control policies are often tuned manually. This is time-consuming and frustrating. Reinforcement learning is a promising alternative that aims to automate this process, yet often requires too many experiments to be practical. In this paper, we propose a solution to this problem by exploiting prior knowledge from simulations, which are readily available for most robotic platforms. Specifically, we extend Entropy Search, a Bayesian optimization algorithm that maximizes information gain from each experiment, to the case of multiple information sources. The result is a principled way to automatically combine cheap, but inaccurate information from simulations with expensive and accurate physical experiments in a cost-effective manner. We apply the resulting method to a cart-pole system, which confirms that the algorithm can find good control policies with fewer experiments than standard Bayesian optimization on the physical system only.

1605.01950 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Automatic LQR Tuning Based on Gaussian Process Global Optimization

基于高斯过程全局优化的自动LQR调优

Alonso Marco, Philipp Hennig, Jeannette Bohg, Stefan Schaal, Sebastian Trimpe

AI总结 本文提出一种结合线性最优控制的自动控制器调优框架,利用贝叶斯优化算法提升控制器参数,通过实验数据优化性能目标,以七自由度机械臂平衡倒立杆为例验证方法有效性。

Comments 8 pages, 5 figures, to appear in IEEE 2016 International Conference on Robotics and Automation. Video demonstration of the experiments available at https://am.is.tuebingen.mpg.de/publications/marco_icra_2016

详情
AI中文摘要

本文提出一种基于线性最优控制与贝叶斯优化的自动控制器调优框架。该框架根据预定义的性能目标,利用实验数据自动改进初始控制器参数。所采用的贝叶斯优化算法为熵搜索,将潜在目标表示为高斯过程,并构建关于目标最小值位置的显式信念。通过最大化每次实验评估的信息增益,该框架能够在较少评估次数下获得改进的控制器。实验演示使用了七自由度机械臂平衡倒立杆的任务,二、四维调优问题的结果展示了该方法在机器人平台上的自动控制器调优潜力。

英文摘要

This paper proposes an automatic controller tuning framework based on linear optimal control combined with Bayesian optimization. With this framework, an initial set of controller gains is automatically improved according to a pre-defined performance objective evaluated from experimental data. The underlying Bayesian optimization algorithm is Entropy Search, which represents the latent objective as a Gaussian process and constructs an explicit belief over the location of the objective minimum. This is used to maximize the information gain from each experimental evaluation. Thus, this framework shall yield improved controllers with fewer evaluations compared to alternative approaches. A seven-degree-of-freedom robot arm balancing an inverted pole is used as the experimental demonstrator. Results of a two- and four-dimensional tuning problems highlight the method's potential for automatic controller tuning on robotic platforms.

1709.05843 2026-06-04 cs.RO cs.MA cs.SY eess.SY 版本更新

Decentralized Collision-Free Control of Multiple Robots in 2D and 3D Spaces

多机器人在二维和三维空间中去碰撞的去中心化控制

Xiaotian Yang

AI总结 本文提出在任意未知二维和三维区域中实现多机器人去中心化无碰撞控制的方法,通过网格选择和算法设计解决完全覆盖和搜索任务,算法在MATLAB中验证并与其他算法对比。

详情
AI中文摘要

去中心化机器人控制吸引了大量研究兴趣。然而,一些研究使用了不现实的假设而未考虑碰撞避免。本报告聚焦于在任意未知的二维和三维区域内实现多机器人无碰撞控制,以解决完全覆盖和搜索任务。所有算法均为去中心化,因为机器人能力有限,并且已数学证明。报告首先讨论了两种任务中的网格选择。网格模式简化了区域表示,机器人只需在邻居顶点之间直线移动。对于100%完全二维覆盖,提出了等边三角形网格。对于忽略边界效应的完全覆盖,每种情况都计算出在二维和三维区域中顶点最少的网格。第二部分针对二维和三维区域的完全覆盖,提出了一种去中心化无碰撞算法,驱动机器人前往离参考点最远的区域。该区域可以是静态或扩展的,并在MATLAB中模拟。第三部分提供了三种基于网格的去中心化随机算法,用于在二维或三维区域中搜索目标。目标数量可以是已知或未知的。在第一个算法中,机器人随机选择空闲邻居,优先选择未访问的邻居。第二个算法在机器人靠近时添加排斥力以分散机器人。第三个算法中,如果被已访问的顶点包围,机器人将使用广度优先搜索算法前往最近的未访问顶点。第二个搜索算法在Pioneer 3-DX机器人上验证。展示了生成公式以估计搜索时间的一般方法。算法在MATLAB中与其他五个算法比较,以展示其有效性。

英文摘要

Decentralized control of robots has attracted huge research interests. However, some of the research used unrealistic assumptions without collision avoidance. This report focuses on the collision-free control for multiple robots in both complete coverage and search tasks in 2D and 3D areas which are arbitrary unknown. All algorithms are decentralized as robots have limited abilities and they are mathematically proved. The report starts with the grid selection in the two tasks. Grid patterns simplify the representation of the area and robots only need to move straightly between neighbor vertices. For the 100% complete 2D coverage, the equilateral triangular grid is proposed. For the complete coverage ignoring the boundary effect, the grid with the fewest vertices is calculated in every situation for both 2D and 3D areas. The second part is for the complete coverage in 2D and 3D areas. A decentralized collision-free algorithm with the above selected grid is presented driving robots to sections which are furthest from the reference point. The area can be static or expanding, and the algorithm is simulated in MATLAB. Thirdly, three grid-based decentralized random algorithms with collision avoidance are provided to search targets in 2D or 3D areas. The number of targets can be known or unknown. In the first algorithm, robots choose vacant neighbors randomly with priorities on unvisited ones while the second one adds the repulsive force to disperse robots if they are close. In the third algorithm, if surrounded by visited vertices, the robot will use the breadth-first search algorithm to go to one of the nearest unvisited vertices via the grid. The second search algorithm is verified on Pioneer 3-DX robots. The general way to generate the formula to estimate the search time is demonstrated. Algorithms are compared with five other algorithms in MATLAB to show their effectiveness.

1709.03248 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Vector Field Guidance for Convoy Monitoring Using Elliptical Orbits

利用椭圆轨道的矢量场引导用于车队监控

Aseem V. Borkar, Vivek S. Borkar, Arpita Sinha

AI总结 本文提出了一种基于矢量场的新型引导方案,用于无人机沿可能非线性轨迹跟踪和监控地面车队。通过回归算法计算时间变化的椭圆,确保无人机轨迹反复穿越该移动椭圆。

详情
AI中文摘要

我们提出了一种基于矢量场的新型引导方案,用于跟踪和监控沿可能非线性轨迹移动的车队,由空中代理执行。该方案首先使用简单的回归算法计算一个时间变化的椭圆,该椭圆包含车队中的所有目标。然后确保代理收敛到一个轨迹,该轨迹反复穿越这个移动的椭圆。该方案通过非线性微分方程的扰动理论进行分析,并提供了支持性模拟。还讨论了一些相关实现问题,并强调了该方案的优势。

英文摘要

We propose a novel vector field based guidance scheme for tracking and surveillance of a convoy, moving along a possibly nonlinear trajectory on the ground, by an aerial agent. The scheme first computes a time varying ellipse that encompasses all the targets in the convoy using a simple regression based algorithm. It then ensures convergence of the agent to a trajectory that repeatedly traverses this moving ellipse. The scheme is analyzed using perturbation theory of nonlinear differential equations and supporting simulations are provided. Some related implementation issues are discussed and advantages of the scheme are highlighted.

1709.03426 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Trajectory Synthesis for Fisher Information Maximization

信息最大化的轨迹合成

Andrew D. Wilson, Jarvis A. Schultz, Todd D. Murphey

AI总结 本文提出一种连续时间优化方法,通过改进Fisher信息矩阵的范数来生成局部最优轨迹,用于动态系统参数估计,实验验证显示轨迹优化显著提升了参数估计精度。

Comments 12 pages

详情
Journal ref
IEEE Transactions on Robotics, vol. 30, no. 6, pp. 1358-1370, 2014
AI中文摘要

动态系统模型参数估计可通过实验轨迹的选择得到显著提升。对于一般的非线性动态系统,找到全局最优轨迹通常不可行;然而,在给定初始模型参数估计和初始轨迹的情况下,本文提出了一种连续时间优化方法,生成在存在测量噪声下的局部最优轨迹。该优化算法旨在寻找能改进Fisher信息矩阵范数的系统轨迹。通过双臂小车装置进行数值和实验验证。在模拟中,优化轨迹使Fisher信息矩阵的最小特征值比初始轨迹提高了三个数量级。实验结果表明,优化轨迹在实践中使参数估计误差改善了一个数量级。

英文摘要

Estimation of model parameters in a dynamic system can be significantly improved with the choice of experimental trajectory. For general, nonlinear dynamic systems, finding globally "best" trajectories is typically not feasible; however, given an initial estimate of the model parameters and an initial trajectory, we present a continuous-time optimization method that produces a locally optimal trajectory for parameter estimation in the presence of measurement noise. The optimization algorithm is formulated to find system trajectories that improve a norm on the Fisher information matrix. A double-pendulum cart apparatus is used to numerically and experimentally validate this technique. In simulation, the optimized trajectory increases the minimum eigenvalue of the Fisher information matrix by three orders of magnitude compared to the initial trajectory. Experimental results show that this optimized trajectory translates to an order of magnitude improvement in the parameter estimate error in practice.

1709.02561 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Formal Verification of Station Keeping Maneuvers for a Planar Autonomous Hybrid System

平面自主混合系统的轨道保持 maneuver 的形式验证

Benjamin Martin, Khalil Ghorbal, Eric Goubault, Sylvie Putot

AI总结 本文研究了平面自主混合系统轨道保持 maneuver 的形式验证,通过混合程序模型验证可达性和安全性属性,并利用Keymaera X自动生成不变量区域以满足验证需求。

Comments In Proceedings FVAV 2017, arXiv:1709.02126

详情
Journal ref
EPTCS 257, 2017, pp. 91-104
AI中文摘要

我们正式验证了一个混合控制律,用于执行平面车辆的轨道保持 maneuver。此类 maneuver 要求车辆在有限时间内到达其站台的邻域并保持在该区域等待进一步指令。我们将动力学以及控制律建模为混合程序,并正式验证涉及的可达性和安全性属性。特别强调自动产生不变量区域,这在执行此类验证中至关重要。我们使用定理证明器 Keymaera X 来消解一些生成的证明义务。

英文摘要

We formally verify a hybrid control law designed to perform a station keeping maneuver for a planar vehicle. Such maneuver requires that the vehicle reaches a neighborhood of its station in finite time and remains in it while waiting for further instructions. We model the dynamics as well as the control law as a hybrid program and formally verify both the reachability and safety properties involved. We highlight in particular the automated generation of invariant regions which turns out to be crucial in performing such verification. We use the theorem prover Keymaera X to discharge some of the generated proof obligations.

1709.02560 2026-06-04 eess.SY cs.RO cs.SE cs.SY 版本更新

Run-Time Risk Mitigation in Automated Vehicles: A Model for Studying Preparatory Steps

自动驾驶车辆运行时的风险缓解:一种研究预备步骤的模型

Mario Gleirscher

AI总结 本文研究自动驾驶车辆运行时的风险缓解,通过模型分析驾驶过程及控制回路,识别潜在危害并提出缓解措施,为安全控制器的自动化合成提供方法。

Comments In Proceedings FVAV 2017, arXiv:1709.02126

详情
Journal ref
EPTCS 257, 2017, pp. 75-90
AI中文摘要

本文假设自动驾驶或高度自动化驾驶(AD)将伴随着超越ISO 26262或SOTIF最新修订要求的严格保证义务。因此,汽车控制和安全工程师必须(i)全面分析驾驶过程及其控制回路,(ii)识别源自该回路的相关危害,(iii)建立可行的自动化措施以有效缓解这些危害或减轻其后果。通过研究一个例子,本文探讨了可用于形式验证 desired properties 的模型,这些 properties 源自潜在保证义务,如有效缓解策略的全局存在性。此外,所提出的方法旨在逐步细化,以实现 AD 安全控制器的自动化合成,这些控制器实现此类属性。

英文摘要

We assume that autonomous or highly automated driving (AD) will be accompanied by tough assurance obligations exceeding the requirements of even recent revisions of ISO 26262 or SOTIF. Hence, automotive control and safety engineers have to (i) comprehensively analyze the driving process and its control loop, (ii) identify relevant hazards stemming from this loop, (iii) establish feasible automated measures for the effective mitigation of these hazards or the alleviation of their consequences. By studying an example, this article investigates some achievements in the modeling for the steps (i), (ii), and (iii), amenable to formal verification of desired properties derived from potential assurance obligations such as the global existence of an effective mitigation strategy. In addition, the proposed approach is meant for step-wise refinement towards the automated synthesis of AD safety controllers implementing such properties.

1709.02456 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Attack-Aware Multi-Sensor Integration Algorithm for Autonomous Vehicle Navigation Systems

面向攻击的多传感器集成算法用于自动驾驶导航系统

Sangjun Lee, Yongbum Cho, Byung-Cheol Min

AI总结 本文提出了一种基于故障检测与隔离的攻击感知多传感器集成算法,用于检测自动驾驶导航系统中的网络攻击。算法利用扩展卡尔曼滤波构建鲁棒残差,并通过参数统计工具识别攻击,通过软件在环仿真验证了其在动态系统中的快速检测和低误报率性能。

Comments "Copyright 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works."

详情
AI中文摘要

本文提出了一种基于故障检测与隔离的攻击感知多传感器集成算法,用于检测自动驾驶导航系统中的网络攻击。该算法利用扩展卡尔曼滤波在存在噪声的情况下构建鲁棒残差,然后使用参数统计工具识别网络攻击。参数统计工具基于测量历史构建的残差,而非每次单次测量,在离散时间信号和动态系统的特性基础上进行。这种方法使所提出的多传感器集成算法能够为动态系统应用提供快速检测和低误报率。通过自主导航系统INS/GNSS集成的示例,利用软件在环仿真验证了所提算法。

英文摘要

In this paper, we propose a fault detection and isolation based attack-aware multi-sensor integration algorithm for the detection of cyberattacks in autonomous vehicle navigation systems. The proposed algorithm uses an extended Kalman filter to construct robust residuals in the presence of noise, and then uses a parametric statistical tool to identify cyberattacks. The parametric statistical tool is based on the residuals constructed by the measurement history rather than one measurement at a time in the properties of discrete-time signals and dynamic systems. This approach allows the proposed multi-sensor integration algorithm to provide quick detection and low false alarm rates for applications in dynamic systems. An example of INS/GNSS integration of autonomous navigation systems is presented to validate the proposed algorithm by using a software-in-the-loop simulation.

1708.09347 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Sequential Action Control: Closed-Form Optimal Control for Nonlinear and Nonsmooth Systems

顺序动作控制:非线性与非光滑系统的闭环最优控制

Alex Ansari, Todd Murphey

AI总结 本文提出一种基于模型的算法,用于在线闭环计算非线性系统的预测最优控制。通过高阶模型和轨迹目标控制混合脉冲、欠驱动和约束系统,采用闭式表达式优化短期控制动作,保证全局唯一性和稳定性。

Comments 19 pages

详情
Journal ref
IEEE Transactions on Robotics, vol. 32, no. 5, pp. 1196-1214, 2016
AI中文摘要

本文提出了一种新的基于模型的算法,用于在线闭环计算传统上具有挑战性的非线性系统的预测最优控制。示例展示了该算法如何使用仅有的高层模型和轨迹目标来控制混合脉冲、欠驱动和约束系统。与迭代优化有限时间域控制序列不同,本文推导出个体控制动作的闭式表达式,即可以应用于短时间的控制值,以在长时间域内最优改善跟踪目标。在温和假设下,这些动作在平衡点附近变为线性反馈律,允许稳定性分析和基于性能的参数选择。全局上,最优动作保证存在性和唯一性。通过在线递推时间域的方式对这些动作进行序列化,所提出的控制器提供了一种最小-最大约束响应,避免了通常需要施加控制约束的开销。基准示例显示,该方法能够避免局部极小值,并在跟踪性能方面优于非线性最优控制器和最近的特定案例方法,且速度比传统方法快多个数量级。

英文摘要

This paper presents a new model-based algorithm that computes predictive optimal controls on-line and in closed loop for traditionally challenging nonlinear systems. Examples demonstrate the same algorithm controlling hybrid impulsive, underactuated, and constrained systems using only high-level models and trajectory goals. Rather than iteratively optimize finite horizon control sequences to minimize an objective, this paper derives a closed-form expression for individual control actions, i.e., control values that can be applied for short duration, that optimally improve a tracking objective over a long time horizon. Under mild assumptions, actions become linear feedback laws near equilibria that permit stability analysis and performance-based parameter selection. Globally, optimal actions are guaranteed existence and uniqueness. By sequencing these actions on-line, in receding horizon fashion, the proposed controller provides a min-max constrained response to state that avoids the overhead typically required to impose control constraints. Benchmark examples show the approach can avoid local minima and outperform nonlinear optimal controllers and recent, case-specific methods in terms of tracking performance, and at speeds orders of magnitude faster than traditionally achievable.

1708.09342 2026-06-04 eess.SY cs.LG cs.RO cs.SY math.OC 版本更新

Optimal and Learning Control for Autonomous Robots

自主机器人最优与学习控制

Jonas Buchli, Farbod Farshidian, Alexander Winkler, Timothy Sandy, Markus Giftthaler

AI总结 本文基于最优控制与强化学习,从统一视角探讨自主机器人闭环控制问题,提供统一符号和术语对比,帮助理解不同领域方法。

Comments Lecture Notes, 101 pages

详情
AI中文摘要

自主机器人最优与学习控制课程在苏黎世联邦理工学院的机器人、系统与控制硕士项目中教授,旨在从统一视角教授最优控制和强化学习以解决闭环控制问题。起始点是制定最优控制问题并由此推导出不同类型的解决方案和算法。这些讲义力求在可能的情况下使用统一的符号,并提供一些术语和符号的翻译帮助,以比较不同领域的术语和符号。该课程假定具备控制理论、线性代数和随机微积分的基础知识。

英文摘要

Optimal and Learning Control for Autonomous Robots has been taught in the Robotics, Systems and Controls Masters at ETH Zurich with the aim to teach optimal control and reinforcement learning for closed loop control problems from a unified point of view. The starting point is the formulation of of an optimal control problem and deriving the different types of solutions and algorithms from there. These lecture notes aim at supporting this unified view with a unified notation wherever possible, and a bit of a translation help to compare the terminology and notation in the different fields. The course assumes basic knowledge of Control Theory, Linear Algebra and Stochastic Calculus.

1708.06207 2026-06-04 cs.HC cs.CY cs.RO cs.SY eess.SY 版本更新

Givers & Receivers perceive handover tasks differently: Implications for Human-Robot collaborative system design

施与者与接收者对交接任务的认知不同:对人机协作系统设计的启示

Roy Someshwar, Yael Edan

AI总结 研究通过三种方法探讨了超市瓶类交接任务中人类协作行为,揭示了角色差异对共同行动认知的影响,为短周期重复任务的人机协作系统设计提供系统分析方法。

Comments 16 pages, 8 figures, This manuscript is a prior version of the article accepted in the IJSR

详情
AI中文摘要

针对短周期重复交接任务中的人类-人类联合行动,本研究采用三种方法进行探讨:在多个超市进行的工作方法实地研究、使用人体工程学软件进行的模拟分析,以及在实验室中重现超市环境条件下的人类-人类协作实验。评估包括客观和主观措施。主观评估从心理角度出发,展示了团队成员因角色(施与者或接收者)不同而对共同行动认知的差异。所提出的方法可为类似任务提供系统分析方法。结合三种分析结果,本研究为人机协作系统设计提供了短周期重复任务联合行动的科学见解。

英文摘要

Human-human joint-action in short-cycle repetitive handover tasks was investigated for a bottle handover task using a three-fold approach: work-methods field studies in multiple supermarkets, simulation analysis using an ergonomics software package and by conducting an in-house lab experiment on human-human collaboration by re-creating the environment and conditions of a supermarket. Evaluation included both objective and subjective measures. Subjective evaluation was done taking a psychological perspective and showcases among other things, the differences in the way a common joint-action is being perceived by individual team partners depending upon their role (giver or receiver). The proposed approach can provide a systematic method to analyze similar tasks. Combining the results of all the three analyses, this research gives insight into the science of joint-action for short-cycle repetitive tasks and its implications for human-robot collaborative system design.

1708.06345 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Robust Optimal Planning and Control of Non-Periodic Bipedal Locomotion with A Centroidal Momentum Model

非周期双足运动的鲁棒最优规划与控制:基于质心动量模型

Ye Zhao, Benito R. Fernandez, Luis Sentis

AI总结 本文提出基于质心动量动力学的混合相空间规划与控制方法,通过非周期关键帧状态的鲁棒跟踪实现双足运动的稳健控制,重点解决非周期步态生成和扰动鲁棒性问题。

Comments 43 pages, 22 figures, journal, International Journal of Robotics Research, 2017. arXiv admin note: substantial text overlap with arXiv:1701.05929, arXiv:1511.04628

详情
AI中文摘要

本文提出基于质心动量动力学的混合相空间规划与控制方法,通过非周期关键帧状态的鲁棒跟踪实现双足运动的稳健控制,重点解决非周期步态生成和扰动鲁棒性问题。

英文摘要

This study presents a theoretical method for planning and controlling agile bipedal locomotion based on robustly tracking a set of non-periodic keyframe states. Based on centroidal momentum dynamics, we formulate a hybrid phase-space planning and control method which includes the following key components: (i) a step transition solver that enables dynamically tracking non-periodic keyframe states over various types of terrains, (ii) a robust hybrid automaton to effectively formulate planning and control algorithms, (iii) a steering direction model to control the robot's heading, (iv) a phase-space metric to measure distance to the planned locomotion manifolds, and (v) a hybrid control method based on the previous distance metric to produce robust dynamic locomotion under external disturbances. Compared to other locomotion methodologies, we have a large focus on non-periodic gait generation and robustness metrics to deal with disturbances. Such focus enables the proposed control method to robustly track non-periodic keyframe states over various challenging terrains and under external disturbances as illustrated through several simulations.

1708.06252 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Mixture Reduction on Matrix Lie Groups

矩阵李群上的混合减少

Josip Cesic, Ivan Markovic, Ivan Petrovic

AI总结 本文提出在矩阵李群上用于混合滤波的集中高斯分布(CGDs)减少方法,通过重新参数化计算KL散度并合并混合组件,用于多目标跟踪滤波研究。

Comments IEEE Signal Processing Letters

详情
AI中文摘要

许多物理系统在矩阵李群上演化,为这类流形设计的混合滤波是解决挑战性估计问题的必要工具。然而,混合滤波面临组件数量持续增长的问题,因此需要适当的混合减少技术。本文提出了一种用于矩阵李群上分布的混合减少方法,称为集中高斯分布(CGDs)。这包括适当重新参数化CGD参数以计算KL散度,选择并合并混合组件。此外,我们还引入了在李群上的多目标跟踪滤波器作为所提减少方法的混合滤波研究示例。特别是,我们实现了在矩阵李群上的概率假说密度滤波器。我们使用最优子图案分配度量在由100个随机生成多目标场景组成的合成数据集上验证了滤波器性能。

英文摘要

Many physical systems evolve on matrix Lie groups and mixture filtering designed for such manifolds represent an inevitable tool for challenging estimation problems. However, mixture filtering faces the issue of a constantly growing number of components, hence require appropriate mixture reduction techniques. In this letter we propose a mixture reduction approach for distributions on matrix Lie groups, called the concentrated Gaussian distributions (CGDs). This entails appropriate reparametrization of CGD parameters to compute the KL divergence, pick and merge the mixture components. Furthermore, we also introduce a multitarget tracking filter on Lie groups as a mixture filtering study example for the proposed reduction method. In particular, we implemented the probability hypothesis density filter on matrix Lie groups. We validate the filter performance using the optimal subpattern assignment metric on a synthetic dataset consisting of 100 randomly generated multitarget scenarios.

1708.01925 2026-06-04 cs.MA cs.AI cs.CY cs.RO cs.SY eess.SY 版本更新

Designing Autonomous Vehicles: Evaluating the Role of Human Emotions and Social Norms

设计自动驾驶车辆:评估人类情感与社会规范的作用

Faisal Riaz, Muaz A. Niazi

AI总结 本文提出通过引入社会规范合规机制,使自动驾驶车辆遵循道路与社会规则,利用模糊逻辑和情绪计算提升决策能力,通过模拟验证其在减少碰撞方面的有效性。

Comments 42 pages, 12 figures

详情
AI中文摘要

人类即将在未来不久将驾驶权利委托给自动驾驶车辆。然而,为完成这一复杂任务,需要一种机制,迫使自动驾驶车辆遵守由良好驾驶者实践的道路和社会规则。此任务可通过在自动驾驶车辆中引入社会规范合规机制来实现。本文提出一个自动驾驶车辆的人工社会作为人类社会的类比。每个AV被分配了具有不同社会影响的社会性格。社会规范被引入,帮助AV在受情绪影响的情况下做出道路避障决策。此外,通过基于前景的情绪(即恐惧)的社交规范合规机制,利用模糊逻辑计算情绪,并通过SimConnect方法将恐惧的模糊值提供给Netlogo模拟环境,以模拟自动驾驶车辆的人工社会。通过行为空间工具进行了广泛的测试,以确定所提出方法在碰撞数量方面的性能。此外,还提出了基于随机漫步模型的人工社会作为比较。与随机漫步的比较证明,所提出的方法为未来自动驾驶车辆的自动驾驶系统提供了更好的选择,这些系统在安全道路旅行方面将更具社会接受性和信任度。

英文摘要

Humans are going to delegate the rights of driving to the autonomous vehicles in near future. However, to fulfill this complicated task, there is a need for a mechanism, which enforces the autonomous vehicles to obey the road and social rules that have been practiced by well-behaved drivers. This task can be achieved by introducing social norms compliance mechanism in the autonomous vehicles. This research paper is proposing an artificial society of autonomous vehicles as an analogy of human social society. Each AV has been assigned a social personality having different social influence. Social norms have been introduced which help the AVs in making the decisions, influenced by emotions, regarding road collision avoidance. Furthermore, social norms compliance mechanism, by artificial social AVs, has been proposed using prospect based emotion i.e. fear, which is conceived from OCC model. Fuzzy logic has been employed to compute the emotions quantitatively. Then, using SimConnect approach, fuzzy values of fear has been provided to the Netlogo simulation environment to simulate artificial society of AVs. Extensive testing has been performed using the behavior space tool to find out the performance of the proposed approach in terms of the number of collisions. For comparison, the random-walk model based artificial society of AVs has been proposed as well. A comparative study with a random walk, prove that proposed approach provides a better option to tailor the autopilots of future AVS, Which will be more socially acceptable and trustworthy by their riders in terms of safe road travel.

1708.04790 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Evaluation of Human-Robot Collaboration Models for Fluent Operations in Industrial Tasks

工业任务中流畅操作的人机协作模型评估

Lior Sayfeld, Ygal Peretz, Roy Someshwar, Yael Edan

AI总结 本文评估了人机协作模型在工业任务中的性能,比较了基于定时和传感器的模型与自适应模型,发现自适应模型在总装配时间和空闲时间上有显著提升。

Comments Robotics: Science and Systems, Human-Robot Hand-Over Workshop 2015

详情
AI中文摘要

在本研究中,我们评估了集成人机操作系统中的人机协作模型。设计了一个包含协作机器人臂和人类工人的集成工作单元,用于执行实时装配任务。80名22-27岁的工业工程学生参与了实验,比较了基于定时和传感器的模型与在此框架内开发的自适应模型。性能指标包括总装配时间和总空闲时间。结果显示,自适应系统显著提高了所检查的参数,并与基于定时和传感器的模型相比,总装配时间减少了7%,总空闲时间减少了60%。

英文摘要

In this study we evaluated human-robot collaboration models in an integrated human-robot operational system. An integrated work cell which includes a robotic arm working collaboratively with a human worker was specially designed for executing a real-time assembly task. Eighty industrial engineering students aged 22-27 participated in experiments in which timing and sensor based models were compared to an adaptive model developed within this framework. Performance measures included total assembly time and total idle time. The results showed conclusively that the adaptive system improved the examined parameters and provided an improvement of 7% in total assembly time and 60% in total idle time when compared to timing and sensory based models.

1708.03055 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Optimal Control for Constrained Coverage Path Planning

受约束覆盖路径规划的最优控制

Ankit Manerikar, Debasmit Das, Pranay Banerjee

AI总结 本文研究了在地图中存在障碍物约束下,如何通过改进线性扫掠覆盖方法实现最小能量/时间最优和最大面积覆盖,并分析不同参数对性能的影响。

Comments Report for AAE 568 (Applied Optimal Control) at Purdue

详情
AI中文摘要

受约束覆盖路径规划问题涉及机器人在地图中存在某些作为障碍物出现的约束条件下,试图覆盖最大面积。在多种覆盖路径规划方法中,我们考虑将线性扫掠覆盖方法进行增强,以实现最小能量/时间最优的同时实现最大面积覆盖。此外,我们还研究了不同参数变化对改进方法性能的影响。

英文摘要

The problem of constrained coverage path planning involves a robot trying to cover maximum area of an environment under some constraints that appear as obstacles in the map. Out of the several coverage path planning methods, we consider augmenting the linear sweep-based coverage method to achieve minimum energy/ time optimality along with maximum area coverage. In addition, we also study the effects of variation of different parameters on the performance of the modified method.

1708.00675 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Optical Target Tracking by Scheduled Range Measurements

通过计划范围测量进行光学目标跟踪

Mohammad Hossein Ferdowsi, Ebrahim Sabzikar

AI总结 本文研究了通过定期目标方位测量和计划的范围测量率进行光学目标跟踪,利用目标范围估计误差方差作为调度准则,采用改进的球坐标系下目标动态状态向量,推导了近常速、近常加速度和协调转弯率运动模型的动力学方程,并利用UKF-IMM滤波器进行跟踪。

详情
Journal ref
Optical Engineering 54.4 (2015): 044101-044101
AI中文摘要

本文研究了通过定期目标方位测量和计划的范围测量率进行光学目标跟踪。利用目标范围估计误差方差作为调度准则。为此,将目标动态状态向量以改进的球坐标系表示,使得所有目标状态与范围相关的目标状态解耦。推导了近常速、近常加速度和协调转弯率运动模型在改进球坐标系下的目标状态动力学方程。对于所得到的状态动力学方程,采用具有范围测量调度的UKF-IMM滤波器作为跟踪滤波器。结果显示目标状态被正确估计,所用滤波器在机动目标跟踪中具有高性能。

英文摘要

In this paper, optical target tracking, by regular target bearing measurements and target range in a lower and scheduled measurement rate is considered. Variance of the target range estimation error is used as scheduling criterion. For this purpose, target dynamic state vector in modified spherical coordinates is stated in such a way that all target states be decoupled from range-related target state. Target state dynamic equations in modified spherical coordinates for nearly constant velocity, nearly constant acceleration and coordinated turn rate kinematic models, are analytically derived. For resulted state dynamic equations, a UKF-IMM filter with range measurement scheduling is utilized as a tracking filter. It is shown that target states are estimated properly and applied filter has high performance in maneuvering target tracking.

1605.05120 2026-06-04 cs.HC cs.RO cs.SY eess.SY 版本更新

Pushing the limits of the CyberGrasp for haptic rendering

突破CyberGrasp在触觉渲染中的极限

Manuel Aiple, André Schiele

AI总结 本文提出ExHand Box,通过扩展带宽和定制控制器提升CyberGrasp的触觉反馈性能,实现在硬接触场景中更高的接触刚度和更稳定的反馈效果。

Comments 7 pages, 12 figures

详情
Journal ref
Robotics and Automation (ICRA), 2013 IEEE International Conference on, Karlsruhe, 2013, pp. 3541-3546
AI中文摘要

CyberGrasp是一种已知的数据手套-外骨骼设备组合,允许向人类手指渲染触觉反馈。然而,其设计限制了通过有限的控制带宽和位置传感器分辨率进行远程操作的实用性。因此,系统在触觉渲染中受限于低可实现的接触刚度和反馈增益幅度。此外,系统禁止其控制器实现的简单适应。在本文中,提出了ExHand Box,一种新设计的后端,以扩大CyberGrasp的带宽限制,并使其能够完全定制控制器实现。ExHand Box为否则未修改的CyberGlove和CyberGrasp手系统提供了新的计算机、接口电子和电机控制器。新系统的回路频率可以自由变化,最高可达2kHz,并且可以通过自动代码生成接口实现自定义控制器。系统性能识别实验展示了在不同采样周期范围内改进的硬接触行为。在稳定状态下演示了高达50kN/m的最大接触刚度,这显著高于非定制原始系统版本所能实现的水平。此外,进行了一项双侧控制实验,以展示新系统在通用远程操作研究中的实用性。在该实验中,引入了一种射线投射算法用于预接触检测,以补偿主从设备之间在以太网网络中出现的高延迟和抖动通信链路。证明了在稳定状态下接触刚度可以保持在系统性能识别的量级,演示了在稳定状态下达到41kN/m的刚度。

英文摘要

The CyberGrasp is a well known dataglove-exoskeleton device combination that allows to render haptic feedback to the human fingers. Its design, however, restricts its usability for teleoperation through a limited control bandwidth and position sensor resolution. Therefore the system is restricted to low achievable contact stiffness and feedback gain magnitudes in haptic rendering. Moreover, the system prohibits simple adaption of its controller implementation. In this paper, the ExHand Box is presented, a newly designed back-end to widen the CyberGrasp's bandwidth restrictions and to open it up for fully customized controller implementations. The ExHand Box provides a new computer, interface electronics and motor controllers for the otherwise unmodified CyberGlove and CyberGrasp hand systems. The loop frequency of the new system can be freely varied up to 2 kHz and custom controllers can be implemented through an automatic code generation interface. System performance identification experiments are presented that demonstrate improved behavior in hard contact situations over a range of sampling periods. Maximum contact stiffnesses of up to 50kN/m in a stable condition are demonstrated, which is significantly higher than what could be achieved with the non-customized original system version. Moreover, a bilateral control experiment is conducted to demonstrate the new system's usability for generic teleoperation research. In this experiment a raycasting algorithm is introduced for pre-contact detection in order to compensate for high delay and jitter communication links between master and slave as they appear in an Ethernet network. It is demonstrated that the contact stiffness can be maintained in the order of magnitude of the system performance identification with a demonstrated stiffness of 41kN/m in a stable condition.

1707.08689 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Multi-Robot Transfer Learning: A Dynamical System Perspective

多机器人迁移学习:动态系统视角

Mohamed K. Helwa, Angela P. Schoellig

AI总结 本文从动态系统角度研究多机器人迁移学习中的最优转移映射性质,提出无需详细动力学知识的算法,通过实验验证该算法在四旋翼平台间迁移学习中减少60-70%的误差。

Comments 7 pages, 6 figures, accepted at the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems

详情
AI中文摘要

多机器人迁移学习允许一个机器人利用第二个相似机器人生成的数据来改进自身行为。潜在优势是减少训练时间并降低训练阶段不可避免的风险。迁移学习算法旨在找到不同机器人之间的最优转移映射。本文通过单输入单输出(SISO)系统的理论研究,探讨了此类最优转移映射的性质。我们首先证明最优迁移学习映射通常是一个动态系统。本文的主要贡献是提供一种确定该最优动态映射性质的算法,包括其阶数和回归器(即它所依赖的变量)。所提出的算法不需要详细的机器人动力学知识,但依赖于通过简单实验测试可获得的基本系统属性。我们通过两个不同四旋翼平台间的迁移学习示例验证了所提算法。实验结果表明,通过我们的算法获得的最优动态映射在减少迁移学习误差方面比直接转移数据或使用最优静态映射的情况减少了60-70%。

英文摘要

Multi-robot transfer learning allows a robot to use data generated by a second, similar robot to improve its own behavior. The potential advantages are reducing the time of training and the unavoidable risks that exist during the training phase. Transfer learning algorithms aim to find an optimal transfer map between different robots. In this paper, we investigate, through a theoretical study of single-input single-output (SISO) systems, the properties of such optimal transfer maps. We first show that the optimal transfer learning map is, in general, a dynamic system. The main contribution of the paper is to provide an algorithm for determining the properties of this optimal dynamic map including its order and regressors (i.e., the variables it depends on). The proposed algorithm does not require detailed knowledge of the robots' dynamics, but relies on basic system properties easily obtainable through simple experimental tests. We validate the proposed algorithm experimentally through an example of transfer learning between two different quadrotor platforms. Experimental results show that an optimal dynamic map, with correct properties obtained from our proposed algorithm, achieves 60-70% reduction of transfer learning error compared to the cases when the data is directly transferred or transferred using an optimal static map.

1707.08488 2026-06-04 eess.SY cs.RO cs.SY 版本更新

A New Framework for Synthetic Aperture Sonar Micronavigation

合成孔径声纳微导航的新框架

Salvatore Caporale, Yvan Petillot

AI总结 本文提出基于误差函数最小化的新型微导航方法,通过比较连续回波的向量空间交集实现高精度定位,通过仿真和受控环境实验验证其有效性。

Comments 12 pages, 19 figures

详情
AI中文摘要

合成孔径成像系统通过相干叠加沿孔径路径获取的观测数据实现恒定的方位分辨率。为此,其位置必须以次波长精度确定。在水下合成孔径声纳(SAS)中,水中的传播和导航特性使获取此信息具有挑战性。需结合惯性传感器与信号处理技术,通常称为微导航。本文提出一种基于两个连续回波之间具有相互信息的误差函数最小化的新微导航方法。该误差通过比较回波正交投影器之间的向量空间交集获得。通过仿真和受控环境实验验证了所提方法的有效性和通用性。

英文摘要

Synthetic aperture imaging systems achieve constant azimuth resolution by coherently summating the observations acquired along the aperture path. At this aim, their locations have to be known with subwavelength accuracy. In underwater Synthetic Aperture Sonar (SAS), the nature of propagation and navigation in water makes the retrieval of this information challenging. Inertial sensors have to be employed in combination with signal processing techniques, which are usually referred to as micronavigation. In this paper we propose a novel micronavigation approach based on the minimization of an error function between two contiguous pings having some mutual information. This error is obtained by comparing the vector space intersections between the pings orthogonal projectors. The effectiveness and generality of the proposed approach is demonstrated by means of simulations and by means of an experiment performed in a controlled environment.

1610.06283 2026-06-04 cs.RO cs.LG cs.NE cs.SY eess.SY 版本更新

Deep Neural Networks for Improved, Impromptu Trajectory Tracking of Quadrotors

用于四旋翼机即时轨迹跟踪的深度神经网络

Qiyang Li, Jingxing Qian, Zining Zhu, Xuchan Bao, Mohamed K. Helwa, Angela P. Schoellig

AI总结 本文提出基于深度神经网络的算法,通过提供定制参考输入提升经典反馈控制器的轨迹跟踪性能,实验表明该方法能有效减少跟踪误差,适用于实时轨迹跟踪应用。

Comments 7 pages, 8 figures. Accepted final version. To appear in the proc. of the 2017 IEEE International Conference on Robotics and Automation

详情
AI中文摘要

四旋翼机的轨迹跟踪控制对于应用范围从勘测和检查到影视制作都至关重要。然而,设计和调优经典控制器,如比例-积分-微分(PID)控制器,以实现高跟踪精度可能耗时且困难,由于隐藏动态和其他非理想因素。深度神经网络(DNN)凭借其卓越的近似抽象、非线性函数的能力,提出了一种增强轨迹跟踪控制的新方法。本文提出了一种基于DNN的算法作为附加模块,以提高经典反馈控制器的跟踪性能。给定期望轨迹,DNNs根据其获得的经验为控制器提供定制参考输入。输入旨在实现期望轨迹与输出轨迹之间的单位映射。这项工作的动机是交互式“画即飞”应用,用户在移动设备上绘制轨迹,四旋翼机即时飞越该轨迹,通过DNN增强的控制系统。实验结果表明,所提出的方法在DNNs在选定的周期轨迹上训练后,能够提高用户绘制轨迹的跟踪精度,表明该方法在现实应用中的潜力。跟踪误差在训练和测试轨迹上分别减少约40-50%,突显了DNNs在知识泛化方面的能力。

英文摘要

Trajectory tracking control for quadrotors is important for applications ranging from surveying and inspection, to film making. However, designing and tuning classical controllers, such as proportional-integral-derivative (PID) controllers, to achieve high tracking precision can be time-consuming and difficult, due to hidden dynamics and other non-idealities. The Deep Neural Network (DNN), with its superior capability of approximating abstract, nonlinear functions, proposes a novel approach for enhancing trajectory tracking control. This paper presents a DNN-based algorithm as an add-on module that improves the tracking performance of a classical feedback controller. Given a desired trajectory, the DNNs provide a tailored reference input to the controller based on their gained experience. The input aims to achieve a unity map between the desired and the output trajectory. The motivation for this work is an interactive "fly-as-you-draw" application, in which a user draws a trajectory on a mobile device, and a quadrotor instantly flies that trajectory with the DNN-enhanced control system. Experimental results demonstrate that the proposed approach improves the tracking precision for user-drawn trajectories after the DNNs are trained on selected periodic trajectories, suggesting the method's potential in real-world applications. Tracking errors are reduced by around 40-50% for both training and testing trajectories from users, highlighting the DNNs' capability of generalizing knowledge.

1707.05458 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Stabilization Control of the Differential Mobile Robot Using Lyapunov Function and Extended Kalman Filter

微分移动机器人稳定控制的Lyapunov函数与扩展卡尔曼滤波应用

T. T. Hoang, P. M. Duong, N. T. T. Van, T. Q. Vinh

AI总结 本文设计了导航微分移动机器人到达目标位置的控制模型,采用扩展卡尔曼滤波进行状态估计,结合Lyapunov函数实现稳定控制,确保闭环系统渐近稳定与鲁棒性。

Comments arXiv admin note: text overlap with arXiv:1611.07112, arXiv:1611.07114

详情
Journal ref
Journal of Science and Technology, pp.441-452, Vol. 50 no.4, 2012
AI中文摘要

本文提出了一种控制模型,用于使微分移动机器人从任意初始姿态导航至目标位置。该模型分为两个阶段:状态估计和稳定控制。在状态估计中,采用扩展卡尔曼滤波来最优结合系统动力学和测量信息。构造了两个Lyapunov函数,允许混合反馈控制律执行机器人运动。闭环系统的渐近稳定性和鲁棒性得到保证。通过仿真和实验验证了所提方法的有效性和实用性。

英文摘要

This paper presents the design of a control model to navigate the differential mobile robot to reach the desired destination from an arbitrary initial pose. The designed model is divided into two stages: the state estimation and the stabilization control. In the state estimation, an extended Kalman filter is employed to optimally combine the information from the system dynamics and measurements. Two Lyapunov functions are constructed that allow a hybrid feedback control law to execute the robot movements. The asymptotical stability and robustness of the closed loop system are assured. Simulations and experiments are carried out to validate the effectiveness and applicability of the proposed approach.

1707.05456 2026-06-04 cs.RO cs.NI cs.SY eess.SY 版本更新

Control of an Internet-based Robot System Using the Real-time Transport Protocol

基于实时传输协议的互联网机器人系统控制

P. M. Duong, T. T. Hoang, T. Q. Vinh

AI总结 本文提出利用实时传输协议替代传统TCP和UDP进行机器人系统控制,通过理论分析、仿真和实验验证其可行性与有效性。

Comments in Proceeding of The 5th Vietnam Conference on Mechatronics, Ho chi minh City, Vietnam, 2010

详情
AI中文摘要

本文介绍了一种新的方法,用于在互联网上控制机器人系统。实时传输协议(RTP)被用作通信协议,而不是传统上使用TCP和UDP。理论分析、仿真研究和实验实现已执行,以评估所提出方法在实际应用中的可行性和有效性。

英文摘要

In this paper, we introduce a novel approach in controlling robot systems over the Internet. The Real-time Transport Protocol (RTP) is used as the communication protocol instead of traditionally using TCP and UDP. The theoretic analyses, the simulation studies and the experimental implementation have been performed to evaluate the feasibility and effectiveness of the proposed approach for practical uses.

1705.10259 2026-06-04 cs.MA cs.LO cs.RO cs.SY eess.SY 版本更新

Distributed Communication-aware Motion Planning for Multi-agent Systems from STL and SpaTeL Specifications

分布式通信感知多智能体系统运动规划

Zhiyu Liu, Bo Wu, Jin Dai, Hai Lin

AI总结 本文提出一种分布式运动规划框架,结合STL和SpaTeL规格,实现通信质量与运动控制的协同设计。

Comments Submitted for publication on 2017 IEEE Conference on Decision and Control (CDC2017)

详情
AI中文摘要

在未来智能交通系统中,联网车辆通过假设通信可靠来实现安全操作。传统方法将控制设计与通信网络分开处理,但车辆运动会影响通信质量。本文提出一种控制理论框架,用于多智能体系统的分布式运动规划,满足复杂的时空规格并考虑通信质量。将期望运动规格和通信性能分别编码为信号临时逻辑(STL)和时空逻辑(SpaTeL)公式,并作为混合整数线性规划(MILP)系统和环境状态变量的约束。通过分布式模型预测控制(MPC)框架为每个智能体生成满足STL和SpaTeL规格的控制策略。通过多智能体系统的分布式通信感知运动规划仿真验证了所提框架的有效性。

英文摘要

In future intelligent transportation systems, networked vehicles coordinate with each other to achieve safe operations based on an assumption that communications among vehicles and infrastructure are reliable. Traditional methods usually deal with the design of control systems and communication networks in a separated manner. However, control and communication systems are tightly coupled as the motions of vehicles will affect the overall communication quality. Hence, we are motivated to study the co-design of both control and communication systems. In particular, we propose a control theoretical framework for distributed motion planning for multi-agent systems which satisfies complex and high-level spatial and temporal specifications while accounting for communication quality at the same time. Towards this end, desired motion specifications and communication performances are formulated as signal temporal logic (STL) and spatial-temporal logic (SpaTeL) formulas, respectively. The specifications are encoded as constraints on system and environment state variables of mixed integer linear programs (MILP), and upon which control strategies satisfying both STL and SpaTeL specifications are generated for each agent by employing a distributed model predictive control (MPC) framework. Effectiveness of the proposed framework is validated by a simulation of distributed communication-aware motion planning for multi-agent systems.

1705.05065 2026-06-04 cs.RO cs.AI cs.CV cs.SY eess.SY 版本更新

AirSim: High-Fidelity Visual and Physical Simulation for Autonomous Vehicles

AirSim:面向自动驾驶车辆的高保真视觉与物理模拟

Shital Shah, Debadeepta Dey, Chris Lovett, Ashish Kapoor

AI总结 本文提出基于Unreal引擎的AirSim模拟器,用于高效开发和测试自动驾驶算法,支持高频率物理模拟和多种协议,通过四旋翼实验验证其有效性。

Comments Accepted for Field and Service Robotics conference 2017 (FSR 2017)

详情
AI中文摘要

为自动驾驶车辆开发和测试算法在现实世界中成本高且耗时。为利用最新机器智能和深度学习进展,需收集大量标注训练数据。本文提出基于Unreal引擎的新模拟器,提供真实的物理和视觉模拟。模拟器包含可实现实时硬件在环(HITL)模拟的物理引擎,支持MavLink等流行协议。模拟器从零开始设计,可扩展以适应新车辆类型、硬件平台和软件协议。模块化设计使各组件可独立用于其他项目。通过实现四旋翼自动驾驶车辆并实验性比较软件组件与真实飞行,验证了模拟器的有效性。

英文摘要

Developing and testing algorithms for autonomous vehicles in real world is an expensive and time consuming process. Also, in order to utilize recent advances in machine intelligence and deep learning we need to collect a large amount of annotated training data in a variety of conditions and environments. We present a new simulator built on Unreal Engine that offers physically and visually realistic simulations for both of these goals. Our simulator includes a physics engine that can operate at a high frequency for real-time hardware-in-the-loop (HITL) simulations with support for popular protocols (e.g. MavLink). The simulator is designed from the ground up to be extensible to accommodate new types of vehicles, hardware platforms and software protocols. In addition, the modular design enables various components to be easily usable independently in other projects. We demonstrate the simulator by first implementing a quadrotor as an autonomous vehicle and then experimentally comparing the software components with real-world flights.

1403.5204 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Adaptive Control of Robot Manipulators With Uncertain Kinematics and Dynamics

具有不确定运动学和动力学的机器人操作机自适应控制

Hanlei Wang

AI总结 本文提出两种自适应控制方案,实现不管运动学和动力学不确定性的任务空间轨迹跟踪,通过数值模拟验证了控制器性能。

Comments 18 pages, 7 figures, revised for improving the presentation and adding some contents and references based on the reviewers' and AE's comments from IEEE Transactions on Automatic Control

详情
Journal ref
IEEE Transactions on Automatic Control, 62(2): 948-954, Feb. 2017
AI中文摘要

在本文中,我们研究了具有不确定运动学和动力学的机器人操作机的自适应控制问题。我们提出两种自适应控制方案,以实现任务空间轨迹跟踪的目标,无论运动学和动力学的不确定性。所提出的控制器具有良好的分离特性,我们还表明,通过适当修改的第一种自适应控制器可以产生改进的性能,而无需牺牲保守的增益选择。所提出控制器的性能通过数值模拟展示。

英文摘要

In this paper, we investigate the adaptive control problem for robot manipulators with both the uncertain kinematics and dynamics. We propose two adaptive control schemes to realize the objective of task-space trajectory tracking irrespective of the uncertain kinematics and dynamics. The proposed controllers have the desirable separation property, and we also show that the first adaptive controller with appropriate modifications can yield improved performance, without the expense of conservative gain choice. The performance of the proposed controllers is shown by numerical simulations.

1707.03092 2026-06-04 eess.SY cs.LG cs.RO cs.SY 版本更新

A Separation-Based Design to Data-Driven Control for Large-Scale Partially Observed Systems

基于分离的设计到数据驱动控制用于大规模部分观测系统

Dan Yu, Mohammadhussein Rafieisakhaei, Suman Chakravorty

AI总结 本文研究了由偏微分方程(PDE)描述的状态动力学导致的 partially observed 随机最优控制问题,通过黑盒模拟模型求解开环确定性轨迹优化问题,并基于输入输出实验数据设计线性二次高斯控制器。

Comments 3 pages, 6 figures, In Robotics: Science and Systems (RSS) 2017 Workshop of "POMDPs in Robotics: State of The Art, Challenges, and Opportunities"

详情
AI中文摘要

本文研究了由偏微分方程(PDE)描述的状态动力学导致的 partially observed 随机最优控制问题,该问题导致极大规模的问题。首先,使用动态系统的黑盒模拟模型求解开环确定性轨迹优化问题。接着,针对依赖于名义轨迹的线性化系统,设计线性二次高斯(LQG)控制器,该控制器通过由优化名义系统冲击响应组成的输入输出实验数据进行识别。通过计算非线性热例示该方法的性能。

英文摘要

This paper studies the partially observed stochastic optimal control problem for systems with state dynamics governed by Partial Differential Equations (PDEs) that leads to an extremely large problem. First, an open-loop deterministic trajectory optimization problem is solved using a black box simulation model of the dynamical system. Next, a Linear Quadratic Gaussian (LQG) controller is designed for the nominal trajectory-dependent linearized system, which is identified using input-output experimental data consisting of the impulse responses of the optimized nominal system. A computational nonlinear heat example is used to illustrate the performance of the approach.

1707.02201 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Learning human behaviors from motion capture by adversarial imitation

通过对抗模仿学习学习人类行为

Josh Merel, Yuval Tassa, Dhruva TB, Sriram Srinivasan, Jay Lemmon, Ziyu Wang, Greg Wayne, Nicolas Heess

AI总结 本文提出利用生成对抗模仿学习训练神经网络策略,从有限的不完全观测状态特征中生成人类化运动模式,即使演示来自不同物理参数的躯体,也能通过子技能策略解决任务。

详情
AI中文摘要

深度强化学习的快速进展使训练高维人形身体控制器变得越来越可行。然而,纯强化学习方法使用简单的奖励函数往往会产生非人类化且过于刻板的运动行为。在本文中,我们扩展了生成对抗模仿学习,以使训练通用神经网络策略成为可能,从而从仅包含部分观测状态特征的有限演示中生成人类化运动模式,即使在没有动作信息且演示来自具有不同且未知物理参数的躯体时也是如此。我们利用这种方法从动作捕捉数据构建子技能策略,并展示这些策略在由更高层次控制器控制时可以用于解决任务。

英文摘要

Rapid progress in deep reinforcement learning has made it increasingly feasible to train controllers for high-dimensional humanoid bodies. However, methods that use pure reinforcement learning with simple reward functions tend to produce non-humanlike and overly stereotyped movement behaviors. In this work, we extend generative adversarial imitation learning to enable training of generic neural network policies to produce humanlike movement patterns from limited demonstrations consisting only of partially observed state features, without access to actions, even when the demonstrations come from a body with different and unknown physical parameters. We leverage this approach to build sub-skill policies from motion capture data and show that they can be reused to solve tasks when controlled by a higher level controller.

1705.09415 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Near-Optimal Belief Space Planning via T-LQG

通过T-LQG实现接近最优的信念空间规划

Mohammadhussein Rafieisakhaei, Suman Chakravorty, P. R. Kumar

AI总结 本文提出T-LQG方法,用于非线性机器人系统在观测和运动不确定性下的规划问题,提供近优反馈控制策略,解决POMDP问题。

Comments 3 pages, 3 figures, In Robotics: Science and Systems (RSS) 2017 Workshop of "POMDPs in Robotics: State of The Art, Challenges, and Opportunities"

详情
AI中文摘要

本文考虑了在观测和运动不确定性下非线性机器人系统的规划问题。通常将此问题形式化为部分观察马尔可夫决策过程(POMDP),确定最优解在计算上是不可行的。我们提出了一种轨迹优化线性二次高斯(T-LQG)方法,该方法能够为POMDP问题提供可量化近优的解决方案。我们提供了一个新的'分离原理',用于设计一个最优的开环轨迹随后是最佳反馈控制律,这为涉及多项式阶数最小阶数的信念空间规划问题提供了一个近优反馈控制策略。

英文摘要

We consider the problem of planning under observation and motion uncertainty for nonlinear robotics systems. Determining the optimal solution to this problem, generally formulated as a Partially Observed Markov Decision Process (POMDP), is computationally intractable. We propose a Trajectory-optimized Linear Quadratic Gaussian (T-LQG) approach that leads to quantifiably near-optimal solutions for the POMDP problem. We provide a novel "separation principle" for the design of an optimal nominal open-loop trajectory followed by an optimal feedback control law, which provides a near-optimal feedback control policy for belief space planning problems involving a polynomial order of calculations of minimum order.

1405.6341 2026-06-04 cs.RO cs.AI cs.LG cs.SY eess.SY 版本更新

Efficient Model Learning for Human-Robot Collaborative Tasks

高效的人机协作任务模型学习

Stefanos Nikolaidis, Keren Gu, Ramya Ramakrishnan, Julie Shah

AI总结 本文提出一种框架,通过联合动作演示学习人类用户模型,使机器人能自动计算稳健的协作策略。采用无监督学习聚类动作序列,学习逆强化学习奖励函数,并在混合可观测马尔可夫决策过程框架中应用,实现对新用户的类型推断和策略计算。

详情
Journal ref
Proceedings of the Tenth Annual ACM/IEEE International Conference on Human-Robot Interaction (HRI 2015)
AI中文摘要

我们提出了一种框架,用于从联合动作演示中学习人类用户模型,使机器人能够计算协作任务的稳健策略。学习过程完全自动,无需人工干预。首先,我们描述了使用无监督学习算法将演示的动作序列聚类为不同的人类类型。这些演示序列还被机器人用来通过逆强化学习算法学习代表每种类型的奖励函数。学习的模型随后作为混合可观测马尔可夫决策过程(MO-MDP)的一部分使用,其中人类类型是部分可观测变量。通过该框架,我们可以推断新用户类型(未包含在训练集中),并计算与新用户偏好一致且对人类动作偏离具有鲁棒性的机器人策略。最后,我们通过人类受试者实验数据验证了该方法,并进行了概念验证演示,其中一个人与小型工业机器人进行协作任务。

英文摘要

We present a framework for learning human user models from joint-action demonstrations that enables the robot to compute a robust policy for a collaborative task with a human. The learning takes place completely automatically, without any human intervention. First, we describe the clustering of demonstrated action sequences into different human types using an unsupervised learning algorithm. These demonstrated sequences are also used by the robot to learn a reward function that is representative for each type, through the employment of an inverse reinforcement learning algorithm. The learned model is then used as part of a Mixed Observability Markov Decision Process formulation, wherein the human type is a partially observable variable. With this framework, we can infer, either offline or online, the human type of a new user that was not included in the training set, and can compute a policy for the robot that will be aligned to the preference of this new user and will be robust to deviations of the human actions from prior demonstrations. Finally we validate the approach using data collected in human subject experiments, and conduct proof-of-concept demonstrations in which a person performs a collaborative task with a small industrial robot.

1706.01127 2026-06-04 cs.RO cs.SY eess.SY math.DS math.OC 版本更新

Virtual Constraints and Hybrid Zero Dynamics for Realizing Underactuated Bipedal Locomotion

虚拟约束与混合零动力学用于实现欠驱动双足运动

Jessy W Grizzle, Christine Chevallereau

AI总结 本文提出了一种协调理论,用于设计反馈控制器实现欠驱动双足机器人稳定行走。通过引入虚拟约束和混合零动力学,同步关节相位变量并捕捉欠驱动特性。

Comments 17 pages, 4 figures, bookchapter

详情
AI中文摘要

欠驱动在人类运动中普遍存在,也应普遍存在于双足机器人运动中。本章提出了一种协调理论,用于设计反馈控制器实现欠驱动双足机器人稳定行走。引入了两个基本工具:虚拟约束和混合零动力学。虚拟约束是通过时间不变反馈控制器施加在机械模型状态变量上的关系。其作用之一是同步机器人关节到内部步态相位变量。另一个作用是诱导一个低维系统,即零动力学,捕捉机器人模型的欠驱动特性,而无需任何近似。为增强直观,首先建立了物理约束与虚拟约束之间的关系。从这里,开发了欠驱动双足模型的混合零动力学,并确立了其在设计渐近稳定的行走运动中的基本作用。本章包含大量参考了已实现这些技术的机器人。

英文摘要

Underactuation is ubiquitous in human locomotion and should be ubiquitous in bipedal robotic locomotion as well. This chapter presents a coherent theory for the design of feedback controllers that achieve stable walking gaits in underactuated bipedal robots. Two fundamental tools are introduced, virtual constraints and hybrid zero dynamics. Virtual constraints are relations on the state variables of a mechanical model that are imposed through a time-invariant feedback controller. One of their roles is to synchronize the robot's joints to an internal gait phasing variable. A second role is to induce a low dimensional system, the zero dynamics, that captures the underactuated aspects of a robot's model, without any approximations. To enhance intuition, the relation between physical constraints and virtual constraints is first established. From here, the hybrid zero dynamics of an underactuated bipedal model is developed, and its fundamental role in the design of asymptotically stable walking motions is established. The chapter includes numerous references to robots on which the highlighted techniques have been implemented.

1705.10432 2026-06-04 cs.AI cs.RO cs.SY eess.SY 版本更新

Fine-grained acceleration control for autonomous intersection management using deep reinforcement learning

基于深度强化学习的细粒度加速控制用于自动驾驶交叉口管理

Hamid Mirzaei, Tony Givargis

AI总结 本文利用信任区域策略优化方法,实现自动驾驶车辆在网格街道中的细粒度加速控制,以达成全局管理目标。

Comments Accepted in IEEE Smart World Congress 2017

详情
AI中文摘要

近年来,结合深度学习和强化学习的进展为设计新的控制代理提供了有前景的路径,这些代理能够学习复杂控制任务的最优策略。这些新方法解决了传统强化学习方法的主要限制,如定制特征工程和小动作/状态空间维度要求。在本文中,我们利用一种最先进的强化学习方法,即信任区域策略优化,来解决自动驾驶车辆的交叉口管理问题。我们展示了使用该方法可以对自动驾驶车辆进行网格街道计划中的细粒度加速控制,以实现全局设计目标。

英文摘要

Recent advances in combining deep learning and Reinforcement Learning have shown a promising path for designing new control agents that can learn optimal policies for challenging control tasks. These new methods address the main limitations of conventional Reinforcement Learning methods such as customized feature engineering and small action/state space dimension requirements. In this paper, we leverage one of the state-of-the-art Reinforcement Learning methods, known as Trust Region Policy Optimization, to tackle intersection management for autonomous vehicles. We show that using this method, we can perform fine-grained acceleration control of autonomous vehicles in a grid street plan to achieve a global design objective.

1705.08566 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A Near-Optimal Separation Principle for Nonlinear Stochastic Systems Arising in Robotic Path Planning and Control

非线性随机系统在机器人路径规划与控制中的近最优分离原理

Mohammadhussein Rafieisakhaei, Suman Chakravorty, P. R. Kumar

AI总结 本文提出了一种针对机器人路径规划与控制中非线性随机系统的近最优分离方法,通过小噪声假设实现可计算的近最优控制设计,同时推导出具有高斯噪声的非线性随机系统的轨迹优化线性二次调节器设计。

Comments 7 pages, 4 Figures, Submitted to 56th IEEE Conference on Decision and Control (CDC), 2017

详情
AI中文摘要

我们考虑出现在移动机器人路径规划和控制中的非线性随机系统。与几乎所有非线性随机系统类似,最优求解问题是不可行的。我们提供了一种设计方法,该方法产生了一个可计算的设计,其性能在可量化地接近最优。我们展示了一个在小噪声假设下的

英文摘要

We consider nonlinear stochastic systems that arise in path planning and control of mobile robots. As is typical of almost all nonlinear stochastic systems, the optimally solving problem is intractable. We provide a design approach which yields a tractable design that is quantifiably near-optimal. We exhibit a "separation" principle under a small noise assumption consisting of the optimal open-loop design of nominal trajectory followed by an optimal feedback law to track this trajectory, which is different from the usual effort of separating estimation from control. As a corollary, we obtain a trajectory-optimized linear quadratic regulator design for stochastic nonlinear systems with Gaussian noise.

1705.05727 2026-06-04 cs.RO cs.SY eess.SY nlin.CD 版本更新

A General Scheme Implicit Force Control for a Flexible-Link Manipulator

一种灵活连杆机械臂的通用隐式力控方案

Cecilia Murrugarra, Osberth De Castro, Juan Carlos Grieco, Gerardo Fernandez

AI总结 本文提出了一种针对与柔顺环境交互的单连杆柔顺机械臂的隐式力控方案,基于机械臂的数学模型,考虑了柔性梁的动力学和重力作用,通过结构参数确定控制器参数,利用李雅普诺夫理论保证稳定性,采用内外闭环控制结构实现位置和力的跟踪控制及振动抑制。

Comments 16 pages, 14 figures

详情
AI中文摘要

本文提出了一种隐式力控方案,用于与柔顺环境交互的单连杆柔顺机械臂。控制器基于机械臂的数学模型,考虑了柔性梁的动力学和重力作用。控制器参数通过机械臂梁的结构参数确定。该控制器基于李雅普诺夫理论保证稳定性。控制器包含两个闭环:内环为具有重力和振动频率补偿的跟踪控制,外环为隐式力控。通过三种不同机械臂(长度和直径不同)和三种不同柔顺环境的仿真验证,结果表明控制器能够实现渐近跟踪和位置与力的调节,并在有限时间内抑制梁的振动。

英文摘要

In this paper we propose an implicit force control scheme for a one-link flexible manipulator that interact with a compliant environment. The controller was based in the mathematical model of the manipulator, considering the dynamics of the beam flexible and the gravitational force. With this method, the controller parameters are obtained from the structural parameters of the beam (link) of the manipulator. This controller ensure the stability based in the Lyapunov Theory. The controller proposed has two closed loops: the inner loop is a tracking control with gravitational force and vibration frequencies compensation and the outer loop is a implicit force control. To evaluate the performance of the controller, we have considered to three different manipulators (the length, the diameter were modified) and three environments with compliance modified. The results obtained from simulations verify the asymptotic tracking and regulated in position and force respectively and the vibrations suppression of the beam in a finite time.

1705.05344 2026-06-04 cs.RO cs.SY eess.SY 版本更新

GP-ILQG: Data-driven Robust Optimal Control for Uncertain Nonlinear Dynamical Systems

GP-ILQG:基于数据的鲁棒最优控制用于不确定非线性动力学系统

Gilwoo Lee, Siddhartha S. Srinivasa, Matthew T. Mason

AI总结 本文提出GP-ILQG算法,结合数据驱动的系统辨识方法与基于微分动态规划的鲁棒最优控制方法,有效解决现实与仿真之间的差距问题,实现快速修正模型并提升控制鲁棒性。

详情
AI中文摘要

随着复杂系统控制需求的增加,基于模型的强化学习中使用仿真器变得更为常见。然而,克服现实差距(源于非线性模型偏差和对扰动的易感性)具有挑战性。为此,我们提出一种新的算法,将数据驱动的系统辨识方法(高斯过程)与基于微分动态规划的鲁棒最优控制方法(迭代线性二次控制)相结合。我们的算法将仿真器的模型作为高斯过程的均值函数,并仅学习仿真器预测与实际观测之间的差异,使其成为仿真与现实观测的自然混合。我们证明了该方法能够快速修正错误模型,生成鲁棒最优控制器,并高效地将所学的模型知识转移到新任务中。

英文摘要

As we aim to control complex systems, use of a simulator in model-based reinforcement learning is becoming more common. However, it has been challenging to overcome the Reality Gap, which comes from nonlinear model bias and susceptibility to disturbance. To address these problems, we propose a novel algorithm that combines data-driven system identification approach (Gaussian Process) with a Differential-Dynamic-Programming-based robust optimal control method (Iterative Linear Quadratic Control). Our algorithm uses the simulator's model as the mean function for a Gaussian Process and learns only the difference between the simulator's prediction and actual observations, making it a natural hybrid of simulation and real-world observation. We show that our approach quickly corrects incorrect models, comes up with robust optimal controllers, and transfers its acquired model knowledge to new tasks efficiently.

1705.05116 2026-06-04 cs.RO cs.AI cs.CV cs.LG cs.SY eess.SY 版本更新

Tuning Modular Networks with Weighted Losses for Hand-Eye Coordination

通过加权损失调节模块网络以提升手眼协调

Fangyi Zhang, Jürgen Leitner, Michael Milford, Peter I. Corke

AI总结 本文提出端到端微调方法,通过加权损失提升模块化深度视觉-运动策略在平面抓取任务中的手眼协调性能。

Comments 2 pages, to appear in the Deep Learning for Robotic Vision (DLRV) Workshop in CVPR 2017

详情
AI中文摘要

本文介绍了一种端到端微调方法,用于改进模块化深度视觉-运动策略(模块网络)中的手眼协调能力,其中每个模块独立训练。得益于加权损失,该微调方法显著提升了策略在机器人平面抓取任务中的性能。

英文摘要

This paper introduces an end-to-end fine-tuning method to improve hand-eye coordination in modular deep visuo-motor policies (modular networks) where each module is trained independently. Benefiting from weighted losses, the fine-tuning method significantly improves the performance of the policies for a robotic planar reaching task.

1705.04763 2026-06-04 cs.RO cs.SY eess.SY 版本更新

High-Precision Trajectory Tracking in Changing Environments Through $\mathcal{L}_1$ Adaptive Feedback and Iterative Learning

通过L1自适应反馈和迭代学习实现高精度轨迹跟踪

Karime Pereida, Rikky R. P. R. Duivenvoorden, Angela P. Schoellig

AI总结 本文提出结合L1自适应反馈与迭代学习控制的框架,提升系统在未知动态扰动下的轨迹跟踪性能,通过实验验证其优于纯ILC方法的鲁棒性和泛化能力。

Comments 7 pages, 5 figures, Proc. of the 2017 IEEE International Conference on Robotics and Automation

详情
AI中文摘要

随着机器人等自动化系统被引入未知和动态环境,需要鲁棒且自适应的控制策略以应对扰动、未建模动态和参数不确定性。本文提出并证明了一种结合L1自适应反馈和迭代学习控制(ILC)的框架,以提高受未知和变化扰动影响的系统轨迹跟踪性能。L1自适应控制器使系统在存在未知和变化扰动的情况下表现出可重复的预定义行为,但并不意味着实现完美轨迹跟踪。ILC通过以往执行的经验改进跟踪性能。ILC的性能受限于底层系统的鲁棒性和可重复性,而在本方法中,这由L1自适应控制器处理。特别地,我们能够将学习的轨迹泛化到不同的系统配置,因为L1自适应控制器处理了系统底层的变化。我们通过四旋翼在未知动态扰动下的实验,展示了结合方法相比纯ILC在轨迹跟踪性能和泛化能力上的改进。这是首次在实验中展示L1自适应控制与ILC结合的工作。

英文摘要

As robots and other automated systems are introduced to unknown and dynamic environments, robust and adaptive control strategies are required to cope with disturbances, unmodeled dynamics and parametric uncertainties. In this paper, we propose and provide theoretical proofs of a combined $\mathcal{L}_1$ adaptive feedback and iterative learning control (ILC) framework to improve trajectory tracking of a system subject to unknown and changing disturbances. The $\mathcal{L}_1$ adaptive controller forces the system to behave in a repeatable, predefined way, even in the presence of unknown and changing disturbances; however, this does not imply that perfect trajectory tracking is achieved. ILC improves the tracking performance based on experience from previous executions. The performance of ILC is limited by the robustness and repeatability of the underlying system, which, in this approach, is handled by the $\mathcal{L}_1$ adaptive controller. In particular, we are able to generalize learned trajectories across different system configurations because the $\mathcal{L}_1$ adaptive controller handles the underlying changes in the system. We demonstrate the improved trajectory tracking performance and generalization capabilities of the combined method compared to pure ILC in experiments with a quadrotor subject to unknown, dynamic disturbances. This is the first work to show $\mathcal{L}_1$ adaptive control combined with ILC in experiment.

1702.02453 2026-06-04 cs.LG cs.RO cs.SY eess.SY 版本更新

Preparing for the Unknown: Learning a Universal Policy with Online System Identification

为未知做准备:学习通用策略与在线系统识别

Wenhao Yu, Jie Tan, C. Karen Liu, Greg Turk

AI总结 本文提出了一种学习通用策略的方法,通过在线系统识别和大量训练示例,使策略在未知动态模型下具备鲁棒性,适用于多种动态模型和环境变化。

Comments Accepted as a conference paper at RSS 2017

详情
AI中文摘要

我们提出了一种学习控制策略的新方法,该方法能够在未知动态模型下有效运行。我们通过利用大量由物理模拟器生成的训练示例来创建此类策略。系统由两个组件组成:通用策略(UP)和在线系统识别(OSI)函数。我们描述我们的控制策略为通用,因为它是在广泛动态模型上训练的。这些动态模型的变化可能包括机器人组件的质量和惯性差异、摩擦系数变化或未知被操作物体的质量。通过在这些变化上训练通用策略,控制策略在未知环境中准备了更广泛的可能条件。系统第二部分利用系统的近期状态和动作历史来预测动态模型参数mu。在线系统识别的mu值然后作为输入提供给控制策略(连同系统状态)。UP-OSI是一种在广泛动态模型上适用且对环境突然变化具有响应性的稳健控制策略。我们评估了该系统在多种任务上的性能,包括cart-pole翻转问题、双倒立摆、跳蛙器的运动和机械臂的块投掷任务。UP-OSI在各种动态模型上均有效。此外,当测试动态模型超出训练范围时,UP-OSI在UP单独的情况下表现更优,即使UP被给予实际的动态模型值。除了创建更稳健的控制器的好处外,UP-OSI还具有缩小模拟与真实物理系统现实差距的潜力。

英文摘要

We present a new method of learning control policies that successfully operate under unknown dynamic models. We create such policies by leveraging a large number of training examples that are generated using a physical simulator. Our system is made of two components: a Universal Policy (UP) and a function for Online System Identification (OSI). We describe our control policy as universal because it is trained over a wide array of dynamic models. These variations in the dynamic model may include differences in mass and inertia of the robots' components, variable friction coefficients, or unknown mass of an object to be manipulated. By training the Universal Policy with this variation, the control policy is prepared for a wider array of possible conditions when executed in an unknown environment. The second part of our system uses the recent state and action history of the system to predict the dynamics model parameters mu. The value of mu from the Online System Identification is then provided as input to the control policy (along with the system state). Together, UP-OSI is a robust control policy that can be used across a wide range of dynamic models, and that is also responsive to sudden changes in the environment. We have evaluated the performance of this system on a variety of tasks, including the problem of cart-pole swing-up, the double inverted pendulum, locomotion of a hopper, and block-throwing of a manipulator. UP-OSI is effective at these tasks across a wide range of dynamic models. Moreover, when tested with dynamic models outside of the training range, UP-OSI outperforms the Universal Policy alone, even when UP is given the actual value of the model dynamics. In addition to the benefits of creating more robust controllers, UP-OSI also holds out promise of narrowing the Reality Gap between simulated and real physical systems.

1704.04722 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Bounded Distributed Flocking Control of Nonholonomic Mobile Robots

非holonomic移动机器人有界分布式编队控制

Thang Nguyen, Hung La, Vahid Azimi, Thanh-Trung Han

AI总结 本文提出基于有界反馈的非holonomic移动机器人分布式编队控制方法,解决动态特性带来的挑战,实现速度一致性、避障和聚拢维持。

详情
AI中文摘要

针对多智能体系统中简化模型的编队控制问题,本文考虑实际约束,提出基于邻近机器人信息的编队控制协议。通过Lyapunov-like函数和图论进行理论分析,仿真结果验证了所提分布式编队控制方案的有效性。

英文摘要

There have been numerous studies on the problem of flocking control for multiagent systems whose simplified models are presented in terms of point-mass elements. Meanwhile, full dynamic models pose some challenging problems in addressing the flocking control problem of mobile robots due to their nonholonomic dynamic properties. Taking practical constraints into consideration, we propose a novel approach to distributed flocking control of nonholonomic mobile robots by bounded feedback. The flocking control objectives consist of velocity consensus, collision avoidance, and cohesion maintenance among mobile robots. A flocking control protocol which is based on the information of neighbor mobile robots is constructed. The theoretical analysis is conducted with the help of a Lyapunov-like function and graph theory. Simulation results are shown to demonstrate the efficacy of the proposed distributed flocking control scheme.

1705.01332 2026-06-04 eess.SY cs.RO cs.SY 版本更新

LiDAR-based Control of Autonomous Rotorcraft for the Inspection of Pier-like Structures: Proofs

基于LiDAR的自主旋翼飞行器对类似桥体结构的检测控制:证明

Bruno J. Guerreiro, Carlos Silvestre, Rita Cunha, David Cabecinhas

AI总结 本文提出基于LiDAR的自主旋翼飞行器轨迹跟踪控制方法,通过定义基于LiDAR测量的运动学模型,结合LPV表示和H2控制理论,实现对桥体结构的高效检测。

Comments [1] B. J. Guerreiro, C. Silvestre, R. Cunha, and D. Cabecinhas, Lidar-based control of autonomous rotorcraft for the inspection of pier-like structures, IEEE Transactions in Control Systems Technology, 2017. (to appear)

详情
AI中文摘要

本文是[1]论文的补充文档,旨在提供部分结果的更详细证明。主要论文解决了在仅能通过LiDAR传感器获取相对位置测量的情况下,自主旋翼飞行器的轨迹跟踪控制问题。所提出的方法定义了一个基于LiDAR测量的替代运动学模型,并使用依赖轨迹的误差空间来表达飞行器的动态模型。采用具有分段仿射参数依赖性的LPV表示来描述在预定义操作区域内的误差动态,并利用LMIs解决连续时间H2控制问题,该问题在增益调度控制理论的框架内得以实现。

英文摘要

This is a complementary document to the paper presented in [1], to provide more detailed proofs for some results. The main paper addresses the problem of trajectory tracking control of autonomous rotorcraft in operation scenarios where only relative position measurements obtained from LiDAR sensors are possible. The proposed approach defines an alternative kinematic model, directly based on LiDAR measurements, and uses a trajectory-dependent error space to express the dynamic model of the vehicle. An LPV representation with piecewise affine dependence on the parameters is adopted to describe the error dynamics over a set of predefined operating regions, and a continuous-time $H_2$ control problem is solved using LMIs and implemented within the scope of gain-scheduling control theory.

1603.06716 2026-06-04 eess.SY cs.LO cs.RO cs.SY 版本更新

Risk-Averse $ω$-regular Markov Decision Process Control

风险规避的ω-正则马尔可夫决策过程控制

Ruediger Ehlers, Salar Moarref, Ufuk Topcu

AI总结 本文提出一种新的优化准则,用于处理无限时间 horizon 的ω-正则规范满足问题,通过风险规避策略在机器人控制中验证其有效性。

详情
AI中文摘要

许多在马尔可夫决策过程(MDPs)建模的环境中出现的控制问题涉及无限时间 horizon 规范。在这一背景下,经典目标是计算一个最大化满足规范概率的控制策略。然而,在许多情况下,系统执行的每一步都有非零失败概率。对于无限时间 horizon 规范,这导致无论选择何种策略,规范在长期内都会被违反,这使得以往的策略计算方法在这些场景中不再适用。本文引入了一种新的优化准则,用于MDPs中无法避免长期规范违反的情况。我们给出了计算在该准则下最优策略的算法,并展示了该准则捕捉了MDP控制中的乐观和风险规避思想:虽然计算出的策略在失败状态进入较晚,但它们通过最大化到达各自下一个目标状态的概率表现出风险规避特性。我们通过两个机器人控制场景的结果验证了风险规避MDP策略的实用性。

英文摘要

Many control problems in environments that can be modeled as Markov decision processes (MDPs) concern infinite-time horizon specifications. The classical aim in this context is to compute a control policy that maximizes the probability of satisfying the specification. In many scenarios, there is however a non-zero probability of failure in every step of the system's execution. For infinite-time horizon specifications, this implies that the specification is violated with probability 1 in the long run no matter what policy is chosen, which prevents previous policy computation methods from being useful in these scenarios. In this paper, we introduce a new optimization criterion for MDP policies that captures the task of working towards the satisfaction of some infinite-time horizon $ω$-regular specification. The new criterion is applicable to MDPs in which the violation of the specification cannot be avoided in the long run. We give an algorithm to compute policies that are optimal in this criterion and show that it captures the ideas of optimism and risk-averseness in MDP control: while the computed policies are optimistic in that a MDP run enters a failure state relatively late, they are risk-averse by always maximizing the probability to reach their respective next goal state. We give results on two robot control scenarios to validate the usability of risk-averse MDP policies.

1705.00091 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Safe Trajectory Synthesis for Autonomous Driving in Unforeseen Environments

自主驾驶在不可预见环境中的安全轨迹合成

Shreyas Kousik, Sean Vaskov, Matthew Johnson-Roberson, Ramanarayan Vasudevan

AI总结 本文提出一种基于低保真模型的轨迹规划方法,通过保守的可达集和障碍物交集确保安全,同时证明了时间下界以保证安全性。

Comments Submitted to DSCC 2017

详情
AI中文摘要

自动驾驶在任意环境中的路径规划需要安全性的保证,但当车辆用高保真模型描述时,在实时情况下确保安全性可能不切实际。为了解决这个问题,本文提出了一种基于低保真模型的轨迹规划方法,通过计算保守的可达集(FRS)来确保安全。该方法首先计算在有限时间范围内跟踪低保真模型轨迹的高保真模型轨迹的保守可达集。在运行时,车辆将此FRS与环境中的障碍物进行交集运算,以消除可能导致碰撞的轨迹,然后从剩余的安全集合中选择最优计划。通过对集合交集和后续路径选择的时间进行界限,本文证明了FRS时间范围和感知时间范围的下界以保证安全。该方法通过动态双轮车作为低保真模型和动态三轮车作为高保真模型在仿真中进行了演示。

英文摘要

Path planning for autonomous vehicles in arbitrary environments requires a guarantee of safety, but this can be impractical to ensure in real-time when the vehicle is described with a high-fidelity model. To address this problem, this paper develops a method to perform trajectory design by considering a low-fidelity model that accounts for model mismatch. The presented method begins by computing a conservative Forward Reachable Set (FRS) of a high-fidelity model's trajectories produced when tracking trajectories of a low-fidelity model over a finite time horizon. At runtime, the vehicle intersects this FRS with obstacles in the environment to eliminate trajectories that can lead to a collision, then selects an optimal plan from the remaining safe set. By bounding the time for this set intersection and subsequent path selection, this paper proves a lower bound for the FRS time horizon and sensing horizon to guarantee safety. This method is demonstrated in simulation using a kinematic Dubin's car as the low-fidelity model and a dynamic unicycle as the high-fidelity model.

1601.04037 2026-06-04 cs.RO cs.AI cs.SY eess.SY math.DS math.OC 版本更新

Funnel Libraries for Real-Time Robust Feedback Motion Planning

用于实时鲁棒反馈运动规划的 funnel 库

Anirudha Majumdar, Russ Tedrake

AI总结 本文提出利用预计算的 funnel 库实现实时鲁棒反馈运动规划,通过凸优化计算 funnel 并在运行时安全组合运动计划,验证了在复杂环境中高动态机器人系统鲁棒性和安全性。

Comments International Journal of Robotics Research (To Appear)

详情
AI中文摘要

我们考虑了在存在环境不确定性、参数模型不确定性和扰动时,生成保证成功的机器人运动计划的问题。此外,我们还考虑了必须在实时中生成这些计划的场景,因为环境中的约束(如障碍物)可能在运行时通过有噪声的传感器感知到。我们的方法是预先计算不同系统操作的“funnels”库,这些 funnels 确保在执行对应操作的反馈控制器时,状态在扰动范围内保持。我们利用凸优化(特别是求和平方编程)的强大计算能力来计算这些 funnels。所得到的 funnel 库然后在运行时被顺序组合以生成运动计划,同时确保机器人的安全性。本文的一个主要优势是通过显式考虑不确定性的影响,机器人可以根据运动计划对扰动的脆弱性来评估。我们通过大量硬件实验(在高速(约12英里/小时)下避障的小型固定翼飞机)和彻底的仿真实验(地面车辆和四旋翼模型在复杂环境中导航)来演示和验证我们的方法。据我们所知,这些演示构成了首次证明安全且鲁棒的控制方法,用于具有复杂非线性动力学的机器人系统,在具有复杂几何约束的环境中实时规划。

英文摘要

We consider the problem of generating motion plans for a robot that are guaranteed to succeed despite uncertainty in the environment, parametric model uncertainty, and disturbances. Furthermore, we consider scenarios where these plans must be generated in real-time, because constraints such as obstacles in the environment may not be known until they are perceived (with a noisy sensor) at runtime. Our approach is to pre-compute a library of "funnels" along different maneuvers of the system that the state is guaranteed to remain within (despite bounded disturbances) when the feedback controller corresponding to the maneuver is executed. We leverage powerful computational machinery from convex optimization (sums-of-squares programming in particular) to compute these funnels. The resulting funnel library is then used to sequentially compose motion plans at runtime while ensuring the safety of the robot. A major advantage of the work presented here is that by explicitly taking into account the effect of uncertainty, the robot can evaluate motion plans based on how vulnerable they are to disturbances. We demonstrate and validate our method using extensive hardware experiments on a small fixed-wing airplane avoiding obstacles at high speed (~12 mph), along with thorough simulation experiments of ground vehicle and quadrotor models navigating through cluttered environments. To our knowledge, these demonstrations constitute one of the first examples of provably safe and robust control for robotic systems with complex nonlinear dynamics that need to plan in real-time in environments with complex geometric constraints.

1704.08695 2026-06-04 eess.SY cs.RO cs.SY 版本更新

AWEsome: An open-source test platform for airborne wind energy systems

AWEsome:一种用于空载风能系统的开源测试平台

Philip Bechtle, Thomas Gehrmann, Christoph Sieg, Udo Zillmann

AI总结 本文介绍了一种低成本、基于开源软件的空载风能系统测试平台AWEsome,可为研究团队和初创企业提供无需大额投资的飞行测试和控制策略验证机会。

Comments pdflatex, 25 pages, 9 figures

详情
AI中文摘要

本文介绍了AWEsome(Airborne Wind Energy Standardized Open-source Model Environment),一种用于空载风能系统的测试平台,由低成本硬件和完全基于开源软件组成。因此,无需大额资金投入,特别适用于研究团队和初创企业获取飞行操作经验、测试新型控制策略或技术设计,或用于公共关系。我们的系统由经过修改的现成模型飞机组成,该飞机由pixhawk自动驾驶硬件和ardupilot软件控制,用于固定翼飞机。飞机通过绳索连接到地面。我们实现了新的飞行模式,用于自主沿周期性图案飞行。我们展示了我们算法的主要功能。我们报告了这些模式在真实飞行中的首次成功测试。

英文摘要

In this paper we present AWEsome (Airborne Wind Energy Standardized Open-source Model Environment), a test platform for airborne wind energy systems that consists of low-cost hardware and is entirely based on open-source software. It can hence be used without the need of large financial investments, in particular by research groups and startups to acquire first experiences in their flight operations, to test novel control strategies or technical designs, or for usage in public relations. Our system consists of a modified off-the-shelf model aircraft that is controlled by the pixhawk autopilot hardware and the ardupilot software for fixed wing aircraft. The aircraft is attached to the ground by a tether. We have implemented new flight modes for the autonomous tethered flight of the aircraft along periodic patterns. We present the principal functionality of our algorithms. We report on first successful tests of these modes in real flights.

1704.03103 2026-06-04 cs.RO cs.AI cs.CG cs.SY eess.SY 版本更新

Minkowski Operations of Sets with Application to Robot Localization

Minkowski运算与机器人定位的应用

Benoit Desrochers, Luc Jaulin

AI总结 本文通过引入Minkowski和与差的分离器,高效解决机器人在非结构化环境中基于声呐测量的定位问题,并通过测试案例验证了方法的有效性。

Comments In Proceedings SNR 2017, arXiv:1704.02421

详情
Journal ref
EPTCS 247, 2017, pp. 34-45
AI中文摘要

本文展示使用分离器(由两个互补约束器组成)可以高效解决机器人在非结构化环境中基于声呐测量的定位问题。我们引入与Minkowski和与差相关的分离器以促进问题解决。通过测试案例说明了该方法的原理。

英文摘要

This papers shows that using separators, which is a pair of two complementary contractors, we can easily and efficiently solve the localization problem of a robot with sonar measurements in an unstructured environment. We introduce separators associated with the Minkowski sum and the Minkowski difference in order to facilitate the resolution. A test-case is given in order to illustrate the principle of the approach.

1704.02075 2026-06-04 cs.RO cs.SY eess.SY 版本更新

On Sensing, Agility, and Computation Requirements for a Data-gathering Agile Robotic Vehicle

关于数据收集敏捷机器人车辆的感知、敏捷性和计算需求

Fangchang Ma, Sertac Karaman

AI总结 本文研究了机器人车辆在未知数据源位置下收集信息的性能,分析了感知、敏捷性和计算能力对信息获取的影响,提出了理论结果和实验验证。

Comments 22 pages, 11 figures

详情
AI中文摘要

我们考虑了一种任务为通过访问一组空间分布的数据源来收集信息的机器人车辆,这些数据源的位置在事先未知,但可以在飞行中发现。我们假设涉及漂移的一阶机器人动力学,并假设数据源的位置是泊松分布的。在此设定下,我们以感知、敏捷性和计算能力来表征机器人的性能。更具体地说,机器人性能的表征包括其从远处感知目标位置的能力、快速移动的能力以及进行推理和规划的计算能力。我们还以每个数据源可以获取的信息量和分布来表征机器人的性能。我们的理论结果包括:目标位置的信息量分布对从远处感知目标的需求影响巨大;性能随着移动能力的增加而增加,但回报递减;并且计算需求随着规划的增加而比推理更快增加,同时随着感知范围和移动能力的增加。我们提供了计算实验来验证我们的理论结果。最后,我们展示了这些结果可以用于移动机器人系统的感知、驱动和计算能力的协同设计,以执行信息收集任务。我们的证明技术建立了机器人信息收集基本问题与统计力学中的最后通路渗透问题之间的新联系,这可能本身就有价值。

英文摘要

We consider a robotic vehicle tasked with gathering information by visiting a set of spatially-distributed data sources, the locations of which are not known a priori, but are discovered on the fly. We assume a first-order robot dynamics involving drift and that the locations of the data sources are Poisson-distributed. In this setting, we characterize the performance of the robot in terms of its sensing, agility, and computation capabilities. More specifically, the robot's performance is characterized in terms of its ability to sense the target locations from a distance, to maneuver quickly, and to perform computations for inference and planning. We also characterize the performance of the robot in terms of the amount and distribution of information that can be acquired at each data source. The following are among our theoretical results: the distribution of the amount of information among the target locations immensely impacts the requirements for sensing targets from a distance; performance increases with increasing maneuvering capability, but with diminishing returns; and the computation requirements increase more rapidly for planning as opposed to inference, with both increasing sensing range and maneuvering ability. We provide computational experiments to validate our theoretical results. Finally, we demonstrate that these results can be utilized in the co-design of sensing, actuation, and computation capabilities of mobile robotic systems for an information-gathering mission. Our proof techniques establish novel connections between the fundamental problems of robotic information-gathering and the last-passage percolation problem of statistical mechanics, which may be of interest on its own right.

1307.0276 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Controllability Analysis and Degraded Control for a Class of Hexacopters Subject to Rotor Failures

六旋翼无人机在旋翼失效情况下的可控性分析与退化控制

Guang-Xun Du, Quan Quan, Kai-Yuan Cai

AI总结 本文分析了六旋翼无人机在旋翼失效时的可控性问题,提出了一种退化控制策略,证明当旋翼最大升力超过一定值时系统可控。

Comments 21 pages, 7 figures, submitted to Journal of Intelligent & Robotic Systems

详情
Journal ref
Journal of Intelligent & Robotic Systems, 2015, 78(1): 143-157
AI中文摘要

本文考虑了某一类六旋翼无人机的可控性分析及故障容错控制问题。证明当一个旋翼失效时,该六旋翼无人机不可控,尽管其过量驱动且可控性矩阵行满秩。根据此,提出了一种控制退化系统的故障容错控制策略,其中忽略六旋翼的偏航状态。理论分析表明,退化系统可控当且仅当每个旋翼的最大升力大于某一值。对原型六旋翼无人机的仿真和实验结果展示了本文可控性分析和退化控制策略的可行性。

英文摘要

This paper considers the controllability analysis and fault tolerant control problem for a class of hexacopters. It is shown that the considered hexacopter is uncontrollable when one rotor fails, even though the hexacopter is over-actuated and its controllability matrix is row full rank. According to this, a fault tolerant control strategy is proposed to control a degraded system, where the yaw states of the considered hexacopter are ignored. Theoretical analysis indicates that the degraded system is controllable if and only if the maximum lift of each rotor is greater than a certain value. The simulation and experiment results on a prototype hexacopter show the feasibility of our controllability analysis and degraded control strategy.

1704.00888 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A Discrete-Time Attitude Observer on SO(3) for Vision and GPS Fusion

基于SO(3)的离散时间姿态观测器用于视觉与GPS融合

Alireza Khosravian, Tat-Jun Chin, Ian Reid, Robert Mahony

AI总结 本文提出一种离散时间几何姿态观测器,融合单目视觉与GPS速度测量,通过减轻视觉里程计姿态估计的固有漂移并直接估计北东下坐标系下的姿态,提供收敛性分析和收敛率下界。

Comments To appear in IEEE Conference on Robotics and Automation 2017

详情
Journal ref
Alireza Khosravian, Tat-Jun Chin, Ian Reid, Robert Mahony "A Discrete-Time Attitude Observer on SO(3) for Vision and GPS Fusion", in Proc. IEEE Conf. Robotics and Automation, 2017
AI中文摘要

本文提出了一种离散时间几何姿态观测器,用于融合单目视觉与GPS速度测量。该观测器利用视觉里程计处理单目图像得到的相对变换,并将其与GPS速度测量融合。该传感器融合的目标是减轻视觉里程计姿态估计的固有漂移,并直接估计北东下坐标系下的姿态。本文的主要贡献是提供严谨的稳定性分析,证明观测器的姿态估计以指数形式收敛到真实姿态,并提供观测器收敛率的下界。通过实验研究,我们证明该观测器能有效补偿纯单目视觉姿态估计的固有漂移,并且即使初始化时具有非常大的姿态误差,也能恢复北东下坐标系下的姿态。

英文摘要

This paper proposes a discrete-time geometric attitude observer for fusing monocular vision with GPS velocity measurements. The observer takes the relative transformations obtained from processing monocular images with any visual odometry algorithm and fuses them with GPS velocity measurements. The objectives of this sensor fusion are twofold; first to mitigate the inherent drift of the attitude estimates of the visual odometry, and second, to estimate the orientation directly with respect to the North-East-Down frame. A key contribution of the paper is to present a rigorous stability analysis showing that the attitude estimates of the observer converge exponentially to the true attitude and to provide a lower bound for the convergence rate of the observer. Through experimental studies, we demonstrate that the observer effectively compensates for the inherent drift of the pure monocular vision based attitude estimation and is able to recover the North-East-Down orientation even if it is initialized with a very large attitude error.

1704.00534 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Controlling a triangular flexible formation of autonomous agents

控制自主体的三角形柔性编队

Hector Garcia de Marina, Zhiyong Sun, Ming Cao, Brian D. O. Anderson

AI总结 本文提出结合位置和距离梯度下降控制的算法,通过添加人工偏置使自主体形成共线配置,并引入旋转矩阵控制邻近自主体的方位角。

Comments 7 pages, accepted in the 20th World Congress of the International Federation of Automatic Control (IFAC)

详情
AI中文摘要

在编队控制中,由三个自主体构成的三角形编队作为基准,可用于测试和比较不同控制器的性能。我们提出一种结合位置和距离梯度下降控制优势的算法。例如,仅需控制两对邻近自主体,自主体可在自身局部坐标系中工作,且编队相对于全局坐标系的方位不预设。我们首先提出一种基于向邻近自主体的范围传感器添加人工偏置的新技术,使得最终位置对应共线配置。随后,通过在偏置项中引入预设旋转矩阵,可控制邻近自主体的方位角。

英文摘要

In formation control, triangular formations consisting of three autonomous agents serve as a class of benchmarks that can be used to test and compare the performances of different controllers. We present an algorithm that combines the advantages of both position- and distance-based gradient descent control laws. For example, only two pairs of neighboring agents need to be controlled, agents can work in their own local frame of coordinates and the orientation of the formation with respect to a global frame of coordinates is not prescribed. We first present a novel technique based on adding artificial biases to neighboring agents' range sensors such that their eventual positions correspond to a collinear configuration. Right after, a small modification in the bias terms by introducing a prescribed rotation matrix will allow the control of the bearing of the neighboring agents.

1703.08557 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Towards a Functional System Architecture for Automated Vehicles

迈向自动驾驶车辆的功能系统架构

Simon Ulbrich, Andreas Reschka, Jens Rieken, Susanne Ernst, Gerrit Bagschik, Frank Dierkes, Marcus Nolte, Markus Maurer

AI总结 本文提出了一种通用功能架构,用于自动驾驶车辆,该架构独立于具体实现,并基于Stadtpilot项目中的真实世界实现进行验证,涵盖环境感知、规划控制、定位、地图提供及人机交互等方面。

Comments Submitted for review to IEEE Transactions on Intelligent Transportation Systems, 16 pages, 4 figures

详情
AI中文摘要

本文提出了一种自动驾驶车辆的功能系统架构。该架构提供了一个总体的、通用的结构,独立于特定车辆项目的具体实现。然而,它受到并经过Stadtpilot项目在德累斯顿工业大学中的真实世界自动驾驶实现的启发和交叉验证。该架构涵盖环境和自我感知、规划和控制、定位、地图提供、车辆至X通信以及与人类操作员的交互等方面。

英文摘要

This paper presents a functional system architecture for an automated vehicle. It provides an overall, generic structure that is independent of a specific implementation of a particular vehicle project. Yet, it has been inspired and cross-checked with a real world automated driving implementation in the Stadtpilot project at the Technische Universität Braunschweig. The architecture entails aspects like environment and self perception, planning and control, localization, map provision, Vehicle-To-X-communication, and interaction with human operators.

1703.08515 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Mean-Field Controllability and Decentralized Stabilization of Markov Chains, Part II: Asymptotic Controllability and Polynomial Feedbacks

均场可控性与马尔可夫链的去中心化稳定化,第二部分:渐近可控性与多项式反馈

Shiba Biswal, Karthik Elamvazhuthi, Spring Berman

AI总结 本文研究了有限状态空间中群体代理连续时间马尔可夫链的均场反馈稳定化方法,证明了强连通支持概率分布可被时间不变输入稳定,并展示了渐近可控性及去中心化密度反馈律的存在性。

Comments Submitted to IEEE Conference on Decision and Control, 2017

详情
AI中文摘要

本文是两部分系列论文的第二部分,提出了一种方法用于有限状态空间中群体代理连续时间马尔可夫链(CTMC)的均场反馈稳定化。所得到的(均场)控制问题为控制具有期望全局稳定性性质的非线性系统。首先证明任何具有强连通支持的概率分布均可通过时间不变输入稳定。其次展示所有可能概率分布的渐近可控性,包括分配零密度到某些状态且不一定具有强连通支持的分布。最后证明当图是强连通且双向时,总存在一个全局渐近稳定的去中心化密度反馈律,其控制输入在平衡时为零。然后将闭环多项式反馈的合成问题框架为使用最新求和平方优化工具的优化问题。该优化问题寻找多项式反馈律,使候选李雅普诺夫函数成为闭环系统稳定性证书。本文方法在五顶点图的两个案例中进行了测试,并通过相应常微分方程组的数值模拟验证了所构造控制律的稳定性特性。

英文摘要

This paper, the second of a two-part series, presents a method for mean-field feedback stabilization of a swarm of agents on a finite state space whose time evolution is modeled as a continuous time Markov chain (CTMC). The resulting (mean-field) control problem is that of controlling a nonlinear system with desired global stability properties. We first prove that any probability distribution with a strongly connected support can be stabilized using time-invariant inputs. Secondly, we show the asymptotic controllability of all possible probability distributions, including distributions that assign zero density to some states and which do not necessarily have a strongly connected support. Lastly, we demonstrate that there always exists a globally asymptotically stabilizing decentralized density feedback law with the additional property that the control inputs are zero at equilibrium, whenever the graph is strongly connected and bidirected. Then the problem of synthesizing closed-loop polynomial feedback is framed as a optimization problem using state-of-the-art sum-of-squares optimization tools. The optimization problem searches for polynomial feedback laws that make the candidate Lyapunov function a stability certificate for the resulting closed-loop system. Our methodology is tested for two cases on a five vertex graph, and the stabilization properties of the constructed control laws are validated with numerical simulations of the corresponding system of ordinary differential equations.

1703.08243 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Mean-Field Controllability and Decentralized Stabilization of Markov Chains, Part I: Global Controllability and Rational Feedbacks

马尔可夫链的均场可控性与去中心化稳定化,第一部分:全局可控性与理性反馈

Karthik Elamvazhuthi, Vaibhav Deshmukh, Matthias Kawski, Spring Berman

AI总结 本文研究了连续时间马尔可夫链的可控性和稳定化性质,通过转移率作为控制参数。证明了在强连通图下,小时间局部和全局可控性,以及存在去中心化线性反馈法稳定概率密度。

Comments Submitted to IEEE Conference on Decision and Control, 2017

详情
AI中文摘要

本文研究了连续时间马尔可夫链的可控性和稳定化性质,通过转移率作为控制参数。首先,我们证明了在强连通图下,从和到严格正的平衡配置的小时间局部和全局可控性。其次,我们证明了当允许转移率为负且目标密度位于概率密度集内部时,总是存在一个局部指数稳定的去中心化线性(密度)反馈律,该反馈律在平衡点处取零值并尊重图结构。对于双向图,即存在反向边的图,我们展示了这种线性控制律可通过形式为k(x) = a(x) + b(x)f(x)/g(x)的去中心化理性反馈律实现,该反馈律也尊重图结构和控制约束(正性和平衡点处零值)。这使得可以利用线性矩阵不等式(LMI)工具算法性地构造去中心化密度反馈控制器,以稳定机器人集群到目标任务分布,且在平衡点处无任务切换,如通过多个数值示例所展示的。

英文摘要

In this paper, we study the controllability and stabilizability properties of the Kolmogorov forward equation of a continuous time Markov chain (CTMC) evolving on a finite state space, using the transition rates as the control parameters. Firstly, we prove small-time local and global controllability from and to strictly positive equilibrium configurations when the underlying graph is strongly connected. Secondly, we show that there always exists a locally exponentially stabilizing decentralized linear (density-)feedback law that takes zero valu at equilibrium and respects the graph structure, provided that the transition rates are allowed to be negative and the desired target density lies in the interior of the set of probability densities. For bidirected graphs, that is, graphs where a directed edge in one direction implies an edge in the opposite direction, we show that this linear control law can be realized using a decentralized rational feedback law of the form k(x) = a(x) + b(x)f(x)/g(x) that also respects the graph structure and control constraints (positivity and zero at equilibrium). This enables the possibility of using Linear Matrix Inequality (LMI) based tools to algorithmically construct decentralized density feedback controllers for stabilization of a robotic swarm to a target task distribution with no task-switching at equilibrium, as we demonstrate with several numerical examples.

1702.00785 2026-06-04 cs.MA cs.RO cs.SY eess.SY 版本更新

Evaluation of Automated Vehicles Encountering Pedestrians at Unsignalized Crossings

自动驾驶车辆在无信号交叉口与行人互动的评估

Baiming Chen, Ding Zhao, Huei Peng

AI总结 本文提出了一种评估自动驾驶车辆在无信号交叉口与行人互动安全性和可行性的新方法,通过分析数据和模拟行人反应,验证了自动驾驶策略的有效性。

详情
AI中文摘要

车辆与行人之间的互动一直是交通安全的主要问题。经验丰富的驾驶员能够分析环境并选择有助于避免碰撞的驾驶策略。然而,尚不清楚自动驾驶车辆如何与行人互动。本文提出了一种新的方法,用于评估自动驾驶车辆在无信号交叉口的驾驶策略的安全性和可行性。在密歇根州安阿伯市的公交车上安装了MobilEye传感器,收集了2973个有效的交叉事件数据。然后使用多元高斯混合模型创建了一个随机交互模型。该模型允许我们模拟行人对迎面而来的车辆做出反应时的移动,并评估自动驾驶车辆的通过策略。然后进行了模拟以演示评估过程。

英文摘要

Interactions between vehicles and pedestrians have always been a major problem in traffic safety. Experienced human drivers are able to analyze the environment and choose driving strategies that will help them avoid crashes. What is not yet clear, however, is how automated vehicles will interact with pedestrians. This paper proposes a new method for evaluating the safety and feasibility of the driving strategy of automated vehicles when encountering unsignalized crossings. MobilEye sensors installed on buses in Ann Arbor, Michigan, collected data on 2,973 valid crossing events. A stochastic interaction model was then created using a multivariate Gaussian mixture model. This model allowed us to simulate the movements of pedestrians reacting to an oncoming vehicle when approaching unsignalized crossings, and to evaluate the passing strategies of automated vehicles. A simulation was then conducted to demonstrate the evaluation procedure.

1703.07736 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Circular formation control of fixed-wing UAVs with constant speeds

固定翼无人机恒速圆形成形控制

Hector Garcia de Marina, Zhiyong Sun, Murat Bronz, Gautier Hattenberger

AI总结 本文提出了一种稳定固定翼无人机恒速圆形成形的算法,通过跟踪不同半径的圆来控制车辆相对于目标圆周的相位,确保团队在特定区域内保持约束。

Comments 6 pages, submitted to IROS 2017

详情
AI中文摘要

本文提出了一种稳定固定翼无人机恒速圆形成形的算法。该算法基于跟踪不同半径的圆以控制车辆相对于目标圆周的相位。我们证明了所求的平衡状态是指数稳定的,并且由于引导向量场引导车辆,该算法可以扩展到其他闭合轨迹。该方法的主要优势是,即使在车辆间通信或感知丢失时,算法也能保证团队在特定区域内保持约束。我们通过实际的三架飞机编队飞行展示了该算法的有效性。该算法已准备好在开源的Paparazzi自动驾驶系统中供公众使用。

英文摘要

In this paper we propose an algorithm for stabilizing circular formations of fixed-wing UAVs with constant speeds. The algorithm is based on the idea of tracking circles with different radii in order to control the inter-vehicle phases with respect to a target circumference. We prove that the desired equilibrium is exponentially stable and thanks to the guidance vector field that guides the vehicles, the algorithm can be extended to other closed trajectories. One of the main advantages of this approach is that the algorithm guarantees the confinement of the team in a specific area, even when communications or sensing among vehicles are lost. We show the effectiveness of the algorithm with an actual formation flight of three aircraft. The algorithm is ready to use for the general public in the open-source Paparazzi autopilot.

1703.06416 2026-06-04 cs.MA cs.DC cs.RO cs.SY eess.SY 版本更新

A Passivity-Based Distributed Reference Governor for Constrained Robotic Networks

基于被动性的分布式参考管理者用于受约束的机器人网络

Tam Nguyen, Takeshi Hatanaka, Mamoru Doi, Emanuele Garone, Masayuki Fujita

AI总结 本文提出了一种基于被动性的分布式参考管理者,用于预稳定化的移动机器人网络,通过梯度下降法和对偶上升法解决参考管理者问题,并通过仿真实验验证了其有效性。

Comments 8 pages, International Federation of Automatic Conference 2017, 8 figures

详情
AI中文摘要

本文聚焦于应用于预稳定化移动机器人网络的基于被动性的分布式参考管理者(RG)。本文的创新点在于解决RG问题的方法,提出了一种基于被动性的分布式优化方案。具体而言,梯度下降法用于最小化全局目标函数,而对偶上升法用于最大化哈密顿量。为使代理收敛到约定的最优解,使用了比例-积分一致性估计器。本文通过被动性论证证明了RG状态估计收敛到最优解,考虑了物理系统的静态特性。然后,通过仿真实验和测试验证了考虑物理系统动态特性的方案的有效性。

英文摘要

This paper focuses on a passivity-based distributed reference governor (RG) applied to a pre-stabilized mobile robotic network. The novelty of this paper lies in the method used to solve the RG problem, where a passivity-based distributed optimization scheme is proposed. In particular, the gradient descent method minimizes the global objective function while the dual ascent method maximizes the Hamiltonian. To make the agents converge to the agreed optimal solution, a proportional-integral consensus estimator is used. This paper proves the convergence of the state estimates of the RG to the optimal solution through passivity arguments, considering the physical system static. Then, the effectiveness of the scheme considering the dynamics of the physical system is demonstrated through simulations and experiments.

1703.06387 2026-06-04 cs.RO cs.SY eess.SY 版本更新

An opportunistic linear-convex algorithm for localization in mobile robot networks

一种机会主义的线性-凸算法用于移动机器人网络的定位

Sam Safavi, Usman Khan

AI总结 本文提出一种分布式算法用于定位在有界区域内任意移动的机器人网络,通过机会主义策略在附近机器人存在时更新位置,基于重心坐标设计线性-凸更新方法,并证明其在噪声下的渐近收敛性。

详情
AI中文摘要

本文开发了一种分布式算法,用于定位在有界区域内任意移动的机器人网络。在移动网络中,主要挑战是机器人可能无法找到附近的机器人来实现分布式算法。我们通过提供一种机会主义算法来解决这个问题,该算法仅在附近有机器人时进行位置更新,否则不更新。我们假设每个机器人测量其运动和到附近机器人的距离的噪声版本。为了在$\mathbb{R}^m$中定位移动机器人网络,我们提供了一个简单的线性更新,基于重心坐标并具有线性-凸性质。我们将相应的定位算法抽象为线性时变(LTV)系统,并证明其渐近收敛到真实位置。我们首先考虑噪声less情况,其中距离和运动向量已知(测量完美),并提供收敛性的充分条件。然后评估算法在存在噪声时的性能,并提供修改以抵消噪声的不利影响。我们进一步证明,只要至少有一个已知的基站(位置完全已知的节点)存在,我们的算法可以精确跟踪移动网络。

英文摘要

In this paper, we develop a \textcolor{black}{\emph{distributed}} algorithm to localize a network of robots moving arbitrarily in a bounded region. In the case of such mobile networks, the main challenge is that the robots may not be able to find nearby robots to implement a distributed algorithm. We address this issue by providing an opportunistic algorithm that only implements a location update when there are nearby robots and does not update otherwise. We assume that each robot measures a noisy version of its motion and the distances to the nearby robots. To localize a network of mobile robots in~$\mathbb{R}^m$, we provide a simple \emph{linear} update, which is based on barycentric coordinates and is linear-convex. We abstract the corresponding localization algorithm as a Linear Time-Varying (LTV) system and show that it asymptotically converges to the true locations~of~the robots. We first focus on the noiseless case, where the distance and motion vectors are known (measured) perfectly, and provide sufficient conditions on the convergence of the algorithm. We then evaluate the performance of the algorithm in the presence of noise and provide modifications to counter the undesirable effects of noise. \textcolor{black}{We further show that our algorithm precisely tracks a mobile network as long as there is at least one known beacon (a node whose location is perfectly known).

1703.05019 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Finding a Feasible Initial Solution for Flatness-Based Multi-Link Manipulator Motion Planning under State and Control Constraints

为基于平坦度的多连杆机械臂运动规划在状态和控制约束下寻找可行的初始解

Keisuke Uto, Makoto Obayashi, Gaku Takano

AI总结 本文提出一种方法,在非凸优化问题中寻找可行点并可靠求解,用于在状态和控制约束下规划多连杆机械臂的设定点运动。通过分析最终时间效应,构造初始可行解,结合线性规划和递归逆动力学算法进行数值实验验证。

Comments accepted to the SICE International Symposium on Control Systems 2017

详情
AI中文摘要

本文提出了一种方法,用于在非凸优化问题中初始化可行点并可靠求解,该问题涉及在状态和控制约束下为多连杆机械臂规划设定点运动。我们通过分析可行性问题的最终时间效应来构建初始可行解。具体来说,我们首先通过求解线性规划问题,在无控制约束的情况下找到可行的时间最优轨迹。然后,通过缩放轨迹来找到受控制约束的可行轨迹。为了评估所提方法的实用性,我们通过将该方法与递归逆动力学算法结合,进行了数值实验以解决多连杆机械臂运动规划问题。

英文摘要

In this paper, we present a method to initialize at a feasible point and unfailingly solve a non-convex optimization problem in which a set-point motion is planned for a multi-link manipulator under state and control constraints. We construct an initial feasible solution by analyzing the final time effect for feasibility problems of flatness based motion planning problems. More specifically, we first find a feasible time-optimal trajectory under state constraints without a control constraint by solving a linear programming problem. Then, we find a feasible trajectory under control constraints by scaling the trajectory. To evaluate the practical applicability of the proposed method, we did numerical experiments to solve a multi-link manipulator motion planning problem by combining the method with recursive inverse dynamics algorithms.

1703.04881 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Multi-Objective Cooperative Search of Spatially Diverse Routes in Uncertain Environments

多目标协作搜索空间多样路线以应对不确定环境

Johnathan Votion, Yongcan Cao

AI总结 本文提出一种多车辆协作系统,用于在未知环境中搜索有价值路线,通过调整权重和控制增益实现路径多样性,最终选择最优路径。

Comments submitted to a conference

详情
AI中文摘要

本文旨在为不确定环境中的协作无人系统开发新的导航和侦察能力。目标是设计一种协作多车辆系统,能够探测未知环境并找到对人员最有价值的路线。为此,多车辆系统首先探索空间多样路线,然后选择最安全的路线。所提出的协作路径规划器根据多个因素(如行驶距离、通行便利性及通行便利性的不确定性)依次生成一组空间多样路线。规划器对每个因素的依赖性通过加权分数改变,从而改变最优路线的判断标准。为惩罚不同车辆选择相同路径,使用控制增益增加其他车辆分配路线附近路径的成本。通过调整控制增益,可以实现路线的空间多样性。通过反复协作搜索不同路径,最终可选择出最有价值的最优路径。

英文摘要

This paper focuses on developing new navigation and reconnaissance capabilities for cooperative unmanned systems in uncertain environments. The goal is to design a cooperative multi-vehicle system that can survey an unknown environment and find the most valuable route for personnel to travel. To accomplish the goal, the multi-vehicle system first explores spatially diverse routes and then selects the safest route. In particular, the proposed cooperative path planner sequentially generates a set of spatially diverse routes according to a number of factors, including travel distance, ease of travel, and uncertainty associated with the ease of travel. The planner's dependence on each of these factors is altered by a weighted score, doing so changes the criteria for determining an optimum route. To penalize the selection of same paths by different vehicles, a control gain is used to increase the cost of paths that lie near the route(s) assigned to other vehicles. By varying the control gain, the spatial diversity among routes can be accomplished. By repeatedly searching for different paths cooperatively, an optimal path can be selected that yields the most valuable route.

1703.04550 2026-06-04 cs.RO cs.LG cs.NE cs.SY eess.SY 版本更新

Sensor Fusion for Robot Control through Deep Reinforcement Learning

通过深度强化学习实现机器人控制的传感器融合

Steven Bohez, Tim Verbelen, Elias De Coninck, Bert Vankeirsbilck, Pieter Simoens, Bart Dhoedt

AI总结 本文提出通过深度强化学习实现机器人传感器信息融合,提升机器人在搜索和拾取任务中的鲁棒性和性能。

Comments 6 pages, 6 figures, submitted to IROS 2017

详情
AI中文摘要

深度强化学习正日益成为机器人控制算法的热门方法,旨在使机器人能够从非结构化感官输入中自学习有用的特征表示,从而获得最优的操作策略。除了机器人上的传感器外,环境中的传感器也可能被部署,尽管这些可能需要通过不可靠的无线连接访问。在本文中,我们展示了能够融合多个传感器信息并具有运行时传感器故障鲁棒性的深度神经网络架构。我们评估了我们的方法在机器人搜索和拾取任务中的性能,包括仿真和现实世界中的测试。

英文摘要

Deep reinforcement learning is becoming increasingly popular for robot control algorithms, with the aim for a robot to self-learn useful feature representations from unstructured sensory input leading to the optimal actuation policy. In addition to sensors mounted on the robot, sensors might also be deployed in the environment, although these might need to be accessed via an unreliable wireless connection. In this paper, we demonstrate deep neural network architectures that are able to fuse information coming from multiple sensors and are robust to sensor failures at runtime. We evaluate our method on a search and pick task for a robot both in simulation and the real world.

1604.02943 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Taming mismatches in inter-agent distances for the formation-motion control of second-order agents

消除交互距离不匹配以实现二阶代理的形成-运动控制

Hector Garcia de Marina, Bayu Jayawardhana, Ming Cao

AI总结 本文分析距离不匹配对二阶代理标准梯度刚性形成控制的影响,指出其导致最终形状扭曲和群体稳态运动,并通过结合标准控制律与分布式估计器消除这些行为,同时利用不匹配参数控制群体的平移和旋转运动。

Comments 14 pages, conditionally accepted in Automatic Control, IEEE Transactions on

详情
AI中文摘要

本文分析了距离不匹配对标准梯度基于刚性形成控制在二阶代理中的影响。研究表明,类似于文献中最近讨论的一阶情况,这些不匹配引入了两种不希望的群体行为:最终形状的扭曲和群体形成中的稳态运动。我们展示通过将标准形成控制律与分布式估计器结合可以消除这些不希望的行为。最后,我们展示了如何有效利用这些不匹配作为设计参数,以控制群体的结合平移和旋转运动。

英文摘要

This paper presents the analysis on the influence of distance mismatches on the standard gradient-based rigid formation control for second-order agents. It is shown that, similar to the first-order case as recently discussed in the literature, these mismatches introduce two undesired group behaviors: a distorted final shape and a steady-state motion of the group formation. We show that such undesired behaviors can be eliminated by combining the standard formation control law with distributed estimators. Finally, we show how the mismatches can be effectively employed as design parameters in order to control a combined translational and rotational motion of the formation.

1703.03161 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

Behavior-based Navigation of Mobile Robot in Unknown Environments Using Fuzzy Logic and Multi-Objective Optimization

基于模糊逻辑和多目标优化的未知环境中移动机器人行为导航

Thi Thanh Van Nguyen, Manh Duong Phung, Quang Vinh Tran

AI总结 本文提出BBFM架构,通过模糊控制器和多目标优化协调机器人在未知环境中避障和避开局部极小值的问题,提升了导航精度和效率。

详情
Journal ref
International Journal of Control and Automation, Vol. 10, No. 2 (2017), pp.349-364
AI中文摘要

本文提出一种名为BBFM的行为导航架构,用于解决在存在障碍物和局部极小值区域的未知环境中移动机器人的导航问题。在该架构中,复杂导航任务被分解为主要子任务或行为。每个行为由模糊控制器实现并独立执行以处理特定导航问题。模糊控制器被修改为仅包含模糊化和推理过程,使其输出表示行为的目标的隶属函数。所有控制器的隶属函数随后用作多目标优化过程的目标函数以协调所有行为。该过程的结果是整体控制信号,即帕累托最优的控制信号,用于控制机器人。进行了大量模拟、比较和实验。结果表明,所提出的架构在精度、平滑度、行驶距离和时间响应方面优于一些流行的基于行为的架构。

英文摘要

This study proposes behavior-based navigation architecture, named BBFM, to deal with the problem of navigating the mobile robot in unknown environments in the presence of obstacles and local minimum regions. In the architecture, the complex navigation task is split into principal sub-tasks or behaviors. Each behavior is implemented by a fuzzy controller and executed independently to deal with a specific problem of navigation. The fuzzy controller is modified to contain only the fuzzification and inference procedures so that its output is a membership function representing the behavior's objective. The membership functions of all controllers are then used as the objective functions for a multi-objective optimization process to coordinate all behaviors. The result of this process is an overall control signal, which is Pareto-optimal, used to control the robot. A number of simulations, comparisons, and experiments were conducted. The results show that the proposed architecture outperforms some popular behavior-based architectures in term of accuracy, smoothness, traveled distance, and time response.

1703.02899 2026-06-04 cs.LG cs.RO cs.SY eess.SY stat.ML 版本更新

Model-Based Policy Search for Automatic Tuning of Multivariate PID Controllers

基于模型的策略搜索用于多变量PID控制器的自动调优

Andreas Doerr, Duy Nguyen-Tuong, Alonso Marco, Stefan Schaal, Sebastian Trimpe

AI总结 本文提出基于模型的策略搜索框架,用于自动调优多变量PID控制器,通过数据驱动的方法解决复杂系统的控制器调优问题。

Comments Accepted final version to appear in 2017 IEEE International Conference on Robotics and Automation (ICRA)

详情
AI中文摘要

PID控制架构在工业应用中被广泛使用。尽管其开放参数数量较少,但实际中调优多个耦合的PID控制器可能变得繁琐。本文扩展了PILCO,一种基于模型的策略搜索框架,以纯数据驱动的方式自动调优多变量PID控制器,无需事先了解系统。通过适当扩展系统状态,将PID策略框架为静态状态反馈策略,从而将PID调优视为有限时间最优控制问题的解法,无需进一步先验知识。该框架应用于平衡倒立摆于七自由度机械臂的任务,展示了其在复杂现实问题中快速且数据高效的学习能力。

英文摘要

PID control architectures are widely used in industrial applications. Despite their low number of open parameters, tuning multiple, coupled PID controllers can become tedious in practice. In this paper, we extend PILCO, a model-based policy search framework, to automatically tune multivariate PID controllers purely based on data observed on an otherwise unknown system. The system's state is extended appropriately to frame the PID policy as a static state feedback policy. This renders PID tuning possible as the solution of a finite horizon optimal control problem without further a priori knowledge. The framework is applied to the task of balancing an inverted pendulum on a seven degree-of-freedom robotic arm, thereby demonstrating its capabilities of fast and data-efficient policy learning, even on complex real world problems.

1502.02474 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Planning for robotic exploration based on forward simulation

基于正向模拟的机器人探索规划

Mikko Lauri, Risto Ritala

AI总结 本文提出基于POMDP的探索规划方法,结合信息论目标函数和正向模拟算法,通过改进的互信息近似方法提升机器人在部分已知环境中的探索效率。

Comments 19 pages, 11 figures in Robotics and Autonomous Systems (2016)

详情
Journal ref
Robotics and Autonomous Systems 83 (2016) 15-31
AI中文摘要

我们解决了控制移动机器人探索部分已知环境的问题。机器人的目标是最大化收集的环境信息量。我们将问题建模为部分可观测马尔可夫决策过程(POMDP)并采用正向模拟算法进行求解。我们提出了一种新的基于样本的互信息近似方法,适用于移动机器人。该近似方法可无缝集成到正向模拟规划算法中。我们研究了基于POMDP的规划在探索中的有效性,并通过与前沿探索结合来缓解其不足。在模拟和真实环境中实验结果表明,根据环境不同,基于POMDP的探索规划可比前沿探索表现更优。

英文摘要

We address the problem of controlling a mobile robot to explore a partially known environment. The robot's objective is the maximization of the amount of information collected about the environment. We formulate the problem as a partially observable Markov decision process (POMDP) with an information-theoretic objective function, and solve it applying forward simulation algorithms with an open-loop approximation. We present a new sample-based approximation for mutual information useful in mobile robotics. The approximation can be seamlessly integrated with forward simulation planning algorithms. We investigate the usefulness of POMDP based planning for exploration, and to alleviate some of its weaknesses propose a combination with frontier based exploration. Experimental results in simulated and real environments show that, depending on the environment, applying POMDP based planning for exploration can improve performance over frontier exploration.

1702.07335 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Approximately Optimal Continuous-Time Motion Planning and Control via Probabilistic Inference

近似最优连续时间运动规划与控制 via 概率推理

Mustafa Mukadam, Ching-An Cheng, Xinyan Yan, Byron Boots

AI总结 本文提出PIPC算法,通过概率推理和高斯过程轨迹表示,实现对非线性性能指标的近似最优控制,在仿真中展示了其在递推时间窗口内的多系统应用能力。

Comments minor fixes and typos

详情
AI中文摘要

最优运动规划和控制问题是机器人学中的基本问题。然而,对于连续时间随机系统而言,该问题一般难以求解,且当存在非即时非线性性能指标时,解法难以近似。本文提供了一个高效的算法PIPC(概率推理用于规划与控制),能够产生具有任意高阶非线性性能指标的近似最优策略。利用概率推理和高斯过程轨迹表示,PIPC利用问题的内在稀疏性,使其复杂度与非线性因素数量成线性关系。我们在仿真中展示了该算法在递推时间窗口内的多系统应用能力。

英文摘要

The problem of optimal motion planing and control is fundamental in robotics. However, this problem is intractable for continuous-time stochastic systems in general and the solution is difficult to approximate if non-instantaneous nonlinear performance indices are present. In this work, we provide an efficient algorithm, PIPC (Probabilistic Inference for Planning and Control), that yields approximately optimal policies with arbitrary higher-order nonlinear performance indices. Using probabilistic inference and a Gaussian process representation of trajectories, PIPC exploits the underlying sparsity of the problem such that its complexity scales linearly in the number of nonlinear factors. We demonstrate the capabilities of our algorithm in a receding horizon setting with multiple systems in simulation.

1702.05314 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Control of an Unmanned Surface Vehicle with Uncertain Displacement and Drag

无人水面舰艇的不确定位移和阻力控制

Wilhelm B. Klinger, Ivan R. Bertaska, Karl D. von Ellenrieder, Manhar R. Dhanak

AI总结 本文研究了在位移和阻力不确定时无人水面舰艇的控制问题,采用低层控制器进行实验测试,通过系统辨识确定水动力系数,并验证了改进的反步控制器在处理变量质量和阻力时的有效性。

详情
AI中文摘要

本文对无人水面舰艇(USV)进行了实验测试,以评估在位移和阻力随时间变化且不确定时两种低层控制器的性能。USV是一种4.3米长、150千克的波适应模块化舰艇(WAM-V),具有可膨胀双体配置和水jet推进。进行了开环机动测试以表征车辆的动力学。通过系统辨识 maneuvering 数据确定了车辆的水动力系数,并用于控制系统的模拟。所得到的控制器在水面上进行了实地测试。变量质量和阻力测试表明,车辆的最佳控制由模型参考自适应反步速度和航向控制器实现。Liao等人(2010)开发的反步控制器被修改以考虑必要控制动作的过度预测和电机饱和。研究表明,当在改进的反步控制器的冲量控制子系统中实现自适应算法时,变量质量和阻力的影响被减轻。

英文摘要

Experimental testing of an unmanned surface vehicle (USV) has been performed to evaluate the performance of two low-level controllers when displacement and drag properties are time-varying and uncertain. The USV is a 4.3 meter long, 150 kilogram wave adaptive modular vessel (WAM-V) with an inflatable twin hull configuration and waterjet propulsion. Open loop maneuvering tests were conducted to characterize the dynamics of the vehicle. The hydrodynamic coefficients of the vehicle were determined through system identification of the maneuvering data and were used for simulations during control system development. The resulting controllers were experimentally field tested on-water. Variable mass and drag tests show that the vehicle is best controlled by a model reference adaptive backstepping speed and heading controller. The backstepping controller developed by Liao et. al (2010) is modified to account for an overprediction of necessary control action and motor saturation. It is shown that when an adaptive algorithm is implemented for the surge control subsystem of the modified backstepping controller, the effects of variable mass and drag are mitigated.

1702.04941 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Station-keeping control of an unmanned surface vehicle exposed to current and wind disturbances

无人水面舰艇在洋流和风扰动下的驻留控制

Edoardo I. Sarda, Huajin Qu, Ivan R. Bertaska, Karl D. von Ellenrieder

AI总结 本文评估了无人水面舰艇在风和洋流扰动下的驻留控制性能,比较了非线性比例导数、反步法和滑模反馈控制器的效果,发现滑模控制器表现最佳,风前馈控制对部分控制器有显著提升作用。

详情
Journal ref
Journal of Ocean Engineering, Volume 127, Pages 305-324, 2016
AI中文摘要

为评估无人水面舰艇在户外海洋环境中的驻留控制性能,进行了4米长、180千克的无人水面舰艇(USV)实地试验。该USV采用双体结构和定制推进系统,包含两个可旋转推力器。非线性比例导数、反步法和滑模反馈控制器在4-5节风速下进行了测试,包括有无风前馈控制的情况。测试时USV的纵轴与平均风向一致或垂直。结果显示滑模控制器整体表现最佳,风前馈控制对滑模控制器效果提升不显著,但对比例导数和反步法控制器在纵轴垂直于风向时有显著提升。对风速波动功率谱中存在长度尺度的分析表明,单个风速仪足以表征作用于USV的风速和方向。

英文摘要

Field trials of a 4 meter long, 180 kilogram, unmanned surface vehicle (USV) have been conducted to evaluate the performance of station-keeping heading and position controllers in an outdoor marine environment disturbed by wind and current. The USV has a twin hull configuration and a custom-designed propulsion system, which consists of two azimuthing thrusters, one for each hull. Nonlinear proportional derivative, backstepping and sliding mode feedback controllers were tested in winds of about 4-5 knots, with and without wind feedforward control. The controllers were tested when the longitudinal axis of the USV was aligned with the mean wind direction and when the longitudinal axis was perpendicular to the mean wind direction. It was found that the sliding mode controller performed best overall and that the addition of wind feedforward control did not significantly improve its effectiveness. However, wind feedforward control did substantially improve the performance of the proportional derivative and backstepping controllers when the mean wind direction was perpendicular to the longitudinal axis of the USV. An analysis of the length scales present in the power spectrum of the turbulent speed fluctuations in the wind suggests that a single anemometer is sufficient to characterize the speed and direction of the wind acting on the USV.

1702.04940 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Supervisory Switching Control of an Unmanned Surface Vehicle

无人水面舰艇的监督切换控制

Ivan R. Bertaska, Karl D. von Ellenrieder

AI总结 本文提出了一种新的切换控制器方法以提升系统性能,通过三个控制器捕捉无人水面舰艇的行为,包括欠驱动非线性控制器、全驱动非线性控制器和线性反风速控制器,并通过实验验证监督切换控制系统的有效性。

详情
Journal ref
MTS/IEEE Oceans 2015
AI中文摘要

本文提出了一种新的方法,用于确定控制器的切换以提高系统性能。利用三个控制器捕捉无人水面舰艇(USVs)的三种行为。一个欠驱动非线性控制器用于将车辆从一个位置转移到另一个位置;一个全驱动非线性控制器用于将车辆保持在设定点;一个线性反风速控制器用于反向操作模式。给定一个要跟随的轨迹,基于性能的监督切换控制系统(PBSSC)决定在控制器之间切换以提高系统性能。实验结果表明,PBSSC系统能够比任何单独的控制器更好地减少姿态误差。

英文摘要

A novel method to determine the switching of controllers to increase the performance of a system is presented. Three controllers are utilized to capture three behaviors representative of unmanned surface vehicles (USVs). An underactuated nonlinear controller is derived to transit the vehicle between locations; a fully-actuated nonlinear controller is given to station-keep the vehicle at a setpoint; and a linear anti-windup controller is presented for the reversing mode of operation. Given a trajectory to follow, a performance-based supervisory switching control system (PBSSC) dictates the switching between controllers to improve system performance. Experimental results are shown that indicate that the PBSSC system is able to mitigate errors in pose better than any of the individual controllers.

1610.02797 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Guidance algorithm for smooth trajectory tracking of a fixed wing UAV flying in wind flows

固定翼无人机在风流中平滑轨迹跟踪的引导算法

Hector Garcia de Marina, Yuri A. Kapitanyuk, Murat Bronz, Gautier Hattenberger, Ming Cao

AI总结 本文提出一种算法,用于解决固定翼无人机在恒定空速和恒定风扰下跟踪平滑曲线的问题,通过构造隐函数描述的引导向量场实现轨迹跟踪,并能通过离线调整避免无人机物理约束。

Comments 6 pages

详情
Journal ref
Proceedings of the 2017 IEEE International Conference on Robotics and Automation (ICRA)
AI中文摘要

本文提出了一种算法,用于解决固定翼无人驾驶航空器在恒定空速和恒定风扰下跟踪平滑曲线的问题。该算法基于跟随由隐函数描述的期望(可能随时间变化)轨迹的引导向量场的想法。算法的输出可以直接用无人机的银行角表达,以实现协调转弯。此外,该算法可以离线调整,以确保在期望轨迹附近不违反无人机的物理约束,例如最大银行角。我们提供了相应的理论收敛分析和实际飞行的性能结果。

英文摘要

This paper presents an algorithm for solving the problem of tracking smooth curves by a fixed wing unmanned aerial vehicle travelling with a constant airspeed and under a constant wind disturbance. The algorithm is based on the idea of following a guiding vector field which is constructed from the implicit function that describes the desired (possibly time-varying) trajectory. The output of the algorithm can be directly expressed in terms of the bank angle of the UAV in order to achieve coordinated turns. Furthermore, the algorithm can be tuned offline such that physical constraints of the UAV, e.g. the maximum bank angle, will not be violated in a neighborhood of the desired trajectory. We provide the corresponding theoretical convergence analysis and performance results from actual flights.

1609.05264 2026-06-04 cs.MA cs.RO cs.SY eess.SY 版本更新

Asynchronous and Dynamic Coverage Control Scheme for Persistent Surveillance Missions

异步和动态覆盖控制方案用于持续监视任务

Jeffrey R. Peters, Sean J. Wang, Amit Surana, Francesco Bullo

AI总结 本文提出了一种基于分解的多智能体持续监视任务覆盖控制方案,通过模块化框架解耦高层任务分配与低层运动规划,利用中央基站管理覆盖分配和监视参数,并通过非计划和异步交换传输至移动代理,实现负载平衡和有效路径规划。

详情
AI中文摘要

本文提出了一种基于分解的覆盖控制方案,用于在通信受限和动态环境中运行的多智能体持续监视任务。所提出的方法在模块化框架中解耦高层任务分配与低层运动规划。覆盖分配和监视参数由中央基站管理,并通过非计划和异步交换传输至移动代理。覆盖更新促进负载平衡,同时保持几何和时间特征,允许与通用路径规划器有效配对。具体而言,所提出的方案保证了(i)覆盖区域连通且集体覆盖环境,(ii)子区域可能仅在有界时间内未被覆盖,(iii)碰撞(或传感重叠)被内在避免,(iv)在静态事件可能性下,集体覆盖区域收敛到帕累托最优配置。该管理方案随后与满足宽松假设的通用路径规划器配对。通过模拟监视任务进行了说明。

英文摘要

A decomposition-based coverage control scheme is proposed for multi-agent, persistent surveillance missions operating in a communication-constrained, dynamic environment. The proposed approach decouples high-level task assignment from low-level motion planning in a modular framework. Coverage assignments and surveillance parameters are managed by a central base station, and transmitted to mobile agents via unplanned and asynchronous exchanges. Coverage updates promote load balancing, while maintaining geometric and temporal characteristics that allow effective pairing with generic path planners. Namely, the proposed scheme guarantees that (i) coverage regions are connected and collectively cover the environment, (ii) subregions may only go uncovered for bounded periods of time, (iii) collisions (or sensing overlaps) are inherently avoided, and (iv) under static event likelihoods, the collective coverage regions converge to a Pareto-optimal configuration. This management scheme is then paired with a generic path planner satisfying loose assumptions. The scheme is illustrated through simulated surveillance missions.

1612.07850 2026-06-04 cs.RO cs.CV cs.SY eess.SY 版本更新

Automatic Interpretation of Unordered Point Cloud Data for UAV Navigation in Construction

无人机在建筑施工中无序点云数据的自动解释

M. D. Phung, C. H. Quach, D. T. Chu, N. Q. Nguyen, T. H. Dinh, Q. P. Ha

AI总结 本文提出了一种数据处理系统,用于自动为无人机生成航路点,以检查建筑和桥梁等结构表面。系统通过两个正交安装的2D激光扫描仪和惯性测量单元的数据,利用数据注册、表面检测和航路生成算法,实现结构点云重建和航路规划。

Comments In The 14th International Conference on Control, Automation, Robotics and Vision, ICARCV 2016

详情
AI中文摘要

本工作旨在开发一种数据处理系统,能够自动为无人驾驶航空器(UAV)生成航路点,以检查建筑物和桥梁等结构的表面。输入包括由两个正交安装在UAV上的2D激光扫描仪和惯性测量单元(IMU)记录的数据。为实现目标,开发了处理所收集数据的算法,分为三类:(i)数据注册和滤波以生成结构的3D模型并控制点云密度以提高数据完整性;(ii)表面和障碍物检测以协助UAV的监控任务;(iii)航路点生成以设置飞行路径。不同数据集的实验表明,所开发的系统能够重建结构的3D点云,提取其表面和物体,并为UAV生成航路点以完成检查任务。

英文摘要

The objective of this work is to develop a data processing system that can automatically generate waypoints for navigation of an unmanned aerial vehicle (UAV) to inspect surfaces of structures like buildings and bridges. The input includes data recorded by two 2D laser scanners, orthogonally mounted on the UAV, and an inertial measurement unit (IMU). To achieve the goal, algorithms are developed to process the data collected. They are separated into three major groups: (i) the data registration and filtering to generate a 3D model of the structure and control the density of point clouds for data completeness enhancement; (ii) the surface and obstacle detection to assist the UAV in monitoring tasks; and (iii) the waypoint generation to set the flight path. Experiments on different data sets show that the developed system is able to reconstruct a 3D point cloud of the structure, extract its surfaces and objects, and generate waypoints for the UAV to accomplish inspection tasks.

1503.04894 2026-06-04 cs.RO cs.MA cs.SY eess.SY 版本更新

Biomimetic Algorithms for Coordinated Motion: Theory and Implementation

仿生算法协调运动:理论与实现

Udit Halder, Biswadip Dey

AI总结 本文基于生物飞行行为,提出两种覆盖与聚群策略,通过轮式机器人与Vicon系统验证了运动伪装理论,并展示了基于局部信息的拓扑速度对齐方法,证明了生物启发在多智能体机器人系统中的应用价值。

详情
AI中文摘要

本文受生物飞行行为启发,如蜻蜓领地战斗和星鸦群体飞行,提出两种覆盖与聚群策略。利用先前关于相互运动伪装的理论研究,实现了适用于实验室测试平台的区域覆盖控制律,该平台配备轮式移动机器人和Vicon高速运动捕捉系统。同一测试平台还用于演示另一种基于局部信息的策略,称为拓扑速度对齐,使智能体朝同一方向移动。本文展示了生物启发在多智能体机器人集体设计中的适用性。

英文摘要

Drawing inspiration from flight behavior in biological settings (e.g. territorial battles in dragonflies, and flocking in starlings), this paper demonstrates two strategies for coverage and flocking. Using earlier theoretical studies on mutual motion camouflage, an appropriate steering control law for area coverage has been implemented in a laboratory test-bed equipped with wheeled mobile robots and a Vicon high speed motion capture system. The same test-bed is also used to demonstrate another strategy (based on local information), termed topological velocity alignment, which serves to make agents move in the same direction. The present work illustrates the applicability of biological inspiration in the design of multi-agent robotic collectives.

1503.03388 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Station Keeping through Beacon-referenced Cyclic Pursuit

通过信标参考的循环追捕实现轨道维持

Kevin S. Galloway, Biswadip Dey

AI总结 本文研究了多智能体系统中循环恒定方位追捕的改进方法,通过信标和邻居的交互实现集体围绕信标旋转,分析了旋转半径和角度分离的参数影响,并验证了双智能体系统的稳定性。

详情
AI中文摘要

本文研究了多智能体系统中循环恒定方位(CB)追捕的改进方法,其中每个智能体关注一个邻居和一个信标。该问题允许集体围绕信标旋转的形状平衡,旋转半径和角度分离由反馈律中的参数选择决定。证明了双智能体系统的旋转形状平衡的稳定性,并通过运动捕捉系统验证了移动机器人集体的行为。

英文摘要

This paper investigates a modification of cyclic constant bearing (CB) pursuit in a multi-agent system in which each agent pays attention to a neighbor and a beacon. The problem admits shape equilibria with collective circling about the beacon, with the circling radius and angular separation of agents determined by choice of parameters in the feedback law. Stability of circling shape equilibria is shown for a 2-agent system, and the results are demonstrated on a collective of mobile robots tracked by a motion capture system.

1702.00325 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Hybrid Fuel Cells Power for Long Duration Robot Missions in Field Environments

混合燃料电池为野外环境中的长时间机器人任务提供动力

Jekan Thangavelautham, Danielle Gallardo, Daniel Strawser, Steven Dubowsky

AI总结 本文提出混合燃料电池系统用于提升机器人在野外长时间任务中的续航能力,通过燃料电池与电池的结合解决传统电源的局限性。

Comments 8 pages, 5 figures in Field Robotics - 14th International Conference on Climbing and Walking Robots and the Support Technologies for Mobile Machines

详情
AI中文摘要

移动机器人常需执行长时间任务,包括搜索救援、哨兵、维修、监视和娱乐。当前电源技术限制了机器人在许多任务中的行走和攀爬能力。内燃机噪音大且排放有毒废气,而可充电电池能量密度低且自放电率高。理论上,燃料电池没有这些限制。特别是质子交换膜(PEM)可提供极高的能量密度,清洁且安静。然而,PEM燃料电池因性能退化而不可靠。这可通过在燃料电池电池混合配置中保护燃料电池来缓解,使用过滤电子设备确保燃料电池远离电气噪声,并通过电池隔离它免受电压尖峰影响。针对HOAP 2仿人机器人展示了模拟结果,表明燃料电池混合电源优于传统电池。

英文摘要

Mobile robots are often needed for long duration missions. These include search and rescue, sentry, repair, surveillance and entertainment. Current power supply technology limit walking and climbing robots from many such missions. Internal combustion engines have high noise and emit toxic exhaust while rechargeable batteries have low energy densities and high rates of self-discharge. In theory, fuel cells do not have such limitations. In particular Proton Exchange Membrane (PEMs) can provide very high energy densities, are clean and quiet. However, PEM fuel cells are found to be unreliable due to performance degradation. This can be mitigated by protecting the fuel cell in a fuel-cell battery hybrid configuration using filtering electronics that ensure the fuel cell is isolated from electrical noise and a battery to isolate it from power surges. Simulation results are presented for a HOAP 2 humanoid robot that suggests a fuel cell powered hybrid power supply superior to conventional batteries.

1701.08915 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Accelerated Evaluation of Automated Vehicles Using Piecewise Mixture Models

利用分段混合模型加速自动驾驶车辆评估

Zhiyuan Huang, Ding Zhao, Henry Lam, David J. LeBlanc

AI总结 本文提出利用分段混合分布模型改进自动驾驶车辆评估方法,提升效率和准确性,实现评估过程加速四到五个数量级。

Comments 11 pages, 13 figures

详情
AI中文摘要

高度自动驾驶车辆的认证过程尚未由任何国家定义。目前,公司通过公共道路测试自动驾驶车辆,这耗时且低效。我们提出了加速评估概念,利用周围车辆的修改统计和重要抽样理论,将评估时间减少几个数量级,同时确保统计准确性。本文进一步改进该概念,使用分段混合分布模型代替单参数分布模型。我们开发并应用于向前碰撞控制系统,以应对进行变道的车辆。变道车辆的行为基于密歇根大学安全试点模型部署计划收集的超过403,581次变道数据建模。模拟结果证实,分段混合分布方法在准确性和效率上优于单参数分布方法,并将评估过程加速了几乎四个数量级。

英文摘要

The process to certify highly Automated Vehicles has not yet been defined by any country in the world. Currently, companies test Automated Vehicles on public roads, which is time-consuming and inefficient. We proposed the Accelerated Evaluation concept, which uses a modified statistics of the surrounding vehicles and the Importance Sampling theory to reduce the evaluation time by several orders of magnitude, while ensuring the evaluation results are statistically accurate. In this paper, we further improve the accelerated evaluation concept by using Piecewise Mixture Distribution models, instead of Single Parametric Distribution models. We developed and applied this idea to forward collision control system reacting to vehicles making cut-in lane changes. The behavior of the cut-in vehicles was modeled based on more than 403,581 lane changes collected by the University of Michigan Safety Pilot Model Deployment Program. Simulation results confirm that the accuracy and efficiency of the Piecewise Mixture Distribution method outperformed single parametric distribution methods in accuracy and efficiency, and accelerated the evaluation process by almost four orders of magnitude.

1609.02174 2026-06-04 eess.SY cs.MA cs.RO cs.SY 版本更新

Distributed sampled-data control of nonholonomic multi-robot systems with proximity networks

非holonomic多机器人系统的分布式采样数据控制与临近网络

Zhixin Liu, Lin Wang, Jinhuan Wang, Daoyi Dong, Xiaoming Hu

AI总结 本文研究了通过距离诱导临近网络连接的多机器人系统的分布式采样数据控制问题,设计了基于最近邻规则的分布式采样数据控制器,通过混合闭环系统实现同步,并定量确定领导者数量以跟踪常量或时变信号。

Comments 15 pages, 3 figures

详情
Journal ref
Automatica 77 (2017) 170-179
AI中文摘要

本文考虑了通过距离诱导临近网络连接的一组移动机器人系统的分布式采样数据控制问题。假设存在驻留时间以避免因位置突变导致的邻居关系振荡。基于最近邻规则设计了分布式采样数据控制律,结合连续时间动力学产生混合闭环系统。对于均匀独立初始状态,提供了保证无领导者的系统同步的充分条件。为了使所有机器人以期望的方位和速度移动,引入了领导者,并定量确定所需的领导者比例以跟踪常量或时变信号。所有条件仅依赖于邻居半径、最大初始移动速度和驻留时间,不假设邻居图的先验性质,这与现有文献不同。

英文摘要

This paper considers the distributed sampled-data control problem of a group of mobile robots connected via distance-induced proximity networks. A dwell time is assumed in order to avoid chattering in the neighbor relations that may be caused by abrupt changes of positions when updating information from neighbors. Distributed sampled-data control laws are designed based on nearest neighbour rules, which in conjunction with continuous-time dynamics results in hybrid closed-loop systems. For uniformly and independently initial states, a sufficient condition is provided to guarantee synchronization for the system without leaders. In order to steer all robots to move with the desired orientation and speed, we then introduce a number of leaders into the system, and quantitatively establish the proportion of leaders needed to track either constant or time-varying signals. All these conditions depend only on the neighborhood radius, the maximum initial moving speed and the dwell time, without assuming a prior properties of the neighbor graphs as are used in most of the existing literature.

1610.02849 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Automatic Gain Tuning of a Momentum Based Balancing Controller for Humanoid Robots

基于动量的平衡控制器自动增益调节技术

Daniele Pucci, Gabriele Nava, Francesco Nori

AI总结 本文提出了一种用于双足机器人基于动量的平衡控制器自动增益调节技术,通过线性化闭环约束关节空间动力学并设计增益矩阵以实现系统稳定性和正定性。

Comments Accepted at IEEE-RAS International Conference on Humanoid Robots (HUMANOIDS). 2016

详情
AI中文摘要

本文提出了一种用于双足机器人基于动量的平衡控制器自动增益调节技术。通过线性化闭环约束关节空间动力学并设计增益矩阵以实现系统稳定性和正定性。对称性和正定性约束通过提出对称正定矩阵的跟踪器来实现。在双足机器人iCub上进行了仿真结果。

英文摘要

This paper proposes a technique for automatic gain tuning of a momentum based balancing controller for humanoid robots. The controller ensures the stabilization of the centroidal dynamics and the associated zero dynamics. Then, the closed-loop, constrained joint space dynamics is linearized and the controller's gains are chosen so as to obtain desired properties of the linearized system. Symmetry and positive definiteness constraints of gain matrices are enforced by proposing a tracker for symmetric positive definite matrices. Simulation results are carried out on the humanoid robot iCub.

1511.01166 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A bi-criteria path planning algorithm for robotics applications

用于机器人应用的双标准路径规划算法

Zachary Clawson, Xuchu Ding, Brendan Englot, Thomas A. Frewen, William M. Sisson, Alexander Vladimirsky

AI总结 本文提出一种高效的双标准路径规划算法,通过扩展状态空间来跟踪剩余预算,以满足二次成本约束。该算法在概率道路图上测试,用于最小化旅行距离同时控制总体威胁暴露。

Comments 19 pages, 12 figures; submitted for publication to IEEE Transactions on Automation Science and Engineering

详情
AI中文摘要

现实中的路径规划应用通常需要同时优化多个标准。本文介绍了一种高效的图上双标准路径规划算法。我们的方法基于扩展状态空间以跟踪剩余预算,以满足对二次成本的约束。所得到的扩展图是无环的,然后可以通过简单的向上扫描预算级别来最小化主要成本。我们通过概率道路图测试了该算法的效率和准确性,以在受机器人总体威胁暴露约束下最小化旅行距离。我们还展示了在真实机器人系统上应用此方法的现场实验结果。

英文摘要

Realistic path planning applications often require optimizing with respect to several criteria simultaneously. Here we introduce an efficient algorithm for bi-criteria path planning on graphs. Our approach is based on augmenting the state space to keep track of the "budget" remaining to satisfy the constraints on secondary cost. The resulting augmented graph is acyclic and the primary cost can be then minimized by a simple upward sweep through budget levels. The efficiency and accuracy of our algorithm is tested on Probabilistic Roadmap graphs to minimize the distance of travel subject to a constraint on the overall threat exposure of the robot. We also present the results from field experiments illustrating the use of this approach on realistic robotic systems.

1611.07114 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Multi-sensor perceptual system for mobile robot and sensor fusion-based localization

多传感器感知系统用于移动机器人及基于传感器融合的定位

T. T. Hoang, P. M. Duong, N. T. T. Van, D. A. Viet, T. Q. Vinh

AI总结 本文提出一种扩展卡尔曼滤波方法,用于利用双四象限编码器、指南针、激光雷达和全方位摄像头对移动机器人进行定位,通过融合多种传感器数据实现高效定位。

Comments In 2012 International Conference on Control, Automation and Information Sciences (ICCAIS). arXiv admin note: substantial text overlap with arXiv:1611.07112

详情
AI中文摘要

本文提出了一种扩展卡尔曼滤波(EKF)方法,用于对配备双四象限编码器、指南针、激光雷达(LRF)和全方位摄像头的移动机器人进行定位。预测步骤通过利用机器人的运动学模型以及估计输入噪声协方差矩阵,该矩阵与轮子的角速度成比例。在修正步骤中,融合所有传感器的测量数据,包括编码器的增量脉冲、LRF的线段、指南针的机器人方向以及全方位摄像头的偏转角。在室内结构化环境中进行了实验,良好的定位结果证明了该算法的有效性和适用性。

英文摘要

This paper presents an Extended Kalman Filter (EKF) approach to localize a mobile robot with two quadrature encoders, a compass sensor, a laser range finder (LRF) and an omni-directional camera. The prediction step is performed by employing the kinematic model of the robot as well as estimating the input noise covariance matrix as being proportional to the wheel's angular speed. At the correction step, the measurements from all sensors including incremental pulses of the encoders, line segments of the LRF, robot orientation of the compass and deflection angular of the omni-directional camera are fused. Experiments in an indoor structured environment were implemented and the good localization results prove the effectiveness and applicability of the algorithm.

1611.07112 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Development of an EKF-based localization algorithm using compass sensor and LRF

基于指南针传感器和LRF的EKF定位算法开发

T. T. Hoang, P. M. Duong, N. T. T. Van, D. A. Viet, T. Q. Vinh

AI总结 本文提出一种移动机器人感知系统,结合指南针和激光雷达数据,通过扩展卡尔曼滤波实现高精度定位,提升导航性能。

Comments In 12th International Conference on Control Automation Robotics & Vision (ICARCV), 2012. arXiv admin note: substantial text overlap with arXiv:1611.07114

详情
AI中文摘要

本文提出了一种基于扩展卡尔曼滤波的移动机器人感知系统,结合指南针和激光雷达数据,通过传感器融合模型实现高精度定位,提升导航性能。

英文摘要

This paper presents the implementation of a perceptual system for a mobile robot. The system is designed and installed with modern sensors and multi-point communication channels. The goal is to equip the robot with a high level of perception to support a wide range of navigating applications including Internet-based telecontrol, semi-autonomy, and autonomy. Due to uncertainties of acquiring data, a sensor fusion model is developed, in which heterogeneous measured data including odometry, compass heading and laser range is combined to get an optimal estimate in a statistical sense. The combination is carried out by an extended Kalman filter. Experimental results indicate that based on the system, the robot localization is enhanced and sufficient for navigation tasks.

1612.06008 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Optimal Control-Based UAV Path Planning with Dynamically-Constrained TSP with Neighborhoods

基于最优控制的无人机路径规划与动态约束TSP带邻居问题

Dae-Sung Jang, Hyeok-Joo Chae, Han-Lim Choi

AI总结 本文提出一种基于采样 roadmap 算法的无人机路径规划方法,通过最优控制生成路径以减少计算时间并提升解的质量,解决动态约束 TSP 带邻居问题。

Comments 17 pages, 7 figures

详情
AI中文摘要

本文针对具备遥感能力的无人机路径规划问题,将其视为动态约束的旅行商问题带邻居,提出一种结合最优控制的采样 roadmap 算法,通过减少局部路径优化的数值计算和提取 roadmap 中的信息来提高计算效率。数值仿真验证了该算法在降低计算时间及提升解质量方面优于传统 roadmap 基路径规划方法。

英文摘要

This paper addresses path planning of an unmanned aerial vehicle (UAV) with remote sensing capabilities (or wireless communication capabilities). The goal of the path planning is to find a minimum-flight-time closed tour of the UAV visiting all executable areas of given remote sensing and communication tasks; in order to incorporate the nonlinear vehicle dynamics, this problem is regarded as a dynamically-constrained traveling salesman problem with neighborhoods. To obtain a close-to-optimal solution for the path planning in a tractable manner, a sampling-based roadmap algorithm that embeds an optimal control-based path generation process is proposed. The algorithm improves the computational efficiency by reducing numerical computations required for optimizing inefficient local paths, and by extracting additional information from a roadmap of a fixed number of samples. Comparative numerical simulations validate the efficiency of the presented algorithm in reducing computation time and improving the solution quality compared to previous roadmap-based planning methods.

1612.05594 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Importance sampling-based approximate optimal planning and control

基于重要性采样的近似最优规划与控制

Jie Fu

AI总结 本文提出了一种非线性系统在非可微约束下的采样规划与最优控制方法,通过重要性采样算法迭代计算最优权重参数,确保策略的最优性和鲁棒性。

Comments submitted to IEEE ACC 2017

详情
AI中文摘要

本文提出了一种基于采样的非线性系统规划与最优控制方法。受可扩展规划算法的启发,将最优运动计划视为反馈控制器,并通过加权基函数的线性组合进行近似。本文的主要贡献是引入重要性采样,特别是模型参考自适应搜索算法,以迭代计算最优权重参数。关键思想是通过迭代估计参数化分布,使其收敛于狄拉克δ函数,无限峰值于全局最优权重。然后,利用此直接策略搜索,结合轨迹验证,确保对于一类非线性系统,所获得的策略不仅最优,而且对有界扰动具有鲁棒性。通过数值实验,包括具有非线性成本函数的线性系统和双极车运动规划,验证了方法的正确性和效率。

英文摘要

In this paper, we propose a sampling-based planning and optimal control method of nonlinear systems under non-differentiable constraints. Motivated by developing scalable planning algorithms, we consider the optimal motion plan to be a feedback controller that can be approximated by a weighted sum of given bases. Given this approximate optimal control formulation, our main contribution is to introduce importance sampling, specifically, model-reference adaptive search algorithm, to iteratively compute the optimal weight parameters, i.e., the weights corresponding to the optimal policy function approximation given chosen bases. The key idea is to perform the search by iteratively estimating a parametrized distribution which converges to a Dirac's Delta that infinitely peaks on the global optimal weights. Then, using this direct policy search, we incorporated trajectory-based verification to ensure that, for a class of nonlinear systems, the obtained policy is not only optimal but robust to bounded disturbances. The correctness and efficiency of the methods are demonstrated through numerical experiments including linear systems with a nonlinear cost function and motion planning for a Dubins car.

1612.04915 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Collaborative Object Transportation Using MAVs via Passive Force Control

利用MAVs的被动力控进行协同物体运输

Andrea Tagliabue, Mina Kamel, Sebastian Verling, Roland Siegwart, Juan Nieto

AI总结 本文提出一种基于被动力控的MAVs协同运输策略,通过双六旋翼运输大型物体,无需通信链路、负载形状或抓取点位置信息。

Comments under review for the IEEE International Conference on Robotics and Automation (ICRA) 2017

详情
AI中文摘要

本文提出了一种基于被动力控的MAVs协同运输策略,旨在开发一种不依赖MAV之间通信链路、负载形状知识或抓取点位置的鲁棒方法。所提出的方法基于主从范式,其中从动代理通过顺应控制器保证对主代理施加于负载的外部力的顺应性。外部力作用于从动代理的估计是通过基于无迹卡尔曼滤波(UKF)的非线性估计器从视觉惯性导航系统提供的信息中进行估计。实验结果展示了力估计器的性能,并展示了1.2米长物体的协同运输。

英文摘要

This paper shows a strategy based on passive force control for collaborative object transportation using Micro Aerial Vehicles (MAVs), focusing on the transportation of a bulky object by two hexacopters. The goal is to develop a robust approach which does not rely on: (a) communication links between the MAVs, (b) the knowledge of the payload shape and (c) the position of grasping point. The proposed approach is based on the master-slave paradigm, in which the slave agent guarantees compliance to the external force applied by the master to the payload via an admittance controller. The external force acting on the slave is estimated using a non-linear estimator based on the Unscented Kalman Filter (UKF) from the information provided by a visual inertial navigation system. Experimental results demonstrate the performance of the force estimator and show the collaborative transportation of a 1.2 m long object.

1607.07797 2026-06-04 cs.RO cs.FL cs.SY eess.SY 版本更新

Combined Top-Down and Bottom-Up Approaches to Performance-guaranteed Integrated Task and Motion Planning of Cooperative Multi-agent Systems

结合自上而下与自下而上方法的性能保证协同多智能体系统集成任务与运动规划

Rafael Rodrigues da Silva, Bo Wu, Jin Dai, Hai Lin

AI总结 本文提出一种分层框架,通过结合自下而上的反应式运动控制器与自上而下的任务计划,实现协同多智能体系统的性能保证任务与运动规划,通过假设-保证范式验证局部任务与全局任务的一致性。

Comments Submitted to Automatica

详情
AI中文摘要

我们提出了一种分层设计框架,用于自动合成协同多智能体系统的协调方案和控制策略,以满足正式的性能要求。通过将自下而上的反应式运动控制器与自上而下的任务计划相结合。一方面,从一个全局任务开始,该任务以所有智能体任务能力的正则语言指定,任务规划层位于所提出框架的顶部,将全局任务分解为与每个智能体个体能力一致的局部任务,并通过假设-保证范式论证局部任务的完成是否意味着全局任务的满足。另一方面,每个智能体关联的自下而上运动计划通过组合基本运动原语生成,这些原语通过差分动态逻辑(d$\mathcal{L}$)验证安全,通过可满足性模理论(SMT)求解器在面对局部任务要求和环境描述所施加的约束时搜索可行解。研究表明,所提出的框架能够处理动态环境,因为运动原语具有反应特性,使运动计划能够适应局部环境变化。此外,当SMT求解器无法找到可行解时,任务重新配置可以由运动规划层触发。整体设计框架的有效性通过自动化仓库案例研究得到验证。

英文摘要

We propose a hierarchical design framework to automatically synthesize coordination schemes and control policies for cooperative multi-agent systems to fulfill formal performance requirements, by associating a bottom-up reactive motion controller with a top-down mission plan. On one hand, starting from a global mission that is specified as a regular language over all the agents' mission capabilities, a mission planning layer sits on the top of the proposed framework, decomposing the global mission into local tasks that are in consistency with each agent's individual capabilities, and compositionally justifying whether the achievement of local tasks implies the satisfaction of the global mission via an assume-guarantee paradigm. On the other hand, bottom-up motion plans associated with each agent are synthesized corresponding to the obtained local missions by composing basic motion primitives, which are verified safe by differential dynamic logic (d$\mathcal{L}$), through a Satisfiability Modulo Theories (SMT) solver that searches feasible solutions in face of constraints imposed by local task requirements and the environment description. It is shown that the proposed framework can handle dynamical environments as the motion primitives possess reactive features, making the motion plans adaptive to local environmental changes. Furthermore, on-line mission reconfiguration can be triggered by the motion planning layer once no feasible solutions can be found through the SMT solver. The effectiveness of the overall design framework is validated by an automated warehouse case study.

1609.05258 2026-06-04 cs.RO cs.AI cs.CV cs.SY eess.SY 版本更新

The ACRV Picking Benchmark (APB): A Robotic Shelf Picking Benchmark to Foster Reproducible Research

ACRV 摘取基准 (APB):一个促进可重复研究的机器人货架摘取基准

Jürgen Leitner, Adam W. Tow, Jake E. Dean, Niko Suenderhauf, Joseph W. Durham, Matthew Cooper, Markus Eich, Christopher Lehnert, Ruben Mangels, Christopher McCool, Peter Kujala, Lachlan Nicholson, Trung Pham, James Sergeant, Liao Wu, Fangyi Zhang, Ben Upcroft, Peter Corke

AI总结 本文提出ACRV摘取基准(APB),通过42个常见物品、广泛可用的货架和精确的物品排列指南,提供可重复的机器人摘取基准,支持完整机器人系统的比较。

Comments 8 pages, submitted to RA:Letters

详情
AI中文摘要

机器人挑战如亚马逊摘取挑战(APC)或DARPA挑战是推动科学进步的重要方式。它们使研究在明确的基准上进行比较,所有参与者享有相同的测试条件。然而,此类挑战事件仅偶尔举行,参赛人数有限,且测试条件难以在主事件后复制。我们提出一个新的物理基准挑战:ACRV摘取基准(APB)。该基准设计为可重复,包含42个常见物品、广泛可用的货架和精确的物品排列指南。明确的评估协议使完整机器人系统(包括感知和操作)的比较成为可能,而不仅仅是子系统。本文还描述并报告了基于Baxter机器人开放基线系统的实验结果。

英文摘要

Robotic challenges like the Amazon Picking Challenge (APC) or the DARPA Challenges are an established and important way to drive scientific progress. They make research comparable on a well-defined benchmark with equal test conditions for all participants. However, such challenge events occur only occasionally, are limited to a small number of contestants, and the test conditions are very difficult to replicate after the main event. We present a new physical benchmark challenge for robotic picking: the ACRV Picking Benchmark (APB). Designed to be reproducible, it consists of a set of 42 common objects, a widely available shelf, and exact guidelines for object arrangement using stencils. A well-defined evaluation protocol enables the comparison of \emph{complete} robotic systems -- including perception and manipulation -- instead of sub-systems only. Our paper also describes and reports results achieved by an open baseline system based on a Baxter robot.

1510.06496 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Synthesizing least-limiting guidelines for safety of semi-autonomous systems

为半自主系统安全合成最少限制的指南

Jana Tumova, Dimos V. Dimarogonas

AI总结 本文提出系统方法生成最少限制指南,以在最坏情况下最小限制人类操作员。

Comments Extended version of CDC 2016 paper

详情
AI中文摘要

我们考虑为半自主系统合成安全设计的控制策略的问题。我们的目标是处理当仅依靠自主可控部分无法保证安全,需要与不可控部分(如人类操作员)协作的情况。本文提出一种系统方法,生成最少限制指南,即在最坏情况下长期系统执行中尽可能减少对人类操作员的限制。算法借鉴了双玩家回合博弈的思想。

英文摘要

We consider the problem of synthesizing safe-by-design control strategies for semi-autonomous systems. Our aim is to address situations when safety cannot be guaranteed solely by the autonomous, controllable part of the system and a certain level of collaboration is needed from the uncontrollable part, such as the human operator. In this paper, we propose a systematic solution to generating least-limiting guidelines, i.e. the guidelines that restrict the human operator as little as possible in the worst-case long-term system executions. The algorithm leverages ideas from 2-player turn-based games.

1612.04324 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Stabilization and Trajectory Control of a Quadrotor with Uncertain Suspended Load

四旋翼载具带不确定悬挂负载的稳定与轨迹控制

Xu Zhou, Xiaoli Zhang, Jiucai Zhang, Rui Liu

AI总结 本文研究四旋翼搭载不确定悬挂负载时的稳定与轨迹控制问题,通过比较三种控制器性能,发现负载质量不确定性主要影响稳定而非轨迹跟踪,提出关键运动质量概念并验证鲁棒控制器的有效性。

Comments 56 pages, 12 figures, article submitted to ASME Journal of Dynamic Systems Measurement and Control, 2016 April

详情
AI中文摘要

近年来,四旋翼搭载固定已知质量悬挂负载的稳定与轨迹控制已被广泛研究。然而,负载质量不总是事先已知,或在实际运输中可能变化。这种质量不确定性会给四旋翼系统带来不确定扰动,导致现有控制器的稳定性和轨迹跟踪性能变差。为提高这种情况下四旋翼的稳定性与轨迹跟踪能力,本文全面研究了不确定负载质量对四旋翼的影响。通过比较三种不同控制器——比例导数(PD)控制器、滑模控制器(SMC)和模型预测控制器(MPC)的性能,证明稳定而非轨迹跟踪误差是负载质量不确定性的主要影响因素。存在一个关键运动质量,使四旋翼能够维持期望的运输性能。此外,仿真结果验证了具有强抗扰能力的控制器在实际应用中的有效性。

英文摘要

Stabilization and trajectory control of a quadrotor carrying a suspended load with a fixed known mass has been extensively studied in recent years. However, the load mass is not always known beforehand or may vary during the practical transportations. This mass uncertainty brings uncertain disturbances to the quadrotor system, causing existing controllers to have worse stability and trajectory tracking performance. To improve the quadrotor stability and trajectory tracking capability in this situation, we fully investigate the impacts of the uncertain load mass on the quadrotor. By comparing the performances of three different controllers -- the proportional-derivative (PD) controller, the sliding mode controller (SMC), and the model predictive controller (MPC) -- stabilization rather than trajectory tracking error is proved to be the main influence in the load mass uncertainty. A critical motion mass exists for the quadrotor to maintain a desired transportation performance. Moreover, simulation results verify that a controller with strong robustness against disturbances is a good choice for practical applications.

1612.04023 2026-06-04 eess.SY cs.AI cs.RO cs.SY 版本更新

Proceedings of the The First Workshop on Verification and Validation of Cyber-Physical Systems

第一届验证与验证网络物理系统研讨会会议记录

Mehdi Kargahi, Ashutosh Trivedi

发表机构 * Reykjavík, Iceland(冰岛雷克雅未克) MITL Specification Debugging for Monitoring of Cyber-Physical Systems(网络物理系统监控的MITL规格调试) Automatic Synthesis of Controllers from Specifications using Control Certificates(使用控制证书从规范自动合成控制器) A Compositional Framework for Preference-Aware Agents(偏好感知代理的组合框架) Output Feedback Controller Design with Symbolic Observers for Cyber-physical Systems(网络物理系统符号观测器输出反馈控制器设计) Towards an Approximate Conformance Relation for Hybrid I/O Automata(混合I/O自动机近似一致性关系) On Nonlinear Prices in Timed Automata(时序自动机中的非线性价格) Towards the Verification of Safety-critical Autonomous Systems in Dynamic Environments(动态环境中安全关键自主系统的验证)

AI总结 本文介绍了首届网络物理系统验证与验证研讨会,探讨了验证与验证方法,包括控制、模拟和形式化方法等,旨在解决复杂软件和算法的验证问题。

详情
Journal ref
EPTCS 232, 2016
AI中文摘要

第一届国际网络物理系统验证与验证研讨会(V2CPS-16)于冰岛雷克雅未克举行的第十二届国际形式化方法整合会议(iFM 2016)期间召开。该研讨会旨在汇集形式化验证和网络物理系统(CPS)领域的研究人员和专家,讨论涵盖广泛验证与验证方法的主题,包括但不限于控制、模拟、形式化方法等。网络物理系统(CPS)是网络化计算和物理过程的整合,具有有意义的相互作用;前者监控、控制并影响后者,而后者也影响前者。CPS在机器人、交通、通信、基础设施、能源和制造系统中有广泛应用。许多安全关键系统,如化学过程、医疗设备、飞机飞行控制系统和汽车系统,确实属于CPS。CPS的先进能力需要复杂的软件和合成算法,这些算法难以验证。事实上,该领域中的许多问题都是不可判定的。因此,一个重要的步骤是找到特定的抽象,这些抽象可能在特定属性上算法上可验证,描述CPS的部分或整体行为。

英文摘要

The first International Workshop on Verification and Validation of Cyber-Physical Systems (V2CPS-16) was held in conjunction with the 12th International Conference on integration of Formal Methods (iFM 2016) in Reykjavik, Iceland. The purpose of V2CPS-16 was to bring together researchers and experts of the fields of formal verification and cyber-physical systems (CPS) to cover the theme of this workshop, namely a wide spectrum of verification and validation methods including (but not limited to) control, simulation, formal methods, etc. A CPS is an integration of networked computational and physical processes with meaningful inter-effects; the former monitors, controls, and affects the latter, while the latter also impacts the former. CPSs have applications in a wide-range of systems spanning robotics, transportation, communication, infrastructure, energy, and manufacturing. Many safety-critical systems such as chemical processes, medical devices, aircraft flight control, and automotive systems, are indeed CPS. The advanced capabilities of CPS require complex software and synthesis algorithms, which are hard to verify. In fact, many problems in this area are undecidable. Thus, a major step is to find particular abstractions of such systems which might be algorithmically verifiable regarding specific properties of such systems, describing the partial/overall behaviors of CPSs.

1612.02739 2026-06-04 cs.RO cs.AI cs.LG cs.SY eess.SY 版本更新

Controlling Robot Morphology from Incomplete Measurements

从不完整测量中控制机器人形态

Martin Pecka, Karel Zimmermann, Michal Reinštein, Tomáš Svoboda

AI总结 针对复杂形态机器人在城市搜索与救援任务中的地形穿越需求,提出通过自主控制处理不完整数据并确保安全性的方法。

Comments Accepted into IEEE Transactions to Industrial Electronics, Special Section on Motion Control for Novel Emerging Robotic Devices and Systems

详情
AI中文摘要

复杂形态的移动机器人对于在城市搜索与救援任务中穿越粗糙地形至关重要。由于远程操作复杂形态会增加操作员的认知负担,因此需要自主控制。自主控制会测量机器人状态和周围地形,通常只能部分观测,因此数据往往不完整。我们对缺失测量进行边缘化,并评估一个显式安全条件。如果安全条件被违反,身体安装的机械臂通过触觉探索收集缺失数据。

英文摘要

Mobile robots with complex morphology are essential for traversing rough terrains in Urban Search & Rescue missions (USAR). Since teleoperation of the complex morphology causes high cognitive load of the operator, the morphology is controlled autonomously. The autonomous control measures the robot state and surrounding terrain which is usually only partially observable, and thus the data are often incomplete. We marginalize the control over the missing measurements and evaluate an explicit safety condition. If the safety condition is violated, tactile terrain exploration by the body-mounted robotic arm gathers the missing data.

1612.01476 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Modeling and Control of an Autonomous Three Wheeled Mobile Robot with Front Steer

自主三轮移动机器人前轮转向的建模与控制

Ayush Pandey, Siddharth Jha, Debashish Chakravarty

AI总结 本文提出了一种自主三轮移动机器人的前轮转向建模与控制策略,设计了速度控制PID控制器并实现了数字控制框架,同时展示了轨迹控制框架,通过实验验证了系统模型并讨论了控制器性能与鲁棒性。

Comments IEEE International Conference on Robotic Computing 2017. (under review)

详情
AI中文摘要

本文提出了自主三轮移动机器人前轮转向的建模与控制策略。尽管三轮车前轮转向设计在公共运输车辆中很常见,但其在自主车辆导航和定位中的优势很少被利用。本文提出了此类机器人车辆的系统模型。为获得的模型设计了PID速度控制器,并在数字控制框架中实现。本文还提出了轨迹控制框架,这是此类三轮机器人的一大挑战。通过机器人车辆设计获得的实验结果验证了推导出的系统模型。还简要讨论了控制器性能和鲁棒性问题。

英文摘要

Modeling and control strategies for a design of an autonomous three wheeled mobile robot with front wheel steer is presented. Although, the three-wheel vehicle design with front wheel steer is common in automotive vehicles used often in public transport, but its advantages in navigation and localization of autonomous vehicles is seldom utilized. We present the system model for such a robotic vehicle. A PID controller for speed control is designed for the model obtained and has been implemented in a digital control framework. The trajectory control framework, which is a challenging task for such a three-wheeled robot has also been presented in the paper. The derived system model has been verified using experimental results obtained for the robot vehicle design. Controller performance and robustness issues have also been discussed briefly.

1612.01034 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Localization of networked robot systems subject to random delay and packet loss

网络机器人系统在随机延迟和丢包下的定位

Manh Duong Phung, Thi Thanh Van Nguyen, Thuan Hoang Tran, Quang Vinh Tran

AI总结 本文针对通信延迟和丢包影响下的移动机器人定位问题,提出统一的状态空间表示法和最优线性估计器,通过相关因子整合延迟测量,验证了方法在仿真和真实机器人系统中的有效性。

Comments In 2013 IEEE/ASME International Conference on Advanced Intelligent Mechatronics (AIM

详情
AI中文摘要

本文针对移动机器人在通信延迟和丢包下的定位问题,构建了统一的状态空间表示法以描述混合不确定性。基于该表示法,开发了最优线性估计器,通过相关因子整合延迟测量以提高估计精度。该估计器进一步扩展至非线性系统。在MATLAB仿真和真实机器人系统实验中验证了该方法的性能,良好的定位结果证明了其在网络化移动机器人定位中的有效性。

英文摘要

This paper deals with the localization problem of mobile robot subject to communication delay and packet loss. The delay and loss may appear in a random fashion in both control inputs and observation measurements. A unified state-space representation is constructed to describe these mixed uncertainties. Based on it, the optimal linear estimator is developed. The main idea is the derivation of a relevance factor to incorporate delayed measurements to the being estimate. The estimator is then extended for nonlinear systems. The performance of this method is tested within the simulations in MATLAB and the experiments in a real robot system. The good localization results prove the efficiency of the method for the purpose of localization of networked mobile robot.

1611.10007 2026-06-04 eess.SY cs.MA cs.RO cs.SY math.CO math.DS 版本更新

Structural Controllability of Multi-Agent Networks: Robustness against Simultaneous Failures

多智能体网络的结构可控性:同时故障下的鲁棒性

M. Amin Rahimian, Amir G. Aghdam

AI总结 本文研究了多领导者多智能体系统的结构可控性,探讨了在通信链路和智能体同时失效时的保持性,提出联合(r,s)-可控性和联合t-可控性作为多智能体系统可靠性的量化指标。

详情
Journal ref
Automatica, Volume 49, Issue 11, 2013, Pages 3149-3157
AI中文摘要

本文从图论角度研究了具有多个领导者的领导者-跟随者多智能体系统的结构可控性。研究了在通信链路和智能体同时失效时结构可控性的保持问题。先前研究探讨了智能体和通信链路丢失对信息流图可控性的影响。本文利用相应结果,引入了一些有用的指数和重要性度量,帮助表征和量化单个链路和智能体在整体网络可控性中的作用。现有结果通过考虑链路和智能体同时丢失的影响进行了扩展。为此,引入了联合(r,s)-可控性和联合t-可控性作为多智能体系统可靠性的量化指标,并研究了其重要性质。最后,引入了联合临界有向图的概念,并指出如果一个有向图是联合临界,则联合t-可控性是任何一组链路和智能体失效(数量小于t)后保持可控的必要且充分条件。文中通过各种例子阐述了分析结果。

英文摘要

In this paper, structural controllability of a leader-follower multi-agent system with multiple leaders is studied from a graph-theoretic point of view. The problem of preservation of structural controllability under simultaneous failures in both the communication links and the agents is investigated. The effects of the loss of agents and communication links on the controllability of an information flow graph are previously studied. In this work, the corresponding results are exploited to introduce some useful indices and importance measures that help characterize and quantify the role of individual links and agents in the controllability of the overall network. Existing results are then extended by considering the effects of losses in both links and agents at the same time. To this end, the concepts of joint (r,s)-controllability and joint t-controllability are introduced as quantitative measures of reliability for a multi-agent system, and their important properties are investigated. Lastly, the class of jointly critical digraphs is introduced and it is stated that if a digraph is jointly critical, then joint t-controllability is a necessary and sufficient condition for remaining controllable following the failure of any set of links and agents, with cardinality less than t. Various examples are exploited throughout the paper to elaborate on the analytical findings.

1611.09987 2026-06-04 eess.SY cs.MA cs.RO cs.SY math.DS 版本更新

Digraphs with Distinguishable Dynamics under the Multi-Agent Agreement Protocol

具有可区分动态的有向图在多智能体协议下的研究

M. Amin Rahimian, Amir Ajorlou, Amir G. Aghdam

AI总结 研究在多智能体网络中,通过观察代理的输出响应区分有向图的能力,提出基于图论的矩阵树定理扩展,揭示了多智能体网络健康与故障状态间响应区分能力与信息流图中节点路径的关系。

详情
Journal ref
Asian Journal of Control, Volume 16, Issue 5, 2014, Pages 1300-1311
AI中文摘要

本文研究了在多智能体网络中,通过观察代理的输出响应区分有向图的能力。给定一个固定观察点,旨在找到足够的图形条件,使得网络信息流有向图中一组边的失效可区分于另一组边。当后者为空时,这对应于给定观察代理响应下前者边集的可检测性。在结果开发过程中,证明了代数图论中所有主子式矩阵树定理的一个强大扩展,该定理将有向图转换后的拉普拉斯矩阵的主子式与图中顶点间路径的数量和最短路径长度相关联。结果揭示了区分健康和故障多智能体网络响应能力与信息流有向图中节点路径之间复杂的关系。结果对受多重链路失效影响的多智能体系统的操作和设计有直接影响。通过仿真和示例来说明分析发现。

英文摘要

In this work, the ability to distinguish digraphs from the output response of some observing agents in a multi-agent network under the agreement protocol has been studied. Given a fixed observation point, it is desired to find sufficient graphical conditions under which the failure of a set of edges in the network information flow digraph is distinguishable from another set. When the latter is empty, this corresponds to the detectability of the former link set given the response of the observing agent. In developing the results, a powerful extension of the all-minors matrix tree theorem in algebraic graph theory is proved which relates the minors of the transformed Laplacian of a directed graph to the number and length of the shortest paths between its vertices. The results reveal an intricate relationship between the ability to distinguish the responses of a healthy and a faulty multi-agent network and the inter-nodal paths in their information flow digraphs. The results have direct implications for the operation and design of multi-agent systems subject to multiple link losses. Simulations and examples are presented to illustrate the analytic findings.

1611.09436 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Proposal of algorithms for navigation and obstacles avoidance of autonomous mobile robot

自主移动机器人导航与障碍物避障算法的提出

T. T. Hoang, D. T. Hiep, P. M. Duong, N. T. T. Van, B. G. Duong, T. Q. Vinh

AI总结 本文提出算法用于室内自主移动机器人导航与避障,利用激光雷达获取3D环境图像,通过改进的向量场直方图算法生成2D地图并控制轨迹跟踪,实验结果良好。

Comments In 2013 8th IEEE Conference on Industrial Electronics and Applications (ICIEA)

详情
AI中文摘要

本文提出了用于室内自主移动机器人导航和避障的算法。使用激光雷达获取环境的3D图像,提出新的3D到2D图像压力和障碍物检测(IPaBD)算法,用于生成2D全局地图。该地图是设计轨迹的基础。开发了跟踪控制器以使机器人跟随轨迹。利用超声波传感器处理障碍物避障问题。提出了改进的向量场直方图(改进的VFH)算法,以克服原始VFH的一些限制。已进行实验,结果令人鼓舞。

英文摘要

This paper presents algorithms to navigate and avoid obstacles for an in-door autonomous mobile robot. A laser range finder is used to obtain 3D images of the environment. A new algorithm, namely 3D-to-2D image pressure and barriers detection (IPaBD), is proposed to create a 2D global map from the 3D images. This map is basic to design the trajectory. A tracking controller is developed to control the robot to follow the trajectory. The obstacle avoidance is addressed with the use of sonar sensors. An improved vector field histogram (Improved-VFH) algorithm is presented with improvements to overcome some limitations of the original VFH. Experiments have been conducted and the result is encouraged.

1611.09433 2026-06-04 cs.RO cs.HC cs.SY eess.SY 版本更新

A novel platform for internet-based mobile robot systems

一种基于互联网的移动机器人系统新平台

P. M. Duong, T. T. Hoang, N. T. T. Van, D. A. Viet, T. Q. Vinh

AI总结 本文提出一种基于互联网的移动机器人系统软硬件架构,通过3G网络连接多传感器智能机器人,采用客户端-服务器架构实现数据传输,并通过避障和安全点达成等自主机制确保安全,为远程控制算法等研究提供实验平台。

Comments In 2012 7th IEEE Conference on Industrial Electronics and Applications (ICIEA)

详情
AI中文摘要

本文介绍了一种用于在线移动机器人系统的软硬件结构。硬件部分主要由通过3G移动网络连接到互联网的多传感器智能机器人组成。系统采用客户端-服务器软件架构,客户端和服务器之间的数据传输通过不同的传输协议进行。自主机制如避障和安全点达成被实现以确保机器人安全。该架构已在真实互联网上投入使用,初步结果令人鼓舞。通过采用这种结构,将非常容易构建用于研究各种远程操作主题(如远程控制算法、界面设计、网络协议和应用等)的实验平台。

英文摘要

In this paper, we introduce a software and hardware structure for on-line mobile robotic systems. The hardware mainly consists of a Multi-Sensor Smart Robot connected to the Internet through 3G mobile network. The system employs a client-server software architecture in which the exchanged data between the client and the server is transmitted through different transport protocols. Autonomous mechanisms such as obstacle avoidance and safe-point achievement are implemented to ensure the robot safety. This architecture is put into operation on the real Internet and the preliminary result is promising. By adopting this structure, it will be very easy to construct an experimental platform for the research on diverse tele-operation topics such as remote control algorithms, interface designs, network protocols and applications etc.

1611.09431 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Localization of a unicycle-like mobile robot using LRF and omni-directional camera

使用LRF和 omnidirectional相机对类双轮车移动机器人进行定位

Tran Hiep Dinh, Manh Duong Phung, Thuan Hoang Tran, Quang Vinh Tran

AI总结 本文提出利用扩展卡尔曼滤波器对配备LRF和 omnidirectional相机的类双轮车移动机器人进行定位,通过改进的最小二乘二次方法提取环境线段并利用线匹配算法提高定位精度。

Comments In 2012 IEEE International Conference on Control System, Computing and Engineering (ICCSCE)

详情
AI中文摘要

本文针对定位问题,采用扩展卡尔曼滤波器(EKF)对配备激光雷达(LRF)传感器和 omnidirectional相机的类双轮车移动机器人进行定位。LRF用于扫描环境并用线段描述,线段通过改进的最小二乘二次方法提取,其中引入动态阈值。相机用于确定机器人方位。EKF的预测步骤通过提取机器人运动学模型和输入信号参数进行,修正步骤通过实现线匹配算法和比较局部与全局地图中线段参数进行。线匹配算法中引入转换矩阵以降低计算成本。在真实移动机器人系统中进行了实验,结果证明了该方法在定位中的适用性。

英文摘要

This paper addresses the localization problem. The extended Kalman filter (EKF) is employed to localize a unicycle-like mobile robot equipped with a laser range finder (LRF) sensor and an omni-directional camera. The LRF is used to scan the environment which is described with line segments. The segments are extracted by a modified least square quadratic method in which a dynamic threshold is injected. The camera is employed to determine the robot's orientation. The prediction step of the EKF is performed by extracting parameters from the kinematic model and input signal of the robot. The correction step is conducted with the implementation of a line matching algorithm and the comparison between line's parameters of the local and global maps. In the line matching algorithm, a conversion matrix is introduced to reduce the computation cost. Experiments have been carried out in a real mobile robot system and the results prove the applicability of the method for the purpose of localization.

1611.09424 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Development of a multi-sensor perceptual system for mobile robot and EKF-based localization

移动机器人多传感器感知系统的发展及基于EKF的定位方法

T. T. Hoang, P. M. Duong, N. T. T. Van, D. A. Viet, T. Q. Vinh

AI总结 本文提出一种基于现代传感器和多点通信通道的移动机器人感知系统,采用传感器融合模型处理数据以提高机器人定位与控制的准确性,通过扩展卡尔曼滤波器优化系统状态。

Comments In 2012 International Conference on Systems and Informatics (ICSAI). arXiv admin note: substantial text overlap with arXiv:1611.07112, arXiv:1611.07114

详情
AI中文摘要

本文介绍了移动机器人感知系统的设计与实现,利用现代传感器和多点通信通道获取数据,并通过传感器融合模型处理以获得对机器人定位和控制有意义的信息。由于数据获取的不确定性,应用扩展卡尔曼滤波器以获得系统的最优状态。已进行了若干实验,结果证实了感知系统的有效运行和卡尔曼滤波方法的高效性。

英文摘要

This paper presents the design and implementation of a perceptual system for the mobile robot using modern sensors and multi-point communication channels. The data extracted from the perceptual system is processed by a sensor fusion model to obtain meaningful information for the robot localization and control. Due to the uncertainties of acquiring data, an extended Kalman filter was applied to get optimal states of the system. Several experiments have been conducted and the results confirmed the functioning operation of the perceptual system and the efficiency of the Kalman filter approach.

1311.3979 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Precision improvement of MEMS gyros for indoor mobile robots with horizontal motion inspired by methods of TRIZ

为室内移动机器人改进MEMS陀螺仪精度的水平运动启发的TRIZ方法

Dongmyoung Shin, Sung Gil Park, Byung Soo Song, Eung Su Kim, Oleg Kupervasser, Denis Pivovartchuk, Ilya Gartseev, Oleg Antipov, Evgeniy Kruchenkov, Alexey Milovanov, Andrey Kochetov, Igor Sazonov, Igor Nogtev, Sun Woo Hyun

AI总结 本文利用TRIZ方法解决室内移动机器人中水平运动下MEMS陀螺仪精度提升问题,通过创新方法提高传感器性能。

Comments 6 pages, the paper is accepted to 9th IEEE International Conference on Nano/Micro Engineered and Molecular Systems, Hawaii, USA (IEEE-NEMS 2014) as an oral presentation

详情
Journal ref
Proceedings of 9th IEEE International Conference on Nano/Micro Engineered and Molecular Systems (IEEE-NEMS 2014) April 13-16, 2014,Hawaii,USA, pp 102-107
AI中文摘要

在本文中,通过TRIZ(

英文摘要

In the paper, the problem of precision improvement for the MEMS gyrosensors on indoor robots with horizontal motion is solved by methods of TRIZ ("the theory of inventive problem solving").

1308.0037 2026-06-04 eess.SY cs.MA cs.NI cs.RO cs.SY math.OC 版本更新

Route Swarm: Wireless Network Optimization through Mobility

路由蜂群:通过移动性实现无线网络优化

Ryan K. Williams, Andrea Gasparri, Bhaskar Krishnamachari

AI总结 本文提出一种混合架构,用于协调网络化机器人在传感和信息路由中的应用。通过动态重构机器人网络,确保静态无线节点间的高质量路由,实现信息流与物理控制的解耦。

Comments 9 pages, 4 figures, submitted to the IEEE International Conference on Intelligent Robots and Systems (IROS) 2014

详情
AI中文摘要

在本文中,我们展示了一种新颖的混合架构,用于协调网络化机器人在传感和信息路由应用中的协同工作。所提出的信息和传感驱动的物理可重构机器人网络(INSPIRE)由物理控制平面(PCP)和信息控制平面(ICP)组成。PCP负责控制代理位置,ICP负责调节信息流向通信/传感目标。我们描述了一个实例,其中移动机器人网络动态重构以确保静态无线节点之间的高质量路由,这些节点作为信息流的源/目的地对。ICP指挥机器人向均匀分布的流分配配置,而流内的配置则最大化路由质量。PCP则通过基于势能的控制引导机器人,以根据ICP命令进行重构。这种称为路由蜂群的公式解耦了信息流和物理控制,从而在路由和传感需求以及机器人配置之间产生反馈。我们通过在现实无线网络环境下进行模拟来验证我们的提案。

英文摘要

In this paper, we demonstrate a novel hybrid architecture for coordinating networked robots in sensing and information routing applications. The proposed INformation and Sensing driven PhysIcally REconfigurable robotic network (INSPIRE), consists of a Physical Control Plane (PCP) which commands agent position, and an Information Control Plane (ICP) which regulates information flow towards communication/sensing objectives. We describe an instantiation where a mobile robotic network is dynamically reconfigured to ensure high quality routes between static wireless nodes, which act as source/destination pairs for information flow. The ICP commands the robots towards evenly distributed inter-flow allocations, with intra-flow configurations that maximize route quality. The PCP then guides the robots via potential-based control to reconfigure according to ICP commands. This formulation, deemed Route Swarm, decouples information flow and physical control, generating a feedback between routing and sensing needs and robotic configuration. We demonstrate our propositions through simulation under a realistic wireless network regime.

1511.02547 2026-06-04 eess.SY cs.MA cs.RO cs.SY 版本更新

Decentralized Algorithms for 3D Symmetric Formations in Robotic Networks: a Contraction Theory Approach

分布式算法用于机器人网络中的3D对称编队:一种收缩理论方法

Sumeet Singh, Edward Schmerling, Marco Pavone

AI总结 本文提出分布式算法用于多机器人三维对称编队控制,结合循环追捕数学性质和收缩理论,确保全局收敛至对称编队,并通过仿真验证。

Comments Submitted to IEEE Transactions in Robotics

详情
AI中文摘要

本文提出用于多机器人三维编队控制的分布式算法。我们利用循环追捕的数学性质以及收缩和部分收缩理论的结果,设计出确保全局收敛到对称编队的分布式控制算法。首先考虑正多边形编队作为基础案例,然后扩展到约翰逊固体和其他多边形网格编队。算法进一步增强以控制编队大小并避免编队内其他机器人碰撞。在存在有界加性扰动的情况下评估算法的鲁棒性,并量化其对编队质量的影响。最后,我们提出一种将控制律嵌入复杂动态系统(如四旋翼无人机)的一般方法,并通过仿真和四旋翼无人机车队实验验证该方法。

英文摘要

This paper presents decentralized algorithms for formation control of multiple robots in three dimensions. Specifically, we leverage the mathematical properties of cyclic pursuit along with results from contraction and partial contraction theory to design decentralized control algorithms that ensure global convergence to symmetric formations. We first consider regular polygon formations as a base case, and then extend the results to Johnson solid and other polygonal mesh formations. The algorithms are further augmented to allow control over formation size and avoid collisions with other robots in the formation. The robustness properties of the algorithms are assessed in the presence of bounded additive disturbances and their effect on the quality of the formation is quantified. Finally, we present a general methodology for embedding the control laws on complex dynamical systems, in this case, quadcopters, and validate this approach via simulations and experiments on a fleet of quadcopters.

1510.06469 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Optimal Temporal Logic Planning in Probabilistic Semantic Maps

在概率语义地图中进行最优时间逻辑规划

Jie Fu, Nikolay Atanasov, Ufuk Topcu, George J. Pappas

AI总结 本文研究了在概率语义地图中基于时间逻辑约束的机器人运动规划问题,提出通过引入置信度参数delta将随机控制问题转化为确定性最短路径问题,并设计启发函数以提高A*算法的效率和正确性。

Comments 8 pages, 6 figures. submitted to IEEE International Conference on Robotics and Automation 2016

详情
AI中文摘要

本文考虑了在通过语义同时定位与建图(SLAM)获得的概率地图中,机器人运动规划受时间逻辑约束的问题。地图分布的不确定性对保证线性时间逻辑(LTL)规范的正确性构成重大挑战。我们展示该问题可转化为一个最优控制问题,其中语义地图和逻辑公式评估都是随机的。我们的第一项贡献是通过引入置信度参数delta,将LTL子类的随机控制问题转化为确定性最短路径问题。从确定性问题获得的机器人轨迹在真实环境中具有最小成本并以概率delta满足逻辑规范。我们的第二项贡献是设计了一个可接受的启发函数,引导确定性问题的规划朝着满足时间逻辑规范的方向进行。这使我们能够使用A*算法获得最优且高效的解决方案。在模拟语义环境中使用差分驱动机器人验证了我们方法的性能和正确性。

英文摘要

This paper considers robot motion planning under temporal logic constraints in probabilistic maps obtained by semantic simultaneous localization and mapping (SLAM). The uncertainty in a map distribution presents a great challenge for obtaining correctness guarantees with respect to the linear temporal logic (LTL) specification. We show that the problem can be formulated as an optimal control problem in which both the semantic map and the logic formula evaluation are stochastic. Our first contribution is to reduce the stochastic control problem for a subclass of LTL to a deterministic shortest path problem by introducing a confidence parameter $delta$. A robot trajectory obtained from the deterministic problem is guaranteed to have minimum cost and to satisfy the logic specification in the true environment with probability $delta$. Our second contribution is to design an admissible heuristic function that guides the planning in the deterministic problem towards satisfying the temporal logic specification. This allows us to obtain an optimal and very efficient solution using the A* algorithm. The performance and correctness of our approach are demonstrated in a simulated semantic environment using a differential-drive robot.

1506.08762 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Passivity-Based Adaptive Control for Visually Servoed Robotic Systems

基于被动性的自适应控制用于视觉伺服机器人系统

Hanlei Wang

AI总结 本文研究了具有不确定运动学、动力学和相机参数的机器人系统视觉伺服问题,提出两种基于被动性的自适应控制方案,通过Lyapunov分析证明在无需假设估计深度可逆的情况下,图像空间跟踪误差收敛至零。

Comments 18 pages, 7 figures, revised for improving the presentation and correcting several typos based on the reviewers' and AE's comments from IEEE Transactions on Automatic Control

详情
AI中文摘要

本文研究了具有不确定运动学、动力学和相机参数的机器人系统视觉伺服问题。我们首先提出了与系统整体运动学相关的被动性性质,然后提出两种基于被动性的自适应控制方案以解决视觉跟踪问题。一种方案采用自适应逆雅可比反馈,另一种采用自适应转置雅可比反馈。通过Lyapunov分析方法,证明在所提出的任一控制方案下,图像空间跟踪误差收敛至零,而无需依赖估计深度的可逆性假设。数值模拟用于展示所提出的自适应控制器的跟踪性能。

英文摘要

This paper investigates the visual servoing problem for robotic systems with uncertain kinematic, dynamic, and camera parameters. We first present the passivity properties associated with the overall kinematics of the system, and then propose two passivity-based adaptive control schemes to resolve the visual tracking problem. One scheme employs the adaptive inverse-Jacobian-like feedback, and the other employs the adaptive transpose Jacobian feedback. With the Lyapunov analysis approach, it is shown that under either of the proposed control schemes, the image-space tracking errors converge to zero without relying on the assumption of the invertibility of the estimated depth. Numerical simulations are performed to show the tracking performance of the proposed adaptive controllers.

1611.05095 2026-06-04 cs.LG cs.RO cs.SY eess.SY 版本更新

Learning Dexterous Manipulation Policies from Experience and Imitation

从经验与模仿中学习灵巧操作策略

Vikash Kumar, Abhishek Gupta, Emanuel Todorov, Sergey Levine

AI总结 本文研究了通过经验与模仿学习反馈控制灵巧五指手非抓取操作的任务,提出基于轨迹优化的局部控制器,并通过深度学习和最近邻方法进行泛化,展示了小数据训练下的有效性和盲操作优势。

Comments Initial draft for a journal submission

详情
AI中文摘要

我们探索了基于学习的反馈控制方法,用于控制执行非抓取操作的灵巧五指手。首先,我们学习了能够从预定义初始状态开始执行任务的局部控制器。这些控制器是通过轨迹优化构建的,基于从传感器数据直接学习到的局部线性时变模型。在某些情况下,我们使用通过虚拟环境中的遥控收集的人类示范来初始化优化器。我们证明,这些控制器在模拟和物理平台上都能在初始条件的有限范围内稳健地执行任务。然后,我们考虑了两种泛化方法:深度学习和最近邻。我们发现最近邻方法性能更高。然而,神经网络也有其优势:它仅使用触觉和本体感觉反馈,而没有关于物体的视觉反馈(即盲操作),并且学习了一个时间不变的策略。相比之下,最近邻方法根据运动捕捉感知的初始物体状态切换时间变化的局部控制器。尽管两种泛化方法仍有改进空间,我们的工作表明(i)复杂的非抓取操作任务的局部轨迹控制器可以从惊人的少量训练数据中构建,(ii)此类控制器的集合可以插值形成更全局的控制器。结果总结在补充视频中:https://youtu.be/E0wmO6deqjo

英文摘要

We explore learning-based approaches for feedback control of a dexterous five-finger hand performing non-prehensile manipulation. First, we learn local controllers that are able to perform the task starting at a predefined initial state. These controllers are constructed using trajectory optimization with respect to locally-linear time-varying models learned directly from sensor data. In some cases, we initialize the optimizer with human demonstrations collected via teleoperation in a virtual environment. We demonstrate that such controllers can perform the task robustly, both in simulation and on the physical platform, for a limited range of initial conditions around the trained starting state. We then consider two interpolation methods for generalizing to a wider range of initial conditions: deep learning, and nearest neighbors. We find that nearest neighbors achieve higher performance. Nevertheless, the neural network has its advantages: it uses only tactile and proprioceptive feedback but no visual feedback about the object (i.e. it performs the task blind) and learns a time-invariant policy. In contrast, the nearest neighbors method switches between time-varying local controllers based on the proximity of initial object states sensed via motion capture. While both generalization methods leave room for improvement, our work shows that (i) local trajectory-based controllers for complex non-prehensile manipulation tasks can be constructed from surprisingly small amounts of training data, and (ii) collections of such controllers can be interpolated to form more global controllers. Results are summarized in the supplementary video: https://youtu.be/E0wmO6deqjo

1308.5133 2026-06-04 cs.RO cs.NE cs.SY eess.SY 版本更新

Performance Measurement Under Increasing Environmental Uncertainty In The Context of Interval Type-2 Fuzzy Logic Based Robotic Sailing

在区间型2模糊逻辑基础上的机器人航海中环境不确定性增加时的性能测量

Naisan Benatar, Uwe Aickelin, Jonathan M. Garibald

AI总结 本文探讨了在环境不确定性显著变化时,传统性能指标如RMSE的不足,提出更复杂的比较方法,应用于机器人控制问题,证明其比简单方法更稳健。

Comments International Conference on Fuzzy Systems 2013 (Fuzz-IEEE 2013)

详情
AI中文摘要

基于模糊逻辑的机器人控制器在不确定环境下的性能测量是一个被当前文献忽视的领域。本文指出标准指标如RMSE在环境不确定性显著变化时不合适。本文综述了其他作者应用的方法,设计了更复杂的比较方法,并将其应用于机器人控制问题,与单一指标进行比较。结果表明,所描述的技术比更简单的方法提供了更稳健的性能比较,允许更好的比较。

英文摘要

Performance measurement of robotic controllers based on fuzzy logic, operating under uncertainty, is a subject area which has been somewhat ignored in the current literature. In this paper standard measures such as RMSE are shown to be inappropriate for use under conditions where the environmental uncertainty changes significantly between experiments. An overview of current methods which have been applied by other authors is presented, followed by a design of a more sophisticated method of comparison. This method is then applied to a robotic control problem to observe its outcome compared with a single measure. Results show that the technique described provides a more robust method of performance comparison than less complex methods allowing better comparisons to be drawn.

1611.03372 2026-06-04 cs.RO cs.AI cs.SE cs.SY eess.SY 版本更新

A stochastically verifiable autonomous control architecture with reasoning

一种具有推理能力的随机可验证自主控制架构

Paolo Izzo, Hongyang Qu, Sandor M. Veres

AI总结 本文提出一种具有推理能力的随机可验证自主控制架构LISA,通过将系统抽象为DTMC和MDP模型,实现代理与环境的概率验证,提升设计与运行时的验证效率。

Comments Accepted at IEEE Conf. Decision and Control, 2016

详情
AI中文摘要

本文介绍了一种名为有限指令集代理(LISA)的新代理架构,用于自主控制。该架构基于先前的AgentSpeak实现,结构比其前身更简单,旨在促进设计时和运行时的验证方法。研究并展示了将LISA系统抽象为两种不同的离散概率模型(DTMC和MDP)的过程。LISA系统为代理和环境的完整建模提供了工具,用于概率验证。代理程序可以自动编译为DTMC或MDP模型进行验证,使用Prism工具。自动生成的Prism模型可用于设计时和运行时的验证。运行时验证在LISA系统中作为内部建模机制,用于预测未来的 outcomes。

英文摘要

A new agent architecture called Limited Instruction Set Agent (LISA) is introduced for autonomous control. The new architecture is based on previous implementations of AgentSpeak and it is structurally simpler than its predecessors with the aim of facilitating design-time and run-time verification methods. The process of abstracting the LISA system to two different types of discrete probabilistic models (DTMC and MDP) is investigated and illustrated. The LISA system provides a tool for complete modelling of the agent and the environment for probabilistic verification. The agent program can be automatically compiled into a DTMC or a MDP model for verification with Prism. The automatically generated Prism model can be used for both design-time and run-time verification. The run-time verification is investigated and illustrated in the LISA system as an internal modelling mechanism for prediction of future outcomes.

1609.07015 2026-06-04 cs.MA cs.RO cs.SY eess.SY 版本更新

Distributed Consistent Data Association

分布式一致数据关联

Spyridon Leonardos, Xiaowei Zhou, Kostas Daniilidis

AI总结 本文提出两种去中心化方法,通过一致的全局数据关联提升多传感器系统准确性,结合理论分析和实验验证方法有效性。

详情
AI中文摘要

数据关联是多传感器系统中的基本问题。当前技术依赖于成对数据关联,即使使用异常值拒绝方案,仍可能产生虚假关联。同时考虑多个成对关联显著提高准确性并导致一致性。本文提出两种完全去中心化的方法,用于从成对数据关联中实现一致的全局数据关联。第一种方法是在双重随机矩阵集上的共识算法。第二种方法是Pachauri等人提出的谱方法的去中心化版本。我们通过理论分析和实验评估证明了两种方法的有效性。

英文摘要

Data association is one of the fundamental problems in multi-sensor systems. Most current techniques rely on pairwise data associations which can be spurious even after the employment of outlier rejection schemes. Considering multiple pairwise associations at once significantly increases accuracy and leads to consistency. In this work, we propose two fully decentralized methods for consistent global data association from pairwise data associations. The first method is a consensus algorithm on the set of doubly stochastic matrices. The second method is a decentralization of the spectral method proposed by Pachauri et al.. We demonstrate the effectiveness of both methods using theoretical analysis and experimental evaluation.

1610.05863 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Learning Quadrotor Dynamics Using Neural Network for Flight Control

使用神经网络学习四旋翼动力学用于飞行控制

Somil Bansal, Anayo K. Akametalu, Frank J. Jiang, Forrest Laine, Claire J. Tomlin

AI总结 本文研究通过神经网络学习四旋翼动力学以合成不同轨迹的控制器,验证了仅使用平移和旋转轨迹训练模型的有效性。

详情
AI中文摘要

传统四旋翼或直升机控制学习方法聚焦于通过迭代改进名义控制器提升特定轨迹性能,如学习示范、迭代学习和强化学习。然而,这些方案中训练轨迹获取的信息如何用于合成更一般轨迹的控制器尚不明确。最近研究表明深度学习在推断直升机动力学方面的有效性。受深度学习泛化能力的启发,本文探讨是否可以利用基于神经网络的动力学模型来合成不同于训练轨迹的控制器。为此,我们仅使用平移和仅使用旋转训练轨迹分别学习四旋翼动力学模型,每种轨迹均可独立控制,然后用该模型同时控制四旋翼的偏航和位置,这因两种运动间的非线性耦合而具有挑战性。我们通过四旋翼测试平台实验验证了本方法。

英文摘要

Traditional learning approaches proposed for controlling quadrotors or helicopters have focused on improving performance for specific trajectories by iteratively improving upon a nominal controller, for example learning from demonstrations, iterative learning, and reinforcement learning. In these schemes, however, it is not clear how the information gathered from the training trajectories can be used to synthesize controllers for more general trajectories. Recently, the efficacy of deep learning in inferring helicopter dynamics has been shown. Motivated by the generalization capability of deep learning, this paper investigates whether a neural network based dynamics model can be employed to synthesize control for trajectories different than those used for training. To test this, we learn a quadrotor dynamics model using only translational and only rotational training trajectories, each of which can be controlled independently, and then use it to simultaneously control the yaw and position of a quadrotor, which is non-trivial because of nonlinear couplings between the two motions. We validate our approach in experiments on a quadrotor testbed.

1610.03518 2026-06-04 cs.RO cs.AI cs.LG cs.SY eess.SY 版本更新

Transfer from Simulation to Real World through Learning Deep Inverse Dynamics Model

通过学习深度逆动力学模型实现仿真到现实世界的迁移

Paul Christiano, Zain Shah, Igor Mordatch, Jonas Schneider, Trevor Blackwell, Joshua Tobin, Pieter Abbeel, Wojciech Zaremba

AI总结 本文提出通过学习深度逆动力学模型,在仿真与现实世界之间实现控制策略的迁移,解决仿真与现实差异导致的性能下降问题。

详情
AI中文摘要

在仿真中开发控制策略通常比直接在现实世界中运行实验更实用和安全。这适用于通过规划和优化获得的策略,甚至更适用于通过强化学习获得的策略,后者通常非常数据密集。然而,仿真中成功的策略在部署到现实机器人时往往无法工作。然而,策略在仿真中执行的整体思路在现实世界中通常仍然有效。本文研究了此类场景,其中仿真中遍历的状态序列在现实世界中仍然合理,即使控制细节不同,例如摩擦、接触、质量和几何属性的差异。在执行过程中,我们的方法在每个时间步计算仿真基于的控制策略会做什么,但不执行这些控制在现实机器人上,而是计算仿真期望的下一个状态,并依赖于学习的深度逆动力学模型来决定最合适的现实世界动作以达到这些状态。深度模型只有在训练数据足够的情况下才有效,我们还提出了一种数据收集方法来(逐步)学习深度逆动力学模型。我们的实验表明,我们的方法在处理仿真到现实世界模型差异的各种基线方法中表现良好,包括输出误差控制和高斯动态适应。

英文摘要

Developing control policies in simulation is often more practical and safer than directly running experiments in the real world. This applies to policies obtained from planning and optimization, and even more so to policies obtained from reinforcement learning, which is often very data demanding. However, a policy that succeeds in simulation often doesn't work when deployed on a real robot. Nevertheless, often the overall gist of what the policy does in simulation remains valid in the real world. In this paper we investigate such settings, where the sequence of states traversed in simulation remains reasonable for the real world, even if the details of the controls are not, as could be the case when the key differences lie in detailed friction, contact, mass and geometry properties. During execution, at each time step our approach computes what the simulation-based control policy would do, but then, rather than executing these controls on the real robot, our approach computes what the simulation expects the resulting next state(s) will be, and then relies on a learned deep inverse dynamics model to decide which real-world action is most suitable to achieve those next states. Deep models are only as good as their training data, and we also propose an approach for data collection to (incrementally) learn the deep inverse dynamics model. Our experiments shows our approach compares favorably with various baselines that have been developed for dealing with simulation to real world model discrepancy, including output error control and Gaussian dynamics adaptation.

1610.03028 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Notes on geometry of locomotion of 3-dimensional version of the Purcell's swimmer

关于三维Purcell游泳者运动几何的笔记

Sudin Kadam, Ravi Banavar

AI总结 本文提出三维Purcell游泳者模型,结合Cox理论和阻力力理论,推导出纯运动学方程,用于低雷诺数下的运动分析。

Comments These are notes, and have not been submitted for any kind of publication

详情
AI中文摘要

我们提出了一种广义的三维Purcell游泳者模型,该模型是一个平面机制,在低雷诺数 regime 下进行运动。我们使用Cox理论和阻力力理论来推导系统作用的力。最终,我们得出系统方程的纯运动学形式。

英文摘要

We present a generalized, 3 dimensional version of the Purcell's swimmer which is a planar mechanism locomoting at low Reynlods number regime. We use Cox theory and resistive force theory to come up with the forces acting on the system. We finally come up with a purely kinematic form of the system's equations.

1610.01243 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

On the Construction of Safe Controllable Regions for Affine Systems with Applications to Robotics

关于仿射系统安全可控区域构造及其在机器人中的应用

Mohamed K. Helwa, Angela P. Schoellig

AI总结 本文研究了构造仿射系统在块内可控区域的问题,提出了一种计算高效的算法,并通过实验验证了其在机器人系统中的有效性。

Comments 17 pages, 18 figures, under review for publication in Automatica

详情
AI中文摘要

本文研究了构造仿射系统在块内可控区域的问题。即,我们关注构造仿射系统的状态空间中的区域,使得区域内所有状态通过区域内部应用均匀有界输入相互可达。我们首先表明现有结果在给定多边形区域上检查块内可控性无法轻易扩展到构造块内可控区域的问题。然后探索问题的几何特性,提供一种计算高效的构造块内可控区域的算法。我们还证明了该算法的正确性。随后,我们使用所提算法为不同类型的机器人系统构造安全速度剖面,包括完全驱动机器人、地面机器人(建模为具有加速度限制的双轮车)和无人驾驶航空器(UAVs)。最后,我们展示了UAVs上的几个实验结果以验证所提算法的有效性。例如,我们使用所提算法为UAVs实现实时避障。

英文摘要

This paper studies the problem of constructing in-block controllable (IBC) regions for affine systems. That is, we are concerned with constructing regions in the state space of affine systems such that all the states in the interior of the region are mutually accessible through the region's interior by applying uniformly bounded inputs. We first show that existing results for checking in-block controllability on given polytopic regions cannot be easily extended to address the question of constructing IBC regions. We then explore the geometry of the problem to provide a computationally efficient algorithm for constructing IBC regions. We also prove the soundness of the algorithm. We then use the proposed algorithm to construct safe speed profiles for different robotic systems, including fully-actuated robots, ground robots modeled as unicycles with acceleration limits, and unmanned aerial vehicles (UAVs). Finally, we present several experimental results on UAVs to verify the effectiveness of the proposed algorithm. For instance, we use the proposed algorithm for real-time collision avoidance for UAVs.

1610.01045 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

A Game-Theoretic Approach to Robust Fusion and Kalman Filtering Under Unknown Correlations

基于博弈论的鲁棒融合与卡尔曼滤波在未知相关性下的方法

Spyridon Leonardos, Kostas Daniilidis

AI总结 本文提出一种博弈论方法,用于在未知交叉相关性下融合两个随机向量,并在分布式状态估计中展示了其优于协方差交集的性能。

详情
AI中文摘要

本文解决了融合两个具有未知交叉相关性的随机向量的问题。我们提出了一个公式和数值方法,用于计算最小最大意义下的最优估计。我们扩展了该公式到依赖于两个具有未知交叉相关性的随机向量的线性测量模型。作为应用,我们考虑了多个智能体的分布式状态估计问题。所提出的估计器在考虑交叉相关性的同时比广泛使用的协方差交集方法更不保守。我们通过分布式状态估计中的相对位置测量的数值示例和模拟展示了所提方法的优越性。

英文摘要

This work addresses the problem of fusing two random vectors with unknown cross-correlations. We present a formulation and a numerical method for computing the optimal estimate in the minimax sense. We extend our formulation to linear measurement models that depend on two random vectors with unknown cross-correlations. As an application we consider the problem of decentralized state estimation for a group of agents. The proposed estimator takes cross-correlations into account while being less conservative than the widely used Covariance Intersection. We demonstrate the superiority of the proposed method compared to Covariance Intersection with numerical examples and simulations within the specific application of decentralized state estimation using relative position measurements.

1609.08536 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Scheduling Nonlinear Sensors for Stochastic Process Estimation

为随机过程估计调度非线性传感器

Vasileios Tzoumas, Nikolay A. Atanasov, Ali Jadbabaie, George J. Pappas

AI总结 本文研究在众多可用传感器中仅激活少数传感器以估计随机过程状态的问题,提出了一种时间复杂度与规划周期线性相关的算法,性能优于多项式时间算法,且在高斯过程等场景下具有与最优算法相当的效率。

Comments Corrected typos in proof of Theorem 1; submitted for conference publication. arXiv admin note: text overlap with arXiv:1608.07533

详情
AI中文摘要

本文聚焦于在众多可用传感器中仅激活少量传感器以估计感兴趣随机过程状态的问题。该问题在目标跟踪和同时定位与建图(SLAM)等应用中具有重要性。由于涉及演化未知的随机系统、非线性测量传感器以及限制的操作资源,该问题具有挑战性。我们提供了一种适用于一般随机过程和非线性测量的算法,其时间复杂度与规划周期线性相关,性能与最优性能相差一个乘法因子1/2。这值得注意,因为该算法在计算优势上显著优于达到最佳近似因子1/e的多项式时间算法。此外,对于重要的高斯过程和受高斯噪声污染的非线性测量,我们的算法具有与最先进线性系统和测量算法相同的时间复杂度。我们通过证明批量状态向量在测量条件下的熵的两个性质来实现我们的结果:a)在传感器选择上是超模的;b)具有稀疏模式(涉及块三对角矩阵)使其在每个传感器集合上易于评估。

英文摘要

In this paper, we focus on activating only a few sensors, among many available, to estimate the state of a stochastic process of interest. This problem is important in applications such as target tracking and simultaneous localization and mapping (SLAM). It is challenging since it involves stochastic systems whose evolution is largely unknown, sensors with nonlinear measurements, and limited operational resources that constrain the number of active sensors at each measurement step. We provide an algorithm applicable to general stochastic processes and nonlinear measurements whose time complexity is linear in the planning horizon and whose performance is a multiplicative factor 1/2 away from the optimal performance. This is notable because the algorithm offers a significant computational advantage over the polynomial-time algorithm that achieves the best approximation factor 1/e. In addition, for important classes of Gaussian processes and nonlinear measurements corrupted with Gaussian noise, our algorithm enjoys the same time complexity as even the state-of-the-art algorithms for linear systems and measurements. We achieve our results by proving two properties for the entropy of the batch state vector conditioned on the measurements: a) it is supermodular in the choice of the sensors; b) it has a sparsity pattern (involves block tri-diagonal matrices) that facilitates its evaluation at each sensor set.

1608.07533 2026-06-04 math.OC cs.RO cs.SY eess.SY math.DS 版本更新

Near-Optimal Sensor Scheduling for Batch State Estimation: Complexity, Algorithms, and Limits

近优传感器调度用于批量状态估计:复杂性、算法与极限

Vasileios Tzoumas, Ali Jadbabaie, George J. Pappas

AI总结 本文针对线性系统批量状态估计问题,提出一种在精度和时间复杂度上均优于现有算法的近优传感器调度算法,其解的质量接近最优解,并具有较低的计算复杂度。

Comments Correction of typos in proofs

详情
AI中文摘要

本文聚焦于线性系统的批量状态估计问题。该问题在环境场估计、机器人导航和目标跟踪等应用中具有重要意义。其难点在于传感器受限的操作资源(如共享的通信带宽或电池电量)限制了每个测量步骤中能同时工作的传感器数量。因此,必须采用传感器调度算法。然而,现有的批量状态估计传感器调度算法在系统规模和时间跨度上表现不佳。此外,尽管现有的卡尔曼滤波器传感器调度算法在扩展性上更好,但它们不提供性能保证或误差最小化的近似界限。本文的主要贡献之一是提供了一种算法,该算法在估计精度上具有批量状态调度算法的准确性,并在时间复杂度上具有卡尔曼滤波器调度算法的低复杂度。具体而言:1)我们的算法是近优的:它达到的解质量接近最优解的1/2倍,这一因子接近在多项式时间内可达到的最佳近似因子1/e;2)我们的算法的时间复杂度不仅低于现有的批量状态估计算法,而且也低于或与现有的卡尔曼滤波器调度算法的时间复杂度相当。我们通过证明两个关于我们的批量状态估计误差度量的性质来实现这些结果,该度量量化了批量状态向量的最小方差线性估计的平方误差:a)它在传感器选择上的超模性;b)它具有稀疏模式(涉及块三对角矩阵)的结构,这使得在每个传感器集上评估变得容易。

英文摘要

In this paper, we focus on batch state estimation for linear systems. This problem is important in applications such as environmental field estimation, robotic navigation, and target tracking. Its difficulty lies on that limited operational resources among the sensors, e.g., shared communication bandwidth or battery power, constrain the number of sensors that can be active at each measurement step. As a result, sensor scheduling algorithms must be employed. Notwithstanding, current sensor scheduling algorithms for batch state estimation scale poorly with the system size and the time horizon. In addition, current sensor scheduling algorithms for Kalman filtering, although they scale better, provide no performance guarantees or approximation bounds for the minimization of the batch state estimation error. In this paper, one of our main contributions is to provide an algorithm that enjoys both the estimation accuracy of the batch state scheduling algorithms and the low time complexity of the Kalman filtering scheduling algorithms. In particular: 1) our algorithm is near-optimal: it achieves a solution up to a multiplicative factor 1/2 from the optimal solution, and this factor is close to the best approximation factor 1/e one can achieve in polynomial time for this problem; 2) our algorithm has (polynomial) time complexity that is not only lower than that of the current algorithms for batch state estimation; it is also lower than, or similar to, that of the current algorithms for Kalman filtering. We achieve these results by proving two properties for our batch state estimation error metric, which quantifies the square error of the minimum variance linear estimator of the batch state vector: a) it is supermodular in the choice of the sensors; b) it has a sparsity pattern (it involves matrices that are block tri-diagonal) that facilitates its evaluation at each sensor set.

1512.01195 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Reachable Set Approach to Collision Avoidance for UAVs

基于可达集的无人机避撞方法

Yuchen Zhou, John S. Baras

AI总结 本文提出基于可达集的无人机避撞算法,通过凸优化求解控制约束集,解决多无人机避撞问题。

Comments CDC 2015 fixed-wing nonlinear dynamics extension. CDC 2015 DOI: 10.1109/CDC.2015.7403154

详情
AI中文摘要

本文提出了一种基于可达集的无人机避撞算法。无人机在农业研究、 surveillance 和灾害救援中广泛应用,因此必须具备 onboard 避撞能力以确保安全。不同于传统的轨迹避撞方法,本文提出基于可达集和管的避撞方案。我们将问题形式化为凸优化问题,以寻找适合参与飞行器的控制约束集。我们已将该方法应用于两个四旋翼和两个固定翼无人机的避撞场景研究。

英文摘要

In this paper, we propose a reachable set based collision avoidance algorithm for unmanned aerial vehicles (UAVs). UAVs have been deployed for agriculture research and management, surveillance and sensor coverage for threat detection and disaster search and rescue operations. It is essential for the aircraft to have on-board collision avoidance capability to guarantee safety. Instead of the traditional approach of collision avoidance between trajectories, we propose a collision avoidance scheme based on reachable sets and tubes. We then formulate the problem as a convex optimization problem seeking suitable control constraint sets for participating aircraft. We have applied the approach on a case study of two quadrotors and two fix-wing aircraft collision avoidance scenario.

1609.07437 2026-06-04 eess.SY cs.MA cs.RO cs.SY 版本更新

Distributed scaling control of rigid formations

分布式刚体编队的尺度控制

Hector Garcia de Marina, Bayu Jayawardhana, Ming Cao

AI总结 本文通过引入分布式参数操控刚体编队的稳态运动,结合图刚性理论和承载刚性理论实现编队的可控缩放。

Comments 6 pages In proceedings 55th Conference on Decision and Control, year 2016

详情
AI中文摘要

近年来,邻近代理间的偏置距离测量在梯度距离基编队控制中可能导致可预测的集体运动。本文利用这一效应,通过引入分布式参数到规定间距中,能够操控编队的稳态运动。这种操控表现为同时诱导恒定的平移和角速度组合以及可控的刚体编队缩放。翻译参数的计算基于已知的图刚性理论,而缩放参数则基于最近的承载刚性理论发现。我们对修改后的梯度系统进行了稳定性分析和仿真以验证主要结果。

英文摘要

Recently it has been reported that biased range-measurements among neighboring agents in the gradient distance-based formation control can lead to predictable collective motion. In this paper we take advantage of this effect and by introducing distributed parameters to the prescribed inter-distances we are able to manipulate the steady-state motion of the formation. This manipulation is in the form of inducing simultaneously the combination of constant translational and angular velocities and a controlled scaling of the rigid formation. While the computation of the distributed parameters for the translational and angular velocities is based on the well-known graph rigidity theory, the parameters responsible for the scaling are based on some recent findings in bearing rigidity theory. We carry out the stability analysis of the modified gradient system and simulations in order to validate the main result.

1609.07436 2026-06-04 cs.RO cs.SY eess.SY 版本更新

UAV attitude estimation using Unscented Kalman Filter and TRIAD

使用无迹卡尔曼滤波和TRIAD估计无人机姿态

Hector Garcia de Marina, Fernando J. Pereda, Jose Marina Giron-Sierra, Felipe Espinosa

AI总结 本文提出一种基于UKF和TRIAD算法的AHRS,通过仿真和实验证明其在微控制器上具有低计算成本和实时性能。

Comments 10 pages

详情
Journal ref
IEEE Transactions on Industrial Electronics, Volume 59, Issue 11, Pages 4465-4474, year 2012
AI中文摘要

自主车辆,尤其是无人机,面临确定姿态角的主要问题。本文提出一种使用现成组件估计这些角度的新方法。本文介绍了一种基于UKF的AHRS,使用TRIAD算法作为观测模型。通过仿真和与基于EKF的AHRS的比较,评估了该方法的性能。本文还展示了使用真实固定翼无人机的现场实验结果,结果表明在微控制器上具有良好的实时性能和低计算成本。

英文摘要

A main problem in autonomous vehicles in general, and in \acp{UAV} in particular, is the determination of the attitude angles. A novel method to estimate these angles using off-the-shelf components is presented. This paper introduces an \ac{AHRS} based on the \ac{UKF} using the \ac{TRIAD} algorithm as the observation model. The performance of the method is assessed through simulations and compared to an \ac{AHRS} based on the \ac{EKF}. The paper presents field experiment results using a real fixed-wing \ac{UAV}. The results show good real-time performance with low computational cost in a microcontroller.

1609.07038 2026-06-04 cs.MA cs.RO cs.SY eess.SY 版本更新

Simultaneous Intermittent Communication Control and Path Optimization in Networks of Mobile Robots

移动机器人网络中的间歇通信控制与路径优化同时进行

Yiannis Kantaros, Michael M. Zavlanos

AI总结 本文提出了一种间歇通信框架,通过移动机器人在连接移动图的边缘移动并仅在节点相遇时通信,实现网络的间歇连通性和路径优化,同时通过局部LTL公式分解和分布式冲突解决方案保证全局LTL公式的满足。

详情
AI中文摘要

本文提出了一种间歇通信框架用于移动机器人网络。具体而言,我们考虑机器人沿连接移动图的边移动,并仅在该图的节点相遇时通信,从而形成动态通信网络。我们提出的分布式控制器同时确保网络的间歇连通性和路径优化。我们展示间歇连通性需求可以被一个全局线性时间逻辑(LTL)公式封装。然后将其近似分解为局部LTL表达式并分配给机器人。为了避免由于这种近似分解可能产生的冲突行为,我们开发了一种分布式冲突解决方案,该方案基于分配的局部LTL表达式为每个机器人生成非冲突的离散运动计划,其组合满足全局LTL公式。通过适当引入生成运动计划执行中的延迟,我们还证明所提出的控制器可以异步执行。

英文摘要

In this paper, we propose an intermittent communication framework for mobile robot networks. Specifically, we consider robots that move along the edges of a connected mobility graph and communicate only when they meet at the nodes of that graph giving rise to a dynamic communication network. Our proposed distributed controllers ensure intermittent connectivity of the network and path optimization, simultaneously. We show that the intermittent connectivity requirement can be encapsulated by a global Linear Temporal Logic (LTL) formula. Then we approximately decompose it into local LTL expressions which are then assigned to the robots. To avoid conflicting robot behaviors that can occur due to this approximate decomposition, we develop a distributed conflict resolution scheme that generates non-conflicting discrete motion plans for every robot, based on the assigned local LTL expressions, whose composition satisfies the global LTL formula. By appropriately introducing delays in the execution of the generated motion plans we also show that the proposed controllers can be executed asynchronously.

1609.07006 2026-06-04 cs.RO cs.MA cs.SY eess.SY 版本更新

SafeGuardPF: Safety Guaranteed Reactive Potential Fields for Mobile Robots in Unknown and Dynamic Environments

SafeGuardPF: 保障安全的反应势场用于未知和动态环境中的移动机器人

Rafael Rodrigues da Silva, Samuel Silva, Grigoriy Dubrovskiy, Hai Lin

AI总结 本文提出SafeGuardPF方法,通过将反应势场运动控制器建模为混合自动机,并利用微分动态逻辑形式化验证其安全性,确保机器人在动态环境中避免碰撞。

Comments 8 pages, 9 figures, Submitted for publication in 2017 American Control Conference (ACC2017)

详情
AI中文摘要

自主导航在未知和动态环境中实现可靠避障仍具挑战性,尤其当存在移动障碍物时。为应对这一问题,本文采用反应势场方法,该方法在每个周期内仅需机器人相对于最近障碍物点的当前状态即可计算势场,从而更高效且适用于多智能体场景。本文的主要贡献是将反应势场运动控制器建模为混合自动机,并通过微分动态逻辑形式化验证其安全性,确保机器人在碰撞时不会发生故障,即只有在机器人静止时才可能发生碰撞。所提出控制器及验证结果通过仿真实验和Pioneer P3-AT机器人实现进行验证。

英文摘要

An autonomous navigation with proven collision avoidance in unknown and dynamic environments is still a challenge, particularly when there are moving obstacles. A popular approach to collision avoidance in the face of moving obstacles is based on model predictive algorithms, which, however, may be computationally expensive. Hence, we adopt a reactive potential field approach here. At every cycle, the proposed approach requires only current robot states relative to the closest obstacle point to find the potential field in the current position; thus, it is more computationally efficient and more suitable to scale up for multiple agent scenarios. Our main contribution here is to write the reactive potential field based motion controller as a hybrid automaton, and then formally verify its safety using differential dynamic logic. In particular, we can guarantee a passive safety property, which means that collisions cannot occur if the robot is to blame, namely a collision can occur only if the robot is at rest. The proposed controller and verification results are demonstrated via simulations and implementation on a Pioneer P3-AT robot.

1609.06662 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

On Efficient Computation of Shortest Dubins Paths Through Three Consecutive Points

在三个连续点上高效计算最短Dubins路径

Armin Sadeghi, Stephen L. Smith

AI总结 本文研究了在三个连续点上计算曲率受限移动Dubins车辆最优路径的问题,提出新的几何分析方法,改进了Dubins TSP路线的计算效率和解的质量。

Comments Extended version of IEEE CDC 2016 paper

详情
AI中文摘要

在本文中,我们研究了在曲率受限的前向移动Dubins车辆上计算通过三个连续点的最优路径的问题。给定Dubins车辆的初始和最终配置,以及一个无约束航向的中点,目标是计算中点航向以最小化总Dubins路径长度。我们提供了最优路径的新几何分析,并建立了通过三个点的最优Dubins路径的新性质。然后,我们展示了我们的方法如何用于快速改进使用最新技术生成的Dubins TSP路线。我们还提供了广泛的模拟结果,显示所提出的方法在运行时间和解质量上优于传统方法,即对中点的航向进行均匀离散化,然后为每个离散航向求解最小Dubins路径。

英文摘要

In this paper, we address the problem of computing optimal paths through three consecutive points for the curvature-constrained forward moving Dubins vehicle. Given initial and final configurations of the Dubins vehicle, and a midpoint with an unconstrained heading, the objective is to compute the midpoint heading that minimizes the total Dubins path length. We provide a novel geometrical analysis of the optimal path, and establish new properties of the optimal Dubins' path through three points. We then show how our method can be used to quickly refine Dubins TSP tours produced using state-of-the-art techniques. We also provide extensive simulation results showing the improvement of the proposed approach in both runtime and solution quality over the conventional method of uniform discretization of the heading at the mid-point, followed by solving the minimum Dubins path for each discrete heading.

1609.06435 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Controlling rigid formations of mobile agents under inconsistent measurements

在不一致测量下控制移动代理的刚体编队

Hector Garcia de Marina, Ming Cao, Bayu Jayawardhana

AI总结 本文提出基于局部估计器的梯度控制器,以增强对不一致测量的鲁棒性,通过机器人实验和仿真验证了其在消除集体运动不一致轨道方面的有效性。

Comments 10 pages

详情
Journal ref
IEEE Transactions on Robotics, Volume 31, Issue 1, Feb. 2015
AI中文摘要

尽管近年来使用梯度控制器稳定自主代理的刚体编队取得了巨大成功,但最近在代理局部控制器中使用不一致测量时却报告了令人惊讶且引人入胜的不希望的集体运动。为了使现有的梯度控制对这种测量不一致具有鲁棒性,我们利用局部估计器,遵循著名的内部模型原理进行鲁棒输出调节控制。新的基于估计器的梯度控制仍然具有分布性质,并且即使在刚体编队中的代理数量增长时,也可以系统地构建。我们严格证明所提出控制能够保证指数收敛,然后通过机器人实验和计算机仿真证明所报告的由不一致引起的集体运动轨道得到有效消除。

英文摘要

Despite the great success of using gradient-based controllers to stabilize rigid formations of autonomous agents in the past years, surprising yet intriguing undesirable collective motions have been reported recently when inconsistent measurements are used in the agents' local controllers. To make the existing gradient control robust against such measurement inconsistency, we exploit local estimators following the well known internal model principle for robust output regulation control. The new estimator-based gradient control is still distributed in nature and can be constructed systematically even when the number of agents in a rigid formation grows. We prove rigorously that the proposed control is able to guarantee exponential convergence and then demonstrate through robotic experiments and computer simulations that the reported inconsistency-induced orbits of collective movements are effectively eliminated.

1609.06283 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Robotic Swarm Control from Spatio-Temporal Specifications

二维机器人群的时空规范控制

Iman Haghighi, Sadra Sadraddini, Calin Belta

AI总结 本文研究如何通过丰富的时空逻辑控制二维机器人群以实现复杂时空模式,将规范编码为混合整数线性约束,并通过优化机器人移动总和或相似度指标来规划轨迹。

Comments A shorter version of this paper is going to be published at the proceedings of the 55th international conference on decision and control (CDC 2016)

详情
AI中文摘要

本文研究了如何通过丰富的时空逻辑控制二维机器人群以实现高阶和复杂的时空模式。我们使用能够描述广泛时间变化和复杂空间配置的丰富时空逻辑,并开发了一种方法,将此类形式规范编码为一组混合整数线性约束,这些约束被纳入混合整数线性规划问题中。我们为每个个体机器人规划轨迹,以确保整个群体满足时空要求,同时优化总机器人移动量或一个显示群体轨迹与给定时空行为相似度的指标。附有说明性案例研究。

英文摘要

In this paper, we study the problem of controlling a two-dimensional robotic swarm with the purpose of achieving high level and complex spatio-temporal patterns. We use a rich spatio-temporal logic that is capable of describing a wide range of time varying and complex spatial configurations, and develop a method to encode such formal specifications as a set of mixed integer linear constraints, which are incorporated into a mixed integer linear programming problem. We plan trajectories for each individual robot such that the whole swarm satisfies the spatio-temporal requirements, while optimizing total robot movement and/or a metric that shows how strongly the swarm trajectory resembles given spatio-temporal behaviors. An illustrative case study is included.

1609.05960 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

Incremental Sampling-based Motion Planners Using Policy Iteration Methods

基于增量采样的运动规划器使用策略迭代方法

Oktay Arslan, Panagiotis Tsiotras

AI总结 本文提出了一种基于策略迭代的运动规划算法,利用动态规划思想在随机图中求解最短路径问题,通过改进策略加速计算过程,适用于大规模并行化。

详情
AI中文摘要

最近随机运动规划的进步导致了一类新的基于采样的算法发展,这些算法提供了渐近最优性保证,例如RRT*和PRM*算法。仔细分析发现,这些算法中的所谓'重 wiring'步骤可以被解释为局部策略迭代(PI)步骤(即局部策略评估步骤后跟局部策略改进步骤),因此随着样本数趋于无穷大,这两种算法几乎肯定收敛到最优路径(概率1)。策略迭代,与价值迭代(VI)一样,是解决动态规划(DP)问题的常用方法。基于这一观察,最近提出了RRT#算法,该算法在每次迭代中对那些可能成为最优路径部分的顶点(即'有希望'的顶点)执行Bellman更新(即'备份')。RRT#算法因此利用了动态规划思想,并在随机生成的图上逐步实现以获得高质量的解决方案。在本文中,基于这一关键洞察,我们探索了一类不同的动态规划算法来解决由迭代采样方法生成的随机图中的最短路径问题。这些算法利用策略迭代而不是价值迭代,因此更适合大规模并行化。与RRT*算法不同,策略改进在重 wiring步骤中不是仅在局部进行,而是在当前迭代中被分类为'有希望'的顶点集合上进行。这倾向于加快整个过程。所得到的算法,恰当地命名为策略迭代-RRT#(PI-RRT#),是第一种基于动态规划思想的随机运动规划新类算法,利用PI方法。

英文摘要

Recent progress in randomized motion planners has led to the development of a new class of sampling-based algorithms that provide asymptotic optimality guarantees, notably the RRT* and the PRM* algorithms. Careful analysis reveals that the so-called "rewiring" step in these algorithms can be interpreted as a local policy iteration (PI) step (i.e., a local policy evaluation step followed by a local policy improvement step) so that asymptotically, as the number of samples tend to infinity, both algorithms converge to the optimal path almost surely (with probability 1). Policy iteration, along with value iteration (VI) are common methods for solving dynamic programming (DP) problems. Based on this observation, recently, the RRT$^{\#}$ algorithm has been proposed, which performs, during each iteration, Bellman updates (aka "backups") on those vertices of the graph that have the potential of being part of the optimal path (i.e., the "promising" vertices). The RRT$^{\#}$ algorithm thus utilizes dynamic programming ideas and implements them incrementally on randomly generated graphs to obtain high quality solutions. In this work, and based on this key insight, we explore a different class of dynamic programming algorithms for solving shortest-path problems on random graphs generated by iterative sampling methods. These class of algorithms utilize policy iteration instead of value iteration, and thus are better suited for massive parallelization. Contrary to the RRT* algorithm, the policy improvement during the rewiring step is not performed only locally but rather on a set of vertices that are classified as "promising" during the current iteration. This tends to speed-up the whole process. The resulting algorithm, aptly named Policy Iteration-RRT$^{\#}$ (PI-RRT$^{\#}$) is the first of a new class of DP-inspired algorithms for randomized motion planning that utilize PI methods.

1510.07573 2026-06-04 cs.RO cs.CV cs.MA cs.SY eess.SY 版本更新

Generalized Regressive Motion: a Visual Cue to Collision

广义回归运动:碰撞的视觉线索

Krzysztof Chalupka, Michael Dickinson, Pietro Perona

AI总结 研究提出广义回归运动作为碰撞检测的视觉线索,通过几何分析证明其在同类碰撞中的可靠性,并通过基于代理的建模显示其比 looming 更有效。

详情
AI中文摘要

大脑和感觉系统进化以指导运动。关键任务是控制对静止障碍物的接近并检测移动生物。Looming 被提出为主要的单目视觉线索,用于检测其他动物的接近并避免与静止障碍物碰撞。在昆虫和脊椎动物大脑中发现了优雅的神经机制用于 looming 检测。然而,looming 未在两个移动动物碰撞的背景下进行分析。我们提出了一种替代策略,即广义回归运动(GRM),这与最近观察到的果蝇行为一致。几何分析证明 GRM 是同类碰撞的可靠线索,而基于代理的建模表明 GRM 比 looming 更有效用于检测接近、防止碰撞和维持移动性。

英文摘要

Brains and sensory systems evolved to guide motion. Central to this task is controlling the approach to stationary obstacles and detecting moving organisms. Looming has been proposed as the main monocular visual cue for detecting the approach of other animals and avoiding collisions with stationary obstacles. Elegant neural mechanisms for looming detection have been found in the brain of insects and vertebrates. However, looming has not been analyzed in the context of collisions between two moving animals. We propose an alternative strategy, Generalized Regressive Motion (GRM), which is consistent with recently observed behavior in fruit flies. Geometric analysis proves that GRM is a reliable cue to collision among conspecifics, whereas agent-based modeling suggests that GRM is a better cue than looming as a means to detect approach, prevent collisions and maintain mobility.

1609.05483 2026-06-04 eess.SY cs.CV cs.RO cs.SY 版本更新

Set-Point Regulation of Linear Continuous-Time Systems using Neuromorphic Vision Sensors

利用神经形态视觉传感器进行线性连续时间系统的设定点调节

Prince Singh, Sze Zheng Yong, Emilio Frazzoli

AI总结 本文提出基于神经形态视觉传感器的H∞控制器,用于调节线性时不变系统的设定点,并在不稳定系统上验证了方法的有效性。

Comments Submitted to IEEE Transactions on Automatic Control

详情
AI中文摘要

近年来发展出的神经形态视觉传感器因其高时间分辨率和低延迟成为敏捷和自主机器人应用的有前景候选者。每个像素在检测到光照场变化时独立地发出异步的

英文摘要

Recently developed neuromorphic vision sensors have become promising candidates for agile and autonomous robotic applications primarily due to, in particular, their high temporal resolution and low latency. Each pixel of this sensor independently fires an asynchronous stream of "retinal events" once a change in the light field is detected. Existing computer vision algorithms can only process periodic frames and so a new class of algorithms needs to be developed that can efficiently process these events for control tasks. In this paper, we investigate the problem of regulating a continuous-time linear time invariant (LTI) system to a desired point using measurements from a neuromorphic sensor. We present an $H_\infty$ controller that regulates the LTI system to a desired set-point and provide the set of neuromorphic sensor based cameras for the given system that fulfill the regulation task. The effectiveness of our approach is illustrated on an unstable system.

1609.05235 2026-06-04 cs.RO cs.SY eess.SY 版本更新

RFM-SLAM: Exploiting Relative Feature Measurements to Separate Orientation and Position Estimation in SLAM

RFM-SLAM:利用相对特征测量分离SLAM中的姿态和位置估计

Saurav Agarwal, Vikram Shree, Suman Chakravorty

AI总结 本文提出RFM-SLAM框架,通过相对特征测量将SLAM问题建模为线性最小二乘问题,减少非线性优化计算量,避免现有方法因初始猜测导致的灾难性失败,同时在噪声增加时保持精度。

Comments 9 pages, submitted to IEEE ICRA 2017

详情
AI中文摘要

SLAM问题具有一个特殊性质,当机器人姿态已知时,估计机器人姿态历史和特征位置可以转化为标准线性最小二乘问题。本文开发了一种SLAM框架,利用相对特征到特征测量来利用SLAM的结构特性。相对特征测量用于对姿态到姿态的方位约束提出线性估计问题,随后通过迭代非线性在流形上的优化问题计算在相对旋转约束下的机器人方位最大似然估计。一旦计算出机器人方位,就解决一个线性问题来计算机器人位置和地图估计。我们的方法通过将优化问题缩小到比标准基于特征的图方法更小的规模,减少了非线性优化的计算负担。进一步,实验证明我们的方法避免了现有方法因使用里程计作为非线性优化初始猜测而导致的灾难性失败,同时在传感器噪声增加时保持精度。我们通过广泛的模拟和与现有最先进求解器的比较展示了我们的方法。

英文摘要

The SLAM problem is known to have a special property that when robot orientation is known, estimating the history of robot poses and feature locations can be posed as a standard linear least squares problem. In this work, we develop a SLAM framework that uses relative feature-to-feature measurements to exploit this structural property of SLAM. Relative feature measurements are used to pose a linear estimation problem for pose-to-pose orientation constraints. This is followed by solving an iterative non-linear on-manifold optimization problem to compute the maximum likelihood estimate for robot orientation given relative rotation constraints. Once the robot orientation is computed, we solve a linear problem for robot position and map estimation. Our approach reduces the computational burden of non-linear optimization by posing a smaller optimization problem as compared to standard graph-based methods for feature-based SLAM. Further, empirical results show our method avoids catastrophic failures that arise in existing methods due to using odometery as an initial guess for non-linear optimization, while its accuracy degrades gracefully as sensor noise is increased. We demonstrate our method through extensive simulations and comparisons with an existing state-of-the-art solver.

1609.03628 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Co-active Learning to Adapt Humanoid Movement for Manipulation

协同学习以适应人形机器人的运动用于操作

Ren Mao, John S. Baras, Yezhou Yang, Cornelia Fermuller

AI总结 本文提出协同学习框架,通过人机交互适应机器人末端执行器的运动以应对不同约束环境,实验验证了方法的有效性。

详情
AI中文摘要

本文针对机器人在各种环境约束下运动适应问题,提出了一种协同学习框架,用于学习适应机器人末端执行器的运动以执行操作任务。该框架设计用于适应从演示中学习的原始模仿轨迹,以应对具有各种约束的新情况。框架还考虑了用户对适应轨迹的反馈,并通过人机交互学习适应运动。实现的系统能够将训练的运动原语泛化到具有不同约束的各种情况,考虑用户偏好。在人形平台上进行的实验验证了本文方法的有效性。

英文摘要

In this paper we address the problem of robot movement adaptation under various environmental constraints interactively. Motion primitives are generally adopted to generate target motion from demonstrations. However, their generalization capability is weak while facing novel environments. Additionally, traditional motion generation methods do not consider the versatile constraints from various users, tasks, and environments. In this work, we propose a co-active learning framework for learning to adapt robot end-effector's movement for manipulation tasks. It is designed to adapt the original imitation trajectories, which are learned from demonstrations, to novel situations with various constraints. The framework also considers user's feedback towards the adapted trajectories, and it learns to adapt movement through human-in-the-loop interactions. The implemented system generalizes trained motion primitives to various situations with different constraints considering user preferences. Experiments on a humanoid platform validate the effectiveness of our approach.

1608.02683 2026-06-04 math.OC cs.DS cs.RO cs.SY eess.SY 版本更新

System Identification and Control of Valkyrie through SVA--Based Regressor Computation

基于SVA的瓦尔基里人形机器人系统辨识与控制

Shishir Kolathaya, Benjamin J. Morris, Ryan W. Sinnet, Aaron D. Ames

AI总结 本文利用空间向量代数(SVA)实现瓦尔基里人形机器人的同时辨识与控制,通过空间惯性张量计算动力学参数,提出高效算法实现回归器评估,用于在线系统辨识与基于模型的控制。

Comments 8 pages, 15 figures

详情
AI中文摘要

本文演示了利用空间向量代数(SVA)对人形机器人Valkyrie进行的同时辨识与控制。特别是,通过空间惯性张量计算机器人动力学中的惯性、科里奥利-离心力和重力项。在假设链接长度或关节轴间距离准确已知的情况下,将证明机器人惯性属性可直接从惯量张量评估。提出了一种算法来评估回归器,其运行时间为O(n²)。该算法的效率为通过基于SVA的回归器实现在线系统辨识提供了手段,并作为副产品,提供了一种精确的基于模型的控制方法。通过在三个案例研究中的实现,提供了所提方法的实验验证:离线识别双摆和4自由度机械腿,以及在线识别和控制4自由度机械臂。

英文摘要

This paper demonstrates simultaneous identification and control of the humanoid robot, Valkyrie, utilizing Spatial Vector Algebra (SVA). In particular, the inertia, Coriolis-centrifugal and gravity terms for the dynamics of a robot are computed using spatial inertia tensors. With the assumption that the link lengths or the distance between the joint axes are accurately known, it will be shown that inertial properties of a robot can be directly evaluated from the inertia tensor. An algorithm is proposed to evaluate the regressor, yielding a run time of $O(n^2)$. The efficiency of this algorithm yields a means for online system identification via the SVA--based regressor and, as a byproduct, a method for accurate model-based control. Experimental validation of the proposed method is provided through its implementation in three case studies: offline identification of a double pendulum and a $4$-DOF robotic leg, and online identification and control of a $4$-DOF robotic arm.

1510.06263 2026-06-04 cs.RO cs.SY eess.SY 版本更新

An EKF-SLAM algorithm with consistency properties

具有一致性的EKF-SLAM算法

Axel Barrau, Silvere Bonnabel

AI总结 本文针对EKF-SLAM中由于全局参考系原点和方位不可观测导致的不一致性问题,提出基于不变EKF的方法以解决状态空间对称性问题,并通过蒙特卡洛实验验证了理论结果。

Comments Submitted

详情
AI中文摘要

本文针对EKF基于SLAM算法中由于全局参考系原点和方位不可观测导致的不一致性问题,证明在非线性二维问题中,使用最近引入的不变EKF变体可以解决此类不一致性。通过大量蒙特卡洛运行验证了理论结果。

英文摘要

In this paper we address the inconsistency of the EKF-based SLAM algorithm that stems from non-observability of the origin and orientation of the global reference frame. We prove on the non-linear two-dimensional problem with point landmarks observed that this type of inconsistency is remedied using the Invariant EKF, a recently introduced variant ot the EKF meant to account for the symmetries of the state space. Extensive Monte-Carlo runs illustrate the theoretical results.

1606.03736 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Enhancement of Low-cost GNSS Localization in Connected Vehicle Networks Using Rao-Blackwellized Particle Filters

利用 Rao-Blackwellized 粒子滤波器增强连接车辆网络中低成本 GNSS 定位

Macheng Shen, Ding Zhao, Jing Sun

AI总结 本文提出利用 Rao-Blackwellized 粒子滤波器融合连接车辆组的 GNSS 信息并匹配数字地图,以提高定位精度,实验显示算法将误差降低50%,方差减少两个数量级。

Comments 7 pages, 7 figures, IEEE ITSC

详情
AI中文摘要

自动驾驶技术中准确定位是关键功能。然而,由于伪距测量存在偏差噪声,低成本 GNSS 接收器难以实现车道级精度。差分 GNSS 可提高精度,但需要大量基站投资。新兴的连接车辆技术提供了替代方案。本文显示,通过融合连接车辆组的 GNSS 信息并匹配数字地图以消除定位的共同偏差,可提高定位精度。Rao-Blackwellized 粒子滤波器(RBPF)用于联合估计伪距的共同偏差和车辆位置。多径偏差通过多假设检测-拒绝方法缓解。通过预测-更新过程利用时间相关性。所提方法在交叉口场景中与现有静态和平滑静态方法进行比较。仿真结果表明,所提算法将估计误差减少50%,估计方差减少两个数量级。

英文摘要

An essential function for automated vehicle technologies is accurate localization. It is difficult, however, to achieve lane-level accuracy with low-cost Global Navigation Satellite System (GNSS) receivers due to the biased noisy pseudo-range measurements. Approaches such as Differential GNSS can improve the accuracy, but usually require an enormous amount of investment in base stations. The emerging connected vehicle technologies provide an alternative approach to improving the localization accuracy. It has been shown in this paper that localization accuracy can be enhanced by fusing GNSS information within a group of connected vehicles and matching the configuration of the group to a digital map to eliminate the common bias in localization. A Rao-Blackwellized particle filter (RBPF) was used to jointly estimate the common biases of the pseudo-ranges and the vehicles positions. Multipath biases, which are non-common to vehicles, were mitigated by a multi-hypothesis detection-rejection approach. The temporal correlation was exploited through the prediction-update process. The proposed approach was compared to the existing static and smoothed static methods in the intersection scenario. Simulation results show that the proposed algorithm reduced the estimation error by fifty percent and reduced the estimation variance by two orders of magnitude.

1601.04862 2026-06-04 cs.RO cs.DC cs.NE cs.SY eess.SY 版本更新

Scalability in Neural Control of Musculoskeletal Robots

神经控制的肌骨机器人可扩展性

Christoph Richter, Sören Jentzsch, Rafael Hostettler, Jesús A. Garrido, Eduardo Ros, Alois C. Knoll, Florian Röhrbein, Patrick van der Smagt, Jörg Conradt

AI总结 本文提出一种结合Myorobotics框架和SpiNNaker平台的系统,实现数十个神经控制的物理顺应性关节的可扩展性,通过闭环小脑模型实现低功耗高效神经控制。

Comments Accepted at IEEE Robotics and Automation Magazine on 2015-12-31

详情
AI中文摘要

类人机器人是能够感知、行为、交互和感觉像人类的机器人。根据这一定义,类人机器人需要人形的机械硬件和驱动,以及类脑的控制和感知。最明显的实现方式是具有类脑神经控制器的人形肌骨机器人。尽管肌骨机器人硬件和神经控制软件已存在数十年,但尚未有可扩展的方法用于构建和控制类人人类尺度机器人。通过结合Myorobotics框架和SpiNNaker神经形态计算平台,我们展示了能够扩展到数十个神经控制、物理顺应性关节的系统。其核心实现了一个闭环小脑模型,提供最低功耗和最大扩展性的实时低层神经控制:更高阶(如皮层)神经网络和神经形态传感器如硅视网膜或耳蜗可以自然地被整合进来。

英文摘要

Anthropomimetic robots are robots that sense, behave, interact and feel like humans. By this definition, anthropomimetic robots require human-like physical hardware and actuation, but also brain-like control and sensing. The most self-evident realization to meet those requirements would be a human-like musculoskeletal robot with a brain-like neural controller. While both musculoskeletal robotic hardware and neural control software have existed for decades, a scalable approach that could be used to build and control an anthropomimetic human-scale robot has not been demonstrated yet. Combining Myorobotics, a framework for musculoskeletal robot development, with SpiNNaker, a neuromorphic computing platform, we present the proof-of-principle of a system that can scale to dozens of neurally-controlled, physically compliant joints. At its core, it implements a closed-loop cerebellar model which provides real-time low-level neural control at minimal power consumption and maximal extensibility: higher-order (e.g., cortical) neural networks and neuromorphic sensors like silicon-retinae or -cochleae can naturally be incorporated.

1604.07849 2026-06-04 cs.RO cs.SY eess.SY math.OC 版本更新

Distributed rotational and translational maneuvering of rigid formations and their applications

分布式刚体编队的旋转和翻译 maneuvering 及其应用

Hector Garcia de Marina, Bayu Jayawardhana, Ming Cao

AI总结 本文提出一种分布式控制器,通过引入距离约束参数实现编队与运动控制,允许恒定集体翻译、旋转或组合,同时保证编队形状无畸变,应用于编队对齐和目标追踪。

Comments 14 pages

详情
Journal ref
Robotics, IEEE Transactions on, Volume 32, Issue 3, Pages 684 - 696, Year 2016
AI中文摘要

最近有报告指出,范围测量不一致或等效的预定 agent 间距离不匹配可能阻止流行的梯度控制器引导移动 agent 的刚体编队收敛到期望形状,甚至使编队停滞在任何位置。本文不将不匹配视为性能不佳的原因,而是将其作为设计参数,通过在每个距离约束中引入这样的参数对,设计出同时实现编队和运动控制的分布式控制器,不仅涵盖流行的梯度控制,更重要的是允许实现恒定集体翻译、旋转或其组合,同时保证渐近无编队形状畸变。此类运动控制结果随后应用于(a)编队方向对齐和(b)包围和跟踪移动目标。除了严谨的数学证明外,还通过移动机器人实验展示了所提出编队-运动分布式控制器的满意性能。

英文摘要

Recently it has been reported that range-measurement inconsistency, or equivalently mismatches in prescribed inter-agent distances, may prevent the popular gradient controllers from guiding rigid formations of mobile agents to converge to their desired shape, and even worse from standing still at any location. In this paper, instead of treating mismatches as the source of ill performance, we take them as design parameters and show that by introducing such a pair of parameters per distance constraint, distributed controller achieving simultaneously both formation and motion control can be designed that not only encompasses the popular gradient control, but more importantly allows us to achieve constant collective translation, rotation or their combination while guaranteeing asymptotically no distortion in the formation shape occurs. Such motion control results are then applied to (a) the alignment of formations orientations and (b) enclosing and tracking a moving target. Besides rigorous mathematical proof, experiments using mobile robots are demonstrated to show the satisfying performances of the proposed formation-motion distributed controller.

1608.06440 2026-06-04 cs.RO cs.CV cs.SY eess.SY 版本更新

A Delay-Tolerant Potential-Field-Based Network Implementation of an Integrated Navigation System

基于延迟容忍的势场网络的集成导航系统实现

Rachana Ashok Gupta, Ahmad A. Masoud, Mo-Yuen Chow

AI总结 本文提出一种基于网络控制器的集成导航系统,通过互联网实现摄像头、无人地面车和远程服务器的实时网络化,旨在简化无人车导航同时保持系统鲁棒性。

详情
Journal ref
The IEEE Transactions On Industrial Electronics, Vol. 57, No.2, February 2010, PP. 769-783
AI中文摘要

网络控制器(NCs)是能够将动态、空间扩展且功能专门化的模块转化为可执行目标导向组的设备,称为网络控制系统。本文探讨了设计和构建使用互联网作为通信介质的NC的实践方面。重点在于寻找兼容的控制器组件,这些组件可通过主机结构集成,使摄像头、无人地面车(UGV)、远程计算机服务器及必要的操作软件界面能够实时联网。目标是简化UGV的导航过程,同时保持系统性能的鲁棒性。本文描述了所提控制器的结构、其组件及其接口方式。提供了详尽的实验结果,包括性能评估和与之前实现的NC的比较。

英文摘要

Network controllers (NCs) are devices that are capable of converting dynamic, spatially extended, and functionally specialized modules into a taskable goal-oriented group called networked control system. This paper examines the practical aspects of designing and building an NC that uses the Internet as a communication medium. It focuses on finding compatible controller components that can be integrated via a host structure in a manner that makes it possible to network, in real-time, a webcam, an unmanned ground vehicle (UGV), and a remote computer server along with the necessary operator software interface. The aim is to deskill the UGV navigation process and yet maintain a robust performance. The structure of the suggested controller, its components, and the manner in which they are interfaced are described. Thorough experimental results along with performance assessment and comparisons to a previously implemented NC are provided.

1608.06420 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A Harmonic Potential Field Approach for Joint Planning & Control of a Rigid, Separable Nonholonomic, Mobile Robot

一种基于谐振势场的方法用于刚性、可分离非完整移动机器人的联合规划与控制

Ahmad A. Masoud

AI总结 本文提出一种基于谐振势场的方法,用于在移动机器人伺服层面实现路径规划,通过同步控制信号使机器人速度与势场梯度一致,解决复杂环境下的规划问题。

详情
Journal ref
Robotics And Autonomous Systems, Vol. 61, No. 6, June 2013 Page 593,615
AI中文摘要

本文的主要目标是提供一种工具,用于在移动机器人伺服层面进行路径规划。通过在伺服层面以可证明正确的方式执行复杂任务,可以显著提高操作速度、降低能耗并提高响应质量。传统规划局限于机器人的高层控制器,该阶段的指导速度信号通常通过电子速度控制器(ESC)转换为控制信号。本文展示了谐振势场(HPF)方法生成可证明正确、受约束且行为良好的轨迹和控制信号的能力,用于刚性、非完整机器人在静止、杂乱环境中。证明了基于HPF的伺服层面规划器能够解决现实情况下的大量挑战。所提出的方法将HPF规划器解决方案轨迹的丰富且可证明正确性质迁移至机器人本身。这通过同步控制信号实现,其目的是使机器人在局部坐标系中的速度与HPF梯度一致。通过将机器人表示为论文中所谓的分离形式,使两者之间的联系成为可能。所使用的上下文敏感且目标导向的控制信号在存在执行器噪声、饱和和参数不确定性时表现出良好的行为和鲁棒性。该方法通过仿真结果进行了开发、证明了正确性,并展示了方案的能力。

英文摘要

The main objective of this paper is to provide a tool for performing path planning at the servo level of a mobile robot. The ability to perform, in a provably correct manner, such a complex task at the servo level can lead to a large increase in the speed of operation, low energy consumption and high quality of response. Planning has been traditionally limited to the high level controller of a robot. The guidance velocity signal from this stage is usually converted to a control signal using what is known as an electronic speed controller (ESC). This paper demonstrates the ability of the harmonic potential field (HPF) approach to generate a provably correct, constrained, well behaved trajectory and control signal for a rigid, nonholonomic robot in a stationary, cluttered environment. It is shown that the HPF based, servo level planner can address a large number of challenges facing planning in a realistic situation. The suggested approach migrates the rich and provably correct properties of the solution trajectories from an HPF planner to those of the robot. This is achieved using a synchronizing control signal whose aim is to align the velocity of the robot in its local coordinates, with that of the gradient of the HPF. The link between the two is made possible by representing the robot using what the paper terms separable form. The context-sensitive and goal-oriented control signal used to steer the robot is demonstrated to be well behaved and robust in the presence of actuator noise, saturation and uncertainty in the parameters. The approach is developed, proofs of correctness are provided and the capabilities of the scheme are demonstrated using simulation results.

1608.05786 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Design of a Trajectory Tracking Controller for a Nanoquadcopter

小型四旋翼飞行器轨迹跟踪控制器设计

Carlos Luis, Jérôme Le Ny

AI总结 本文设计了基于数学模型的四旋翼飞行器位置与轨迹控制算法,并通过仿真与实机测试验证了LQT控制器在轨迹跟踪中的优越性。

Comments Complete Technical Report

详情
AI中文摘要

本文旨在研究小型四旋翼飞行器的系统建模及设计位置与轨迹控制算法,最终目标是在仿真和真实平台上测试系统。所选平台为开源的Crazyflie 2.0。首先开发了描述四旋翼动力学的数学模型,其次创建了仿真环境以设计两种不同的控制架构:级联PID位置跟踪器和LQT轨迹跟踪器。最后阶段是在所选平台上测试控制器并比较其轨迹跟踪性能。仿真结果与实验结果一致,进一步的模型改进通过闭环模型识别技术提出。结果表明,LQT控制器在轨迹跟踪中表现更优,其位置RMS误差比PID控制器小四倍。尽管LQT控制努力更大,但消除了诱导电机饱和的高控制峰值。LQT控制器还通过超宽频双程测距系统进行了测试,与更精确的VICON系统比较表明,控制器在两种系统之间不同的噪声水平下均能跟踪轨迹。

英文摘要

The primary purpose of this study is to investigate the system modeling of a nanoquadcopter as well as designing position and trajectory control algorithms, with the ultimate goal of testing the system both in simulation and on a real platform. The open source nanoquadcopter platform named Crazyflie 2.0 was chosen for the project. The first phase consisted in the development of a mathematical model that describes the dynamics of the quadcopter. Secondly, a simulation environment was created to design two different control architectures: cascaded PID position tracker and LQT trajectory tracker. Finally, the implementation phase consisted in testing the controllers on the chosen platform and comparing their performance in trajectory tracking. Our simulations agreed with the experimental results, and further refinement of the model is proposed as future work through closed-loop model identification techniques. The results show that the LQT controller performed better at tracking trajectories, with RMS errors in position up to four times smaller than those obtained with the PID. LQT control effort was greater, but eliminated the high control peaks that induced motor saturation in the PID controller. The LQT controller was also tested using an ultra-wide band two-way ranging system, and comparisons with the more precise VICON system indicate that the controller could track a trajectory in both cases despise the difference in noise levels between the two systems.

1603.06443 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

A Scalable and Distributed Solution to the Inertial Motion Capture Problem

可扩展且分布式的惯性运动捕捉问题解决方案

Manon Kok, Sina Khoshfetrat Pakazad, Thomas B. Schön, Anders Hansson, Jeroen D. Hol

AI总结 本文提出一种可扩展且分布式的惯性运动捕捉解决方案,通过定制信息传递方法解决计算复杂性问题,并应用于下肢配置数据进行验证。

Comments 14 pages, 5 figures. In proceedings of the 19th International Conference on Information Fusion, pp. 1348-1355, Heidelberg, Germany, July 2016

详情
AI中文摘要

在惯性运动捕捉中,多个身体部分装备有惯性传感器,包括3D加速度计和3D陀螺仪。使用基于优化的方法解决运动捕捉问题,可以自然地包含生物力学约束,并建模身体部分在关节位置的连接。解决这个问题的计算复杂性随着数据集长度和考虑的传感器和身体部分数量的增加而增加。在本工作中,我们提出了一种利用问题固有结构的可扩展且分布式的解决方案,通过定制信息传递方法实现。作为概念验证,我们将算法应用于下肢配置的数据。

英文摘要

In inertial motion capture, a multitude of body segments are equipped with inertial sensors, consisting of 3D accelerometers and 3D gyroscopes. Using an optimization-based approach to solve the motion capture problem allows for natural inclusion of biomechanical constraints and for modeling the connection of the body segments at the joint locations. The computational complexity of solving this problem grows both with the length of the data set and with the number of sensors and body segments considered. In this work, we present a scalable and distributed solution to this problem using tailored message passing, capable of exploiting the structure that is inherent in the problem. As a proof-of-concept we apply our algorithm to data from a lower body configuration.

1311.5796 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Unscented Orientation Estimation Based on the Bingham Distribution

基于Bingham分布的无偏姿态估计

Igor Gilitschenski, Gerhard Kurz, Simon J. Julier, Uwe D. Hanebeck

AI总结 本文提出基于Bingham分布的滤波器,考虑姿态估计的周期性,利用超球面上的分布特性提升非线性姿态估计的精度。

详情
AI中文摘要

本文提出基于Bingham分布的滤波器,考虑姿态估计的周期性,利用超球面上的分布特性提升非线性姿态估计的精度。

英文摘要

Orientation estimation for 3D objects is a common problem that is usually tackled with traditional nonlinear filtering techniques such as the extended Kalman filter (EKF) or the unscented Kalman filter (UKF). Most of these techniques assume Gaussian distributions to account for system noise and uncertain measurements. This distributional assumption does not consider the periodic nature of pose and orientation uncertainty. We propose a filter that considers the periodicity of the orientation estimation problem in its distributional assumption. This is achieved by making use of the Bingham distribution, which is defined on the hypersphere and thus inherently more suitable to periodic problems. Furthermore, handling of non-trivial system functions is done using deterministic sampling in an efficient way. A deterministic sampling scheme reminiscent of the UKF is proposed for the nonlinear manifold of orientations. It is the first deterministic sampling scheme that truly reflects the nonlinear manifold of the orientation.

1608.02286 2026-06-04 cs.RO cs.MA cs.SY eess.SY math.DS 版本更新

Enforcing Biconnectivity in Multi-robot Systems

在多机器人系统中强制双连通性

Mehran Zareh, Lorenzo Sabattini, Cristian Secchi

AI总结 本文提出一种去中心化梯度协议,通过增加拉普拉斯矩阵第三小特征值来增强多机器人系统的双连通性,同时引入算法估计拉普拉斯矩阵特征向量以定义梯度,验证了理论成果的有效性。

Comments arXiv admin note: text overlap with arXiv:1608.02276

详情
AI中文摘要

在多机器人系统中,保持连通性是一项关键任务,近年来受到广泛关注。如果单个机器人失效,连接系统可能被分割成两个或更多子集。如果在单个机器人失效的情况下保证网络连通性,则可以实现更稳健的通信,这种网络被称为双连通的。在Zareh2016biconnectivitycheck中,我们提出了一个双连通性检查的标准,基本上确定了拉普拉斯矩阵第三小特征值的下界。在本文中,我们介绍了一种去中心化的基于梯度的协议,用于在双连通性检查失败时增加拉普拉斯矩阵第三小特征值的值。我们还介绍了一种去中心化的算法,用于估计拉普拉斯矩阵的特征向量,这些特征向量用于定义梯度。模拟显示了理论发现的有效性。

英文摘要

Connectivity maintenance is an essential task in multi-robot systems and it has received a considerable attention during the last years. A connected system can be broken into two or more subsets simply if a single robot fails. A more robust communication can be achieved if the network connectivity is guaranteed in the case of one-robot failures. The resulting network is called biconnected. In \cite{Zareh2016biconnectivitycheck}, we presented a criterion for biconnectivity check, which basically determines a lower bound on the third-smallest eigenvalue of the Laplacian matrix. In this paper, we introduce a decentralized gradient-based protocol to increase the value of the third-smallest eigenvalue of the Laplacian matrix, when the biconnectivity check fails. We also introduce a decentralized algorithm to estimate the eigenvectors of the Laplacian matrix, which are used for defining the gradient. Simulations show the effectiveness of the theoretical findings.

1603.09672 2026-06-04 eess.SY cs.RO cs.SY math.DS 版本更新

Lyapunov stability of a rigid body with two frictional contacts

具有两个摩擦接触的刚体的李雅普诺夫稳定性

Péter L. Várkonyi, Yizhar Or

AI总结 本文研究了具有两个摩擦性单向接触的平面刚体在非弹性碰撞下的李雅普诺夫稳定性分析,通过引入Poincare映射和不变关系,将系统动态行为编码为两个半解析标量函数,用于判断平衡状态的稳定性。

详情
AI中文摘要

机械系统李雅普诺夫稳定性意味着在位置和速度的小扰动下,动态响应保持在静态平衡配置的任意小邻域内有界。这种稳定性在涉及多个单向接触的机器人应用中非常受欢迎。然而,此类系统的李雅普诺夫稳定性分析极其困难,因为即使小扰动也可能导致混合动力学,其中解涉及许多非光滑转换的不同接触状态。本文研究了具有两个摩擦性单向接触的平面刚体在非弹性碰撞下的李雅普诺夫稳定性分析,针对一般类别的平衡配置和恒定外部载荷。系统在接触转换和碰撞下的混合动力学被公式化,并引入了两个接触状态的Poincare映射。利用不变关系,该Poincare映射被减少为两个半解析标量函数,完全编码了任何小初始扰动下的解的动态行为。这两个函数使几乎任何平衡状态的李雅普诺夫稳定性或不稳定性得以确定。结果通过模拟示例和在描述接触几何和外部载荷的二维参数空间中绘制稳定性与不稳定性区域来展示。

英文摘要

Lyapunov stability of a mechanical system means that the dynamic response stays bounded in an arbitrarily small neighborhood of a static equilibrium configuration under small perturbations in positions and velocities. This type of stability is highly desired in robotic applications that involve multiple unilateral contacts. Nevertheless, Lyapunov stability analysis of such systems is extremely difficult, because even small perturbations may result in hybrid dynamics where the solution involves many nonsmooth transitions between different contact states. This paper concerns with Lyapunov stability analysis of a planar rigid body with two frictional unilateral contacts under inelastic impacts, for a general class of equilibrium configurations under a constant external load. The hybrid dynamics of the system under contact transitions and impacts is formulated, and a \Poincare map at two-contact states is introduced. Using invariance relations, this \Poincare map is reduced into two semi-analytic scalar functions that entirely encode the dynamic behavior of solutions under any small initial perturbation. These two functions enable determination of Lyapunov stability or instability for almost any equilibrium state. The results are demonstrated via simulation examples and by plotting stability and instability regions in two-dimensional parameter spaces that describe the contact geometry and external load.

1608.02276 2026-06-04 cs.RO cs.MA cs.SY eess.SY math.DS 版本更新

Decentralized Biconnectivity Conditions in Multi-robot Systems

多机器人系统中的去中心化双连通性条件

Mehran Zareh, Lorenzo Sabattini, Cristian Secchi

AI总结 本文提出去中心化方法,通过拉普拉斯矩阵的第三小特征值提供双连通性条件,确保网络在单个机器人失效时仍保持连通。

详情
AI中文摘要

群体协作机器人网络的连通性容易因单个机器人与其余部分断开连接而破坏。在考虑对单个机器人失效具有鲁棒性的前提下,通信网络被称为双连通的。简而言之,要证明双连通网络图不存在割点,我们需要提出一种去中心化方法,提供网络双连通性的充分条件,并证明这些条件与拉普拉斯矩阵的第三小特征值相关。机器人间的数据交换假定为邻居间直接交换。

英文摘要

The network connectivity in a group of cooperative robots can be easily broken if one of them loses its connectivity with the rest of the group. In case of having robustness with respect to one-robot-fail, the communication network is termed biconnected. In simple words, to have a biconnected network graph, we need to prove that there exists no articulation point. We propose a decentralized approach that provides sufficient conditions for biconnectivity of the network, and we prove that these conditions are related to the third smallest eigenvalue of the Laplacian matrix. Data exchange among the robots is supposed to be neighbor-to-neighbor.

1608.00337 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Higher-Degree Stochastic Integration Filtering

高次随机积分滤波器

Syed Safwan Khalid, Naveed Ur Rehman, Shafayat Abrar

AI总结 本文提出一种高次随机积分滤波器,用于非线性滤波,通过比较与现有滤波器的性能,证明其优越性。

Comments Submitted to IEEE Signal Processing Letters on 17th July 2016

详情
AI中文摘要

我们获得了一类高次随机积分滤波器(SIF)用于非线性滤波应用。SIF基于随机球面-径向积分规则,能够实现高斯加权多元积分的渐近精确计算,这些积分出现在非线性贝叶斯滤波中。通过将所提出的五次SIF与现有随机、拟随机和立方体(卡尔曼)滤波器进行比较,证明所提滤波器在所有情况下均优于现有滤波器。

英文摘要

We obtain a class of higher-degree stochastic integration filters (SIF) for nonlinear filtering applications. SIF are based on stochastic spherical-radial integration rules that achieve asymptotically exact evaluations of Gaussian weighted multivariate integrals found in nonlinear Bayesian filtering. The superiority of the proposed scheme is demonstrated by comparing the performance of the proposed fifth-degree SIF against a number of existing stochastic, quasi-stochastic and cubature (Kalman) filters. The proposed filter is demonstrated to outperform existing filters in all cases.

1607.07848 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Towards Controllability of Wireless Network Quality using Mobile Robotic Routers

无线网络质量可控性研究:基于移动机器人中继节点

Pradipta Ghosh, Raktim Pal, Bhaskar Krishnamachari

AI总结 本文研究通过移动机器人中继节点的部署与移动控制,优化无线网络通信质量,提出集中式和分布式两种优化算法,通过仿真实验验证其性能。

详情
AI中文摘要

我们考虑了机器人中继节点部署与移动控制的问题,目标是形成和维护一组发射-接收对之间的最优通信网络。在该场景中,任意发射-接收对之间的通信路径包含一组预设的移动机器人中继节点。本文的目标是设计一个算法,优化机器人节点的位置以提高网络整体性能。我们将优化指标定义为所有链路的信号干扰加噪声比(SINR)的最小值。在本文中,我们提出两种优化算法分别解决该问题的集中式和分布式方式。我们还基于一组仿真实验展示了两种算法的性能。

英文摘要

We consider a problem of robotic router placement and mobility control with the objective of formation and maintenance of an optimal communication network between a set of transmitter-receiver pairs. In this scenario, the communication path between any transmitter-receiver pair contains a predetermined set of mobile robotic routers nodes. The goal of this work is to design an algorithm to optimize the positions of the robotic nodes to improve the overall performance of the network. We define the optimization metric to be the minimum of the Signal to Interference plus Noise Ratios (SINR) over all the links. In this manuscript, we propose two optimization algorithms to solve this problem in a centralized and a decentralized manner, respectively.We also demonstrate the performances of both algorithms based on a set of simulation experiments.

1607.04439 2026-06-04 cs.RO cs.NI cs.SY eess.SY 版本更新

A Networked Swarm Model for UAV Deployment in the Assessment of Forest Environments

用于森林环境评估的UAV部署网络化群体模型

Matthias R. Brust, Bogdan M. Strimbu

AI总结 本文提出一种新型UAV群体编队飞行方法,通过自适应网络结构保持群体连通性,实现高精度森林环境数据采集与融合。

详情
AI中文摘要

自主无人机(UAV)因其广泛的应用前景而受到青睐。伴随复杂传感器的出现,UAV可配备通信适配器实现机间通信。在UAV间形成群体通信时,如何管理其通信结构和移动性成为问题。本文考虑建立高效的群体运动模型和UAV之间的网络拓扑结构,专门用于高质量的森林制图场景。森林环境具有高度异质性的树和障碍物分布,对UAV群体构成极大挑战。群体需不断避免与树碰撞,自主改变轨迹,这可能导致与群体断开连接,需在通过障碍后重新连接,同时继续收集需高效融合和评估的环境数据。本文提出了一种新型解决方案,提供自适应且可靠的网络结构,维持群体连通性和可通信性。这些特性对于从UAV群体采集的数据实现详细准确的环境描述至关重要。本文方法的主要特点是群体中UAV数量的高可扩展性和群体内的自适应网络拓扑。

英文摘要

Autonomous Unmanned Aerial Vehicles (UAVs) have gained popularity due to their many potential application fields. Alongside sophisticated sensors, UAVs can be equipped with communication adaptors aimed for inter-UAV communication. Inter-communication of UAVs to form a UAV swarm raises questions on how to manage its communication structure and mobility. In this paper, we consider therefore the problem of establishing an efficient swarm movement model and a network topology between a collection of UAVs, which are specifically deployed for the scenario of high-quality forest-mapping. The forest environment with its highly heterogeneous distribution of trees and obstacles represents an extreme challenge for a UAV swarm. It requires the swarm to constantly avoid possible collisions with trees, to change autonomously the trajectory, which can lead to disconnection to the swarm, and to reconnect to the swarm after passing the obstacle, while continue collecting environmental data that needs to be fused and assessed efficiently. In this paper, we propose a novel solution to the formation flight problem for UAV swarms. The proposed method provides an adaptive and reliable network structure, which maintains swarm connectivity and communicability. These characteristics are needed to achieve a detailed and accurate description of the environment from the data acquired by the UAV swarm. The main characteristics of our approach are high scalability regarding the number of UAVs in the swarm and the adaptive network topology within the swarm.

1601.05257 2026-06-04 eess.SY cs.RO cs.SY stat.AP 版本更新

Magnetometer calibration using inertial sensors

使用惯性传感器的磁力计校准

Manon Kok, Thomas B. Schön

AI总结 本文提出一种实用算法,用于校准磁力计以应对磁干扰和传感器误差,并结合惯性测量进行姿态估计。

Comments 19 pages, 8 figures

详情
Journal ref
IEEE Sensors Journal, Volume 16, Issue 14, Pages 5679--5689, 2016
AI中文摘要

本文提出了一种实用算法,用于校准磁力计以应对磁干扰和传感器误差。该算法还校正磁力计与惯性传感器轴之间的偏移,将磁力计测量与惯性测量结合用于姿态估计。校准算法被建模为最大似然问题的解决方案,并在离线环境中进行计算。使用两个不同商用传感器单元的数据验证了该算法的有效性。将校准后的磁力计测量与惯性传感器结合用于确定传感器姿态,显著提高了航向估计的精度。

英文摘要

In this work we present a practical algorithm for calibrating a magnetometer for the presence of magnetic disturbances and for magnetometer sensor errors. To allow for combining the magnetometer measurements with inertial measurements for orientation estimation, the algorithm also corrects for misalignment between the magnetometer and the inertial sensor axes. The calibration algorithm is formulated as the solution to a maximum likelihood problem and the computations are performed offline. The algorithm is shown to give good results using data from two different commercially available sensor units. Using the calibrated magnetometer measurements in combination with the inertial sensors to determine the sensor's orientation is shown to lead to significantly improved heading estimates.

1606.09278 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Motion Planning With Gamma-Harmonic Potential Fields

基于伽马谐波势场的运动规划

Ahmad A. Masoud

AI总结 本文扩展了谐波势场方法,用于处理无法分割为独立区域的机器人工作空间。通过任务导向的概率描述符和目标点生成导航策略,能够适应存在矢量漂移场的复杂环境。

详情
Journal ref
IEEE Transactions on Aerospace and Electronic Systems, 48 (4), 2012, pp. 2786 - 2801
AI中文摘要

本文扩展了谐波势场(HPF)方法,以实现更广泛的规划能力。该方法适用于机器人工作空间无法分割为独立几何子区域的情况。所提出的方法将任务导向的概率工作空间描述符作为输入,与目标点结合,生成导航策略,以将智能体从工作空间中的任意点引导至目标。该方法易于适应包含矢量漂移场的复杂环境。HPF方法的扩展基于非均匀导电介质中电流的物理类比。所得到的势场称为伽马谐波势场(GHPF)。提供了修改方法能够避免零概率(确定威胁)区域并收敛到目标的证明。通过仿真展示了规划器的能力。

英文摘要

This paper extends the capabilities of the harmonic potential field (HPF) approach to planning. The extension covers the situation where the workspace of a robot cannot be segmented into geometrical subregions where each region has an attribute of its own. The suggested approach uses a task-centered, probabilistic descriptor of the workspace as an input to the planner. This descriptor is processed, along with a goal point, to yield the navigation policy needed to steer the agent from any point in its workspace to the target. The approach is easily adaptable to planning in a cluttered environment containing a vector drift field. The extension of the HPF approach is based on the physical analogy with an electric current flowing in a nonhomogeneous conducting medium. The resulting potential field is known as the gamma-harmonic potential (GHPF). Proofs of the ability of the modified approach to avoid zero-probability (definite threat) regions and to converge to the goal are provided. The capabilities of the planer are demonstrated using simulation.

1607.02632 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Decentralized, Self-organizing, Potential field-based Control for Individuallymotivated, Mobile Agents in a Cluttered Environment: A Vector-Harmonic Potential Field Approach

去中心化、自组织、基于势场的控制方法用于在复杂环境中具有个体动机的移动智能体:一种向量谐波势场方法

Ahmad A. Masoud

AI总结 本文提出了一种去中心化、自组织的势场控制方法,用于复杂环境中多个智能体的协同运动控制,通过向量谐波势场方法实现低计算成本和自适应性。

详情
Journal ref
IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans, May 2007, Volume:37, Issue: 3, pp. 372-390
AI中文摘要

空间多代理系统正受到越来越多研究者的关注,旨在为涉及多个代理资源共享的应用开发理论基础。交通管理系统是其中之一。本文探讨了构建一个去中心化交通控制器,用于大量共享工作空间的智能体,其中包含静止禁区域。所提出的多代理运动控制器在满足工作空间几何条件时是完整的。其计算成本线性增长于智能体数量。控制器具有自组织特性,能够自主处理不完整信息和意外情况。此外,控制器具有开放结构,允许任何智能体加入或退出群体而不影响其他智能体的功能方式。为此,提出了一种去中心化的定义,将去中心化等同于在人工生命模式下智能体群体的自组织。该定义用于指导多代理控制器的构建。控制器采用势场方法实现。理论发展和仿真结果均被提供。

英文摘要

Spatial multi-agency has been receiving growing attention from researchers exploring many of the aspects and modalities of this phenomenon. The aim is to develop the theoretical background needed for a multitude of applications involving the sharing of resources by more than one agent. A traffic management system is one of these applications. Here, a large group of mobile robots that are operating in communication-limited, and sensory-limited modes are required to cope with each others presence as well as the contents of their environment while preserving their ability to reach their preset, independent goals. This work explores the construction of a decentralized traffic controller for a large group of agents sharing a workspace with stationary forbidden regions. The suggested multi-agent motion controller is complete provided that a lenient condition on the geometry of the workspace is upheld. It has a low computational effort that linearly increases with the number of agents. The controller is also self-organizing; therefore, it is able to deal, on its own, with incomplete information and unexpected situations. In addition to the above, the controller has an open structure to enable any agent to join or leave the group without the remaining agents having to adjust the manner in which they function. To meet these requirements, a definition of decentralization is suggested. This definition equates decentralization to self-organization in a group of agents operating in an artificial life mode. The definition is used to provide guidelines for the construction of the multi-agent controller. The controller is realized using the potential field approach. Theoretical developments, as well as simulation results, are provided.

1606.09270 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Kinodynamic Motion Planning: A Novel Type Of Nonlinear, Passive Damping Forces And Advantages

运动动力学路径规划:一种新型非线性、各向异性阻尼力及优势

Ahmad A. Masoud

AI总结 本文提出一种结合非线性各向异性阻尼力的新型运动动力学路径规划方法,有效管理机器人动态并抑制惯性瞬态,无需完整系统动力学知识。

详情
Journal ref
IEEE Robotics And Automation Magazine, March 2010, Pp. 85-99
AI中文摘要

本文扩展了谐振势场方法在路径规划中的能力,涵盖机器人运动的运动学和动力学方面。所提出的方法通过将梯度引导场从谐振势场转换为控制信号,加入了新型非线性、各向异性阻尼力。两种方法的结合提供了一个既能引导机器人又能有效管理其动态的信号。运动动力学规划信号继承了谐振梯度场的引导能力,同时可轻松配置以高效抑制机器人轨迹中的惯性瞬态,而不影响操作速度。该方法适用于耗散系统以及受外力作用的系统,无需完整系统动力学知识。本文提供了理论发展和仿真结果。

英文摘要

This article extends the capabilities of the harmonic potential field approach to planning to cover both the kinematic and dynamic aspects of a robot motion. The suggested approach converts the gradient guidance field from a harmonic potential to a control signal by augmenting it with a novel type of damping forces called nonlinear, anisotropic, damping forces. The combination of the two provides a signal that can both guide a robot and effectively manage its dynamics. The kinodynamic planning signal inherits the guidance capabilities of the harmonic gradient field. It can also be easily configured to efficiently suppress the inertia-induced transients in the robot trajectory without compromising the speed of operation. The approach works with dissipative systems as well as systems acted on by external forces without needing the full knowledge of the system dynamics. Theoretical developments and simulation results are provided in this article.

1607.01478 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

Mixed Strategy for Constrained Stochastic Optimal Control

混合策略用于受约束的随机最优控制

Masahiro Ono, Mahmoud El Chamie, Marco Pavone, Behcet Acikmese

AI总结 本文提出混合策略用于受约束的随机最优控制,证明随机化控制输入在非凸优化问题中可降低成本,等于对偶间隙,并提出基于对偶优化的高效求解方法。

Comments 11 pages. 9 figures.Preliminary version of a working journal paper

详情
AI中文摘要

在具有随机约束的最优控制问题中,随机选择控制输入可以降低预期成本,例如随机模型预测控制(SMPC)。我们考虑具有初始随机化的控制器,即在开始时随机选择K+1个控制序列(称为K-随机化)。已知对于具有K个约束的有限状态、有限动作马尔可夫决策过程(MDP),K-随机化足以达到最小成本。我们发现,对于具有连续状态和动作空间的随机最优控制问题,相同结果也成立。进一步,我们证明当优化问题非凸时,控制输入的随机化可以导致成本降低,且该降低量等于对偶间隙。然后,我们提供随机解最优性的必要和充分条件,并开发基于对偶优化的高效求解方法。此外,在K=1的特殊情况(如联合概率约束问题)中,对偶优化可通过根查找更高效地解决。最后,我们在路径规划到未来火星任务的着陆、下降和着陆(EDL)规划等多个实际问题上测试理论并演示求解方法。

英文摘要

Choosing control inputs randomly can result in a reduced expected cost in optimal control problems with stochastic constraints, such as stochastic model predictive control (SMPC). We consider a controller with initial randomization, meaning that the controller randomly chooses from K+1 control sequences at the beginning (called K-randimization).It is known that, for a finite-state, finite-action Markov Decision Process (MDP) with K constraints, K-randimization is sufficient to achieve the minimum cost. We found that the same result holds for stochastic optimal control problems with continuous state and action spaces.Furthermore, we show the randomization of control input can result in reduced cost when the optimization problem is nonconvex, and the cost reduction is equal to the duality gap. We then provide the necessary and sufficient conditions for the optimality of a randomized solution, and develop an efficient solution method based on dual optimization. Furthermore, in a special case with K=1 such as a joint chance-constrained problem, the dual optimization can be solved even more efficiently by root finding. Finally, we test the theories and demonstrate the solution method on multiple practical problems ranging from path planning to the planning of entry, descent, and landing (EDL) for future Mars missions.

1607.00765 2026-06-04 cs.NE cs.RO cs.SY eess.SY 版本更新

Multi-Objective Design of State Feedback Controllers Using Reinforced Quantum-Behaved Particle Swarm Optimization

基于强化量子行为粒子群优化的多目标状态反馈控制器设计

Kaveh Hassani, Won-Sook Lee

AI总结 本文提出一种新颖的多目标设计方法,利用量子行为粒子群优化算法(QPSO)优化LQR控制器配置,结合模拟退火和高斯邻域选择机制改进初始化方案,引入动态加权准则生成Pareto最优解,通过实验验证其在控制性能上的优越性。

详情
Journal ref
Applied Soft Computing, 41, pp. 66-76, 2016
AI中文摘要

本文提出了一种新颖且通用的多目标设计范式,利用量子行为粒子群优化(QPSO)算法确定给定问题中LQR控制器的最佳配置,考虑一组竞争性目标。本文的主要贡献包括:(1)基于模拟退火算法和高斯邻域选择机制的改进初始化方案;(2)结合膜计算算法优势的局部搜索策略;(3)引入动态加权准则,将软约束和硬约束与控制目标动态结合,为设计者提供一组Pareto最优解,并根据实际偏好选择目标解。所提方法在两个控制基准上通过敏感性分析和全因子参数选择与梯度法、七种元启发式算法和试错法进行比较,实验结果表明,所提方法在控制器努力、暂态响应指标和稳态相关指标上均优于对手方法。

英文摘要

In this paper, a novel and generic multi-objective design paradigm is proposed which utilizes quantum-behaved PSO(QPSO) for deciding the optimal configuration of the LQR controller for a given problem considering a set of competing objectives. There are three main contributions introduced in this paper as follows. (1) The standard QPSO algorithm is reinforced with an informed initialization scheme based on the simulated annealing algorithm and Gaussian neighborhood selection mechanism. (2) It is also augmented with a local search strategy which integrates the advantages of memetic algorithm into conventional QPSO. (3) An aggregated dynamic weighting criterion is introduced that dynamically combines the soft and hard constraints with control objectives to provide the designer with a set of Pareto optimal solutions and lets her to decide the target solution based on practical preferences. The proposed method is compared against a gradient-based method, seven meta-heuristics, and the trial-and-error method on two control benchmarks using sensitivity analysis and full factorial parameter selection and the results are validated using one-tailed T-test. The experimental results suggest that the proposed method outperforms opponent methods in terms of controller effort, measures associated with transient response and criteria related to steady-state.

1607.00644 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Nearest Neighbor-based Rendezvous for Sparsely Connected Mobile Agents

基于最近邻的稀疏连接移动体 rendezvous

Ahmad A. Masoud

AI总结 本文提出一种收敛的最近邻控制协议,用于非平凡动力学的移动体。协议保证即使每个体仅与单个最近邻通信,也能收敛到共同点。最近邻需位于任意小的优先区外,协议由两层结构组成,第一层为一阶动力学提供 rendezvous 信号,第二层将信号转换为适合现实体的控制信号。

详情
Journal ref
ASME. J. Dyn. Sys., Meas., Control. 2015;137(12):121002-121002-18
AI中文摘要

本文提出了一种收敛的最近邻控制协议,用于具有非平凡动力学的移动体。该协议保证即使每个移动体仅能与单个最近邻通信,也能收敛到共同点。然而,最近邻必须位于任意小的优先区外。该协议由两层结构组成,这两层以可证明正确的方式相互连接。第一层假设移动体具有第一阶动力学,提供 rendezvous 信号。另一层以去中心化的方式将该信号转换为适合现实移动体(如UGVs、UAVs和具有第二阶动力学的全向移动体)的控制信号。

英文摘要

In this paper a convergent, nearest-neighbor, control protocol is suggested for agents with nontrivial dynamics. The protocol guarantees convergence to a common point in space even if each agent is restricted to communicate with a single nearest neighbor. The neighbor, however, is required to lie outside an arbitrarily small priority zone surrounding the agent. The control protocol consists of two layers interconnected in a provably-correct manner. The first layer provides the guidance signal to a rendezvous point assuming that the agents have first order dynamics. The other layer converts in a decentralized manner the guidance signal to a control signal that suits realistic agents such as UGVs, UAVs and holonomic agents with second order dynamics.

1606.09275 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A Harmonic Potential Approach For Simultaneous Planning And Control Of A Generic UAV Platform

一种谐波势场方法用于通用无人机平台的同时规划与控制

Ahmad A. Masoud

AI总结 本文提出利用谐波势场方法实现多种无人机的同时规划与控制,通过生成密集参考速度场调节无人机速度,确保其向目标点移动并满足行为约束。

详情
Journal ref
Journal Of Intelligent & Robotic Systems: Volume 65, Issue 1 (2012), Page 153-173
AI中文摘要

本文采用谐波势场方法解决多种无人飞行器的同时规划与控制问题。通过生成参考速度场的梯度来调节无人机速度,使其向目标点移动并满足事先编码在参考场中的约束。调节过程使用了称为虚拟速度吸引器(VVA)的新概念。谐波势场梯度与VVA的综合作用能够产生高效、易于实现且具有证明正确性的上下文敏感控制动作,适用于多种无人机。该方法已开发并提供了正确性证明及仿真结果。

英文摘要

Simultaneous planning and control of a large variety of unmanned aerial vehicles (UAVs) is tackled using the harmonic potential field (HPF) approach. A dense reference velocity field generated from the gradient of an HPF is used to regulate the velocity of the UAV concerned in a manner that would propel the UAV to a target point while enforcing the constraints on behavior that were a priori encoded in the reference field. The regulation process is carried-out using a novel and simple concept called the: virtual velocity attractor (VVA). The combined effect of the HPF gradient and the VVA is found able to yield an efficient, easy to implement, well-behaved and provably-correct context-sensitive control action that suits a wide variety of UAVs. The approach is developed and basic proofs of correctness are provided along with simulation results.

1606.08323 2026-06-04 math.OC cs.RO cs.SY eess.SY math.DS 版本更新

Simultaneous Mode, Input and State Estimation for Switched Linear Stochastic Systems

切换线性随机系统的模式、输入和状态联合估计

Sze Zheng Yong, Minghui Zhu, Emilio Frazzoli

AI总结 本文提出了一种滤波算法,用于同时估计隐藏模式切换线性随机系统的模式、输入和状态。通过多模型方法,利用每个模式的线性输入和状态滤波器银行,通过识别关键性质实现最可能模型的估计。

Comments Submitted to SIAM Journal on Control and Optimization

详情
AI中文摘要

本文提出了一种滤波算法,用于同时估计隐藏模式切换线性随机系统的模式、输入和状态。使用多模型方法,结合每个模式的线性输入和状态滤波器银行,我们的算法依赖于能够找到最可能的模型作为模式估计的能力,这通过识别一个关键性质实现,即我们称为广义创新的残差信号是高斯白噪声。我们还提供了所提算法的渐近分析,并给出了渐近达到真实模型(一致性)或根据信息论度量达到'最近'模型(收敛)的充分条件。一个关于意图感知车辆在交叉口的仿真示例用于展示本方法的有效性。

英文摘要

In this paper, we propose a filtering algorithm for simultaneously estimating the mode, input and state of hidden mode switched linear stochastic systems with unknown inputs. Using a multiple-model approach with a bank of linear input and state filters for each mode, our algorithm relies on the ability to find the most probable model as a mode estimate, which we show is possible with input and state filters by identifying a key property, that a particular residual signal we call generalized innovation is a Gaussian white noise. We also provide an asymptotic analysis for the proposed algorithm and provide sufficient conditions for asymptotically achieving convergence to the true model (consistency), or to the 'closest' model according to an information-theoretic measure (convergence). A simulation example of intention-aware vehicles at an intersection is given to demonstrate the effectiveness of our approach.

1606.05124 2026-06-04 cs.RO cs.AI cs.SY eess.SY 版本更新

Robust Active Perception via Data-association aware Belief Space planning

通过数据关联意识的信念空间规划实现鲁棒的主动感知

Shashank Pathak, Antony Thomas, Asaf Feniger, Vadim Indelman

AI总结 本文提出一种结合数据关联推理的信念空间规划方法,以应对定位不确定性和感知模糊环境中的挑战,通过设计新的成本函数提升主动解歧能力。

详情
AI中文摘要

我们开发了一种信念空间规划(BSP)方法,通过在规划中整合数据关联(DA)推理,同时考虑额外的不确定性来源,从而推动了该领域的前沿。现有BSP方法通常假设数据关联已知且完美,但在存在定位不确定性、模糊和感知混叠环境时,这一假设更难成立。相反,我们的数据关联意识信念空间规划(DA-BSP)方法在信念演化中显式推理数据关联,因此能更好地应对这些具有挑战性的现实场景。特别是,我们展示了由于感知混叠,后验信念成为概率分布函数的混合,设计了衡量预期模糊程度和后验不确定性的成本函数。使用这些以及标准成本(如控制惩罚、距离目标)在目标函数中,得到一个能够可靠表示动作影响且特别擅长主动解歧的通用框架。我们的方法因此适用于感知混叠环境中的鲁棒主动感知和自主导航。我们通过基本和现实的模拟展示了关键方面。

英文摘要

We develop a belief space planning (BSP) approach that advances the state of the art by incorporating reasoning about data association (DA) within planning, while considering additional sources of uncertainty. Existing BSP approaches typically assume data association is given and perfect, an assumption that can be harder to justify while operating, in the presence of localization uncertainty, in ambiguous and perceptually aliased environments. In contrast, our data association aware belief space planning (DA-BSP) approach explicitly reasons about DA within belief evolution, and as such can better accommodate these challenging real world scenarios. In particular, we show that due to perceptual aliasing, the posterior belief becomes a mixture of probability distribution functions, and design cost functions that measure the expected level of ambiguity and posterior uncertainty. Using these and standard costs (e.g.~control penalty, distance to goal) within the objective function, yields a general framework that reliably represents action impact, and in particular, capable of active disambiguation. Our approach is thus applicable to robust active perception and autonomous navigation in perceptually aliased environments. We demonstrate key aspects in basic and realistic simulations.

1602.02990 2026-06-04 cs.RO cs.LG cs.SY eess.SY 版本更新

Self-organized control for musculoskeletal robots

肌骨机器人中的自组织控制

Ralf Der, Georg Martius

AI总结 本文提出了一种自组织控制方法,通过无功能控制器实现机器人与环境的动态交互,展示了其在肌肉驱动臂肩系统中的自组织行为及与物体动态的共振效应。

Comments 11 pages, 4 figures, 1 table

详情
AI中文摘要

随着机器人技术的快速发展,最优控制成为研究核心。传统方法中,控制器基于传感器历史数据和预设目标进行动作决策。然而,弹性驱动机器人面临严重挑战。本文提出自组织控制新范式,采用无自身功能的固定函数控制器,基于传感器历史数据。在Myorobotics工具包的肌肉驱动臂肩系统中,观察到多样化的自组织行为:当系统独处时,臂部产生伪随机姿态序列,也可被操控为确定性运动模式。最有趣的是,当附加物体后,控制器与物体内部动态产生共振:给半满瓶时,系统自发摇晃瓶身以产生最大水动态响应;附加摆锤时,控制器使其进入圆周模式。本文还讨论了该控制器范式在意图驱动行为生成中的应用前景。

英文摘要

With the accelerated development of robot technologies, optimal control becomes one of the central themes of research. In traditional approaches, the controller, by its internal functionality, finds appropriate actions on the basis of the history of sensor values, guided by the goals, intentions, objectives, learning schemes, and so on planted into it. The idea is that the controller controls the world---the body plus its environment---as reliably as possible. However, in elastically actuated robots this approach faces severe difficulties. This paper advocates for a new paradigm of self-organized control. The paper presents a solution with a controller that is devoid of any functionalities of its own, given by a fixed, explicit and context-free function of the recent history of the sensor values. When applying this controller to a muscle-tendon driven arm-shoulder system from the Myorobotics toolkit, we observe a vast variety of self-organized behavior patterns: when left alone, the arm realizes pseudo-random sequences of different poses but one can also manipulate the system into definite motion patterns. But most interestingly, after attaching an object, the controller gets in a functional resonance with the object's internal dynamics: when given a half-filled bottle, the system spontaneously starts shaking the bottle so that maximum response from the dynamics of the water is being generated. After attaching a pendulum to the arm, the controller drives the pendulum into a circular mode. In this way, the robot discovers dynamical affordances of objects its body is interacting with. We also discuss perspectives for using this controller paradigm for intention driven behavior generation.

1606.03754 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Towards Self-Calibrating Inertial Body Motion Capture

迈向自校准的惯性人体运动捕捉

Bertram Taetz, Gabriele Bleser, Markus Miezal

AI总结 本文提出了一种新的在线方法,用于同时估计人体运动的肢体方位和位置及传感器到肢体的校准参数,结合了先进的运动、测量和生物力学模型及新的随机方程和先验知识。

Comments 9 pages, 6 figures, accepted to the 19th International Conference on Information Fusion

详情
AI中文摘要

本文提出了一种新的在线方法,用于同时估计人体运动的肢体方位和位置及传感器到肢体的校准参数,结合了先进的运动、测量和生物力学模型及新的随机方程和先验知识。这些基于多体系统的运动学、解剖和身体形状信息,以及用于正则化的参数属性。这导致了一个受约束的加权最小二乘问题,以滑动窗口的方式求解。磁力计信息目前仅用于初始化,而估计本身无需磁力计。该方法在模拟数据和真实数据上进行了测试,数据来自下肢配置。

英文摘要

This paper presents a novel online capable method for simultaneous estimation of human motion in terms of segment orientations and positions along with sensor-to-segment calibration parameters from inertial sensors attached to the body. In order to solve this ill-posed estimation problem, state-of-the-art motion, measurement and biomechanical models are combined with new stochastic equations and priors. These are based on the kinematics of multi-body systems, anatomical and body shape information, as well as, parameter properties for regularisation. This leads to a constrained weighted least squares problem that is solved in a sliding window fashion. Magnetometer information is currently only used for initialisation, while the estimation itself works without magnetometers. The method was tested on simulated, as well as, on real data, captured from a lower body configuration.

1411.0181 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Restricted Discrete Invariance and Self-Synchronization For Stable Walking of Bipedal Robots

受限离散不变性与自同步用于双足机器人稳定行走

Hamed Razavi, Anthony M. Bloch, Christine Chevallereau, J. W. Grizzle

AI总结 本文研究了双足机器人稳定行走的低维子流形不变性,提出自同步概念,通过3D线性倒立摆模型分析,扩展至9自由度双足机器人,验证渐近稳定行走的可行性。

Comments Conference

详情
AI中文摘要

双足运动模型是混合系统,包含连续部分(由拉格朗日方程和执行器生成)和离散部分(腿部转移)。离散部分通常由连续状态空间中的局部嵌入共维一子流形(切换面)和重置映射组成。本文旨在识别切换面的低维子流形,使其在闭环动力学下不变,从而实现渐近稳定的周期性步态。本文首先研究经典的3D线性倒立摆(LIP)模型,通过分析结果更易获得。关键贡献是自同步概念,即摆动平面的周期趋于共同周期。通过3D LIP模型的不变性研究,将该概念扩展到9自由度3D双足机器人,并通过数值研究验证渐近稳定行走的可行性。

英文摘要

Models of bipedal locomotion are hybrid, with a continuous component often generated by a Lagrangian plus actuators, and a discrete component where leg transfer takes place. The discrete component typically consists of a locally embedded co-dimension one submanifold in the continuous state space of the robot, called the switching surface, and a reset map that provides a new initial condition when a solution of the continuous component intersects the switching surface. The aim of this paper is to identify a low-dimensional submanifold of the switching surface, which, when it can be rendered invariant by the closed-loop dynamics, leads to asymptotically stable periodic gaits. The paper begins this process by studying the well-known 3D Linear Inverted Pendulum (LIP) model, where analytical results are much easier to obtain. A key contribution here is the notion of \textit{self-synchronization}, which refers to the periods of the pendular motions in the sagittal and frontal planes tending to a common period. The notion of invariance resulting from the study of the 3D LIP model is then extended to a 9-DOF 3D biped. A numerical study is performed to illustrate that asymptotically stable walking may be obtained.

1606.02205 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Applying Gaussian distributed constraints to Gaussian distributed variables

将高斯分布约束应用于高斯分布变量

Andrew W. Palmer, Andrew J. Hill, Steven J. Scheding

AI总结 本文提出一种分析方法,用于截断由高斯分布描述的不等式约束高斯分布变量,通过基于矩的高斯近似改进了截断分布,应用于不确定约束的卡尔曼滤波问题,仿真显示效果优于无约束和硬约束卡尔曼滤波。

Comments 34 pages, 12 figures, accepted to Information Fusion. Updated to fix minor errors in equations (33), (34), and (B.12)

详情
AI中文摘要

本文开发了一种分析方法,用于截断由高斯分布描述的不等式约束高斯分布变量。现有截断方法要么假设硬约束,要么使用数值方法处理不确定约束。所提出的方法引入基于矩的高斯近似来近似截断分布。该方法可应用于众多问题,动机问题为具有不确定约束的卡尔曼滤波。在仿真示例中,所开发的方法在性能上优于无约束卡尔曼滤波超过40%,优于硬约束卡尔曼滤波超过17%。

英文摘要

This paper develops an analytical method of truncating inequality constrained Gaussian distributed variables where the constraints are themselves described by Gaussian distributions. Existing truncation methods either assume hard constraints, or use numerical methods to handle uncertain constraints. The proposed approach introduces moment-based Gaussian approximations of the truncated distribution. This method can be applied to numerous problems, with the motivating problem being Kalman filtering with uncertain constraints. In a simulation example, the developed method is shown to outperform unconstrained Kalman filtering by over 40% and hard-constrained Kalman filtering by over 17%.

1605.08542 2026-06-04 eess.SY cs.MA cs.RO cs.SY 版本更新

Distributed controller-estimator for target tracking of networked robotic systems under sampled interaction

分布式控制器-估计器用于网络化机器人系统在采样交互下的目标跟踪

Ming-Feng Ge, Zhi-Hong Guan, Bin Hu, Ding-Xin He, Rui-Quan Liao

AI总结 本文提出两种新型分布式控制器-估计器算法,用于网络化机器人系统在采样交互下的目标跟踪问题,通过小值范数和李雅普诺夫稳定性理论分析,确定交互拓扑、采样周期等参数以实现跟踪误差的实际稳定性。

Comments 8 pages, 4 figures, Published in Automatica

详情
Journal ref
Automatica, 2016, 69: 410-417
AI中文摘要

本文研究了在采样交互下网络化机器人系统(NRSs)的目标跟踪问题。目标被假设为时间变化的,并由二阶振荡器描述。提出了两种新颖的分布式控制器-估计器算法(DCEA),其包含连续和不连续信号。基于小值范数和李雅普诺夫稳定性理论的性质,给出了交互拓扑、采样周期和其他控制参数的条件,以实现跟踪误差的实际稳定性和稳定性区域的定量调节。通过与其他算法的比较和仿真示例,展示了所提DCEA的优势。

英文摘要

This paper investigates the target tracking problem for networked robotic systems (NRSs) under sampled interaction. The target is assumed to be time-varying and described by a second-order oscillator. Two novel distributed controller-estimator algorithms (DCEA), which consist of both continuous and discontinuous signals, are presented. Based on the properties of small-value norms and Lyapunov stability theory, the conditions on the interaction topology, the sampling period, and the other control parameters are given such that the practical stability of the tracking error is achieved and the stability region is regulated quantitatively. The advantages of the presented DCEA are illustrated by comparisons with each other and the existing coordination algorithms. Simulation examples are given to demonstrate the theoretical results.

1406.4047 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Robot Impedance Control and Passivity Analysis with Inner Torque and Velocity Feedback Loops

机器人阻抗控制与内力和速度反馈环的被动性分析

Michele Focchi, Gustavo A. Medrano-Cerda, Thiago Boaventura, Marco Frigerio, Jonas Buchli, Darwin G. Caldwell, Claudio Semini

AI总结 本文研究了机器人阻抗控制中内力和速度反馈环对稳定性区域和系统被动性的影响,通过仿真和实验数据验证,并提出基于内力环和正速度反馈环的关节阻抗控制器设计方法。

Comments 14 pages in Control Theory and Technology (2016)

详情
AI中文摘要

阻抗控制是一种用于机器人交互力控制的成熟技术。然而,实际实现中内环的阻抗控制可能面临若干限制。尽管在设计嵌套控制系统时通常通过最大化内环带宽来提高跟踪性能,但当需要渲染特定范围的阻抗参数时,这种方法可能并不合适。特别是,稳定刚度和阻尼值的可行范围可能受到内控制环(例如力环)带宽以及滤波和采样频率的强烈影响。本文对这些方面如何影响阻抗参数的稳定性区域以及系统的被动性进行了深入分析,并通过仿真和实验数据予以支持。此外,本文还提出了一种基于内力环和正速度反馈环的关节阻抗控制器设计方法。速度反馈的目标是在保持稳定性的前提下,增加力环的带宽,而无需复杂的控制器。

英文摘要

Impedance control is a well-established technique to control interaction forces in robotics. However, real implementations of impedance control with an inner loop may suffer from several limitations. Although common practice in designing nested control systems is to maximize the bandwidth of the inner loop to improve tracking performance, it may not be the most suitable approach when a certain range of impedance parameters has to be rendered. In particular, it turns out that the viable range of stable stiffness and damping values can be strongly affected by the bandwidth of the inner control loops (e.g. a torque loop) as well as by the filtering and sampling frequency. This paper provides an extensive analysis on how these aspects influence the stability region of impedance parameters as well as the passivity of the system. This will be supported by both simulations and experimental data. Moreover, a methodology for designing joint impedance controllers based on an inner torque loop and a positive velocity feedback loop will be presented. The goal of the velocity feedback is to increase (given the constraints to preserve stability) the bandwidth of the torque loop without the need of a complex controller.

1307.7170 2026-06-04 eess.SY cs.MA cs.RO cs.SY math.OC 版本更新

Decentralized Multi-Robot Encirclement of a 3D Target with Guaranteed Collision Avoidance

去中心化多机器人3D目标环绕控制

Antonio Franchi, Paolo Stegagno, Giuseppe Oriolo

AI总结 本文提出多机器人系统在三维空间中环绕移动目标的控制框架,提出三种控制策略变体并验证其有效性,同时讨论了保持安全机器人间距离的扩展方案,通过仿真实验验证了该框架的可行性。

Comments Accepted for Autonomous Robots - Springer

详情
Journal ref
Autonomous Robots February 2016, Volume 40, Issue 2, pp 245-265, First online: 11 July 2015
AI中文摘要

我们提出了一种控制框架,用于使用多机器人系统实现移动在三维空间中的目标环绕。三种基本控制策略的变体被提出以应对不同的环绕问题版本,并正式建立了其有效性。还讨论了确保维护安全机器人间距离的扩展。所提出的框架是完全去中心化的,仅需要机器人之间的本地通信;特别是,每个机器人本地估计所有相关的全局量。通过在运动学点机器人和四旋翼无人机上的仿真实验以及差分驱动轮式移动机器人上的实验验证了所提策略。

英文摘要

We present a control framework for achieving encirclement of a target moving in 3D using a multi-robot system. Three variations of a basic control strategy are proposed for different versions of the encirclement problem, and their effectiveness is formally established. An extension ensuring maintenance of a safe inter-robot distance is also discussed. The proposed framework is fully decentralized and only requires local communication among robots; in particular, each robot locally estimates all the relevant global quantities. We validate the proposed strategy through simulations on kinematic point robots and quadrotor UAVs, as well as experiments on differential-drive wheeled mobile robots.

1605.04368 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Decentralized Autonomous Navigation Strategies for Multi-Robot Search and Rescue

多机器人搜索救援中的去中心化自主导航策略

Ahmad Baranzadeh

AI总结 本文提出三种基于三角网格模式的算法,用于多机器人搜索任务,通过数学证明算法收敛性,并通过仿真和实验验证其有效性,同时探讨了去中心化编队形成与障碍物避让问题。

Comments arXiv admin note: substantial text overlap with arXiv:1402.5188 by other authors

详情
AI中文摘要

在本报告中,我们尝试改进现有多机器人搜索操作中的方法。我们提出了三种新的算法,利用三角网格模式,即机器人在搜索过程中必然经过三角网格的顶点。使用三角网格模式的主要优势是,在覆盖任意有界区域时,所需的最少机器人数量是渐近最优的。我们使用在搜索操作中由机器人制作并共享的新拓扑地图。我们考虑一个事先未知、具有任意形状且包含障碍物的区域。与许多现有启发式算法不同,我们为算法提供了数学证明的收敛性。我们使用真实机器人的模拟器和环境展示了所提出算法的计算机仿真结果。我们通过使用真实机器人进行实验来评估算法的性能。我们比较了我们自己的算法与三种其他研究人员提出的现有算法的性能。结果展示了我们所提方案的优势。本文还探讨了移动机器人团队的编队形成与障碍物避让问题。我们为一组移动机器人提出了一种去中心化的编队形成与障碍物避让算法,以移动到定义的几何构型。此外,我们考虑了一种更复杂的编队问题,其中一组匿名机器人;这些机器人在最终构型中并不知道自己的位置,需要在编队过程中达成共识。我们为这些匿名机器人提出了一种随机算法,以概率1收敛到期望的构型。我们还提出了一种新的障碍物避让规则,用于编队形成算法中。

英文摘要

In this report, we try to improve the performance of existing approaches for search operations in multi-robot context. We propose three novel algorithms that are using a triangular grid pattern, i.e., robots certainly go through the vertices of a triangular grid during the search procedure. The main advantage of using a triangular grid pattern is that it is asymptotically optimal in terms of the minimum number of robots required for the complete coverage of an arbitrary bounded area. We use a new topological map which is made and shared by robots during the search operation. We consider an area that is unknown to the robots a priori with an arbitrary shape, containing some obstacles. Unlike many current heuristic algorithms, we give mathematically proofs of convergence of the algorithms. The computer simulation results for the proposed algorithms are presented using a simulator of real robots and environment. We evaluate the performance of the algorithms via experiments with real robots. We compare the performance of our own algorithms with three existing algorithms from other researchers. The results demonstrate the merits of our proposed solution. A further study on formation building with obstacle avoidance for a team of mobile robots is presented in this report. We propose a decentralized formation building with obstacle avoidance algorithm for a group of mobile robots to move in a defined geometric configuration. Furthermore, we consider a more complicated formation problem with a group of anonymous robots; these robots are not aware of their position in the final configuration and need to reach a consensus during the formation process. We propose a randomized algorithm for the anonymous robots that achieves the convergence to a desired configuration with probability 1. We also propose a novel obstacle avoidance rule, used in the formation building algorithm.

1605.02196 2026-06-04 eess.SY cs.CV cs.LG cs.RO cs.SY 版本更新

All Weather Perception: Joint Data Association, Tracking, and Classification for Autonomous Ground Vehicles

全天候感知:面向自主地面车辆的数据关联、跟踪与分类的联合解决方案

Peter Radecki, Mark Campbell, Kevin Matzen

AI总结 本文提出一种新型概率感知算法,用于自主地面车辆在全天候条件下的数据关联、目标跟踪和分类。该算法扩展了原有的 Rao-Blackwellized 粒子滤波器,结合多模型跟踪进行分类,并通过升级 Cornell 的 AGV 实验证明了先进视觉算法在恶劣天气下的鲁棒性。

Comments 35 pages, 21 figures, 14 tables

详情
AI中文摘要

本文提出了一种新颖的概率感知算法,作为实时联合解决方案,用于自主地面车辆在全天候条件下的数据关联、目标跟踪和目标分类。该算法扩展了最初使用粒子滤波进行数据关联和卡尔曼滤波进行多目标跟踪的 Rao-Blackwellized 粒子滤波器(Miller 等,2011a),现已包含多模型跟踪用于分类。此外,还实现了一种最先进的视觉检测算法,该算法包含方向信息,适用于自主地面车辆(AGV)应用。Cornell 的 AGV 从 DARPA 城市挑战中被升级并用于实验,以检验先进视觉算法能否补充或替代激光雷达和雷达传感器。在恶劣天气和光照条件下,传感器和算法性能得到测试。实验评估显示,在联合概率感知算法中,摄像头、激光雷达和雷达传感器能够实现稳健的全天候数据关联、跟踪和分类。

英文摘要

A novel probabilistic perception algorithm is presented as a real-time joint solution to data association, object tracking, and object classification for an autonomous ground vehicle in all-weather conditions. The presented algorithm extends a Rao-Blackwellized Particle Filter originally built with a particle filter for data association and a Kalman filter for multi-object tracking (Miller et al. 2011a) to now also include multiple model tracking for classification. Additionally a state-of-the-art vision detection algorithm that includes heading information for autonomous ground vehicle (AGV) applications was implemented. Cornell's AGV from the DARPA Urban Challenge was upgraded and used to experimentally examine if and how state-of-the-art vision algorithms can complement or replace lidar and radar sensors. Sensor and algorithm performance in adverse weather and lighting conditions is tested. Experimental evaluation demonstrates robust all-weather data association, tracking, and classification where camera, lidar, and radar sensors complement each other inside the joint probabilistic perception algorithm.

1510.01261 2026-06-04 eess.SY cs.RO cs.SY math.LO 版本更新

Optimal Mission Planner with Timed Temporal Logic Constraints

带有时间时序逻辑约束的最优任务规划器

Yuchen Zhou, Dipankar Maity, John S. Baras

AI总结 本文提出一种基于优化的方法,用于在动态环境中规划移动机器人路径,满足时间受限的时序约束。通过度量时序逻辑(MTL)编码任务规范,并将其转化为混合整数线性约束,利用混合整数线性规划求解器求解优化问题。

Comments European Control Conference 2015

详情
AI中文摘要

在本文中,我们提出了一种基于优化的方法,用于在动态环境中规划移动机器人的路径,以满足时间受限的时序约束。时序逻辑(TL)可以处理非常复杂的任务规范,如安全、覆盖、运动序列等。我们使用度量时序逻辑(MTL)来编码具有时间约束的任务规范。然后,我们将MTL公式转换为混合整数线性约束,并使用混合整数线性规划求解器求解相关优化问题。这种方法不同于基于自动机的方法,后者生成环境和动态的有限抽象,并使用自动机理论方法正式生成满足TL的路径。我们已在多个复杂的动态环境中应用了我们的方法,这些环境受到时间时序规范的约束。

英文摘要

In this paper, we present an optimization based method for path planning of a mobile robot subject to time bounded temporal constraints, in a dynamic environment. Temporal logic (TL) can address very complex task specification such as safety, coverage, motion sequencing etc. We use metric temporal logic (MTL) to encode the task specifications with timing constraints. We then translate the MTL formulae into mixed integer linear constraints and solve the associated optimization problem using a mixed integer linear program solver. This approach is different from the automata based methods which generate a finite abstraction of the environment and dynamics, and use an automata theoretic approach to formally generate a path that satisfies the TL. We have applied our approach on several case studies in complex dynamical environments subjected to timed temporal specifications.

1604.06558 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Folding Assembly by Means of Dual-Arm Robotic Manipulation

通过双臂机械手实现折叠组装

Diogo Almeida, Yiannis Karayiannidis

AI总结 本文提出一种适用于双臂机械手的折叠组装基本操作,用于更高层次的组装策略。通过实验验证了该方法在两个部件接触时的可行性。

Comments 7 pages, accepted for ICRA 2016

详情
AI中文摘要

本文考虑将折叠组装作为一种适用于双臂机械手的组装基本操作,可整合到更高层次的组装策略中。由两个接触部件组成的系统被建模为一个具有转动-平动关节的连杆物体。为建模该系统考虑了不同的抓取场景,并提出了一种基于反馈线性化的简单控制器,利用力矩测量计算接触点运动学。折叠组装控制器已通过两个示例部件进行了实验测试,以展示折叠组装作为一种可行的组装基本操作。

英文摘要

In this paper, we consider folding assembly as an assembly primitive suitable for dual-arm robotic assembly, that can be integrated in a higher level assembly strategy. The system composed by two pieces in contact is modelled as an articulated object, connected by a prismatic-revolute joint. Different grasping scenarios were considered in order to model the system, and a simple controller based on feedback linearisation is proposed, using force torque measurements to compute the contact point kinematics. The folding assembly controller has been experimentally tested with two sample parts, in order to showcase folding assembly as a viable assembly primitive.

1604.05064 2026-06-04 cs.RO cs.SY eess.SY 版本更新

An Approximation Algorithm for a Shortest Dubins Path Problem

一种短路径问题的近似算法

Sivakumar Rathinam, Pramod Khargonekar

AI总结 本文提出了一种改进的近似算法,将短路径问题的解的质量保证从3.04降低到2.04,通过实验证明了该方法的有效性。

详情
AI中文摘要

在车辆必须遵循运动约束的情况下,寻找访问给定目标点序列的最短路径是一个重要的问题,出现在多个无人 aerial 车辆的监控和监视应用中。目前尚无算法能找到最优解,因此具有解质量保证的启发式方法很有用。当任意两个相邻目标点之间的距离至少等于车辆最小半径的两倍时,现有的最佳近似算法保证为3.04。本文提供了一种新的近似算法,将此保证提高到2.04。开发的算法还对涉及最多30个点的数百个典型实例进行了实现,以验证所提出方法的性能。

英文摘要

The problem of finding the shortest path for a vehicle visiting a given sequence of target points subject to the motion constraints of the vehicle is an important problem that arises in several monitoring and surveillance applications involving unmanned aerial vehicles. There is currently no algorithm that can find an optimal solution to this problem. Therefore, heuristics that can find approximate solutions with guarantees on the quality of the solutions are useful. The best approximation algorithm currently available for the case when the distance between any two adjacent target points in the sequence is at least equal to twice the minimum radius of the vehicle has a guarantee of 3.04. This article provides a new approximation algorithm which improves this guarantee to 2.04. The developed algorithm is also implemented for hundreds of typical instances involving at most 30 points to corroborate the performance of the proposed approach.

1604.02930 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Implementation of haptic communication in comanipulative tasks: a statistical state machine model

在协同操作任务中实现触觉通信:一种统计状态机模型

Lucas Roche, Ludovic Saint-Bauzel

AI总结 本文通过轻量化条件下的机械臂实验,探讨物理人机交互中的时间基通信机制,提出统计状态机模型并验证其与人类交互性能的接近性。

详情
AI中文摘要

本文提出了一种基于物理人机交互的统计状态机模型,用于评估轻量化条件下的人类-人类交互中的通信机制。该模型基于时间基通信假设,通过14名受试者的测试,结果显示其性能接近人类交互表现。

英文摘要

- This paper presents an experimental evaluation of physical human-human interaction in lightweight condition using a one degree of freedom robotized setup. It explores possible origins of Physical Human-Human communication, more precisely, the hypothesis of a time based communication. To explore if the communication is correlated to time a statistical state machine model based on physical Human-Human interaction is proposed. The model is tested with 14 subjects and presents results that are close to human-human performances.

1604.00975 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Combining Vision, Machine Learning and Automatic Control to Play the Labyrinth Game

将视觉、机器学习和自动控制结合以玩迷宫游戏

Kristoffer Öfjäll, Michael Felsberg

AI总结 本文提出结合确定性控制与学习控制的方法,利用视觉系统和PID控制器实现迷宫游戏的自动导航,通过学习提升控制性能。

Comments Presented at the SSBA Symposium 2012, Stockholm, Sweden

详情
AI中文摘要

迷宫游戏是一个对人类和控制算法都具有挑战性的平台。球的行为一般容易建模,但接近障碍物时存在严重非线性特性,且球滚动的非平坦表面导致动态变化。传统自动控制方法可处理一般动态,但考虑障碍物和不平表面需详细模型。本文结合简单确定性控制算法与学习控制方法,利用视觉系统估计球位置,并结合PID控制器和基于LWPR的学习控制器实现迷宫导航。

英文摘要

The labyrinth game is a simple yet challenging platform, not only for humans but also for control algorithms and systems. The game is easy to understand but still very hard to master. From a system point of view, the ball behaviour is in general easy to model but close to the obstacles there are severe non-linearities. Additionally, the far from flat surface on which the ball rolls provides for changing dynamics depending on the ball position. The general dynamics of the system can easliy be handled by traditional automatic control methods. Taking the obstacles and uneaven surface into accout would require very detailed models of the system. A simple deterministic control algorithm is combined with a learning control method. The simple control method provides initial training data. As the learning method is trained, the system can learn from the results of its own actions and the performance improves well beyond the performance of the initial controller. A vision system and image analysis is used to estimate the ball position while a combination of a PID controller and a learning controller based on LWPR is used to learn to navigate the ball through the maze.

1604.00602 2026-06-04 eess.SY cs.RO cs.SY math.DS math.OC 版本更新

Convex Computation of the Basin of Stability to Measure the Likelihood of Falling: A Case Study on the Sit-to-Stand Task

用凸优化计算稳定性盆地以衡量跌倒的可能性:站立-坐下的任务案例研究

Victor Shia, Talia Moore, Ruzena Bajcsy, Ram Vasudevan

AI总结 本文利用动力系统理论,开发出自动框架计算稳定性盆地,用于评估站立-坐下的稳定性策略。实验验证了该方法能区分不同稳定性策略。

Comments 11 pages, 9 figures

详情
AI中文摘要

现实世界中的运动涉及意外扰动,因此需要策略来维持稳定性以成功执行所需行为。确保运动系统的安全性需要定量的稳定性度量。由于确定导致失败的扰动集困难,研究人员使用各种特征作为稳定性的代理。本文利用动力系统理论的最新进展,开发出个性化、自动化的框架,计算系统能避免失败的扰动集,即稳定性盆地。该方法跟踪人类运动,合成控制输入进行分析以测量稳定性盆地。该分析的实用性在15名个体执行站立-坐下的任务上得到验证。实验表明,计算出的稳定性盆地可以区分不同稳定性策略。

英文摘要

Locomotion in the real world involves unexpected perturbations, and therefore requires strategies to maintain stability to successfully execute desired behaviours. Ensuring the safety of locomoting systems therefore necessitates a quantitative metric for stability. Due to the difficulty of determining the set of perturbations that induce failure, researchers have used a variety of features as a proxy to describe stability. This paper utilises recent advances in dynamical systems theory to develop a personalised, automated framework to compute the set of perturbations from which a system can avoid failure, which is known as the basin of stability. The approach tracks human motion to synthesise a control input that is analysed to measure the basin of stability. The utility of this analysis is verified on a Sit-to-Stand task performed by 15 individuals. The experiment illustrates that the computed basin of stability for each individual can successfully differentiate between less and more stable Sit-to-Stand strategies.

1603.08246 2026-06-04 eess.SY cs.LO cs.RO cs.SY 版本更新

Timed Automata Approach for Motion Planning Using Metric Interval Temporal Logic

基于度量区间时序逻辑的运动规划方法

Yuchen Zhou, Dipankar Maity, John S. Baras

AI总结 本文研究在给定时间约束的高层规范下机器人运动规划问题,采用度量区间时序逻辑表示任务规范,并构造时序自动机以生成可行的运动路径序列。

Comments Full Version for ECC 2016

详情
AI中文摘要

在本文中,我们考虑在给定的时间有界高层规范下机器人运动(或任务)规划问题。我们使用度量区间时序逻辑(MITL),作为时序逻辑家族的一员,来表示任务规范,然后提供一种构造时序自动机的方法,并提供在自动机上寻找接受运行的方法,以找到机器人完成任务的可行运动(或路径)序列。

英文摘要

In this paper, we consider the robot motion (or task) planning problem under some given time bounded high level specifications. We use metric interval temporal logic (MITL), a member of the temporal logic family, to represent the task specification and then we provide a constructive way to generate a timed automaton and methods to look for accepting runs on the automaton to find a feasible motion (or path) sequence for the robot to complete the task.

1505.01874 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Adaptive importance sampling for control and inference

自适应重要性采样用于控制与推断

Hilbert Johan Kappen, Hans Christian Ruiz

AI总结 本文提出了一种基于交叉熵方法的自适应重要性采样算法,用于解决非线性随机控制问题,展示了其在潜在状态模型后验分布估计中的应用。

Comments 23 pages, 4 figures

详情
AI中文摘要

路径积分(PI)控制问题是一类受限的非线性控制问题,可以形式化为费曼-卡茨路径积分并利用蒙特卡洛采样进行估计。本文回顾了有限时间 horizon 情况下的路径积分控制理论。随后,我们关注如何计算和表示控制解。在 PI 理论中,计算问题转化为重要性采样问题。高效的采样器是状态反馈控制器,其使用需要高效的表示。学习和表示有效的状态反馈控制器对于非线性随机控制问题是一个极具挑战性且大多未解决的问题。我们展示了如何利用交叉熵方法的思想来学习和表示此类控制器。我们推导出一种梯度下降方法,允许使用任意参数化来学习反馈控制器。我们将此方法称为路径积分交叉熵方法或 PICE。我们通过一些简单示例来说明此方法。路径积分控制方法可用于估计潜在状态模型的后验分布。在神经科学中,这些问题出现在使用 EM 方法从神经记录数据中估计连接性时。我们展示了路径积分控制方法作为粒子过滤的准确替代方案。

英文摘要

Path integral (PI) control problems are a restricted class of non-linear control problems that can be solved formally as a Feyman-Kac path integral and can be estimated using Monte Carlo sampling. In this contribution we review path integral control theory in the finite horizon case. We subsequently focus on the problem how to compute and represent control solutions. Within the PI theory, the question of how to compute becomes the question of importance sampling. Efficient importance samplers are state feedback controllers and the use of these requires an efficient representation. Learning and representing effective state-feedback controllers for non-linear stochastic control problems is a very challenging, and largely unsolved, problem. We show how to learn and represent such controllers using ideas from the cross entropy method. We derive a gradient descent method that allows to learn feed-back controllers using an arbitrary parametrisation. We refer to this method as the Path Integral Cross Entropy method or PICE. We illustrate this method for some simple examples. The path integral control methods can be used to estimate the posterior distribution in latent state models. In neuroscience these problems arise when estimating connectivity from neural recording data using EM. We demonstrate the path integral control method as an accurate alternative to particle filtering.

1410.7632 2026-06-04 cs.CV cs.RO cs.SY eess.SY 版本更新

On the Covariance of ICP-based Scan-matching Techniques

关于基于ICP的扫描匹配技术的协方差

Silvère Bonnabel, Martin Barczyk, François Goulette

AI总结 本文研究了ICP算法计算旋转变换协方差的问题,指出点到点版本的ICP应用会导致错误协方差,通过数学证明验证点到平面版本的正确性。

Comments Accepted at 2016 American Control Conference

详情
AI中文摘要

本文考虑了通过迭代最近点(ICP)算法计算旋转变换协方差的问题。该问题对于配备深度感应相机(如Kinect)或激光雷达(如Velodyne)的移动机器人和车辆的定位具有相关性。先前文献中提出的闭式公式通常基于ICP解是通过最小化线性最小二乘问题得到的假设。本文表明,这种做法需要谨慎,因为算法的重新匹配步骤未被显式考虑,应用于点到点版本的ICP会导致完全错误的协方差。随后,我们提供了一个形式化的数学证明,说明该方法在点到平面版本的ICP中是有效的,这验证了从业者直觉和实验结果。

英文摘要

This paper considers the problem of estimating the covariance of roto-translations computed by the Iterative Closest Point (ICP) algorithm. The problem is relevant for localization of mobile robots and vehicles equipped with depth-sensing cameras (e.g., Kinect) or Lidar (e.g., Velodyne). The closed-form formulas for covariance proposed in previous literature generally build upon the fact that the solution to ICP is obtained by minimizing a linear least-squares problem. In this paper, we show this approach needs caution because the rematching step of the algorithm is not explicitly accounted for, and applying it to the point-to-point version of ICP leads to completely erroneous covariances. We then provide a formal mathematical proof why the approach is valid in the point-to-plane version of ICP, which validates the intuition and experimental results of practitioners.

1505.07158 2026-06-04 eess.SY cs.DC cs.RO cs.SY 版本更新

Resilient and Decentralized Control of Multi-level Cooperative Mobile Networks to Maintain Connectivity under Adversarial Environment

多级协作移动网络在对抗环境下的鲁棒与去中心化控制以维持连通性

Juntao Chen, Quanyan Zhu

AI总结 本文提出基于博弈论的框架,设计去中心化算法以最大化全局网络的代数连通性,并通过案例研究验证算法的有效性和网络鲁棒性。

Comments 9 pages, 6 figures

详情
AI中文摘要

网络连通性在多级网络中不同代理间的信息交换中起着重要作用。本文建立了一个博弈论框架,以捕捉多级网络不同层次决策的非协调性质。具体而言,我们设计了一个去中心化算法,旨在迭代地最大化全局网络的代数连通性。此外,我们证明该算法渐近收敛到纳什均衡,并产生一个均衡网络。为了研究网络的鲁棒性,我们引入了三种对抗性攻击模型,并刻画了它们对网络性能的最坏影响。基于双层移动机器人网络的案例研究用于验证所提算法的有效性和鲁棒性,并展示了在恢复过程中不同层次之间的相互依赖性。

英文摘要

Network connectivity plays an important role in the information exchange between different agents in the multi-level networks. In this paper, we establish a game-theoretic framework to capture the uncoordinated nature of the decision-making at different layers of the multi-level networks. Specifically, we design a decentralized algorithm that aims to maximize the algebraic connectivity of the global network iteratively. In addition, we show that the designed algorithm converges to a Nash equilibrium asymptotically and yields an equilibrium network. To study the network resiliency, we introduce three adversarial attack models and characterize their worst-case impacts on the network performance. Case studies based on a two-layer mobile robotic network are used to corroborate the effectiveness and resiliency of the proposed algorithm and show the interdependency between different layers of the network during the recovery processes.

1603.04586 2026-06-04 cs.AI cs.RO cs.SY eess.SY 版本更新

Optimal Sensing via Multi-armed Bandit Relaxations in Mixed Observability Domains

通过混合可观测域中的多臂老虎机放松实现最优感知

Mikko Lauri, Risto Ritala

AI总结 研究在混合可观测域中不确定决策问题,通过放松约束推导最优价值函数上界,并利用多臂老虎机的可计算最优策略提升搜索空间剪枝效率,实验显示在目标监控领域有效。

Comments 6 pages, 2 figures

详情
Journal ref
Proc. IEEE Intl. Conf. on Robotics and Automation (ICRA), pp. 4807-4812, 2015
AI中文摘要

在混合可观测域中研究不确定决策问题,目标是在部分可观测随机过程中,在完全可观测内部状态约束下最大化获得的信息量。通过放松约束推导最优价值函数的上界,识别出在何种条件下放松问题可转化为多臂老虎机,其最优策略易于计算。将该上界应用于原始问题的搜索空间剪枝,并通过模拟实验评估对解质量的影响。实验结果表明,在目标监控领域有效剪枝了搜索空间。

英文摘要

Sequential decision making under uncertainty is studied in a mixed observability domain. The goal is to maximize the amount of information obtained on a partially observable stochastic process under constraints imposed by a fully observable internal state. An upper bound for the optimal value function is derived by relaxing constraints. We identify conditions under which the relaxed problem is a multi-armed bandit whose optimal policy is easily computable. The upper bound is applied to prune the search space in the original problem, and the effect on solution quality is assessed via simulation experiments. Empirical results show effective pruning of the search space in a target monitoring domain.

1510.05344 2026-06-04 eess.SY cs.RO cs.SY 版本更新

A Topology-Guided Path Integral Approach for Stochastic Optimal Control

一种基于拓扑的路径积分方法用于随机最优控制

Jung-Su Ha, Han-Lim Choi

AI总结 本文提出一种高效的连续时间连续空间随机最优控制方法,通过拓扑运动规划算法缓解局部极小问题,生成动态可行且无碰撞轨迹。

Comments 8 pages, 4 figures, accepted to IEEE International Conference on Robotics and Automation (ICRA) 2016

详情
AI中文摘要

本文提出了一种高效的连续时间连续空间随机最优控制方法,该方法基于路径积分表示,通过采样和估计过程计算最优解。由于障碍场导致的状态空间高度非凸性,采样过程常导致局部极小,因此本文提出高效的拓扑运动规划算法来缓解这一问题。结合执行最优控制解的滚动时域方案,所提方法能生成动态可行且无碰撞的轨迹,同时减少对局部极值的担忧。通过示例数值验证了所提方法的有效性和适用性。

英文摘要

This work presents an efficient method to solve a class of continuous-time, continuous-space stochastic optimal control problems of robot motion in a cluttered environment. The method builds upon a path integral representation of the stochastic optimal control problem that allows computation of the optimal solution through sampling and estimation process. As this sampling process often leads to a local minimum especially when the state space is highly non-convex due to the obstacle field, we present an efficient method to alleviate this issue by devising a proposed topological motion planning algorithm. Combined with a receding-horizon scheme in execution of the optimal control solution, the proposed method can generate a dynamically feasible and collision-free trajectory while reducing concern about local optima. Illustrative numerical examples are presented to demonstrate the applicability and validity of the proposed approach.

1603.02650 2026-06-04 eess.SY cs.RO cs.SY 版本更新

An MILP Approach for Real-time Optimal Controller Synthesis with Metric Temporal Logic Specifications

基于混合整数线性规划的实时最优控制器综合方法:带有度量时序逻辑规范

Sayan Saha, A. Agung Julius

AI总结 本文提出一种混合整数线性规划方法,用于在动态环境中实时合成满足度量时序逻辑规范的控制器,通过优化性能指标确保系统行为正确。

Comments American Control Conference 2016 (extended version)

详情
AI中文摘要

本文的核心思想是合成反应控制器,使得系统闭环执行轨迹满足期望规范,以确保正确系统行为,同时优化期望性能标准。在我们的方法中,系统行为的正确性可以根据系统与环境的关系来定义,例如系统的输出轨迹终止在目标集而不进入不安全集。使用度量时序逻辑(MTL)规范可以进一步捕捉复杂系统行为和时间要求,例如输出轨迹必须在一定时间内经过若干路标后终止在目标集。给定一个混合逻辑动态(MLD)系统和以MTL公式或更简单的到达-避免规范表示的系统规范,我们的目标是在非确定性环境中找到满足规范的闭环轨迹。通过使用混合整数线性规划(MILP)框架,我们搜索输入信号空间以获得有效的轨迹,通过在必要时添加约束以满足MTL公式,以避免求解MILP问题的指数复杂性。我们还展示了在动态变化的环境中为移动机器人规划路径的实验结果,具有期望的任务规范。

英文摘要

The fundamental idea of this work is to synthesize reactive controllers such that closed-loop execution trajectories of the system satisfy desired specifications that ensure correct system behaviors, while optimizing a desired performance criteria. In our approach, the correctness of a system's behavior can be defined according to the system's relation to the environment, for example, the output trajectories of the system terminate in a goal set without entering an unsafe set. Using Metric Temporal Logic (MTL) specifications we can further capture complex system behaviors and timing requirements, such as the output trajectories must pass through a number of way-points within a certain time frame before terminating in the goal set. Given a Mixed Logical Dynamical (MLD) system and system specifications in terms of MTL formula or simpler reach-avoid specifications, our goal is to find a closed-loop trajectory that satisfies the specifications, in non-deterministic environments. Using an MILP framework we search over the space of input signals to obtain such valid trajectories of the system, by adding constraints to satisfy the MTL formula only when necessary, to avoid the exponential complexity of solving MILP problems. We also present experimental results for planning a path for a mobile robot through a dynamically changing environment with a desired task specification.

1603.02381 2026-06-04 cs.RO cs.SY eess.SY 版本更新

The Effect of Communication Topology on Scalar Field Estimation by Networked Robotic Swarms

网络化机器人群中通信拓扑对标量场估计的影响

Ragesh K Ramachandran, Spring Berman

AI总结 本文研究了利用网络化机器人群重构二维标量场的问题,通过链式或网格拓扑结构进行通信,采用优化方法结合梯度计算,推导了可观测性格拉姆矩阵的迹界,验证了链式和网格拓扑的估计能力与鲁棒性。

详情
AI中文摘要

本文研究了利用网络化机器人群重构二维标量场的问题,考虑机器人通信网络形成链式或网格拓扑结构。将重构问题建模为受一阶线性动力学约束的优化问题。采用基于优化的方法,利用梯度法并进行梯度分析计算。此外,推导了系统可观测性格拉姆矩阵的迹界,帮助量化和比较链式和网格网络的估计能力。基于与系统H2范数相关的性能度量进行比较,用于研究网络拓扑的鲁棒性。结果通过模拟标量场和实际海洋盐度数据进行验证。

英文摘要

This paper studies the problem of reconstructing a two-dimensional scalar field using a swarm of networked robots with local communication capabilities. We consider the communication network of the robots to form either a chain or a grid topology. We formulate the reconstruction problem as an optimization problem that is constrained by first-order linear dynamics on a large, interconnected system. To solve this problem, we employ an optimization-based scheme that uses a gradient-based method with an analytical computation of the gradient. In addition, we derive bounds on the trace of observability Gramian of the system, which helps us to quantify and compare the estimation capability of chain and grid networks. A comparison based on a performance measure related to the H2 norm of the system is also used to study robustness of the network topologies. Our resultsare validated using both simulated scalar fields and actual ocean salinity data.

1603.02038 2026-06-04 cs.RO cs.AI cs.LG cs.SY eess.SY 版本更新

Unscented Bayesian Optimization for Safe Robot Grasping

无迹贝叶斯优化用于安全机器人抓取

José Nogueira, Ruben Martinez-Cantin, Alexandre Bernardino, Lorenzo Jamone

AI总结 本文提出无迹贝叶斯优化算法,通过考虑输入噪声在安全区域寻找最优抓取策略,提升机器人抓取的安全性和效率。

Comments conference paper

详情
AI中文摘要

我们解决了在输入空间存在不确定性时的机器人抓取优化问题。通过试错探索策略实现抓取未知物体。贝叶斯优化是一种样本高效的优化算法,特别适合此设置,因为它能主动减少试验次数以学习待优化函数。事实上,这种主动对象探索策略与婴儿学习最佳抓取方式的策略相同。在学习抓取策略时,一些抓取参数配置可能对物体与机器人末端执行器之间相对姿态的误差非常敏感。我们称这些配置为不安全,因为抓取执行中的小误差可能将好的抓取变为坏的抓取。因此,为了降低抓取失败的风险,抓取应规划在安全区域。我们提出了一种新的算法,即无迹贝叶斯优化,能够在考虑输入噪声的情况下进行样本高效的优化以找到安全的极值。无迹贝叶斯优化的贡献是双方面的:一方面提供了一个新的决策过程,驱动探索到安全区域;另一方面提供了一个新的选择过程,选择在不进行额外分析或计算成本的情况下最优的抓取策略。这两个贡献都根植于无迹变换背后的强大理论,这是一种流行的非线性近似方法。我们在合成问题和现实的机器人抓取模拟中展示了其相对于经典贝叶斯优化的优势。结果表明,我们的方法在几次试验后就能获得最优且鲁棒的抓取策略,同时所选的抓取保持在安全区域。

英文摘要

We address the robot grasp optimization problem of unknown objects considering uncertainty in the input space. Grasping unknown objects can be achieved by using a trial and error exploration strategy. Bayesian optimization is a sample efficient optimization algorithm that is especially suitable for this setups as it actively reduces the number of trials for learning about the function to optimize. In fact, this active object exploration is the same strategy that infants do to learn optimal grasps. One problem that arises while learning grasping policies is that some configurations of grasp parameters may be very sensitive to error in the relative pose between the object and robot end-effector. We call these configurations unsafe because small errors during grasp execution may turn good grasps into bad grasps. Therefore, to reduce the risk of grasp failure, grasps should be planned in safe areas. We propose a new algorithm, Unscented Bayesian optimization that is able to perform sample efficient optimization while taking into consideration input noise to find safe optima. The contribution of Unscented Bayesian optimization is twofold as if provides a new decision process that drives exploration to safe regions and a new selection procedure that chooses the optimal in terms of its safety without extra analysis or computational cost. Both contributions are rooted on the strong theory behind the unscented transformation, a popular nonlinear approximation method. We show its advantages with respect to the classical Bayesian optimization both in synthetic problems and in realistic robot grasp simulations. The results highlights that our method achieves optimal and robust grasping policies after few trials while the selected grasps remain in safe regions.

1603.00955 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Decentralized State Estimation via a Hybrid of Consensus and Covariance intersection

通过一致性与协方差交叠的混合方法实现去中心化状态估计

Amirhossein Tamjidi, Suman Chakravorty, Dylan Shell

AI总结 本文提出了一种新的递归信息一致性滤波器,用于去中心化动态状态估计,通过协方差交叠处理先验一致性,利用Metropolis Hastings马尔可夫链处理新信息一致性,并在大气散射问题中验证了方法的性能。

详情
AI中文摘要

本文提出了一种新的递归信息一致性滤波器,用于去中心化动态状态估计。不假设网络拓扑结构,局部估计器仅能访问局部信息,网络无需始终连通。通过协方差交叠(CI)处理可能相关先验的一致性,通过基于Metropolis Hastings马尔可夫链的权重处理新信息一致性。建立了估计性能的界限,并证明所提方法产生无偏保守估计,优于CI。在大气散射问题中评估了所提方法的性能,并与竞争算法进行了比较。

英文摘要

This paper presents a new recursive information consensus filter for decentralized dynamic-state estimation. No structure is assumed about the topology of the network and local estimators are assumed to have access only to local information. The network need not be connected at all times. Consensus over priors which might become correlated is performed through Covariance Intersection (CI) and consensus over new information is handled using weights based on a Metropolis Hastings Markov Chains. We establish bounds for estimation performance and show that our method produces unbiased conservative estimates that are better than CI. The performance of the proposed method is evaluated and compared with competing algorithms on an atmospheric dispersion problem.

1603.00748 2026-06-04 cs.LG cs.AI cs.RO cs.SY eess.SY 版本更新

Continuous Deep Q-Learning with Model-based Acceleration

基于模型的连续深度Q学习加速

Shixiang Gu, Timothy Lillicrap, Ilya Sutskever, Sergey Levine

AI总结 本文提出连续深度Q学习算法NAF及基于模型的加速方法,用于提升连续控制任务的样本效率和学习速度。

详情
AI中文摘要

模型无关强化学习已成功应用于多种挑战性问题,并扩展到处理大规模神经网络策略和价值函数。然而,模型无关算法的样本复杂性,特别是使用高维函数近似器时,限制了其在物理系统中的应用。本文探索了减少深度强化学习样本复杂性的算法和表示方法。我们提出两种互补技术来提高此类算法的效率。首先,我们推导出Q学习的连续变种,称为归一化优势函数(NAF),作为替代更常用的策略梯度和actor-critic方法。NAF表示允许我们应用带有经验回放的Q学习来处理连续任务,并在一组模拟机器人控制任务上显著提高性能。为进一步提高我们的方法效率,我们探索了使用学习模型来加速模型无关强化学习。我们展示迭代重新拟合的局部线性模型在这一点上特别有效,并在适用此类模型的领域中展示了显著更快的学习速度。

英文摘要

Model-free reinforcement learning has been successfully applied to a range of challenging problems, and has recently been extended to handle large neural network policies and value functions. However, the sample complexity of model-free algorithms, particularly when using high-dimensional function approximators, tends to limit their applicability to physical systems. In this paper, we explore algorithms and representations to reduce the sample complexity of deep reinforcement learning for continuous control tasks. We propose two complementary techniques for improving the efficiency of such algorithms. First, we derive a continuous variant of the Q-learning algorithm, which we call normalized adantage functions (NAF), as an alternative to the more commonly used policy gradient and actor-critic methods. NAF representation allows us to apply Q-learning with experience replay to continuous tasks, and substantially improves performance on a set of simulated robotic control tasks. To further improve the efficiency of our approach, we explore the use of learned models for accelerating model-free reinforcement learning. We show that iteratively refitted local linear models are especially effective for this, and demonstrate substantially faster learning on domains where such models are applicable.

1511.04634 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Motion Planning for Global Localization in Non-Gaussian Belief Spaces

非高斯信念空间中的全局定位运动规划

Saurav Agarwal, Amirhossein Tamjidi, Suman Chakravorty

AI总结 本文提出了一种在不确定环境下进行运动规划的方法,用于处理模糊数据关联导致的多模态假设。通过递推方法逐步消除多模态信念,实现有限时间内精确定位。

Comments extends previous submission with updated figures, analysis and justifications. arXiv admin note: text overlap with arXiv:1506.01780

详情
AI中文摘要

本文提出了一种在不确定环境下进行运动规划的方法,用于处理模糊数据关联导致的多模态假设。在全局定位问题中,即所谓的“迷失或绑架机器人问题”,在缺乏先验姿态信息的情况下,定位算法应恢复移动机器人相对于全局参考系的正确姿态。我们提出了一种递推方法,用于规划按顺序消除多模态信念的动作,以在有限时间内实现精确定位,即收敛到单峰信念。实验结果展示了在人工迷宫样环境中运行的物理地面机器人。我们展示了两种运行情况,其中机器人没有关于初始姿态的先验信息,规划器的任务是定位机器人。

英文摘要

This paper presents a method for motion planning under uncertainty to deal with situations where ambiguous data associations result in a multimodal hypothesis on the robot state. In the global localization problem, sometimes referred to as the "lost or kidnapped robot problem", given little to no a priori pose information, the localization algorithm should recover the correct pose of a mobile robot with respect to a global reference frame. We present a Receding Horizon approach, to plan actions that sequentially disambiguate a multimodal belief to achieve tight localization on the correct pose in finite time, i.e., converge to a unimodal belief. Experimental results are presented using a physical ground robot operating in an artificial maze-like environment. We demonstrate two runs wherein the robot is given no a priori information about its initial pose and the planner is tasked to localize the robot.

1602.00646 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Autonomous Agent Behaviour Modelled in PRISM -- A Case Study

基于PRISM的自主代理行为建模——一个案例研究

Ruth Hoffmann, Murray Ireland, Alice Miller, Gethin Norman, Sandor Veres

AI总结 本文提出了一种抽象的自主性定义,用于建模自主场景,并通过小规模仿真模型推断定量数据,以验证无人飞行器在自主场景中的行为。

详情
AI中文摘要

由于自主系统需要被证明安全,对代表机器人行为的代理进行形式验证正成为一个重要领域。本文提出了一种抽象的自主性定义,可用于建模自主场景,并提出使用小规模仿真模型来表示抽象动作以推断定量数据。为了展示该方法的适用性,我们构建并验证了一个无人飞行器(UAV)在示例自主场景中的模型,利用该方法进行验证。

英文摘要

Formal verification of agents representing robot behaviour is a growing area due to the demand that autonomous systems have to be proven safe. In this paper we present an abstract definition of autonomy which can be used to model autonomous scenarios and propose the use of small-scale simulation models representing abstract actions to infer quantitative data. To demonstrate the applicability of the approach we build and verify a model of an unmanned aerial vehicle (UAV) in an exemplary autonomous scenario, utilising this approach.

1403.5986 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Controllability Analysis for Multirotor Helicopter Rotor Degradation and Failure

多旋翼直升机旋翼退化与失效的可控性分析

Guang-Xun Du, Quan Quan, Binxian Yang, Kai-Yuan Cai

AI总结 本文针对多旋翼系统在旋翼失效或磨损情况下的可控性分析问题,提出了一种易用的测量指标来评估控制权限,并推导出新的必要充分条件及可控性测试方法,验证了不同旋翼配置的多旋翼在故障容错能力上的差异。

Comments 21 pages, 4 figures

详情
Journal ref
AIAA Journal of Guidance, Control, and Dynamics, 2015, 38(5): 978-984
AI中文摘要

本文考虑了一类受旋翼失效或磨损影响的多旋翼系统的可控性分析问题。研究表明,传统的线性系统可控性理论不足以测试所考虑的多旋翼的可控性。为此,引入了一个易于使用的测量指标来评估可用的控制权限。基于此,推导出多旋翼的可控性的新必要充分条件。此外,提出了一种可控性测试程序。所提出的可控性测试方法应用于不同旋翼配置和不同旋翼效率参数的六旋翼类别,以展示其有效性。分析结果表明,不同旋翼配置的六旋翼具有不同的故障容错能力。因此,在采用任何故障容错控制策略之前,必须测试多旋翼的可控性。

英文摘要

This paper considers the controllability analysis problem for a class of multirotor systems subject to rotor failure/wear. It is shown that classical controllability theories of linear systems are not sufficient to test the controllability of the considered multirotors. Owing to this, an easy-to-use measurement index is introduced to assess the available control authority. Based on it, a new necessary and sufficient condition for the controllability of multirotors is derived. Furthermore, a controllability test procedure is approached. The proposed controllability test method is applied to a class of hexacopters with different rotor configurations and different rotor efficiency parameters to show its effectiveness. The analysis results show that hexacopters with different rotor configurations have different fault-tolerant capabilities. It is therefore necessary to test the controllability of the multirotors before any fault-tolerant control strategies are employed.

1310.2539 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Intrinsic filtering on Lie groups with applications to attitude estimation

在李群上的内在滤波及其在姿态估计中的应用

Axel Barrau, Silvere Bonnabel

AI总结 本文提出了一种基于概率的李群系统内在滤波方法,通过构建连续时间不变观测器理论,证明误差方程为马尔可夫链,且在噪声下分布收敛,同时提出离散时间不变扩展卡尔曼滤波及数值方法用于姿态估计。

Comments Submitted

详情
AI中文摘要

本文提出了一种概率方法,用于具有不变性质的矩阵李群系统内在滤波问题。将具有连续时间模型和离散时间测量的不变问题纳入严谨的随机和几何框架。基于连续时间不变观测器理论,证明误差方程为马尔可夫链,且不依赖于状态估计。当滤波器增益固定且在无噪声情况下具有近全球收敛性时,噪声误差的分布被证明收敛于平稳分布,为李群上滤波的数学理论提供见解。从工程角度,我们引入了离散时间不变扩展卡尔曼滤波,证明信任协方差矩阵渐近收敛,并提出一些更复杂的基于样本的计算方法来计算卡尔曼增益。这些方法应用于姿态估计,推导出该领域的新型理论结果,并通过合成数据的仿真进行说明。

英文摘要

This paper proposes a probabilistic approach to the problem of intrinsic filtering of a system on a matrix Lie group with invariance properties. The problem of an invariant continuous-time model with discrete-time measurements is cast into a rigorous stochastic and geometric framework. Building upon the theory of continuous-time invariant observers, we show that, as in the linear case, the error equation is a Markov chain that does not depend on the state estimate. Thus, when the filter's gains are held fixed, and the filter admits almost-global convergence properties with noise turned off, the noisy error's distribution is proved to converge to a stationary distribution, providing insight into the mathematical theory of filtering on Lie groups. For engineering purposes we also introduce the discrete-time Invariant Extended Kalman Filter, for which the trusted covariance matrix is shown to asymptotically converge, and some numerically more involved sample-based methods as well to compute the Kalman gains. The methods are applied to attitude estimation, allowing to derive novel theoretical results in this field, and illustrated through simulations on synthetic data.

1407.6836 2026-06-04 eess.SY cs.RO cs.SY math.PR 版本更新

A Theory of Cheap Control in Embodied Systems

具身系统中的廉价控制理论

Guido Montufar, Keyan Ghazi-Zahedi, Nihat Ay

AI总结 本文提出了一种设计具身代理廉价控制架构的框架,通过利用具身性实现更高效的通用逼近,对比传统非具身方法,展示通过条件受限玻尔兹曼机政策模型的定量案例研究,实验验证了理论的实用性。

Comments 27 pages, 10 figures

详情
AI中文摘要

我们提出了一种设计具身代理廉价控制架构的框架。我们的推导受经典通用逼近问题的启发,探索利用代理的具身性实现行为生成的新且更高效的通用逼近。该具身通用逼近与传统非具身通用逼近进行比较。为了说明我们的方法,我们展示了基于条件受限玻尔兹曼机政策模型的详细定量案例研究。与需要指数级参数的传统非具身通用逼近不同,在具身设置中,我们能够用大幅减小的模型生成所有可能的行为,从而获得廉价通用逼近。我们通过六足步行机实验测试并验证了理论。实验显示,理论预测的足够控制器复杂度是紧致的,这意味着该理论具有直接的实用意义。关键词:廉价设计,具身性,感觉运动环路,通用逼近,条件受限玻尔兹曼机

英文摘要

We present a framework for designing cheap control architectures for embodied agents. Our derivation is guided by the classical problem of universal approximation, whereby we explore the possibility of exploiting the agent's embodiment for a new and more efficient universal approximation of behaviors generated by sensorimotor control. This embodied universal approximation is compared with the classical non-embodied universal approximation. To exemplify our approach, we present a detailed quantitative case study for policy models defined in terms of conditional restricted Boltzmann machines. In contrast to non-embodied universal approximation, which requires an exponential number of parameters, in the embodied setting we are able to generate all possible behaviors with a drastically smaller model, thus obtaining cheap universal approximation. We test and corroborate the theory experimentally with a six-legged walking machine. The experiments show that the sufficient controller complexity predicted by our theory is tight, which means that the theory has direct practical implications. Keywords: cheap design, embodiment, sensorimotor loop, universal approximation, conditional restricted Boltzmann machine

1510.00771 2026-06-04 cs.CV cs.RO cs.SY eess.SY 版本更新

Design and Analysis of a Single-Camera Omnistereo Sensor for Quadrotor Micro Aerial Vehicles (MAVs)

单相机 omnistereo 传感器的设计与分析用于四旋翼微型飞行器(MAVs)

Carlos Jaramillo

AI总结 本文提出一种适用于低负载四旋翼微型飞行器的单相机 omnistereo 传感器设计,通过共轴超曲面镜实现立体视觉,分析其几何特性与3D感知性能。

Comments 49 pages, 22 figures, journal article draft

详情
Journal ref
Sensors 16 (2016) 217
AI中文摘要

我们描述了一种应用于微型飞行器(MAVs)的 omnistereo 系统的设计和3D感知性能。所提出的 omnistereo 模型采用一个单目相机,与一对双曲面镜(折叠catadioptric配置)共轴对齐。我们证明这种配置在安装在具有低负载的四旋翼MAV上进行立体视觉是可行的。理论上的单视角(SVP)约束帮助我们推导出传感器投影几何的解析解,并生成SVP兼容的全景图像,以从立体对应关系中计算3D信息(真正同步地)。我们对各种系统特性进行了广泛分析,如大小、catadioptric空间分辨率、视场。此外,我们提出了一种概率模型,用于估计从三角化中深度的不确定性,用于斜向后投影射线。我们期望通过我们的解决方案的可重复性来激励,因为它可以被适应(最优地)到其他基于catadioptric的omnistereo视觉应用。

英文摘要

We describe the design and 3D sensing performance of an omnidirectional stereo-vision system (omnistereo) as applied to Micro Aerial Vehicles (MAVs). The proposed omnistereo model employs a monocular camera that is co-axially aligned with a pair of hyperboloidal mirrors (folded catadioptric configuration). We show that this arrangement is practical for performing stereo-vision when mounted on top of propeller-based MAVs characterized by low payloads. The theoretical single viewpoint (SVP) constraint helps us derive analytical solutions for the sensor's projective geometry and generate SVP-compliant panoramic images to compute 3D information from stereo correspondences (in a truly synchronous fashion). We perform an extensive analysis on various system characteristics such as its size, catadioptric spatial resolution, field-of-view. In addition, we pose a probabilistic model for uncertainty estimation of the depth from triangulation for skew back-projection rays. We expect to motivate the reproducibility of our solution since it can be adapted (optimally) to other catadioptric-based omnistereo vision applications.

1505.06379 2026-06-04 eess.SY cs.GT cs.MA cs.RO cs.SY math.OC 版本更新

Communication-Free Distributed Coverage for Networked Systems

无通信分布式覆盖网络系统

A. Yasin Yazicioglu, Magnus Egerstedt, Jeff S. Shamma

AI总结 本文提出一种无通信算法,用于由多个具有本地感知能力的移动代理对任意网络进行分布式覆盖。通过游戏理论框架,代理利用感官输入优化位置,以最大化覆盖范围。

详情
AI中文摘要

在本文中,我们提出了一种无通信算法,用于由一组具有本地感知能力的移动代理对任意网络进行分布式覆盖。网络被表示为图,代理被任意部署在图的一些节点上。如果图中的任意节点处于至少一个代理的感知范围之内,则视为被覆盖。这些代理是移动设备,旨在探索图并通过仅依赖其感官输入来以去中心化的方式优化其位置。我们将这个问题建模为博弈论设定,并提出了一种无通信学习算法以最大化覆盖范围。

英文摘要

In this paper, we present a communication-free algorithm for distributed coverage of an arbitrary network by a group of mobile agents with local sensing capabilities. The network is represented as a graph, and the agents are arbitrarily deployed on some nodes of the graph. Any node of the graph is covered if it is within the sensing range of at least one agent. The agents are mobile devices that aim to explore the graph and to optimize their locations in a decentralized fashion by relying only on their sensory inputs. We formulate this problem in a game theoretic setting and propose a communication-free learning algorithm for maximizing the coverage.

1512.03351 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Adaptive Neural Control for Mobile Robots Autonomous Navigation

自适应神经控制用于自主导航的移动机器人

Monica Dragoicea, Ioan Dumitrache, Nicolae Constantin

AI总结 本文提出了一种自适应神经控制策略,用于非holonomic移动机器人的自主导航,通过同时学习运动学转向和速度动力学,实现稳定跟踪。

Comments in Proceedings of the 7th Int. Symposium on Automatic Control and Computer Science SACCS 2001, Iasi, Romania, CD ISBN 973-8292-11-5, 2001

详情
AI中文摘要

本文提出了一种结合策略,用于跟踪非holonomic移动机器人,该策略在系统参数和扰动条件下工作。该策略同时学习移动机器人系统的运动学转向和速度动力学。在学习控制器(基于神经网络的控制器)中,速度动力学学习控制通过学习机器人动力学的反函数来参与跟踪参考速度轨迹,而参考速度控制输入在不使用完美速度跟踪假设的情况下,用于将运动学转向系统稳定到期望的运动学系统模型。

英文摘要

This paper presents a combined strategy for tracking a non-holonomic mobile robot which works under certain operating conditions for system parameters and disturbances. The strategy includes kinematic steering and velocity dynamics learning of mobile robot system simultaneously. In the learning controller (neural network based controller) the velocity dynamics learning control takes part in tracking of the reference velocity trajectory by learning the inverse function of robot dynamics while the reference velocity control input plays a role in stabilizing the kinematic steering system to the desired reference model of kinematic system even without using the assumption of perfect velocity tracking.

1512.03345 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Mobile Robots Adaptive Control Using Neural Networks

移动机器人自适应控制的神经网络方法

Ioan Dumitrache, Monica Dragoicea

AI总结 本文提出了一种前馈控制策略,用于考虑移动机器人非线性模型及其输入输出交互,通过神经网络控制器补偿建模不确定性,实现智能控制策略。

详情
Journal ref
Proceedings of the 13th Int. Conference on Control Systems and Computer Science CSCS13, Bucuresti, Romania, pp:176-181, 2001
AI中文摘要

本文提出了一种前馈控制策略,用于移动机器人控制,该策略考虑了车辆的非线性模型及其输入输出交互。可以通过在动态模型中包含特定的模型不确定性来观察如何在考虑完整移动机器人动态模型的情况下解决控制问题。通过前馈神经网络控制器可以考虑实际的非线性数学模型。经典速度控制策略可以通过人工神经网络扩展,以补偿建模不确定性。可以开发出一种智能的移动机器人控制策略。

英文摘要

The paper proposes a feed-forward control strategy for mobile robot control that accounts for a non-linear model of the vehicle with interaction between inputs and outputs. It is possible to include specific model uncertainties in the dynamic model of the mobile robot in order to see how the control problem should be addressed taking into consideration the complete dynamic mobile robot model. By means of a neural network feed-forward controller a real non-linear mathematical model of the vehicle can be taken into consideration. The classical velocity control strategy can be extended using artificial neural networks in order to compensate for the modelling uncertainties. It is possible to develop an intelligent strategy for mobile robot control.

1504.06917 2026-06-04 cs.RO cs.SY eess.SY math.OC 版本更新

Spline Path Following for Redundant Mechanical Systems

样条路径跟随用于冗余机械系统

Rajan Gill, Dana Kulić, Christopher Nielsen

AI总结 本文提出了一种适用于由样条生成的框架曲线的路径跟随控制方法,通过求解约束二次优化问题解决冗余性,并通过实验验证了其在具有显著模型不确定性的4自由度机械臂上的有效性。

Comments Submitted to IEEE TRO (under review)

详情
Journal ref
Robotics, IEEE Transactions on (Volume:31 , Issue: 6 ) 02 December 2015
AI中文摘要

路径跟随控制器使控制系统的输出能够接近并沿预指定路径移动,无需事先时间参数化。本文提出了一种适用于冗余机械系统工作空间中由样条生成的框架曲线的路径跟随控制设计方法。可接受的路径类别包括自相交曲线。通过设计解决合适定义的约束二次优化问题来解决运动学冗余性。通过部分反馈线性化,所提出的路径跟随控制器具有明确的物理意义。该方法在具有旋转和线性执行器链接的4自由度机械臂上进行了实验验证,该机械臂具有显著的模型不确定性。

英文摘要

Path following controllers make the output of a control system approach and traverse a pre-specified path with no apriori time parametrization. In this paper we present a method for path following control design applicable to framed curves generated by splines in the workspace of kinematically redundant mechanical systems. The class of admissible paths includes self-intersecting curves. Kinematic redundancies are resolved by designing controllers that solve a suitably defined constrained quadratic optimization problem. By employing partial feedback linearization, the proposed path following controllers have a clear physical meaning. The approach is experimentally verified on a 4-degree-of-freedom (4-DOF) manipulator with a combination of revolute and linear actuated links and significant model uncertainty.

1511.05996 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Uncertainty-based Arbitration of Human-Machine Shared Control

基于不确定性的机人类共享控制仲裁

Parker Owan, Joseph Garbini, Santosh Devasia

AI总结 本文提出基于自动化不确定性的机人类共享控制仲裁方法,通过调整自主级别提升任务完成的容错能力,实验显示共享控制方法在任务完成误差容忍度上提升超5倍。

Comments 8 pages, 11 figures. Submitted to the 2016 American Control Conference

详情
AI中文摘要

制造业需要一致的生产率和任务成功率以实现可持续运行。一些制造任务需要半自动化方法,利用人类适应性和机器精度和速度的结合以降低成本。本文的主要贡献是提出一种基于自动化不确定性的新方法,用于确定机人类共享控制中的自主级别。此外,力反馈根据自主级别进行缩放,以向操作员指示机器的信心。实验结果表明,在人类-机器人铆钉孔测试平台上,共享控制方法在任务完成误差容忍度上比纯自主方法提高了超过5倍。

英文摘要

Manufacturing requires consistent production rate and task success for sustainable operation. Some manufacturing tasks require a semi-autonomous approach, exploiting the combination of human adaptability and machine precision and speed, to be cost effective. The main contribution of this paper is a new approach to determine the level of autonomy for human-machine shared control based on the automation uncertainty. Moreover, the haptic feedback is scaled by the level of autonomy to indicate machine confidence to the operator. Experimentation results, with a human-robot peg-in-a-hole testbed, show more than 5 times improvement in the error tolerance for task completion with the shared control approach when compared to a purely autonomous method.

1511.04628 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A Framework for Planning and Controlling Non-Periodic Bipedal Locomotion

一种用于非周期性双足运动规划与控制的框架

Ye Zhao, Benito R. Fernandez, Luis Sentis

AI总结 本文提出了一种基于非周期性顶点状态鲁棒跟踪的双足运动规划与控制理论框架,通过混合相空间规划与控制方法实现非周期性步态生成及抗干扰能力。

Comments 33 pages, 18 figures, journal

详情
AI中文摘要

本文提出了一种基于鲁棒跟踪非周期性顶点状态的双足运动规划与控制理论框架。基于平面倒置摆模型,我们提出了一个混合相空间规划与控制框架,包含四个关键组件:(1) 一种步态转换求解器,能够在各种地形上动态跟踪非周期性顶点或关键帧状态;(2) 一种鲁棒混合自动机,有效制定规划与控制算法;(3) 一种相空间度量,用于测量到规划运动流形的距离;(4) 一种基于前度量的混合控制方法,以产生在外部干扰下的稳健动态运动。与其他运动框架相比,我们更关注非周期性步态生成和鲁棒性度量以处理干扰。这种关注使所提出的控制框架能够稳健地跟踪各种具有挑战性的地形上的非周期性顶点状态,并通过多个模拟示例进行了说明。此外,它还允许双足机器人在不连续地形上执行非周期性跳跃动作。

英文摘要

This study presents a theoretical framework for planning and controlling agile bipedal locomotion based on robustly tracking a set of non-periodic apex states. Based on the prismatic inverted pendulum model, we formulate a hybrid phase-space planning and control framework which includes the following key components: (1) a step transition solver that enables dynamically tracking non-periodic apex or keyframe states over various types of terrains, (2) a robust hybrid automaton to effectively formulate planning and control algorithms, (3) a phase-space metric to measure distance to the planned locomotion manifolds, and (4) a hybrid control method based on the previous distance metric to produce robust dynamic locomotion under external disturbances. Compared to other locomotion frameworks, we have a larger focus on non-periodic gait generation and robustness metrics to deal with disturbances. Such focus enables the proposed control framework to robustly track non-periodic apex states over various challenging terrains and under external disturbances as illustrated through several simulations. Additionally, it allows a bipedal robot to perform non-periodic bouncing maneuvers over disjointed terrains.

1510.08474 2026-06-04 eess.SY cs.LO cs.RO cs.SY 版本更新

Control with Probabilistic Signal Temporal Logic

基于概率信号时间逻辑的控制

Chanyeol Yoo, Calin Belta

AI总结 本文提出概率信号时间逻辑用于连续信念空间中的复杂任务控制,设计高效合成算法以最大化任务满足概率,并通过无人机仿真验证。

Comments 7 pages, submitted to the 2016 American Control Conference (ACC 2016) on September, 30, 2015 (under review)

详情
AI中文摘要

自主代理常在不确定环境中运作,其决策基于目标状态的信念。我们关注在信念空间上定义的复杂任务控制器合成。由于计算复杂性和现有规范语言的表达性不足,设计此类控制器具有挑战性。本文提出信号时间逻辑(STL)的概率扩展,用于表达连续信念空间上的任务。我们提出一种高效的合成算法,以找到最大化给定任务满足概率的控制输入。我们通过无人机用于监视和搜索任务的仿真验证了我们的算法。

英文摘要

Autonomous agents often operate in uncertain environments where their decisions are made based on beliefs over states of targets. We are interested in controller synthesis for complex tasks defined over belief spaces. Designing such controllers is challenging due to computational complexity and the lack of expressivity of existing specification languages. In this paper, we propose a probabilistic extension to signal temporal logic (STL) that expresses tasks over continuous belief spaces. We present an efficient synthesis algorithm to find a control input that maximises the probability of satisfying a given task. We validate our algorithm through simulations of an unmanned aerial vehicle deployed for surveillance and search missions.

1509.01149 2026-06-04 eess.SY cs.DC cs.RO cs.SY 版本更新

Model Predictive Path Integral Control using Covariance Variable Importance Sampling

基于协方差变量重要性采样的模型预测路径积分控制

Grady Williams, Andrew Aldrich, Evangelos Theodorou

AI总结 本文提出基于广义重要性采样方案的模型预测路径积分控制算法,利用GPU进行并行优化,改进了随机扩散过程的漂移和扩散项,通过仿真与差分动态规划的模型预测控制进行比较。

Comments 8 pages

详情
AI中文摘要

在本文中,我们开发了一种基于广义重要性采样方案的模型预测路径积分(MPPI)控制算法,并通过图形处理单元(GPU)进行采样实现并行优化。所提出的广义重要性采样方案允许随机扩散过程的漂移和扩散项发生变化,并在模型预测控制算法的性能中起重要作用。我们通过仿真将所提算法与模型预测控制的差分动态规划版本进行了比较。

英文摘要

In this paper we develop a Model Predictive Path Integral (MPPI) control algorithm based on a generalized importance sampling scheme and perform parallel optimization via sampling using a Graphics Processing Unit (GPU). The proposed generalized importance sampling scheme allows for changes in the drift and diffusion terms of stochastic diffusion processes and plays a significant role in the performance of the model predictive control algorithm. We compare the proposed algorithm in simulation with a model predictive control version of differential dynamic programming.

1411.2276 2026-06-04 cs.RO cs.NE cs.SY eess.SY 版本更新

Trade-Offs in Exploiting Body Morphology for Control: from Simple Bodies and Model-Based Control to Complex Bodies with Model-Free Distributed Control Schemes

在利用身体形态进行控制中权衡:从简单身体和基于模型的控制到复杂身体与基于模型的分布式控制方案

Matej Hoffmann, Vincent C. Müller

AI总结 本文探讨了在复杂身体设计中基于模型与无模型控制的权衡,分析了软体机器人自动接管控制的优缺点及模型构建的可行性。

详情
Journal ref
Helmut Hauser; Rudolf M. Füchslin & Rolf Pfeifer, ed., 'E-book on Opinions and Outlooks on Morphological Computation', 2014, pp. 185--194
AI中文摘要

为控制目的设计机器人身体的优化隐含在工程师的实践中,但缺乏系统的方法或工具,导致形态优化滞后于控制器的发展。随着柔软、变形或“软”身体的出现,其在控制中的潜力显著,有时被称为“形态计算”,即通过身体卸载控制所需计算。本文主张采用动态系统而非计算视角来审视该问题,并分析简单与复杂身体的优劣,批判性地回顾“软”身体自动接管控制任务的吸引力。同时,本文还探讨了设计空间中的关键维度——是否应使用基于模型的控制,以及在不同形态下开发忠实模型的可行性。

英文摘要

Tailoring the design of robot bodies for control purposes is implicitly performed by engineers, however, a methodology or set of tools is largely absent and optimization of morphology (shape, material properties of robot bodies, etc.) is lagging behind the development of controllers. This has become even more prominent with the advent of compliant, deformable or "soft" bodies. These carry substantial potential regarding their exploitation for control---sometimes referred to as "morphological computation" in the sense of offloading computation needed for control to the body. Here, we will argue in favor of a dynamical systems rather than computational perspective on the problem. Then, we will look at the pros and cons of simple vs. complex bodies, critically reviewing the attractive notion of "soft" bodies automatically taking over control tasks. We will address another key dimension of the design space---whether model-based control should be used and to what extent it is feasible to develop faithful models for different morphologies.

1510.07313 2026-06-04 eess.SY cs.AI cs.LO cs.RO cs.SY 版本更新

Safe Control under Uncertainty

在不确定性下的安全控制

Dorsa Sadigh, Ashish Kapoor

AI总结 本文提出Probabilistic Signal Temporal Logic(PrSTL)用于定义随机属性并确保概率保证,通过该逻辑合成安全控制器,应用于四旋翼和自动驾驶车辆等案例。

Comments 10 pages, 6 figures, Submitted to HSCC 2016

详情
AI中文摘要

本文提出Probabilistic Signal Temporal Logic(PrSTL)作为定义随机属性并确保概率保证的表达语言,通过该逻辑合成安全控制器,应用于四旋翼和自动驾驶车辆等案例。

英文摘要

Controller synthesis for hybrid systems that satisfy temporal specifications expressing various system properties is a challenging problem that has drawn the attention of many researchers. However, making the assumption that such temporal properties are deterministic is far from the reality. For example, many of the properties the controller has to satisfy are learned through machine learning techniques based on sensor input data. In this paper, we propose a new logic, Probabilistic Signal Temporal Logic (PrSTL), as an expressive language to define the stochastic properties, and enforce probabilistic guarantees on them. We further show how to synthesize safe controllers using this logic for cyber-physical systems under the assumption that the stochastic properties are based on a set of Gaussian random variables. One of the key distinguishing features of PrSTL is that the encoded logic is adaptive and changes as the system encounters additional data and updates its beliefs about the latent random variables that define the safety properties. We demonstrate our approach by synthesizing safe controllers under the PrSTL specifications for multiple case studies including control of quadrotors and autonomous vehicles in dynamic environments.

1510.06460 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Robust Satisfaction of Temporal Logic Specifications via Reinforcement Learning

通过强化学习实现时序逻辑规范的鲁棒满足

Austin Jones, Derya Aksaray, Zhaodan Kong, Mac Schwager, Calin Belta

AI总结 本文提出通过强化学习最大化满足信号时序逻辑规范的概率和鲁棒性,通过机器人导航仿真验证其在概率和鲁棒性上的优越性。

Comments 8 pages, 4 figures

详情
AI中文摘要

我们考虑将具有未知随机动态的系统引导以满足给定的丰富时序层任务,该任务以信号时序逻辑公式给出。我们将系统表示为马尔可夫决策过程,其中状态由状态空间的划分组成,转移概率未知。我们提出了能够证明收敛的强化学习算法,以最大化满足给定公式的概率,并最大化平均预期鲁棒性,即衡量公式满足程度的指标。我们通过一对机器人导航仿真案例研究证明,具有鲁棒性最大化的强化学习在满足概率和预期鲁棒性方面优于概率最大化。

英文摘要

We consider the problem of steering a system with unknown, stochastic dynamics to satisfy a rich, temporally layered task given as a signal temporal logic formula. We represent the system as a Markov decision process in which the states are built from a partition of the state space and the transition probabilities are unknown. We present provably convergent reinforcement learning algorithms to maximize the probability of satisfying a given formula and to maximize the average expected robustness, i.e., a measure of how strongly the formula is satisfied. We demonstrate via a pair of robot navigation simulation case studies that reinforcement learning with robustness maximization performs better than probability maximization in terms of both probability of satisfaction and expected robustness.

1405.7178 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Artificial Wrestling: A Dynamical Formulation of Autonomous Agents Fighting in a Coupled Inverted Pendula Framework

人工摔跤:一种自主代理在耦合倒立摆框架中的动态建模

Katsutoshi Yoshida, Shigeki Matsumoto, Yoichi Matsue

AI总结 本文提出基于耦合倒立摆框架的自主代理对抗模型,通过动态控制器存储状态对应关系并生成控制力,实验表明延迟元素和量化分辨率影响性能。

Comments The 12th International Conference on Motion and Vibration Control (MOVIC 2014), August 3-7, 2014, Sapporo, Japan. This article was selected as an article of Mechanical Engineering Journal after minor revisions; the final version is available at http://dx.doi.org/10.1299/mej.14-00518

详情
AI中文摘要

我们开发了自主代理相互对抗的系统,灵感来自人类摔跤。为此,我们提出耦合倒立摆(CIP)框架:1)两个倒立摆的顶端通过连接杆相连;2)每个摆主要通过PD控制器稳定;3)并额外配备智能控制器。基于此框架,我们动态建模智能控制器,用于存储CIP模型的初始状态到最终状态的动态对应关系,接收模型状态向量并输出脉冲控制力以产生期望的最终状态。开发了该控制器的量化和降阶设计,基于离线学习方法获得实用控制流程。随后进行数值模拟以研究智能控制器的个体性能,显示通过在智能控制器中添加延迟元件可提高性能。结果表明,性能不仅取决于学习数据的量化分辨率,还取决于延迟元件的延迟时间。最后,我们将智能控制器安装到所提框架中的两个摆上,以演示倒立摆之间的自主竞争行为。

英文摘要

We develop autonomous agents fighting with each other, inspired by human wrestling. For this purpose, we propose a coupled inverted pendula (CIP) framework in which: 1) tips of two inverted pendulums are linked by a connection rod, 2) each pendulum is primarily stabilized by a PD-controller, 3) and is additionally equipped with an intelligent controller. Based on this framework, we dynamically formulate an intelligent controller designed to store dynamical correspondence from initial states to final states of the CIP model, to receive state vectors of the model, and to output impulsive control forces to produce desired final states of the model. Developing a quantized and reduced order design of this controller, we have a practical control procedure based on an off-line learning method. We then conduct numerical simulations to investigate individual performance of the intelligent controller, showing that the performance can be improved by adding a delay element into the intelligent controller. The result shows that the performance depends not only on quantization resolutions of learning data but also on delay time of the delay element. Finally, we install the intelligent controllers into both pendulums in the proposed framework to demonstrate autonomous competitive behavior between inverted pendulums.

1505.05908 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Cooperative localization for mobile agents: a recursive decentralized algorithm based on Kalman filter decoupling

移动代理的协同定位:基于卡尔曼滤波解耦的递归分布式算法

Solmaz S. Kia, Stephen Rounds, Sonia Martinez

AI总结 本文提出一种递归分布式协同定位算法,通过解耦卡尔曼滤波实现分布式状态估计,提升移动代理的定位精度与效率。

详情
AI中文摘要

我们考虑具有通信和计算能力的移动代理的协同定位技术。首先概述了文献中不同的去中心化策略,特别关注这些算法如何维护团队成员状态估计之间的内在相关性。然后,我们提出了一种新的去中心化协同定位算法,它是集中式扩展卡尔曼滤波的分布式实现。在该算法中,每个代理传播新的中间局部变量,这些变量可用于更新阶段生成所需的传播交叉协方差项。每当网络中存在相对测量时,算法将进行该测量的代理声明为临时主代理。通过从临时地标获取信息,该代理可以计算并广播一组中间变量,每个机器人可以使用这些变量来更新其估计值,以匹配集中式扩展卡尔曼滤波器的估计结果。一旦完成更新,直到下一次相对测量之前不需要进一步通信。

英文摘要

We consider cooperative localization technique for mobile agents with communication and computation capabilities. We start by provide and overview of different decentralization strategies in the literature, with special focus on how these algorithms maintain an account of intrinsic correlations between state estimate of team members. Then, we present a novel decentralized cooperative localization algorithm that is a decentralized implementation of a centralized Extended Kalman Filter for cooperative localization. In this algorithm, instead of propagating cross-covariance terms, each agent propagates new intermediate local variables that can be used in an update stage to create the required propagated cross-covariance terms. Whenever there is a relative measurement in the network, the algorithm declares the agent making this measurement as the interim master. By acquiring information from the interim landmark, the agent the relative measurement is taken from, the interim master can calculate and broadcast a set of intermediate variables which each robot can then use to update its estimates to match that of a centralized Extended Kalman Filter for cooperative localization. Once an update is done, no further communication is needed until the next relative measurement.

1509.04826 2026-06-04 cs.MA cs.RO cs.SY eess.SY 版本更新

Inter-Robot Interactions in Multi-Robot Systems Using Braids

多机器人系统中使用编织群的机器人交互

Yancy Diaz-Mercado, Magnus Egerstedt

AI总结 本文提出利用编织群元素描述机器人交互模式,以实现安全的协作运动和传感器信息交换。

详情
AI中文摘要

本文描述了一个用于多机器人协调和运动规划的框架,重点在于代理间交互的建模。我们通过编织群元素来表征交互,以实现安全(即无碰撞)的交互模式。该方法允许执行整个交互类别的模式,结果是一个由符号输入驱动的混合系统,这些输入必须映射到实际路径上,以实现所需的交互水平并保持安全。

英文摘要

This paper describes a framework for multi-robot coordination and motion planning with emphasis on inter-agent interactions. We focus on the characterization of inter-agent interactions with sufficient level of abstraction so as to allow for the enforcement of desired interaction patterns in a provably safe (i.e., collision-free) manner, e.g., for achieving rich movement patterns in a shared space, or to exchange sensor information. We propose to specify interaction patterns through elements of the so-called braid group. This allows us to not focus on a particular pattern per se, but rather on the problem of being able to execute a whole class of patterns. The result from such a construction is a hybrid system driven by symbolic inputs that must be mapped onto actual paths that both realize the desired interaction levels and remain safe in the sense that collisions are avoided.

1509.02054 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Underwater Doppler Navigation with Self-calibration

水下多普勒导航与自校准

Xianfei Pan, Yuanxin Wu

AI总结 本文提出一种结合惯性测量单元和多普勒速度计的水下导航系统,通过分析证明在中等条件下系统可观测,且能无需外部GPS或声纳信标实现多普勒速度计参数的现场校准。

Comments To appear in Journal of Navigation

详情
AI中文摘要

精确的自主导航仍然是所有水下平台面临的重大挑战。惯性测量单元(IMU)和多普勒速度计(DVL)具有互补特性,是能够在未探索区域实现完全自主水下导航的有前途的传感器,而无需依赖额外的全球定位系统(GPS)或声纳信标。本文从可观测性角度探讨了结合IMU/DVL的导航系统。通过分析证明,在中等条件下,该组合系统是可观测的。具体而言,多普勒速度计的参数,包括标度因子和对齐角度,可以在不使用外部GPS或声纳信标的情况下现场校准。使用实际估计器的仿真结果验证了分析结论。

英文摘要

Precise autonomous navigation remains a substantial challenge to all underwater platforms. Inertial Measurement Units (IMU) and Doppler Velocity Logs (DVL) have complementary characteristics and are promising sensors that could enable fully autonomous underwater navigation in unexplored areas without relying on additional external Global Positioning System (GPS) or acoustic beacons. This paper addresses the combined IMU/DVL navigation system from the viewpoint of observability. We show by analysis that under moderate conditions the combined system is observable. Specifically, the DVL parameters, including the scale factor and misalignment angles, can be calibrated in-situ without using external GPS or acoustic beacon sensors. Simulation results using a practical estimator validate the analytic conclusions.

1508.07723 2026-06-04 eess.SY cs.RO cs.SY 版本更新

A survey on unmanned aerial vehicle collision avoidance systems

关于无人机避障系统的综述

Hung Pham, Scott A. Smolka, Scott D. Stoller, Dung Phan, Junxing Yang

AI总结 本文综述了最新无人机避障系统,分析了感知与检测、避障两部分的分类、优缺点及比较。

Comments This is only a draft

详情
AI中文摘要

碰撞避免是使无人机能够融入现实生活应用的关键因素,无论是军事还是民用领域。长期以来,有许多研究致力于解决这一问题;因此,对这些研究进行比较总结是有必要的。本文综述了截至最新出版物中开发的主要碰撞避免系统。每个碰撞避免系统包含两个主要部分:感知与检测,以及碰撞避免。基于其特性,每个部分被划分为不同的类别;本文对这些类别进行了解释、比较和讨论,阐述了各自的优势与不足。

英文摘要

Collision avoidance is a key factor in enabling the integration of unmanned aerial vehicle into real life use, whether it is in military or civil application. For a long time there have been a large number of works to address this problem; therefore a comparative summary of them would be desirable. This paper presents a survey on the major collision avoidance systems developed in up to date publications. Each collision avoidance system contains two main parts: sensing and detection, and collision avoidance. Based on their characteristics each part is divided into different categories; and those categories are explained, compared and discussed about advantages and disadvantages in this paper.

1508.05514 2026-06-04 stat.ML cs.CV cs.LG cs.RO cs.SY eess.SY 版本更新

Gaussian Mixture Reduction Using Reverse Kullback-Leibler Divergence

基于反向Kullback-Leibler散度的高斯混合减少

Tohid Ardeshiri, Umut Orguner, Emre Özkan

AI总结 本文提出一种贪心混合减少算法,基于Kullback-Leibler散度进行混合成分的剪枝与合并,通过分析近似方法提高计算效率,并在模拟和实际数据中验证其性能优于现有方法。

详情
AI中文摘要

我们提出了一种贪心的混合减少算法,能够基于Kullback-Leibler散度(KLD)剪枝和合并混合成分。该算法不同于已知的Runnalls基于KLD的方法,因为它不限于合并操作。剪枝能力(除合并外)使算法在减少过程中能够保留原始混合的峰值。通过分析近似方法来避免KLD的计算不可行性,从而得到一个计算高效的算法。所提出的算法在两个数值示例中与Runnalls和Williams的方法进行比较,使用模拟和实际数据。结果表明,所提出的方法在性能和计算复杂度方面使其成为现有混合减少方法的高效替代方案。

英文摘要

We propose a greedy mixture reduction algorithm which is capable of pruning mixture components as well as merging them based on the Kullback-Leibler divergence (KLD). The algorithm is distinct from the well-known Runnalls' KLD based method since it is not restricted to merging operations. The capability of pruning (in addition to merging) gives the algorithm the ability of preserving the peaks of the original mixture during the reduction. Analytical approximations are derived to circumvent the computational intractability of the KLD which results in a computationally efficient method. The proposed algorithm is compared with Runnalls' and Williams' methods in two numerical examples, using both simulated and real world data. The results indicate that the performance and computational complexity of the proposed approach make it an efficient alternative to existing mixture reduction methods.

1312.7602 2026-06-04 eess.SY cs.RO cs.SY math.DS math.PR 版本更新

A Martingale Approach and Time-Consistent Sampling-based Algorithms for Risk Management in Stochastic Optimal Control

一个基于鞅和时间一致采样算法的风险管理在随机优化控制中的方法

Vu Anh Huynh, Leonid Kogan, Emilio Frazzoli

AI总结 本文提出基于鞅和时间一致采样算法的方法,用于解决具有风险约束的随机优化控制问题,通过将风险约束转化为鞅来构建时间一致的控制策略,并在扩展的状态空间中采样以近似最优反馈策略。

详情
AI中文摘要

本文考虑了一类具有风险约束的随机优化控制问题,其中风险约束以特定初始状态的失败概率有界形式表达。我们提出了一种鞅方法,将风险约束扩散为鞅以构建时间一致的控制策略。鞅代表了时间上的风险容忍水平。通过将系统动态与受控的鞅相结合,原始的风险约束问题被转化为随机目标问题。我们扩展了增量马尔可夫决策过程(iMDP)算法,通过在扩展的状态空间中采样并计算适当的边界条件来近似原始问题的最优反馈策略。我们证明该算法在概率上是稳健的且渐近最优。所提出算法的性能在具有受限碰撞概率的不确定杂乱环境中的运动规划和控制问题中得到了验证。

英文摘要

In this paper, we consider a class of stochastic optimal control problems with risk constraints that are expressed as bounded probabilities of failure for particular initial states. We present here a martingale approach that diffuses a risk constraint into a martingale to construct time-consistent control policies. The martingale stands for the level of risk tolerance over time. By augmenting the system dynamics with the controlled martingale, the original risk-constrained problem is transformed into a stochastic target problem. We extend the incremental Markov Decision Process (iMDP) algorithm to approximate arbitrarily well an optimal feedback policy of the original problem by sampling in the augmented state space and computing proper boundary conditions for the reformulated problem. We show that the algorithm is both probabilistically sound and asymptotically optimal. The performance of the proposed algorithm is demonstrated on motion planning and control problems subject to bounded probability of collision in uncertain cluttered environments.

1507.00525 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A Reactive Robotized Interface for Lower Limb Rehabilitation: Clinical Results

一种用于下肢康复的反应式机器人接口:临床结果

Ludovic Saint-Bauzel, Viviane Pasqui, Isabelle Monteil

AI总结 本文展示了MONIMAD在治疗小脑疾病患者下肢康复中的临床效果,通过分析坐姿到站立的姿势,利用模糊控制调整机器人接口运动以提高稳定性。

详情
Journal ref
IEEE Transactions on Robotics, Institute of Electrical and Electronics Engineers (IEEE), 2009, pp.583 - 592
AI中文摘要

本文介绍了MONIMAD在下肢康复中的临床结果,该接口用于帮助小脑疾病患者康复。首先解决坐姿到站立动作的姿势分析,通过健康受试者实验确定了坐起动作分为预加速、加速、起始上升和上升几个阶段。通过分析压力中心、地面力和手柄的水平力分量,得出识别患者稳定性并调整机器人接口运动以适应人类自愿运动的规则。这些规则用于模糊控制器的实现,并在贝尔安医院对患者进行实验验证。

英文摘要

-This article presents clinical results from the use of MONIMAD, a reactive robotized interface for lower limb Rehabilitation of patients suffering from cerebellar disease. The first problem to be addressed is the postural analysis of sit-to-stand motion. Experiments with healthy subjects were performed for this purpose. Analysis of external forces shows that sit-to-stand transfer can be subdivided into several phases: preaccel-eration, acceleration, start rising, rising. Observation of Center of Pressure, ground forces and horizontal components force on handles yields rules to identify the stability of the patient and to adjust the robotic interface motion to the human voluntary movement. These rules are used in a fuzzy-based controller implementation. The controller is validated on experiments with diseased patients in Bellan Hospital.

1502.02251 2026-06-04 stat.ML cs.LG cs.RO cs.SY eess.SY 版本更新

From Pixels to Torques: Policy Learning with Deep Dynamical Models

从像素到扭矩:基于深度动态模型的策略学习

Niklas Wahlström, Thomas B. Schön, Marc Peter Deisenroth

AI总结 本文提出一种高效的数据驱动强化学习算法,通过深度动态模型直接从像素信息学习闭环控制策略,解决高维观测下的连续状态-动作空间数据高效学习问题。

Comments 9 pages

详情
AI中文摘要

在开发完全自主系统中,利用非常高的维数观测进行数据高效学习连续状态-动作空间仍是一个关键挑战。本文考虑这一挑战的一个实例,即像素到扭矩问题,其中智能体必须仅从像素信息学习闭环控制策略。我们引入了一种数据高效、基于模型的强化学习算法,该算法直接从像素信息学习此类闭环策略。关键成分是深度动态模型,该模型使用深度自编码器学习图像的低维嵌入,并在该低维特征空间中学习预测模型。联合学习确保不仅静态属性,而且动态属性都被考虑在内。这对于长期预测至关重要,而长期预测是适应性模型预测控制策略的核心。与最先进的连续状态和动作强化学习方法相比,我们的方法学习速度快,可扩展到高维状态空间,并是向完全自主学习从像素到扭矩的重要一步。

英文摘要

Data-efficient learning in continuous state-action spaces using very high-dimensional observations remains a key challenge in developing fully autonomous systems. In this paper, we consider one instance of this challenge, the pixels to torques problem, where an agent must learn a closed-loop control policy from pixel information only. We introduce a data-efficient, model-based reinforcement learning algorithm that learns such a closed-loop policy directly from pixel information. The key ingredient is a deep dynamical model that uses deep auto-encoders to learn a low-dimensional embedding of images jointly with a predictive model in this low-dimensional feature space. Joint learning ensures that not only static but also dynamic properties of the data are accounted for. This is crucial for long-term predictions, which lie at the core of the adaptive model predictive control strategy that we use for closed-loop control. Compared to state-of-the-art reinforcement learning methods for continuous states and actions, our approach learns quickly, scales to high-dimensional state spaces and is an important step toward fully autonomous learning from pixels to torques.

1401.6904 2026-06-04 cs.RO cs.SY eess.SY math.OC 版本更新

Adaptive Visual Tracking for Robotic Systems Without Image-Space Velocity Measurement

无图像空间速度测量的机器人系统自适应视觉跟踪

Hanlei Wang

AI总结 本文研究了无图像空间速度测量的机器人视觉跟踪问题,提出了一种新的图像空间观测器,通过未知动力学中的图像空间速度信息设计自适应控制器,解决了多不确定性分离问题,避免了过参数化。

Comments 21 pages, 3 figures, revised for making improvements based on the reviewers' and AE's comments from Automatica and for adding the journal reference

详情
Journal ref
Automatica, 55: 294-301, May 2015
AI中文摘要

本文研究了机器人系统在无图像空间速度测量情况下的视觉跟踪问题,同时考虑了相机模型和机械臂动力学的不确定性。我们提出了一种新的图像空间观测器,利用未知动力学中包含的图像空间速度信息,设计了不使用图像空间速度信号的自适应控制器,其中深度速率无关的动力学参数和深度参数的适应由图像空间跟踪误差和观测误差驱动。所提观测器-自适应控制器的主要优势在于其简单性和在视觉伺服机器人系统中处理多个不确定性的分离,从而避免了现有工作的过参数化问题。通过李雅普诺夫分析,我们证明了图像空间跟踪误差渐近收敛到零。所提自适应控制方案的性能通过数值模拟进行了验证。

英文摘要

In this paper, we investigate the visual tracking problem for robotic systems without image-space velocity measurement, simultaneously taking into account the uncertainties of the camera model and the manipulator kinematics and dynamics. We propose a new image-space observer that exploits the image-space velocity information contained in the unknown kinematics, upon which, we design an adaptive controller without using the image-space velocity signal where the adaptations of the depth-rate-independent kinematic parameter and depth parameter are driven by both the image-space tracking errors and observation errors. The major superiority of the proposed observer-based adaptive controller lies in its simplicity and the separation of the handling of multiple uncertainties in visually servoed robotic systems, thus avoiding the overparametrization problem of the existing work. Using Lyapunov analysis, we demonstrate that the image-space tracking errors converge to zero asymptotically. The performance of the proposed adaptive control scheme is illustrated by a numerical simulation.

1506.03771 2026-06-04 math.NA cs.NA cs.RO 版本更新

Fast Methods for Eikonal Equations: an Experimental Survey

快速求解eikonal方程的方法:实验调查

Javier V. Gomez, David Alvarez, Santiago Garrido, Luis Moreno

AI总结 本文通过实验对比了9种改进标准快速推进法的算法,重点研究单线程方法和各向同性环境下的计算效率提升。

详情
AI中文摘要

快速推进法是一种计算到达时间地图(以时间单位测量距离地图)非常流行的算法。自1995年提出以来,它已被应用于许多不同领域,如机器人学、医学计算机视觉、流体模拟等。许多替代方法被提出,主要目标是减少计算时间并提高精度。本文收集了主要改进标准快速推进法计算时间的方法,重点研究单线程方法和各向同性环境。9种不同方法在共同的数学框架下进行实验,在代表性环境中进行研究:具有二叉堆的快速推进法、具有斐波那契堆的快速推进法、简化快速推进法、Untidy快速推进法、快速迭代法、群体推进法、快速扫描法、锁定扫描法和双动态队列法。

英文摘要

The Fast Marching Method is a very popular algorithm to compute times-of-arrival maps (distances map measured in time units). Since their proposal in 1995, it has been applied to many different applications such as robotics, medical computer vision, fluid simulation, etc. Many alternatives have been proposed with two main objectives: to reduce its computational time and to improve its accuracy. In this paper, we collect the main approaches which improve the computational time of the standard Fast Marching Method, focusing on single-threaded methods and isotropic environments. 9 different methods are studied under a common mathematical framework and experimentally in representative environments: Fast Marching Method with binary heap, Fast Marching Method with Fibonacci Heap, Simplified Fast Marching Method, Untidy Fast Marching Method, Fast Iterative Method, Group Marching Method, Fast Sweeping Method, Lock Sweeping Method and Double Dynamic Queue Method.

1506.02312 2026-06-04 cs.AI cs.LG cs.RO cs.SY eess.SY 版本更新

A Framework for Constrained and Adaptive Behavior-Based Agents

一种用于约束和自适应行为基 agent 的框架

Renato de Pontes Pereira, Paulo Martins Engel

AI总结 本文提出一种框架,通过强化学习节点整合到行为树中,解决约束 agent 的学习能力问题,并展示其与分层强化学习选项的关系,确保嵌套学习节点的收敛性。

Comments 2015; 15 pages

详情
AI中文摘要

行为树常用于建模机器人和游戏中的 agent,其中必须由人类专家设计受约束的行为以确保 agent 在特定感知下执行特定动作链。在这些应用领域,学习是可取的,因为它能为 agent 提供适应和改进与人类和环境交互的能力,但往往被丢弃,因为其不可靠。本文提出一个框架,将强化学习节点作为行为树的一部分,以解决在受约束 agent 中添加学习能力的问题。我们展示了该框架与分层强化学习中选项的关系,确保嵌套学习节点的收敛性,并通过实验证明学习节点不会影响树中其他节点的执行。

英文摘要

Behavior Trees are commonly used to model agents for robotics and games, where constrained behaviors must be designed by human experts in order to guarantee that these agents will execute a specific chain of actions given a specific set of perceptions. In such application areas, learning is a desirable feature to provide agents with the ability to adapt and improve interactions with humans and environment, but often discarded due to its unreliability. In this paper, we propose a framework that uses Reinforcement Learning nodes as part of Behavior Trees to address the problem of adding learning capabilities in constrained agents. We show how this framework relates to Options in Hierarchical Reinforcement Learning, ensuring convergence of nested learning nodes, and we empirically show that the learning nodes do not affect the execution of other nodes in the tree.

1309.0535 2026-06-04 eess.SY cs.MA cs.RO cs.SY math.OC 版本更新

Decentralized Rigidity Maintenance Control with Range Measurements for Multi-Robot Systems

多机器人系统中基于距离测量的去中心化刚性维持控制

Daniel Zelazo, Antonio Franchi, Heinrich H. Bülthoff, Paolo Robuffo Giordano

AI总结 本文提出了一种基于距离测量的去中心化策略,用于维持多机器人系统的形成刚性,同时允许图拓扑自由变化。通过扩展刚性理论到加权框架并引入刚性特征值,设计了分布式算法以估计相对位置参考框架,并用于生成维持刚性并满足碰撞避免等约束的局部控制作用。

Comments Preprint submitted to The International Journal of Robotics Research

详情
AI中文摘要

本文提出了一种完全去中心化的策略,用于利用仅有的距离测量来维持多机器人系统的形成刚性,同时允许图拓扑随时间自由变化。首先,本文扩展了刚性理论到加权框架并引入了刚性特征值,当该值为正时确保框架的无穷小刚性。随后,提出了一种分布式算法,用于估计一组机器人之间的共同相对位置参考框架,该算法结合了仅有的距离测量以及一个具备测量两个其他机器人方位能力的代理。这一估计步骤被嵌入到随后的分布式算法中,用于估计加权框架的刚性特征值。最终,利用刚性特征值的估计值为每个代理生成局部控制作用,以维持刚性属性并施加额外约束,如碰撞避免和感知/通信范围限制及遮挡。此外,本文方法还允许机器人之间的通信和感知链接随时间自由变化,同时保持整个框架的刚性。所提出的方案通过由6个四旋翼无人机组成的机器人测试平台在复杂环境中进行了实验验证。

英文摘要

This work proposes a fully decentralized strategy for maintaining the formation rigidity of a multi-robot system using only range measurements, while still allowing the graph topology to change freely over time. In this direction, a first contribution of this work is an extension of rigidity theory to weighted frameworks and the rigidity eigenvalue, which when positive ensures the infinitesimal rigidity of the framework. We then propose a distributed algorithm for estimating a common relative position reference frame amongst a team of robots with only range measurements in addition to one agent endowed with the capability of measuring the bearing to two other agents. This first estimation step is embedded into a subsequent distributed algorithm for estimating the rigidity eigenvalue associated with the weighted framework. The estimate of the rigidity eigenvalue is finally used to generate a local control action for each agent that both maintains the rigidity property and enforces additional con- straints such as collision avoidance and sensing/communication range limits and occlusions. As an additional feature of our approach, the communication and sensing links among the robots are also left free to change over time while preserving rigidity of the whole framework. The proposed scheme is then experimentally validated with a robotic testbed consisting of 6 quadrotor UAVs operating in a cluttered environment.

1506.00547 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Differential Geometric SLAM

微分几何SLAM

David Evan Zlotnik, James Richard Forbes

AI总结 本文提出基于微分几何的SLAM算法,通过数学方法实现状态和地图估计,证明在无噪声情况下渐进稳定,仿真验证其鲁棒性。

详情
AI中文摘要

本文提出基于微分几何的SLAM算法,通过数学方法实现状态和地图估计,证明在无噪声情况下渐进稳定,仿真验证其鲁棒性。

英文摘要

The simultaneous localization and mapping (SLAM) problem is considered in three dimensions. The proposed algorithm, differential geometric SLAM (DG-SLAM), employs methods from differential geometry to propagate the state and map estimates. Unlike EKF SLAM, the proposed filter is provably asymptotically stable under the assumption of no measurement noise or biases. The robustness of the DG-SLAM algorithm is assessed in simulation with measurement noise. The simulation demonstrates successful localization and mapping.

1412.6164 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Formation of Multiple Groups of Mobile Robots Using Sliding Mode Control

利用滑模控制形成多个移动机器人类群

Soumic Sarkar, Indra Narayan Kar

AI总结 本文提出基于质心变换的滑模控制方法,用于多组移动机器人形成不同几何形状,通过设计子系统控制器并采用奇异摄动理论实现有限时间滑动表面控制,同时利用势函数梯度确保碰撞避免。

Comments 8 pages, 5 figures

详情
AI中文摘要

多组代理的形成控制在大范围导航和搬运大物体中具有应用。本文应用基于质心的变换(CBT)将轮式移动机器人(WMRs)的联合动力学分解为三个子系统:组内和组间形状动力学以及质心动力学。为每个子系统设计了独立控制器,控制器增益被选择使得整体系统成为奇异摄动系统。然后在奇异摄动系统上设计滑模控制器,以在有限时间内将子系统驱动到滑动表面上。为确保机器人之间的碰撞避免,在滑动面上添加了基于势函数梯度的负梯度。通过仿真结果验证了所提控制器的有效性。

英文摘要

Formation control of multiple groups of agents finds application in large area navigation by generating different geometric patterns and shapes, and also in carrying large objects. In this paper, Centroid Based Transformation (CBT) \cite{c39}, has been applied to decompose the combined dynamics of wheeled mobile robots (WMRs) into three subsystems: intra and inter group shape dynamics, and the dynamics of the centroid. Separate controllers have been designed for each subsystem. The gains of the controllers are such chosen that the overall system becomes singularly perturbed system. Then sliding mode controllers are designed on the singularly perturbed system to drive the subsystems on sliding surfaces in finite time. Negative gradient of a potential based function has been added to the sliding surface to ensure collision avoidance among the robots in finite time. The efficacy of the proposed controller is established through simulation results.

1504.06002 2026-06-04 math.OC cs.RO cs.SY eess.SY math.DS 版本更新

Some Applications of Polynomial Optimization in Operations Research and Real-Time Decision Making

多项式优化在运筹学和实时决策中的某些应用

Amir Ali Ahmadi, Anirudha Majumdar

AI总结 本文探讨了多项式优化在无线覆盖、无人机避障和四旋翼飞行控制中的应用,采用SOS和SDSOS优化方法解决相关问题。

详情
AI中文摘要

我们展示了将代数技术用于优化和认证多项式不等式的应用,以解决运筹学和交通运输工程领域的问题。三个问题被考虑:(i) 通过保证信号质量和最小传输功率覆盖目标地理区域;(ii) 计算简单无人机在复杂环境中的实时碰撞避免证书;(iii) 设计四旋翼无人机的非线性悬停控制器。在小型应用中,我们应用了求和平方(SOS)松弛并使用半正定规划求解。在大型或实时应用中,我们使用了最近引入的

英文摘要

We demonstrate applications of algebraic techniques that optimize and certify polynomial inequalities to problems of interest in the operations research and transportation engineering communities. Three problems are considered: (i) wireless coverage of targeted geographical regions with guaranteed signal quality and minimum transmission power, (ii) computing real-time certificates of collision avoidance for a simple model of an unmanned vehicle (UV) navigating through a cluttered environment, and (iii) designing a nonlinear hovering controller for a quadrotor UV, which has recently been used for load transportation. On our smaller-scale applications, we apply the sum of squares (SOS) relaxation and solve the underlying problems with semidefinite programming. On the larger-scale or real-time applications, we use our recently introduced "SDSOS Optimization" techniques which result in second order cone programs. To the best of our knowledge, this is the first study of real-time applications of sum of squares techniques in optimization and control. No knowledge in dynamics and control is assumed from the reader.

1504.05723 2026-06-04 stat.CO cs.RO cs.SY eess.SY q-fin.CP 版本更新

Noise Robust Online Inference for Linear Dynamic Systems

针对线性动态系统的噪声鲁棒在线推断

Saikat Saha

AI总结 本文提出一种新的噪声自适应 Rao-Blackwellized 粒子滤波器,通过分层高斯模型近似非高斯噪声密度,以提高鲁棒性和适应性,同时保持可扩展性和易实现性。

详情
AI中文摘要

我们重新审视了在非高斯环境下线性动态系统(LDS)的贝叶斯在线推断问题。噪声可以自然非高斯(偏斜和/或重尾)或为了处理虚假观测,噪声可以建模为重尾分布。然而,这种噪声鲁棒性可能在没有虚假观测时导致性能下降。因此,任何推断引擎不仅要鲁棒于噪声异常,还应适应潜在未知且随时间变化的噪声参数;同时应具有可扩展性和易实现性。为了解决这些问题,本文提出了一种新的噪声自适应 Rao-Blackwellized 粒子滤波器(RBPF),通过分层高斯模型作为任何非高斯(过程或测量)噪声密度的代理。这导致了可处理的条件线性高斯模型(CLGM)。然而,该框架需要一个有效的转移核作为粒子滤波(PF)的目标状态。这通常是未知的。我们概述了如何通过辅助潜在变量方法构建这样的核,至少对于包含许多常见非高斯噪声的某些类别。通过数值研究验证了该 RBPF 算法的有效性。

英文摘要

We revisit the Bayesian online inference problems for the linear dynamic systems (LDS) under non- Gaussian environment. The noises can naturally be non-Gaussian (skewed and/or heavy tailed) or to accommodate spurious observations, noises can be modeled as heavy tailed. However, at the cost of such noise robustness, the performance may degrade when such spurious observations are absent. Therefore, any inference engine should not only be robust to noise outlier, but also be adaptive to potentially unknown and time varying noise parameters; yet it should be scalable and easy to implement. To address them, we envisage here a new noise adaptive Rao-Blackwellized particle filter (RBPF), by leveraging a hierarchically Gaussian model as a proxy for any non-Gaussian (process or measurement) noise density. This leads to a conditionally linear Gaussian model (CLGM), that is tractable. However, this framework requires a valid transition kernel for the intractable state, targeted by the particle filter (PF). This is typically unknown. We outline how such kernel can be constructed provably, at least for certain classes encompassing many commonly occurring non-Gaussian noises, using auxiliary latent variable approach. The efficacy of this RBPF algorithm is demonstrated through numerical studies.

1503.07889 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Design and Implementation of an Inertial Navigation System for Pedestrians Based on a Low-Cost MEMS IMU

基于低成本MEMS IMU的行人惯性导航系统设计与实现

Francesco Montorsi, Fabrizio Pancaldi, Giorgio M. Vitetta

AI总结 本文提出一种低成本MEMS IMU的行人惯性导航系统,通过改进的卡尔曼滤波和步态检测算法,有效抑制位置漂移,提升导航精度。

详情
AI中文摘要

行人惯性导航系统无需基础设施,可在短/中期实现亚米级精度。然而,使用低成本IMU时,真实位置与估计位置之间会出现逐渐增长的漂移。本文提出一种新型解决方案:仅使用加速度计和陀螺仪数据(不需磁力计);将传感器误差模型参数纳入扩展卡尔曼滤波的状态向量;采用新的软启发式方法进行步态检测和零速更新。实验结果表明,所提出的仅惯性导航系统在性能上与相关研究中的行人死 reckoning系统相当或更优,尽管所用IMU的精度低于更昂贵的替代品。

英文摘要

Inertial navigation systems for pedestrians are infrastructure-less and can achieve sub-meter accuracy in the short/medium period. However, when low-cost inertial measurement units (IMU) are employed for their implementation, they suffer from a slowly growing drift between the true pedestrian position and the corresponding estimated position. In this paper we illustrate a novel solution to mitigate such a drift by: a) using only accelerometer and gyroscope measurements (no magnetometers required); b) including the sensor error model parameters in the state vector of an extended Kalman filter; c) adopting a novel soft heuristic for foot stance detection and for zero-velocity updates. Experimental results evidence that our inertial-only navigation system can achieve similar or better performance with respect to pedestrian dead-reckoning systems presented in related studies, although the adopted IMU is less accurate than more expensive counterparts.

1503.01407 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Invariant EKF Design for Scan Matching-aided Localization

用于扫描匹配辅助定位的不变扩展卡尔曼滤波设计

Martin Barczyk, Silvère Bonnabel, Jean-Emmanuel Deschaud, François Goulette

AI总结 本文提出基于IEKF和MEKF的室内定位方法,通过融合运动传感器数据与扫描匹配点云实现机器人位姿估计,实验验证了IEKF的优越性。

详情
AI中文摘要

室内定位技术通过融合车载运动传感器数据与环境读数来估计机器人的位姿,本文在低成本Kinect深度相机捕获的扫描匹配点云情况下,提出基于不变扩展卡尔曼滤波(IEKF)和乘法扩展卡尔曼滤波(MEKF)的解决方案。两种设计在实验中成功验证,证明了IEKF设计的优势。

英文摘要

Localization in indoor environments is a technique which estimates the robot's pose by fusing data from onboard motion sensors with readings of the environment, in our case obtained by scan matching point clouds captured by a low-cost Kinect depth camera. We develop both an Invariant Extended Kalman Filter (IEKF)-based and a Multiplicative Extended Kalman Filter (MEKF)-based solution to this problem. The two designs are successfully validated in experiments and demonstrate the advantage of the IEKF design.

1502.07424 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Mechanical Design, Modelling and Control of a Novel Aerial Manipulator

新型空中机械臂的机械设计、建模与控制

Alexandros Nikou, Georgios C. Gavridis, Kostas J. Kyriakopoulos

AI总结 本文提出了一种新型空中机械臂系统,通过技术优化问题确定机械结构、推力器数量及几何布局,设计了自适应反步控制器以实现末端执行器在笛卡尔空间中的精确位置和姿态控制,并通过仿真验证了系统性能。

Comments Comments: 8 Pages, 2015 IEEE International Conference on Robotics and Automation (ICRA '15), Seattle, WA, USA

详情
AI中文摘要

本文提出了一种新型空中机械臂系统。系统的机械结构、推力器数量及其几何布局将通过技术优化问题来确定。上述问题的定义考虑了施加在系统末端执行器上的期望力和力矩。所提出系统的框架在CAD软件中设计,以评估系统参数值。随后开发了运动学和动力学模型,并设计了自适应反步控制器,以控制末端执行器在笛卡尔空间中的精确位置和姿态。最后,通过仿真研究验证了系统性能,其中研究了一个机械操作任务场景。

英文摘要

In this paper a novel aerial manipulation system is proposed. The mechanical structure of the system, the number of thrusters and their geometry will be derived from technical optimization problems. The aforementioned problems are defined by taking into consideration the desired actuation forces and torques applied to the end-effector of the system. The framework of the proposed system is designed in a CAD Package in order to evaluate the system parameter values. Following this, the kinematic and dynamic models are developed and an adaptive backstepping controller is designed aiming to control the exact position and orientation of the end-effector in the Cartesian space. Finally, the performance of the system is demonstrated through a simulation study, where a manipulation task scenario is investigated.

1403.1202 2026-06-04 cond-mat.stat-mech cs.RO cs.SY eess.SY physics.bio-ph q-bio.PE 版本更新

Flocking and turning: a new model for self-organized collective motion

编队与转向:一种新的自组织集体运动模型

Andrea Cavagna, Lorenzo Del Castello, Irene Giardina, Tomas Grigera, Asja Jelic, Stefania Melillo, Thierry Mora, Leonardo Parisi, Edmondo Silvestri, Massimiliano Viale, Aleksandra M. Walczak

AI总结 本文提出新的动态方程,用于描述极化动物群体的集体运动,考虑了相关转向,仅通过社会力。该模型在适当极限下恢复Vicsek模型,但能描述更复杂的转向行为。

Comments Accepted for the Special Issue of the Journal of Statistical Physics: Collective Behavior in Biological Systems, 17 pages, 4 figures, 3 videos

详情
Journal ref
J.Stat.Phys. 158 (2015) 601-627
AI中文摘要

鸟群中的个体以相关方式移动,导致速度极化。对于线性运动,已有良好理解。然而,实际观察中,群体质心常转向,导致更复杂的动态,但仍保持强极化。本文提出新的动态方程,用于极化动物群体的集体运动,考虑相关转向,仅通过社会力。利用旋转对称性和守恒定律,用类似于旋转哈密顿形式的广义坐标来描述速度方向。显式推导该形式与个体速度动态的对应关系,从而获得新的集体运动模型。在适当的阻尼极限下恢复已知的Vicsek模型,该模型耗散旋转信息,不允许可极化的转向。尽管新模型在描述转向群体时最为成功,其动态在广泛动态范围内本质上不同于之前的模型,而在非常大的长度尺度上减少到Toner和Tu的流体动力学描述。因此,推导出的框架是通用的,可能描述任何强极化活性物质系统的集体运动。

英文摘要

Birds in a flock move in a correlated way, resulting in large polarization of velocities. A good understanding of this collective behavior exists for linear motion of the flock. Yet observing actual birds, the center of mass of the group often turns giving rise to more complicated dynamics, still keeping strong polarization of the flock. Here we propose novel dynamical equations for the collective motion of polarized animal groups that account for correlated turning including solely social forces. We exploit rotational symmetries and conservation laws of the problem to formulate a theory in terms of generalized coordinates of motion for the velocity directions akin to a Hamiltonian formulation for rotations. We explicitly derive the correspondence between this formulation and the dynamics of the individual velocities, thus obtaining a new model of collective motion. In the appropriate overdamped limit we recover the well-known Vicsek model, which dissipates rotational information and does not allow for polarized turns. Although the new model has its most vivid success in describing turning groups, its dynamics is intrinsically different from previous ones in a wide dynamical regime, while reducing to the hydrodynamic description of Toner and Tu at very large length-scales. The derived framework is therefore general and it may describe the collective motion of any strongly polarized active matter system.

1501.05628 2026-06-04 cs.RO cs.SY eess.SY math.DS 版本更新

Identification of a Hybrid Spring Mass Damper via Harmonic Transfer Functions as a Step Towards Data-Driven Models for Legged Locomotion

通过谐波传递函数识别混合弹簧质量阻尼器:迈向基于数据的腿部运动模型

İsmail Uyanık, Mustafa Mert Ankaralı, Noah J. Cowan, Ömer Morgül, Uluç Saranlı

AI总结 本文通过数据驱动的频域系统识别方法,利用谐波传递函数构建混合结构动力学模型,展示其在腿部运动建模中的应用潜力。

Comments Draft submitted to the 17th International Conference on Advanced Robotics

详情
AI中文摘要

在手动构建的数学模型中,难以充分捕捉腿部运动的相关方面。即使简单的跑步模型也涉及非可积动力学,需要在模型控制器设计中使用可能不准确的近似方法。本文展示如何利用数据驱动的频域系统识别方法,获取围绕极限环的动态系统输入-输出特性,其混合结构特性类似于腿部运动系统。在某些假设下,可以将此类系统的极限环混合动力学近似为分段光滑的线性时间周期系统(LTP),进一步近似为时间周期性、分段LTI系统以减少识别过程中的参数自由度。本文使用一个简单的单维混合模型,通过线性执行器的动作诱导极限环,以展示方法细节。我们首先推导了示例模型的理论谐波传递函数。然后用小的啁啾信号激励模型,引入围绕其极限环的扰动,并展示系统识别结果以估计该模型的谐波传递函数。数据驱动的HTF模型与理论预测的比较展示了此类经验识别方法在腿部运动中的潜在有效性。

英文摘要

There are limitations on the extent to which manually constructed mathematical models can capture relevant aspects of legged locomotion. Even simple models for basic behaviors such as running involve non-integrable dynamics, requiring the use of possibly inaccurate approximations in the design of model-based controllers. In this study, we show how data-driven frequency domain system identification methods can be used to obtain input--output characteristics for a class of dynamical systems around their limit cycles, with hybrid structural properties similar to those observed in legged locomotion systems. Under certain assumptions, we can approximate hybrid dynamics of such systems around their limit cycle as a piecewise smooth linear time periodic system (LTP), further approximated as a time-periodic, piecewise LTI system to reduce parametric degrees of freedom in the identification process. In this paper, we use a simple one-dimensional hybrid model in which a limit-cycle is induced through the actions of a linear actuator to illustrate the details of our method. We first derive theoretical harmonic transfer functions of our example model. We then excite the model with small chirp signals to introduce perturbations around its limit-cycle and present systematic identification results to estimate the harmonic transfer functions for this model. Comparison between the data-driven HTF model and its theoretical prediction illustrates the potential effectiveness of such empirical identification methods in legged locomotion.

1501.02855 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Assessing Whole-Body Operational Space Control in a Point-Foot Series Elastic Biped: Balance on Split Terrain and Undirected Walking

评估点足系列弹性双足机器人的全身操作空间控制:在高坡分地形平衡与无定向行走

Donghyun Kim, Ye Zhao, Gray Thomas, Luis Sentis

AI总结 本文提出双足机器人敏捷行为的控制与轨迹生成改进,展示全身操作空间控制在高坡分地形平衡和无定向行走中的应用,首次实现该控制在系列弹性双足机器人上的应用。

Comments 17 pages, 9 figures, 4 tables

详情
AI中文摘要

在本文中,我们提出了双足机器人敏捷行为的控制和轨迹生成的改进。我们证明了几年前开发的全身操作空间控制(WBOSC)适用于实现两种敏捷行为:在高坡分地形上平衡和在平坦地形上实现无定向行走。本文展示的是首次在双足机器人上实现WBOSC,特别是具有系列弹性执行器的双足机器人。我们提出并分析了一种新的算法,该算法通过选择足部放置位置动态平衡点足机器人。处理这些系统自然不稳定的动态是一个困难的问题,需要控制器和轨迹生成算法快速且高效地运行。我们提出了全面的开发和整合努力:双足系统的設計與建造以及实验基础设施,WBOSC的定制化以适应敏捷行为,以及新的轨迹生成算法。使用这种定制的控制器,我们首次进行了实验,使双足机器人在高坡分地形上平衡,展示了我们通过力反馈技术精确调节内部力的能力。最后,我们通过基于物理的仿真器和通过平面化运动设置的物理实验,展示了我们的在线轨迹生成算法的稳定能力。

英文摘要

In this paper we present advancements in control and trajectory generation for agile behavior in bipedal robots. We demonstrate that Whole-Body Operational Space Control (WBOSC), developed a few years ago, is well suited for achieving two types of agile behaviors, namely, balancing on a high pitch split terrain and achieving undirected walking on flat terrain. The work presented here is the first implementation of WBOSC on a biped robot, and more specifically a biped robot with series elastic actuators. We present and analyze a new algorithm that dynamically balances point foot robots by choosing footstep placements. Dealing with the naturally unstable dynamics of these type of systems is a difficult problem that requires both the controller and the trajectory generation algorithm to operate quickly and efficiently. We put forth a comprehensive development and integration effort: the design and construction of the biped system and experimental infrastructure, a customization of WBOSC for the agile behaviors, and new trajectory generation algorithms. Using this custom built controller, we conduct, for first time, an experiment in which a biped robot balances in a high pitch split terrain, demonstrating our ability to precisely regulate internal forces using force sensing feedback techniques. Finally, we demonstrate the stabilizing capabilities of our online trajectory generation algorithm in the physics-based simulator and through physical experiments with a planarized locomotion setup.

1501.02854 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Stability and Performance Limits of Latency-Prone Distributed Feedback Controllers

具有延迟的分布式反馈控制器的稳定性与性能极限

Ye Zhao, Nicholas Paine, Kwan Suk Kim, Luis Sentis

AI总结 本文研究了具有延迟的分布式反馈控制器的稳定性与性能极限,探讨了在高阻尼反馈和低阻尼反馈之间平衡的重要性,并通过实验验证了分布式阻抗控制器的稳定性影响因素。

Comments 13 pages, 10 figures, 2 tables, 31 reference

详情
AI中文摘要

机器人控制系统日益依赖分布式反馈控制器来解决复杂传感和决策问题,如在高灵活度的人为中心机器人中遇到的问题。这些需求带来了计算负担的增加,从而导致更大的控制器延迟。为了通过最大化控制反馈增益来提高对机械扰动的鲁棒性,本文强调了在分布式控制器中在高和低级反馈控制努力之间进行权衡的必要性。具体而言,研究了分布式阻抗控制器的影响,其中阻尼反馈努力在靠近控制装置的位置执行,而刚度反馈努力在具有延迟的集中控制过程中执行。一个中心观察是,高阻抗分布式控制器的稳定性对阻尼反馈延迟非常敏感,但对刚度反馈延迟影响较小。本研究深入分析了这一观察,从而获得了对差异的物理理解。然后推导出一个实用的控制器崩溃增益规则,以帮助控制设计师考虑以分布式方式实现其控制应用的好处。这些考虑通过在高性能执行器和 omnidirectional 移动基座上的分析、模拟和实验测试进一步验证。

英文摘要

Robotic control systems are increasingly relying on distributed feedback controllers to tackle complex sensing and decision problems such as those found in highly articulated human-centered robots. These demands come at the cost of a growing computational burden and, as a result, larger controller latencies. To maximize robustness to mechanical disturbances by maximizing control feedback gains, this paper emphasizes the necessity for compromise between high- and low-level feedback control effort in distributed controllers. Specifically, the effect of distributed impedance controllers is studied where damping feedback effort is executed in close proximity to the control plant and stiffness feedback effort is executed in a latency-prone centralized control process. A central observation is that the stability of high impedance distributed controllers is very sensitive to damping feedback delay but much less to stiffness feedback delay. This study pursues a detailed analysis of this observation that leads to a physical understanding of the disparity. Then a practical controller breakdown gain rule is derived to aim at enabling control designers to consider the benefits of implementing their control applications in a distributed fashion. These considerations are further validated through the analysis, simulation and experimental testing on high performance actuators and on an omnidirectional mobile base.

1501.00505 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Adaptive Control of 4-DoF Robot manipulator

四自由度机器人操作臂的自适应控制

P. Mironchyk

AI总结 本文提出了一种针对四自由度机器人操作臂动态参数不确定性的自适应控制策略,通过实验验证其在机器人仿真器中的有效性。

Comments 7 pages, 4(5) figures

详情
AI中文摘要

在机器人实验中,研究人员可能面临机器人操作臂参数不确定性的挑战,这可能由制造过程中的偏差或实验室中的实验更改引起。当机器人抓取物体时,动态和惯性参数也可能不确定。在所有这些情况下,都需要自适应控制策略来识别操作臂动态特性的变化并进行调整。本文提出了一种针对四自由度机器人操作臂动态参数不确定性的自适应控制策略,并展示了该策略在机器人仿真器中的测试结果。

英文摘要

In experimental robotics, researchers may face uncertainties in parameters of a robot manipulator that they are working with. This uncertainty may be caused by deviations in the manufacturing process of a manipulator, or changes applied to manipulator in the lab for sake of experiments. Another situation when dynamical and inertial parameters of a robot are uncertain arises, is the grasping of objects by a manipulator. In all these situations there is a need for adaptive control strategies that would identify changes in dynamical properties of manipulator and adjust for them. This article presents a work on designing of an adaptive control strategy for 4-DoF manipulator with uncertain dynamical properties, and outcomes of testing of this strategy applied to control of simulator of robot.

1412.7824 2026-06-04 eess.SY cs.MA cs.RO cs.SY 版本更新

Multi Time Scale Behaviour of The Formation of Multiple Groups of Nonholonomic Wheeled Mobile Robots

多时间尺度下非完整轮式移动机器人多组形成行为

Soumic Sarkar, Indra Narayan Kar

AI总结 本文提出基于质心的变换方法,将非完整轮式移动机器人的联合动力学分解为三个子系统,分别设计控制器以实现多组形变和质心动态的多时间尺度收敛,同时通过势函数梯度确保避障。

Comments arXiv admin note: text overlap with arXiv:1412.6164

详情
AI中文摘要

不同几何模式和形状通过群体代理生成,需要形成控制。本文应用基于质心的变换(CBT),将非完整轮式移动机器人(WMRs)的联合动力学分解为三个子系统:组内和组间形状动力学以及质心动力学。组内形状动力学可进一步分为每个组的形状动力学,从而形成多组概念。因此,为每个子系统设计了独立控制器。控制器的增益被选择使得整体系统成为奇异扰动系统,不同子系统在不同时间收敛到其期望值。本文进行了多时间尺度收敛分析。基于势函数梯度的负值被添加到控制器中,以确保机器人之间的避障。提供了仿真结果以证明所提控制器的有效性。

英文摘要

Different geometric patterns and shapes are generated using groups of agents, and this needs formation control. In this paper, Centroid Based Transformation (CBT), has been applied to decompose the combined dynamics of nonholonomic Wheeled Mobile Robots (WMRs) into three subsystems: intra and inter group shape dynamics, and the dynamics of the centroid. The intra group shape dynamics can further be partitioned into the shape dynamics of each group, giving the notion of multiple group. Thus separate controllers have been designed for each subsystem. The gains of the controllers are such chosen that the overall system becomes singularly perturbed system, and different subsystems converge to their desired values at different times. Then multi-time scale convergence analysis has been carried out in this paper. Negative gradient of a potential based function has been added to the controller to ensure collision avoidance among the robots. Simulation results have been provided to demonstrate the effectiveness of the proposed controller.

1412.6704 2026-06-04 eess.SY cs.RO cs.SY 版本更新

First Passage Value

首次通过值

Cenk Oguz Saglam, Katie Byl

AI总结 本文扩展了首次通过时间的概念,提出首次通过值,用于描述更广泛的利益相关值,如能量消耗、距离或时间,并为给定置信水平提供首次通过值的界限。

详情
AI中文摘要

对于许多随机动态系统,首次通过时间(MFPT)是一个有用的概念,它给出了感兴趣状态之前的预期时间。本文从多个方面扩展了MFPT。首先,我们证明对于某些系统,仅使用第二大特征值计算的系统级MFPT可以捕捉大部分基本动态,即使对于相当复杂、高维系统也是如此。其次,我们将MFPT推广为首次通过值(MFPV),它提供了更广泛的利益相关值,例如能量消耗、距离或时间。第三,我们为给定置信水平提供了首次通过值(FPV)的界限。本文的核心是强调,为了我们的目标,许多混合系统可以近似为马尔可夫决策过程。因此,许多系统可以使用此框架有效控制。然而,我们的框架特别适用于亚稳系统。此类系统表现出有趣的长寿命行为,它们注定会不可避免地逃逸(例如,最终到达一个不同的失败或成功状态)。我们的目标是根据应用,最小化或最大化该值,直到逃逸。

英文摘要

For many stochastic dynamic systems, the Mean First Passage Time (MFPT) is a useful concept, which gives expected time before a state of interest. This work is an extension of MFPT in several ways. (1) We show that for some systems the system-wide MFPT, calculated using the second largest eigenvalue only, captures most of the fundamental dynamics, even for quite complex, high-dimensional systems. (2) We generalize MFPT to Mean First Passage Value (MFPV), which gives a more general value of interest, e.g., energy expenditure, distance, or time. (3) We provide bounds on First Passage Value (FPV) for a given confidence level. At the heart of this work, we emphasize that for our goals, many hybrid systems can be approximated as Markov Decision Processes. So, many systems can be controlled effectively using this framework. However, our framework is particularly useful for metastable systems. Such systems exhibit interesting long-living behaviors from which they are guaranteed to inevitably escape (e.g., eventually arriving at a distinct failure or success state). Our goal is then either minimizing or maximizing the value until escape, depending on the application.

1412.1251 2026-06-04 cs.RO cs.HC cs.SY eess.SY 版本更新

From Human-Computer Interaction to Human-Robot Social Interaction

从人机交互到人机社会交互

Tarek Toumi, Abdelmadjid Zidani

AI总结 本文探讨了人机社会交互领域,结合人机交互与人机交互的研究,整合机器人的情感与能力概念,以提升人机交互效果,并讨论相关挑战。

详情
Journal ref
IJCSI International Journal of Computer Science Issues, Vol. 11, Issue 1, No 1, 2014 1694-0814
AI中文摘要

人机社会交互已成为一个活跃的研究领域,在此领域中,不同领域的研究者提出解决方案和指导方针,以帮助机器人提高与人类的互动。本文提出引入人机交互和人机交互两个领域的研究成果,并建立桥梁,即在人机模型中整合机器人的情感与能力概念,以适应人机交互,并讨论与所提模型相关的挑战。最后将通过现实案例来展示这一模型。

英文摘要

Human-Robot Social Interaction became one of active research fields in which researchers from different areas propose solutions and directives leading robots to improve their interactions with humans. In this paper we propose to introduce works in both human robot interaction and human computer interaction and to make a bridge between them, i.e. to integrate emotions and capabilities concepts of the robot in human computer model to become adequate for human robot interaction and discuss challenges related to the proposed model. Finally an illustration through real case of this model will be presented.

1412.6029 2026-06-04 cs.RO cs.HC cs.SY eess.SY 版本更新

Pareto efficiency in synthesizing shared autonomy policies with temporal logic constraints

在具有时序逻辑约束下的共享自主策略合成中的帕累托效率

Jie Fu, Ufuk Topcu

AI总结 本文提出一种两阶段策略合成算法,用于生成帕累托高效的协调与控制策略,通过整合Tchebychev标量化方法以优化人类努力与系统性能之间的平衡。

Comments 8 pages, 5 figures, submitted to ICRA 2015 conference

详情
AI中文摘要

在控制系统中,自主控制器和人类操作员共享控制权时,找到能实现期望系统性能且对人类操作员工作量合理的解决方案至关重要。我们提出了一种共享自主系统,能够捕捉自主控制器与人类操作员之间的交互和控制切换,以及操作员在控制执行期间的认知状态演变。为了在人类努力和性能水平之间进行权衡,例如通过满足底层时序逻辑规范的概率来衡量,我们提出了一种两阶段策略合成算法,以生成针对用户指定权重的帕累托高效协调和控制策略。我们整合了Tchebychev标量化方法用于多目标优化方法,以获得比线性标量化方法更好的帕累托有效解集覆盖。

英文摘要

In systems in which control authority is shared by an autonomous controller and a human operator, it is important to find solutions that achieve a desirable system performance with a reasonable workload for the human operator. We formulate a shared autonomy system capable of capturing the interaction and switching control between an autonomous controller and a human operator, as well as the evolution of the operator's cognitive state during control execution. To trade-off human's effort and the performance level, e.g., measured by the probability of satisfying the underlying temporal logic specification, a two-stage policy synthesis algorithm is proposed for generating Pareto efficient coordination and control policies with respect to user specified weights. We integrate the Tchebychev scalarization method for multi-objective optimization methods to obtain a better coverage of the set of Pareto efficient solutions than linear scalarization methods.

1404.2289 2026-06-04 eess.SY cs.RO cs.SY 版本更新

On the Minimal Revision Problem of Specification Automata

关于规范自动机的最小修正问题

Kangjin Kim, Georgios E. Fainekos, Sriram Sankaranarayanan

AI总结 本文研究了机器人任务规划中规范自动机的最小修正问题,证明其为NP完全问题,并提出一种多项式时间的启发式算法以近似求解。

Comments 23 pages, 16 figures, 2 tables, International Joural of Robotics Research 2014 Major Revision (submitted)

详情
AI中文摘要

随着机器人逐步融入日常生活,提供安全且可证明正确操作的保障变得必要。此类保障可通过自动机理论的任务和使命规划实现,其中要求以时序逻辑规范表达。然而,在现实场景中,机器人可能无法满足所有用户任务需求。在这种情况下,机器人必须向用户提供无法完成任务的原因,并指示尽可能接近初始用户意图的任务。本文证明后者问题,即最小规范修正问题,是NP完全的。提出了一种启发式算法,可在多项式时间内计算最小修正问题(MRP)的良好近似解。算法的实验研究显示,在大多数问题实例中,该启发式算法实际上返回最优解。最后,一些算法未返回最优解的情况也被呈现。

英文摘要

As robots are being integrated into our daily lives, it becomes necessary to provide guarantees on the safe and provably correct operation. Such guarantees can be provided using automata theoretic task and mission planning where the requirements are expressed as temporal logic specifications. However, in real-life scenarios, it is to be expected that not all user task requirements can be realized by the robot. In such cases, the robot must provide feedback to the user on why it cannot accomplish a given task. Moreover, the robot should indicate what tasks it can accomplish which are as "close" as possible to the initial user intent. This paper establishes that the latter problem, which is referred to as the minimal specification revision problem, is NP complete. A heuristic algorithm is presented that can compute good approximations to the Minimal Revision Problem (MRP) in polynomial time. The experimental study of the algorithm demonstrates that in most problem instances the heuristic algorithm actually returns the optimal solution. Finally, some cases where the algorithm does not return the optimal solution are presented.

1406.4905 2026-06-04 cs.LG cs.RO cs.SY eess.SY stat.ML 版本更新

Variational Gaussian Process State-Space Models

变分高斯过程状态空间模型

Roger Frigola, Yutian Chen, Carl E. Rasmussen

AI总结 本文提出基于稀疏高斯过程的变分贝叶斯学习方法,用于高效学习非线性状态空间模型,实现对非线性动力系统后验的可计算性,相比传统参数模型,能灵活平衡模型容量与计算成本,避免过拟合。

详情
Journal ref
R. Frigola, Y. Chen and C. E. Rasmussen. Variational Gaussian Process State-Space Models, in Advances in Neural Information Processing Systems (NIPS), 2014
AI中文摘要

状态空间模型在科学和工程的不同领域中已成功应用超过五十年。我们提出了一种基于稀疏高斯过程的高效变分贝叶斯学习非线性状态空间模型的程序。学习结果是对非线性动力系统可计算的后验。与传统参数模型相比,我们提供了在避免过拟合的同时,可以方便地权衡模型容量和计算成本的可能性。我们的主要算法使用了结合变分贝叶斯和顺序蒙特卡洛的混合推断方法。我们还提出了随机变分推断和在线学习方法,以实现对长时间序列的快速学习。

英文摘要

State-space models have been successfully used for more than fifty years in different areas of science and engineering. We present a procedure for efficient variational Bayesian learning of nonlinear state-space models based on sparse Gaussian processes. The result of learning is a tractable posterior over nonlinear dynamical systems. In comparison to conventional parametric models, we offer the possibility to straightforwardly trade off model capacity and computational cost whilst avoiding overfitting. Our main algorithm uses a hybrid inference approach combining variational Bayes and sequential Monte Carlo. We also present stochastic variational inference and online learning approaches for fast learning with long time series.

1410.4622 2026-06-04 cs.RO cs.SY eess.SY math.AT stat.ML 版本更新

Robust Topological Feature Extraction for Mapping of Environments using Bio-Inspired Sensor Networks

鲁棒拓扑特征提取用于生物启发式传感器网络环境映射

Alireza Dirafzoon, Edgar Lobaton

AI总结 本文利用生物启发式传感器网络收集的最小感知信息,通过概率运动模型提取弱接触信息,构建环境拓扑表示。采用拓扑数据分析提取主导特征,结合密度基子采样算法提高鲁棒性,并提出鲁棒尺度不变分类算法以量化特征。

Comments 14 pages, 7 figures

详情
AI中文摘要

本文提出了一种利用生物启发式传感器网络进行未知环境探索和映射的方法。通过受蟑螂运动特性启发的概率运动模型,提取弱接触信息以构建环境拓扑表示。利用节点间交互生成点云,代表环境流形的空间特征。通过拓扑数据分析生成持续区间用于拓扑映射。为提高采样数据对异常值的鲁棒性,采用密度基子采样算法。此外,提出一种鲁棒尺度不变分类算法用于持久图,以提供数据中所需特征的定量表示。同时,提出多种定义接触度量的策略,以提高拓扑方法的估计和分类性能。

英文摘要

In this paper, we exploit minimal sensing information gathered from biologically inspired sensor networks to perform exploration and mapping in an unknown environment. A probabilistic motion model of mobile sensing nodes, inspired by motion characteristics of cockroaches, is utilized to extract weak encounter information in order to build a topological representation of the environment. Neighbor to neighbor interactions among the nodes are exploited to build point clouds representing spatial features of the manifold characterizing the environment based on the sampled data. To extract dominant features from sampled data, topological data analysis is used to produce persistence intervals for features, to be used for topological mapping. In order to improve robustness characteristics of the sampled data with respect to outliers, density based subsampling algorithms are employed. Moreover, a robust scale-invariant classification algorithm for persistence diagrams is proposed to provide a quantitative representation of desired features in the data. Furthermore, various strategies for defining encounter metrics with different degrees of information regarding agents' motion are suggested to enhance the precision of the estimation and classification performance of the topological method.

1403.5195 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Experimental Implementation of an Invariant Extended Kalman Filter-based Scan Matching SLAM

基于不变扩展卡尔曼滤波的扫描匹配SLAM实验实现

Martin Barczyk, Silvère Bonnabel, Jean-Emmanuel Deschaud, François Goulette

AI总结 本文提出将不变扩展卡尔曼滤波方法应用于扫描匹配SLAM问题,通过实验验证了其在机器人平台上的鲁棒性。

详情
Journal ref
Proceedings of the 2014 American Control Conference, Portland, OR, June 2014, pp. 4121-4126
AI中文摘要

本文描述了将不变扩展卡尔曼滤波(IEKF)设计方法应用于扫描匹配SLAM问题的实验实现。我们回顾了IEKF的理论基础及其在保证对劣质状态估计鲁棒性的实际价值,然后在轮式机器人硬件平台上实现了该滤波器。所提出的设计在实验测试中得到了成功验证。

英文摘要

We describe an application of the Invariant Extended Kalman Filter (IEKF) design methodology to the scan matching SLAM problem. We review the theoretical foundations of the IEKF and its practical interest of guaranteeing robustness to poor state estimates, then implement the filter on a wheeled robot hardware platform. The proposed design is successfully validated in experimental testing.

1410.2792 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Convex Model Predictive Control for Vehicular Systems

凸优化预测控制用于车辆系统

Tiffany A. Huang, Matanya B. Horowitz, Joel W. Burdick

AI总结 本文提出一种基于旋转矩阵轨道ope的凸优化预测控制方法,适用于SO(n)状态空间的系统,无需局部线性化,避免传统方法的缺陷,适用于航空和车辆系统。

详情
AI中文摘要

本文提出了一种用于状态属于SO(n)(n=2,3)系统的模型预测控制(MPC)方法,无需坐标图或局部线性化,而是通过操作旋转矩阵的轨道ope实现。这种方法产生了一种新颖的MPC方案,无需传统线性化技术的缺点。除了通常用于线性系统MPC的QP方案外,仅需二次锥约束或半正定约束。特别强调了在航空和车辆系统中的应用,其中该方法消除了这些系统状态空间方程中许多超越三角函数项。此外,该方法已被证明与许多现有的MPC变体兼容,包括通过混合整数线性规划(MILP)实现的障碍规避。

英文摘要

In this work, we present a method to perform Model Predictive Control (MPC) over systems whose state is an element of $SO(n)$ for $n=2,3$. This is done without charts or any local linearization, and instead is performed by operating over the orbitope of rotation matrices. This results in a novel MPC scheme without the drawbacks associated with conventional linearization techniques. Instead, second order cone- or semidefinite-constraints on state variables are the only requirement beyond those of a QP-scheme typical for MPC of linear systems. Of particular emphasis is the application to aeronautical and vehicular systems, wherein the method removes many of the transcendental trigonometric terms associated with these systems' state space equations. Furthermore, the method is shown to be compatible with many existing variants of MPC, including obstacle avoidance via Mixed Integer Linear Programming (MILP).

1410.0879 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Priority-based coordination of mobile robots

基于优先级的移动机器人协调

Jean Gregoire

AI总结 本文研究了在交叉口协调多个移动机器人的问题,提出通过规划优先级而非具体轨迹来实现更鲁棒的协调系统,适用于自动驾驶车辆等共享道路的场景。

Comments PhD Thesis, 182 pages

详情
AI中文摘要

自20世纪80年代末以来,自动驾驶车辆的发展已成为主要工业国家的高强度研究领域。积极的社会经济效益包括减少事故、减少旅行时间、提高能效和减少对昂贵基础设施的需求。某些形式的车对车和/或车对基础设施协作是确保安全高效的全球交通系统所必需的。本文研究了在交叉口协调多个移动机器人的特定协作形式。大多数先前提出的协调系统涉及规划轨迹并沿规划轨迹控制机器人:即计划-编程范式,其中规划被视为动作生成机制。本文的方法是规划优先级——机器人通过交叉口的相对顺序,这比许多轨迹尊重相同的优先级要弱得多。更具体地说,优先级编码协调问题的解的同调类。优先级分配相当于选择某些同调类来解决协调问题,而不是特定轨迹。一旦分配了优先级,机器人将通过保持分配优先级的控制定律进行控制,即确保描述的轨迹属于所选同调类。这导致了一个更鲁棒的协调系统——能够以反应式方式处理大量意外事件——特别适用于自动驾驶车辆在交叉口的协调应用,其中汽车、公共交通和行人共享道路。

英文摘要

Since the end of the 1980's, the development of self-driven autonomous vehicles is an intensive research area in most major industrial countries. Positive socio-economic potential impacts include a decrease of crashes, a reduction of travel times, energy efficiency improvements, and a reduced need of costly physical infrastructure. Some form of vehicle-to-vehicle and/or vehicle-to-infrastructure cooperation is required to ensure a safe and efficient global transportation system. This thesis deals with a particular form of cooperation by studying the problem of coordinating multiple mobile robots at an intersection area. Most of coordination systems proposed in previous work consist in planning a trajectory and to control the robots along the planned trajectory: that is the plan-as-program paradigm where planning is considered as a generative mechanism of action. The approach of the thesis is to plan priorities -- the relative order of robots to go through the intersection -- which is much weaker as many trajectories respect the same priorities. More precisely, priorities encode the homotopy classes of solutions to the coordination problem. Priority assignment is equivalent to the choice of some homotopy class to solve the coordination problem instead of a particular trajectory. Once priorities are assigned, robots are controlled through a control law preserving the assigned priorities, i.e., ensuring the described trajectory belongs to the chosen homotopy class. It results in a more robust coordination system -- able to handle a large class of unexpected events in a reactive manner -- particularly well adapted for an application to the coordination of autonomous vehicles at intersections where cars, public transport and pedestrians share the road.

1410.0083 2026-06-04 eess.SY cs.AI cs.RO cs.SY 版本更新

Integrating active sensing into reactive synthesis with temporal logic constraints under partial observations

将主动感知整合到带有时序逻辑约束的反应合成中,在部分观察下

Jie Fu, Ufuk Topcu

AI总结 本文提出在部分可观测和动态环境中,利用感知动作进行在线反应规划,通过主动感知策略减少不确定性,确保时序逻辑规范以概率1满足。

Comments 7 pages, 2 figures, submitted to American Control Conference 2015

详情
AI中文摘要

我们引入了在部分可观测和动态环境中,具有时序逻辑约束的系统中利用感知动作进行在线反应规划的概念。在动态环境信息不完整的情况下,反应控制器合成相当于解决一个具有部分观察的双人游戏,计算复杂度 impractically 高。为减轻高计算负担,通过感知动作进行在线重规划,避免在部分观察下解决反应系统的策略。相反,我们只解决一个策略,确保给定的时序逻辑规范在系统拥有完整环境观察时可以满足。此类策略随后被转换为基于观察到的状态序列(交互系统及其环境)做出控制决策的策略。当系统遇到一个信念——包含所有可能的当前状态假设的集合——对于观察策略未定义时,触发一系列感知动作,由主动感知策略选择,以减少系统信念中的不确定性。我们证明,在满足系统传感器集合的 mild 技术假设下,通过交替使用基于观察的策略和主动感知策略,可以以概率1满足给定的时序逻辑规范。

英文摘要

We introduce the notion of online reactive planning with sensing actions for systems with temporal logic constraints in partially observable and dynamic environments. With incomplete information on the dynamic environment, reactive controller synthesis amounts to solving a two-player game with partial observations, which has impractically computational complexity. To alleviate the high computational burden, online replanning via sensing actions avoids solving the strategy in the reactive system under partial observations. Instead, we only solve for a strategy that ensures a given temporal logic specification can be satisfied had the system have complete observations of its environment. Such a strategy is then transformed into one which makes control decisions based on the observed sequence of states (of the interacting system and its environment). When the system encounters a belief---a set including all possible hypotheses the system has for the current state---for which the observation-based strategy is undefined, a sequence of sensing actions are triggered, chosen by an active sensing strategy, to reduce the uncertainty in the system's belief. We show that by alternating between the observation-based strategy and the active sensing strategy, under a mild technical assumption of the set of sensors in the system, the given temporal logic specification can be satisfied with probability 1.

1401.7612 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Mathematical Modelling of Turning Delays in Swarm Robotics

群体机器人转向延迟的数学建模

Jake P. Taylor-King, Benjamin Franz, Christian A. Yates, Radek Erban

AI总结 研究转向延迟对群体机器人行为的影响,通过传输方程建模并验证实验结果,证明延迟方程能更准确预测机器人寻找目标区域的时间。

Comments Submitted to the IMA Journal of Applied Mathematics

详情
AI中文摘要

研究转向延迟对群体差分轮式机器人行为的影响,并证明群体层面的行为可通过包含适当延迟的传输方程描述。我们的数学分析结果通过数值模拟和e-puck机器人的实验得到支持。我们比较的实验量是机器人在未知环境中寻找目标区域的平均时间。带有延迟的传输方程比不带延迟的标准传输方程更能预测找到目标的平均时间。

英文摘要

We investigate the effect of turning delays on the behaviour of groups of differential wheeled robots and show that the group-level behaviour can be described by a transport equation with a suitably incorporated delay. The results of our mathematical analysis are supported by numerical simulations and experiments with e-puck robots. The experimental quantity we compare to our revised model is the mean time for robots to find the target area in an unknown environment. The transport equation with delay better predicts the mean time to find the target than the standard transport equation without delay.

1407.5813 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Priority-based coordination of autonomous and legacy vehicles at intersection

基于优先级的自主车辆与传统车辆在交叉口的协调

Xiangjun Qian, Jean Gregoire, Fabien Moutarde, Arnaud De La Fortelle

AI总结 本文提出一种基于优先级的协调方法,支持传统车辆安全通过交叉口,同时保持无碰撞和死锁特性。

Comments put in other preprint server

详情
AI中文摘要

近年来,研究人员提出了多种自主交叉口管理技术,使自动驾驶车辆能够在没有交通灯或停止标志的情况下通过交叉口。特别是,一种具有可证明无碰撞和无死锁特性的优先级协调系统已被提出。在本文中,我们扩展了这种优先级方法,以支持传统车辆,而不会损害上述特性。我们假设传统车辆能够保持与前车的安全距离。然后,我们探讨了一些特殊的系统配置,以确保传统车辆的安全通过。我们将在现实交通模拟器SUMO中实现扩展系统。通过模拟来演示系统的安全性。

英文摘要

Recently, researchers have proposed various autonomous intersection management techniques that enable autonomous vehicles to cross the intersection without traffic lights or stop signs. In particular, a priority-based coordination system with provable collision-free and deadlock-free features has been presented. In this paper, we extend the priority-based approach to support legacy vehicles without compromising above-mentioned features. We make the hypothesis that legacy vehicles are able to keep a safe distance from their leading vehicles. Then we explore some special configurations of system that ensures the safe crossing of legacy vehicles. We implement the extended system in a realistic traffic simulator SUMO. Simulations are performed to demonstrate the safety of the system.

1406.0993 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Latent Kullback Leibler Control for Continuous-State Systems using Probabilistic Graphical Models

基于概率图模型的潜在Kullback-Leibler控制用于连续状态系统

Takamitsu Matsubara, Vicenç Gómez, Hilbert J. Kappen

AI总结 本文提出将KL控制问题嵌入概率图模型中,通过连续状态和离散表示进行控制,展示两种方法在机器人运动控制中的应用。

Comments 9 pages, 5 figures, accepted in Uncertainty in Artificial Intelligence (UAI '14)

详情
AI中文摘要

Kullback Leibler (KL) 控制问题允许通过求解主特征向量问题高效计算最优控制。然而,直接将此类框架应用于连续状态-动作系统受限。本文提出将KL控制问题嵌入概率图模型中,其中观测变量对应系统的连续(可能高维)状态,潜在变量对应状态的离散(低维)表示,适用于KL控制计算。我们展示了两种方法:第一种使用标准隐马尔可夫模型(HMMs)计算精确最优控制,但仅适用于低维系统;第二种使用因子HMMs,可扩展到高维问题,但控制计算为近似。我们通过多个机器人运动控制任务展示了这两种方法。

英文摘要

Kullback Leibler (KL) control problems allow for efficient computation of optimal control by solving a principal eigenvector problem. However, direct applicability of such framework to continuous state-action systems is limited. In this paper, we propose to embed a KL control problem in a probabilistic graphical model where observed variables correspond to the continuous (possibly high-dimensional) state of the system and latent variables correspond to a discrete (low-dimensional) representation of the state amenable for KL control computation. We present two examples of this approach. The first one uses standard hidden Markov models (HMMs) and computes exact optimal control, but is only applicable to low-dimensional systems. The second one uses factorial HMMs, it is scalable to higher dimensional problems, but control computation is approximate. We illustrate both examples in several robot motor control tasks.

1406.1619 2026-06-04 cs.RO cs.SY eess.SY 版本更新

An Invariant Linear Quadratic Gaussian controller for a simplified car

一种不变的线性二次高斯控制器用于简化汽车

Sébastien Diemer, Silvère Bonnabel

AI总结 本文提出一种不变的线性二次高斯控制器,用于跟踪简化汽车模型的参考轨迹,通过模拟验证其在高噪声或初始不确定性下的鲁棒性。

详情
AI中文摘要

本文考虑了基于单轮动力学模型的简化汽车跟踪参考轨迹的问题,其中仅测量位置,并且控制输入和测量值受到独立高斯噪声干扰。为解决此问题,我们设计了一种新型观测器-控制器:不变的线性二次高斯控制器(ILQG)。它基于线性二次高斯控制器,但方程稍作修改以利用问题的对称性。增益调节对估计轨迹的依赖性降低,因此对估计误差更不敏感。除了不变方法本身合理(无论参考轨迹朝西还是朝南,控制器性能不应依赖于此)外,我们通过仿真证明,ILQG在大噪声或大初始不确定性情况下优于传统LQG控制器。我们还表明,这些鲁棒性特性可能对运动规划应用也有用。

英文摘要

In this paper, we consider the problem of tracking a reference trajectory for a simplified car model based on unicycle kinematics, whose position only is measured, and where the control input and the measurements are corrupted by independent Gaussian noises. To tackle this problem we devise a novel observer-controller: the invariant Linear Quadratic Gaussian controller (ILQG). It is based on the Linear Quadratic Gaussian controller, but the equations are slightly modified to account for, and to exploit, the symmetries of the problem. The gain tuning exhibits a reduced dependency on the estimated trajectory, and is thus less sensitive to misestimates. Beyond the fact the invariant approach is sensible (there is no reason why the controller performance should depend on whether the reference trajectory is heading west or south), we show through simulations that the ILQG outperforms the conventional LQG controller in case of large noises or large initial uncertainties. We show that those robustness properties may also prove useful for motion planning applications.

1405.7392 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Information-Theoretic Stochastic Optimal Control via Incremental Sampling-based Algorithms

基于增量采样的信息论随机最优控制

Oktay Arslan, Evangelos Theodorou, Panagiotis Tsiotras

AI总结 本文提出基于增量采样的算法,将RRT与信息论随机最优控制相结合,解决大系统中非线性随机微分方程的最优控制问题。

Comments 18 pages

详情
AI中文摘要

本文考虑由非线性随机微分方程表示的动力系统最优控制问题。已知最优控制策略可通过满足非线性偏微分方程(即哈密顿-雅可比-贝尔曼方程)的价值函数获得。该非线性PDE必须逆向求解,但对于大规模系统来说计算不可行。在某些假设下,应用对数变换后,最优策略可表示为路径积分。基于路径积分的控制方法最近被证明能优雅地解决广泛类别的随机最优控制问题。其中一个实现挑战是计算成本函数在未受力动力学轨迹上的期望值。在均匀采样轨迹上计算此类期望可能导致数值不稳定性,因为成本的指数化。因此,采样低成本轨迹对PI方法的实用实现至关重要。本文使用基于增量采样的算法从未受力系统动力学中采样有用的轨迹,并建立了RRT与信息论随机最优控制之间的新联系。我们展示了所提出方法在多个示例中的数值实现结果。

英文摘要

This paper considers optimal control of dynamical systems which are represented by nonlinear stochastic differential equations. It is well-known that the optimal control policy for this problem can be obtained as a function of a value function that satisfies a nonlinear partial differential equation, namely, the Hamilton-Jacobi-Bellman equation. This nonlinear PDE must be solved backwards in time, and this computation is intractable for large scale systems. Under certain assumptions, and after applying a logarithmic transformation, an alternative characterization of the optimal policy can be given in terms of a path integral. Path Integral (PI) based control methods have recently been shown to provide elegant solutions to a broad class of stochastic optimal control problems. One of the implementation challenges with this formalism is the computation of the expectation of a cost functional over the trajectories of the unforced dynamics. Computing such expectation over trajectories that are sampled uniformly may induce numerical instabilities due to the exponentiation of the cost. Therefore, sampling of low-cost trajectories is essential for the practical implementation of PI-based methods. In this paper, we use incremental sampling-based algorithms to sample useful trajectories from the unforced system dynamics, and make a novel connection between Rapidly-exploring Random Trees (RRTs) and information-theoretic stochastic optimal control. We show the results from the numerical implementation of the proposed approach to several examples.

1405.3094 2026-06-04 cs.RO cs.SY eess.SY 版本更新

The inverted Pendulum: A fundamental Benchmark in Control Theory and Robotics

倒立摆:控制理论与机器人学中的基本基准

Olfa Boubaker

AI总结 本文探讨倒立摆作为控制理论与机器人学中重要基准的应用动机,分析其在实际设计中的实现,并通过150篇文献综述展示其在历史、现状及挑战中的发展。

Comments IEEE International Conference on Education and e-Learning Innovations (ICEELI), 1-3 July 2012, Sousse, Tunisia

详情
AI中文摘要

至少五十年来,倒立摆一直是控制理论和机器人学中教学和研究中最受欢迎的基准之一。本文提出了使用该系统的关键动机,并详细解释了倒立摆基准如何提供有效且高效的应用。将展示几种实际经验、虚拟模型和基于网络的远程控制实验室,重点在于该系统的实际设计实现。通过应用于倒立摆系统,将展示不同控制设计方法和流行机器人问题的文献综述。总共整理了150篇开放文献,从1960年至今,以提供历史、现状和挑战发展的整体图景。

英文摘要

For at least fifty years, the inverted pendulum has been the most popular benchmark, among others, for teaching and researches in control theory and robotics. This paper presents the key motivations for the use of that system and explains, in details, the main reflections on how the inverted pendulum benchmark gives an effective and efficient application. Several real experiences, virtual models and web-based remote control laboratories will be presented with emphasis on the practical design implementation of this system. A bibliographical survey of different design control approaches and trendy robotic problems will be presented through applications to the inverted pendulum system. In total, 150 references in the open literature, dating back to 1960, are compiled to provide an overall picture of historical, current and challenging developments.

1405.2363 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

A sampling-based approach to scalable constraint satisfaction in linear sampled-data systems---Part I: Computation

基于采样的可扩展约束满足方法在线性采样数据系统中的应用---第一部分:计算

Shahab Kaynama, Jeremy H. Gillula, Claire J. Tomlin

AI总结 本文提出一种新的基于采样的算法,用于高维线性采样数据系统中精确逼近可行性核,解决约束满足问题,并在十二维飞行包线保护问题中验证了方法的有效性。

详情
AI中文摘要

采样数据(SD)系统由离散和连续时间组件组成,是实际中最常见的混合物理系统之一;大多数现代控制器是在数字平台上实现的,而被控制的植物动力学在时间上是连续变化的。与所有混合物理系统一样,确保硬约束满足是SD系统安全运行的关键。一种强大的分析工具用于保证此类约束满足是可行性核:所有初始条件的集合,对于存在安全保持控制律(即满足所有输入和状态约束的控制律)而言。在本文中,我们提出了一种新的基于采样的算法,该算法紧密逼近高维线性采样数据时间不变(LTI)系统的可行性核。与该领域先前的工作不同,我们的算法正式处理了SD系统的离散和连续特性。我们证明了我们近似技术的正确性和收敛性,提供了关于如何通过启发式方法优化采样过程的讨论,并在十二维飞行包线保护问题中展示了结果。

英文摘要

Sampled-data (SD) systems, which are composed of both discrete- and continuous-time components, are arguably one of the most common classes of cyberphysical systems in practice; most modern controllers are implemented on digital platforms while the plant dynamics that are being controlled evolve continuously in time. As with all cyberphysical systems, ensuring hard constraint satisfaction is key in the safe operation of SD systems. A powerful analytical tool for guaranteeing such constraint satisfaction is the viability kernel: the set of all initial conditions for which a safety-preserving control law (that is, a control law that satisfies all input and state constraints) exists. In this paper we present a novel sampling-based algorithm that tightly approximates the viability kernel for high-dimensional sampled-data linear time-invariant (LTI) systems. Unlike prior work in this area, our algorithm formally handles both the discrete and continuous characteristics of SD systems. We prove the correctness and convergence of our approximation technique, provide discussions on heuristic methods to optimally bias the sampling process, and demonstrate the results on a twelve-dimensional flight envelope protection problem.

1404.7073 2026-06-04 eess.SY cs.LG cs.LO cs.RO cs.SY 版本更新

Probably Approximately Correct MDP Learning and Control With Temporal Logic Constraints

在时序逻辑约束下进行近似正确马尔可夫决策过程的学习与控制

Jie Fu, Ufuk Topcu

AI总结 本文提出在未知随机环境中,基于PAC-MDP方法学习满足时序逻辑规范的近优控制策略,通过多项式时间与空间复杂度实现高概率的近优策略生成。

Comments 9 pages, 5 figures, Accepted by 2014 Robotics: Science and Systems (RSS)

详情
AI中文摘要

我们考虑在未知随机环境中合成控制策略,以最大化满足给定时序逻辑规范的概率。我们将系统与环境的交互建模为具有初始未知转移概率的马尔可夫决策过程(MDP)。所开发的解决方案基于所谓的基于模型的近似正确马尔可夫决策过程(PAC-MDP)方法。该算法通过样本(即观测)、时间和空间,以多项式复杂度与MDP大小、自动机构造时序逻辑规范的大小、1/ε、1/δ和有限时间 horizon 相关,生成一个ε-近优策略,概率为1-δ。在此方法中,系统维护初始未知MDP的模型,并基于其学习模型和规范自动机构造产品MDP。在执行过程中,策略通过观察系统所采取的转移进行迭代更新。迭代在有限步骤内终止。以高概率,所生成的策略使得任何状态下,该策略满足规范的概率与最优策略之间的差异在预定义范围内。

英文摘要

We consider synthesis of control policies that maximize the probability of satisfying given temporal logic specifications in unknown, stochastic environments. We model the interaction between the system and its environment as a Markov decision process (MDP) with initially unknown transition probabilities. The solution we develop builds on the so-called model-based probably approximately correct Markov decision process (PAC-MDP) methodology. The algorithm attains an $\varepsilon$-approximately optimal policy with probability $1-δ$ using samples (i.e. observations), time and space that grow polynomially with the size of the MDP, the size of the automaton expressing the temporal logic specification, $\frac{1}{\varepsilon}$, $\frac{1}δ$ and a finite time horizon. In this approach, the system maintains a model of the initially unknown MDP, and constructs a product MDP based on its learned model and the specification automaton that expresses the temporal logic constraints. During execution, the policy is iteratively updated using observation of the transitions taken by the system. The iteration terminates in finitely many steps. With high probability, the resulting policy is such that, for any state, the difference between the probability of satisfying the specification under this policy and the optimal one is within a predefined bound.

1403.2174 2026-06-04 cs.RO cs.SY eess.SY 版本更新

A New Technique for INS/GNSS Attitude and Parameter Estimation Using Online Optimization

一种新的INS/GNSS姿态和参数估计技术:使用在线优化

Yuanxin Wu, Jinling Wang, Dewen Hu

AI总结 本文提出一种在线约束优化方法,用于同时估计INS/GNSS的姿态和其他相关参数,无需先验姿态或传感器噪声信息,通过数值结果验证其在高精度应用中的有效性。

Comments IEEE Trans. on Signal Processing, to appear

详情
Journal ref
IEEE Trans. on Signal Processing, 62 (10), 2642 - 2655, 2014
AI中文摘要

惯性导航系统(INS)和全球导航卫星系统(GNSS)的集成通常通过类似卡尔曼滤波的方式在工程应用中实现。这种INS/GNSS集成容易出现姿态初始化失败,尤其是当主机车辆自由移动时。本文提出了一种在线约束优化方法,用于同时估计姿态和其他相关参数,包括GNSS天线的杠杆臂和惯性传感器偏差。该新方法利用自我初始化,无需先验姿态或传感器测量噪声信息。数值结果用于验证其在高精度INS/GNSS应用中的有效性和前景。

英文摘要

Integration of inertial navigation system (INS) and global navigation satellite system (GNSS) is usually implemented in engineering applications by way of Kalman-like filtering. This form of INS/GNSS integration is prone to attitude initialization failure, especially when the host vehicle is moving freely. This paper proposes an online constrained-optimization method to simultaneously estimate the attitude and other related parameters including GNSS antenna's lever arm and inertial sensor biases. This new technique benefits from self-initialization in which no prior attitude or sensor measurement noise information is required. Numerical results are reported to validate its effectiveness and prospect in high accurate INS/GNSS applications.

1404.3580 2026-06-04 cs.MA cs.NI cs.RO cs.SY eess.SY 版本更新

Joint Estimation and Localization in Sensor Networks

传感器网络中的联合估计与定位

Nikolay A. Atanasov, Roberto Tron, Victor M. Preciado, George J. Pappas

AI总结 本文提出了一种分布式线性估计器用于无线传感器网络中动态目标的协同跟踪,证明了其在静态目标估计中的均方一致性,并通过分布式雅可比算法实现传感器定位,确保了联合定位与目标估计方法的强收敛性。

Comments 9 pages (two-column); 5 figures; Manuscript submitted to the 2014 IEEE Conference on Decision and Control (CDC)

详情
AI中文摘要

本文解决了无线传感器网络中动态目标协同跟踪的问题。推导出一种新型的分布式线性估计器,即分布式卡尔曼滤波器的一种变体。证明了在静态目标估计情况下,该滤波器在均方意义下是一致的。当大规模传感器网络部署时,传感器通常缺乏对其位置的良好知识,这会影响目标估计过程。与大多数现有目标跟踪方法不同,我们研究了当传感器姿态需通过辅助定位过程进行估计时,本滤波器的性能。传感器通过分布式雅可比算法从噪声相对测量中进行定位。我们证明了定位方法的强收敛性保证,并进而保证了联合定位与目标估计方法的强收敛性。我们的算法性能在环境监测和目标跟踪任务的仿真中得到了验证。

英文摘要

This paper addresses the problem of collaborative tracking of dynamic targets in wireless sensor networks. A novel distributed linear estimator, which is a version of a distributed Kalman filter, is derived. We prove that the filter is mean square consistent in the case of static target estimation. When large sensor networks are deployed, it is common that the sensors do not have good knowledge of their locations, which affects the target estimation procedure. Unlike most existing approaches for target tracking, we investigate the performance of our filter when the sensor poses need to be estimated by an auxiliary localization procedure. The sensors are localized via a distributed Jacobi algorithm from noisy relative measurements. We prove strong convergence guarantees for the localization method and in turn for the joint localization and target estimation approach. The performance of our algorithms is demonstrated in simulation on environmental monitoring and target tracking tasks.

1404.3316 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Embed System for Robotic Arm with 3 Degree of Freedom Controller using Computational Vision on Real-Time

具有3自由度控制器的机器人臂嵌入式系统使用计算视觉实现实时控制

Luiz Cortinhas, Patrick Monteiro, Amir Zahlan, Gabriel Vianna, Marcio Moscoso

AI总结 本文提出一种基于协议通信的分布式嵌入式控制系统,利用手套手势进行三维识别,通过模糊实现设置x、y、z坐标,实现机器人臂的实时控制。

Comments 8 pages,9 figures, published on AIFL 2014 conference (AIFL-2014 Submission 20)

详情
AI中文摘要

本文研究了具有3自由度控制器的机器人臂嵌入式控制系统,基于协议通信的分布式系统,通过一个支持多个点的服务器和通过套接字的移动应用进行通信。所提出的系统利用手套手势进行三维识别,通过模糊实现设置x、y、z坐标。该方法实现了两个基于Raspberry PI的臂基于计算机运行客户端程序,x64 PC运行服务器程序,以及由ATmega328p基板控制的机器人臂。

英文摘要

This Paper deals with robotic arm embed controller system, with distributed system based on protocol communication between one server supporting multiple points and mobile applications trough sockets .The proposed system utilizes hand with glove gesture in three-dimensional recognition using fuzzy implementation to set x,y,z coordinates. This approach present all implementation over: two raspberry PI arm based computer running client program, x64 PC running server program, and one robot arm controlled by ATmega328p based board.

1404.3221 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

UAV Circumnavigating an Unknown Target Under a GPS-denied Environment with Range-only Measurements

无人机在无GPS环境下通过仅距离测量绕行未知目标

Yongcan Cao

AI总结 本文研究无人机在无GPS环境下利用仅距离测量实现绕行未知目标的控制方法,通过两步控制算法设计,首先假设具备距离和距离变化率测量,其次用滑模估计器估计距离变化率替代实际测量,以实现绕行任务。

Comments A preliminary version of this work will be presented at the 2014 American Control Conference

详情
AI中文摘要

无人机的一种典型应用是情报、监视和侦察任务,其目标是通过信息采集提高态势感知。例如,一种高效获取目标信息的方法是将无人机以期望距离绕行目标。这种无人机运动称为绕行。本文的目标是设计一种无人机控制算法,使在无GPS环境下利用仅距离测量实现绕行任务。该控制算法分为两步。第一步是假设具备距离和距离变化率测量,其中关联的控制输入始终有界。第二步是进一步消除对距离变化率测量的依赖,通过滑模估计器使用距离测量获得估计的距离变化率来替代实际的距离变化率测量。这种控制器设计技术适用于在无GPS环境下其他无人机导航和控制任务的控制设计。

英文摘要

One typical application of unmanned aerial vehicles is the intelligence, surveillance, and reconnaissance mission, where the objective is to improve situation awareness through information acquisition. For examples, an efficient way to gather information regarding a target is to deploy UAV in such a way that it orbits around this target at a desired distance. Such a UAV motion is called circumnavigation. The objective of the paper is to design a UAV control algorithm such that this circumnavigation mission is achieved under a GPS-denied environment using range-only measurement. The control algorithm is constructed in two steps. The first step is to design a UAV control algorithm by assuming the availability of both range and range rate measurements, where the associated control input is always bounded. The second step is to further eliminate the use of range rate measurement by using an estimated range rate, obtained via a sliding-mode estimator using range measurement, to replace actual range rate measurement. Such a controller design technique is applicable in the control design of other UAV navigation and control missions under a GPS-denied environment.

1402.0051 2026-06-04 cs.MA cs.RO cs.SY eess.SY 版本更新

Distributed Algorithms for Stochastic Source Seeking with Mobile Robot Networks: Technical Report

分布式算法用于具有移动机器人网络的随机源定位:技术报告

Nikolay A. Atanasov, Jerome Le Ny, George J. Pappas

AI总结 本文提出分布式控制策略用于定位噪声信号源,针对有模型和无模型两种场景,采用随机梯度方法实现机器人网络在局部最大值区域的收敛。

Comments 13 pages (two-column); 3 figures; Manuscript submitted to the ASME Journal on Dynamic Systems, Measurement and Control (JDSMC); In version 2 typos in the text were corrected, the proofs were cleaned up, hyperlinks were added to the bibliography, several clarifications were added to the text, and some statements were made more precise

详情
AI中文摘要

自主机器人网络是监测大规模环境场的有效工具。本文提出分布式控制策略用于定位噪声信号源,该信号可能代表感兴趣的物理量如磁场、热量、无线电信号或化学浓度。我们为两种场景开发了特定算法:一种传感器具有精确的信号形成过程模型,另一种没有信号模型。在无模型场景中,传感器团队跟随信号场的随机梯度。我们的方法是分布式的,对群体几何变形具有鲁棒性,不需要全局定位,并保证传感器会进入场的局部最大值邻域。在有模型场景中,传感器以分布式方式跟随预期测量与源位置之间互信息的随机梯度。性能通过使用机器人传感器网络模拟无线无线电信号源的定位进行演示。

英文摘要

Autonomous robot networks are an effective tool for monitoring large-scale environmental fields. This paper proposes distributed control strategies for localizing the source of a noisy signal, which could represent a physical quantity of interest such as magnetic force, heat, radio signal, or chemical concentration. We develop algorithms specific to two scenarios: one in which the sensors have a precise model of the signal formation process and one in which a signal model is not available. In the model-free scenario, a team of sensors is used to follow a stochastic gradient of the signal field. Our approach is distributed, robust to deformations in the group geometry, does not necessitate global localization, and is guaranteed to lead the sensors to a neighborhood of a local maximum of the field. In the model-based scenario, the sensors follow the stochastic gradient of the mutual information between their expected measurements and the location of the source in a distributed manner. The performance is demonstrated in simulation using a robot sensor network to localize the source of a wireless radio signal.

1310.0063 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Online Approximate Optimal Station Keeping of an Autonomous Underwater Vehicle

自主水下机器人在线近似最优保持策略

Patrick Walters, Warren E. Dixon

AI总结 研究提出基于强化学习的 actor-critic 框架,用于在线近似自主水下机器人六自由度最优保持策略,实现状态和策略的统一有界收敛。

Comments 6 pages

详情
AI中文摘要

本文考虑了对完全驱动的六自由度自主水下机器人的最优保持策略的在线近似。所开发的控制器是两个玩家零和博弈解的近似,其中控制器是最小化玩家,外部扰动是最大化玩家。解通过基于强化学习的 actor-critic 框架进行近似。结果保证了状态和近似策略对最优策略的统一最终有界收敛,无需持续激励条件。

英文摘要

Online approximation of an optimal station keeping strategy for a fully actuated six degrees-of-freedom autonomous underwater vehicle is considered. The developed controller is an approximation of the solution to a two player zero-sum game where the controller is the minimizing player and an external disturbance is the maximizing player. The solution is approximated using a reinforcement learning-based actor-critic framework. The result guarantees uniformly ultimately bounded (UUB) convergence of the states and UUB convergence of the approximated policies to the optimal polices without the requirement of persistence of excitation.

1403.5374 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Transverse Contraction Criteria for Stability of Nonlinear Hybrid Limit Cycles

横向收缩准则用于非线性混合极限环的稳定性

Justin Z. Tang, Ian R. Manchester

AI总结 本文提出基于线性矩阵不等式序列的微分条件,用于证明非线性混合极限环的轨道稳定性,通过凸优化工具如sum-of-squares编程寻找稳定性证书,无需事先知道稳定极限环在状态空间中的精确位置。

Comments Conference submission

详情
AI中文摘要

本文推导了保证非线性混合极限环轨道稳定性的微分条件。这些条件表示为一系列点wise线性矩阵不等式(LMI),使得可以通过凸优化工具如sum-of-squares编程进行稳定性证书的搜索。与传统Lyapunov方法不同,本文提出的横向收缩框架使能够对混合系统进行稳定性证明,而无需事先知道稳定极限环在状态空间中的精确位置。该方法通过动态行走示例进行了说明。

英文摘要

In this paper, we derive differential conditions guaranteeing the orbital stability of nonlinear hybrid limit cycles. These conditions are represented as a series of pointwise linear matrix inequalities (LMI), enabling the search for stability certificates via convex optimization tools such as sum-of-squares programming. Unlike traditional Lyapunov-based methods, the transverse contraction framework developed in this paper enables proof of stability for hybrid systems, without prior knowledge of the exact location of the stable limit cycle in state space. This methodology is illustrated on a dynamic walking example.

1402.5639 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Decentralized Rendezvous of Nonholonomic Robots with Sensing and Connectivity Constraints

非holonomic机器人群体的去中心化会合

Zhen Kan, Justin Klotz, Eduardo L. Pasiliao, John M. Shea, Warren E. Dixon

AI总结 研究针对具有非holonomic约束的轮式机器人,在保持网络连通性和碰撞避免的前提下,实现群体在指定点集的会合。采用去中心化时变控制器基于导航函数框架,利用局部传感反馈实现导航。

Comments 9 pages, 5 figures, submitted to Automatica

详情
AI中文摘要

本文考虑一组具有非holonomic约束的轮式机器人在指定点集实现会合,同时保持网络连通性和碰撞避免。每个机器人存在通信和传感约束,只有部分机器人知晓全局目标,其余机器人需在连通性约束下移动,使知晓目标的机器人引导群体到达目标。机器人还需在会合点附近外避免碰撞。为实现会合控制目标,基于导航函数框架开发了去中心化时变控制器,利用局部传感反馈(包括邻居位置反馈和绝对方位测量)导航机器人,实现导航期间的无线电静默。仿真结果展示了所提出方法的性能。

英文摘要

A group of wheeled robots with nonholonomic constraints is considered to rendezvous at a common specified setpoint with a desired orientation while maintaining network connectivity and ensuring collision avoidance within the robots. Given communication and sensing constraints for each robot, only a subset of the robots are aware or informed of the global destination, and the remaining robots must move within the network connectivity constraint so that the informed robots can guide the group to the goal. The mobile robots are also required to avoid collisions with each other outside a neighborhood of the common rendezvous point. To achieve the rendezvous control objective, decentralized time-varying controllers are developed based on a navigation function framework to steer the robots to perform rendezvous while preserving network connectivity and ensuring collision avoidance. Only local sensing feedback, which includes position feedback from immediate neighbors and absolute orientation measurement, is used to navigate the robots and enables radio silence during navigation. Simulation results demonstrate the performance of the developed approach.

1402.4004 2026-06-04 cs.RO cs.ET cs.SY eess.SY 版本更新

Design of a Hybrid Robot Control System using Memristor-Model and Ant-Inspired Based Information Transfer Protocols

基于忆阻器模型和蚂蚁启发信息传输协议的混合机器人控制系统设计

Ella Gale, Ben de Lacy Costello, Andrew Adamatzky

AI总结 本文提出一种混合机器人控制方法,利用蚂蚁行为启发算法和忆阻器非线性特性,在传感器信息处理受限情况下实现快速有效的情报近似。

Comments Conference

详情
Journal ref
Workshop on Unconventional Approaches to Robotics, Automation and Control (UARACIN), at International Conference on Robotics and Automation (ICRA) 2013, Karlsruhe, Germany, Fr-Ws-09, pgs. 34-36
AI中文摘要

在无法及时处理所有传感器信息的情况下,需要快速且有效的机器人状态近似。本文在此限制下,利用受蚂蚁工蚁放置行为启发的算法和基于忆阻器非线性特性,设计了一种混合控制方法。

英文摘要

It is not always possible for a robot to process all the information from its sensors in a timely manner and thus quick and yet valid approximations of the robot's situation are needed. Here we design hybrid control for a robot within this limit using algorithms inspired by ant worker placement behaviour and based on memristor-based non-linearity.

1402.0289 2026-06-04 cs.CV cs.RO cs.SY eess.SY 版本更新

A Robust Framework for Moving-Object Detection and Vehicular Traffic Density Estimation

一种用于移动物体检测和车辆交通密度估计的稳健框架

Pranam Janney, Glenn Geers

AI总结 本文提出了一种基于纹理度量的移动物体检测方法,具有计算成本低、参数调整少和抗噪声能力强的特点,实验表明其性能优于现有方法,并提出车辆交通密度估计的框架及对比分析。

详情
AI中文摘要

智能机器需要从视频中获取基本信息,如移动物体检测,以推断更高层次的语义信息。本文提出了一种利用纹理度量检测视频中移动物体的方法。该方法计算成本低,参数调整少,且对噪声、光照变化、动态背景和低帧率具有鲁棒性。实验结果表明,所提方法的性能优于现有方法。我们还利用前景物体检测技术提出车辆交通密度估计的框架,并比较了基于前景物体检测的框架与基于经典密度状态建模的框架在车辆交通密度估计中的差异。

英文摘要

Intelligent machines require basic information such as moving-object detection from videos in order to deduce higher-level semantic information. In this paper, we propose a methodology that uses a texture measure to detect moving objects in video. The methodology is computationally inexpensive, requires minimal parameter fine-tuning and also is resilient to noise, illumination changes, dynamic background and low frame rate. Experimental results show that performance of the proposed approach is higher than those of state-of-the-art approaches. We also present a framework for vehicular traffic density estimation using the foreground object detection technique and present a comparison between the foreground object detection-based framework and the classical density state modelling-based framework for vehicular traffic density estimation.

1312.6573 2026-06-04 cs.RO cs.SY eess.SY 版本更新

Trackability with Imprecise Localization

不精确定位下的跟踪性

Kyle Klein, Subhash Suri

AI总结 研究在存在定位误差的情况下,跟踪器如何在同等速度下追击目标,分析最坏情况下的距离增长速率及保持目标距离所需的速度提升,以及障碍物对跟踪性能的影响。

Comments 17 pages, 9 figures

详情
AI中文摘要

在存在定位误差的情况下,研究跟踪器如何在同等速度下追击目标,分析最坏情况下的距离增长速率及保持目标距离所需的速度提升,以及障碍物对跟踪性能的影响。

英文摘要

Imagine a tracking agent $P$ who wants to follow a moving target $Q$ in $d$-dimensional Euclidean space. The tracker has access to a noisy location sensor that reports an estimate $\tilde{Q}(t)$ of the target's true location $Q(t)$ at time $t$, where $||Q(T) - \tilde{Q}(T)||$ represents the sensor's localization error. We study the limits of tracking performance under this kind of sensing imprecision. In particular, we investigate (1) what is $P$'s best strategy to follow $Q$ if both $P$ and $Q$ can move with equal speed, (2) at what rate does the distance $||Q(t) - P(t)||$ grow under worst-case localization noise, (3) if $P$ wants to keep $Q$ within a prescribed distance $L$, how much faster does it need to move, and (4) what is the effect of obstacles on the tracking performance, etc. Under a relative error model of noise, we are able to give upper and lower bounds for the worst-case tracking performance, both with or without obstacles.

1311.4769 2026-06-04 cs.RO cs.SY eess.SY 版本更新

On 'A Kalman Filter-Based Algorithm for IMU-Camera Calibration: Observability Analysis and Performance Evaluation'

关于基于卡尔曼滤波算法的IMU-相机标定:可观测性分析与性能评估

Yuanxin Wu

AI总结 本文分析了IMU-相机标定中可观测性问题,指出原始工作中的可观测性结论存在错误,影响了SLAM系统的发展。

Comments 3 pages. This work was done in 2009. Abstract revised and More refs added in this new version

详情
AI中文摘要

上述工作[1]在IEEE-TR'08中提出了一种扩展卡尔曼滤波器用于校准相机与IMU之间的偏移。作为主要贡献之一,使用李导数进行了局部弱可观测性分析。该开创性论文[1]无疑成为当前SLAM中可观测性研究的基石,许多实际SLAM系统都是基于该论文的可观测性结果开发的,如[2, 3]。然而,本文的主要可观测性结论建立在错误的证明基础上,实际上无法通过该局部可观测性技术获得,这一事实显然多年来未被SLAM社区所注意到。

英文摘要

The above-mentioned work [1] in IEEE-TR'08 presented an extended Kalman filter for calibrating the misalignment between a camera and an IMU. As one of the main contributions, the locally weakly observable analysis was carried out using Lie derivatives. The seminal paper [1] is undoubtedly the cornerstone of current observability work in SLAM and a number of real SLAM systems have been developed on the observability result of this paper, such as [2, 3]. However, the main observability result of this paper [1] is founded on an incorrect proof and actually cannot be acquired using the local observability technique therein, a fact that is apparently not noticed by the SLAM community over a number of years.

1304.3663 2026-06-04 cs.RO cs.MA cs.SY eess.SY 版本更新

Cooperative localization by dual foot-mounted inertial sensors and inter-agent ranging

通过双足安装惯性传感器和机间测距实现协同定位

John-Olof Nilsson, Dave Zachariah, Isaac Skog, Peter Händel

AI总结 本文提出一种部分去中心化的系统架构,利用步进惯性导航和步进死 reckoning 降低计算和通信成本,实现大规模群体的协同定位。

Comments 14 pages

详情
Journal ref
EURASIP Journal on Advances in Signal Processing 2013, 2013:164
AI中文摘要

本文讨论了通过双足安装惯性传感器和机间测距实现协同定位的实施挑战,并回顾了相关工作。系统架构和传感器融合被确定为关键挑战。提出了一种基于步进惯性导航和步进死 reckoning的部分去中心化系统架构,该架构可将计算成本和所需通信带宽降低两个数量级,同时与传统集中实现相比仅产生可忽略的信息损失。这使得联合全局状态估计对于多达编队规模的群体成为可能。此外,基于状态空间变换和边缘化的鲁棒且低成本的传感器融合方法被提出。变换和边缘化用于提供必要的灵活性,以支持所提出的基于采样更新的机间测距和无测距融合。最后,通过仿真和实时系统实现展示了所提实现的特点。

英文摘要

The implementation challenges of cooperative localization by dual foot-mounted inertial sensors and inter-agent ranging are discussed and work on the subject is reviewed. System architecture and sensor fusion are identified as key challenges. A partially decentralized system architecture based on step-wise inertial navigation and step-wise dead reckoning is presented. This architecture is argued to reduce the computational cost and required communication bandwidth by around two orders of magnitude while only giving negligible information loss in comparison with a naive centralized implementation. This makes a joint global state estimation feasible for up to a platoon-sized group of agents. Furthermore, robust and low-cost sensor fusion for the considered setup, based on state space transformation and marginalization, is presented. The transformation and marginalization are used to give the necessary flexibility for presented sampling based updates for the inter-agent ranging and ranging free fusion of the two feet of an individual agent. Finally, characteristics of the suggested implementation are demonstrated with simulations and a real-time system implementation.

1311.4625 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Control Contraction Metrics and Universal Stabilizability

控制收缩度量与通用可控性

Ian R. Manchester, Jean-Jacques E. Slotine

AI总结 本文提出通用可控性概念,通过求解点wise线性矩阵不等式得到控制收缩度量,为非线性系统全局稳定提供了充分条件,并与控制Lyapunov函数理论有联系。

Comments Conference submission

详情
AI中文摘要

本文引入了通用可控性的概念:非线性系统每一个解都能全局稳定化的条件。我们给出了以存在控制收缩度量为充分条件的充分条件,该度量可通过求解点wise线性矩阵不等式得到。对近似最优控制的扩展是直接的。我们给出的条件对于线性系统和某些非线性系统是必要且充分的,并与控制Lyapunov函数理论有有趣的联系。

英文摘要

In this paper we introduce the concept of universal stabilizability: the condition that every solution of a nonlinear system can be globally stabilized. We give sufficient conditions in terms of the existence of a control contraction metric, which can be found by solving a pointwise linear matrix inequality. Extensions to approximate optimal control are straightforward. The conditions we give are necessary and sufficient for linear systems and certain classes of nonlinear systems, and have interesting connections to the theory of control Lyapunov functions.

1311.4527 2026-06-04 cs.AI cs.DC cs.MA cs.RO cs.SY eess.SY 版本更新

A message-passing algorithm for multi-agent trajectory planning

多智能体轨迹规划的消息传递算法

Jose Bento, Nate Derbinsky, Javier Alonso-Mora, Jonathan Yedidia

AI总结 本文提出基于改进交替方向乘子法的新型算法,用于计算多智能体的无碰撞全局轨迹,具有自然并行化和易整合不同成本函数的优点。

Comments In Advances in Neural Information Processing Systems (NIPS), 2013. Demo video available at http://www.youtube.com/watch?v=yuGCkVT8Bew

详情
AI中文摘要

我们描述了一种新的方法,用于计算具有指定初始和最终配置的p个智能体的无碰撞全局轨迹,基于改进的交替方向乘子法(ADMM)。与现有方法相比,我们的方法具有自然并行化的能力,并且只需少量调整即可整合不同的成本函数。我们应用我们的方法到经典的具有挑战性的实例中,并观察到其计算需求在几个成本函数中随着p良好扩展。我们还展示了我们的算法的一种特化形式可用于通过在速度空间中解决联合优化问题进行局部运动规划。

英文摘要

We describe a novel approach for computing collision-free \emph{global} trajectories for $p$ agents with specified initial and final configurations, based on an improved version of the alternating direction method of multipliers (ADMM). Compared with existing methods, our approach is naturally parallelizable and allows for incorporating different cost functionals with only minor adjustments. We apply our method to classical challenging instances and observe that its computational requirements scale well with $p$ for several cost functionals. We also show that a specialization of our algorithm can be used for {\em local} motion planning by solving the problem of joint optimization in velocity space.

1311.4419 2026-06-04 eess.SY cs.RO cs.SY physics.bio-ph 版本更新

Perception and Steering Control in Paired Bat Flight

群体飞行中的感知与转向控制

Zhaodan Kong, Kayhan Ozcimder, Nathan W. Fuller, John Baillieul

AI总结 本文研究群体飞行中领导者与追随者的行为协调,提出基于虚拟目镜的转向法则,结合现有视觉转向法则生成轨迹,符合蝙蝠飞行数据统计特征。

Comments Submitted to the 19th World Congress of the International Federation of Automatic Control (IFAC)

详情
AI中文摘要

群体中的动物需要协调对环境特征和彼此的反应以安全移动。本文扩展了之前关于德克萨斯州约翰逊城附近Myotis velifer飞行模式的研究。这些蝙蝠每天傍晚以小群体形式从洞穴中出没,通常不超过三到四只,为研究领导者-追随者行为提供了理想对象。通过分析M. velifer群体的飞行路径,数据表明追随者蝙蝠的飞行行为受领导者蝙蝠影响,这种影响无法用现有追逐定律如经典追逐、恒定方位和运动伪装法充分解释。因此,我们提出了一种基于虚拟目镜的转向法则,该法则用于捕捉领导者-追随者对的几何配置。结果表明,该法则可与之前提出的视觉赋能转向法则结合,生成轨迹,其统计特征与数据集中蝙蝠的轨迹相符。结果表明,蝙蝠在导航时使用了环境和邻居的感知信息。

英文摘要

Animals within groups need to coordinate their reactions to perceived environmental features and to each other in order to safely move from one point to another. This paper extends our previously published work on the flight patterns of Myotis velifer that have been observed in a habitat near Johnson City, Texas. Each evening, these bats emerge from a cave in sequences of small groups that typically contain no more than three or four individuals, and they thus provide ideal subjects for studying leader-follower behaviors. By analyzing the flight paths of a group of M. velifer, the data show that the flight behavior of a follower bat is influenced by the flight behavior of a leader bat in a way that is not well explained by existing pursuit laws, such as classical pursuit, constant bearing and motion camouflage. Thus we propose an alternative steering law based on virtual loom, a concept we introduce to capture the geometrical configuration of the leader-follower pair. It is shown that this law may be integrated with our previously proposed vision-enabled steering laws to synthesize trajectories, the statistics of which fit with those of the bats in our data set. The results suggest that bats use perceived information of both the environment and their neighbors for navigation.

1311.4296 2026-06-04 cs.LG cs.NA cs.RO math.NA math.OC 版本更新

Reflection methods for user-friendly submodular optimization

用于用户友好的子模优化的反射方法

Stefanie Jegelka, Francis Bach, Suvrit Sra

AI总结 本文提出一种高效的子模函数优化方法,通过连续最佳逼近问题的反射序列求解,实现连续和离散问题的双重解决,应用于图像分割任务。

Comments Neural Information Processing Systems (NIPS), États-Unis (2013)

详情
AI中文摘要

最近研究表明,子模性自然捕捉了机器学习、信号处理和计算机视觉中广泛出现的概念。因此,需要高效的子模函数优化方法,尤其是最小化问题。尽管一般子模最小化具有挑战性,我们提出了一种新方法,利用子模函数的现有分解性。与以往方法不同,该方法既非近似也非不切实际,也不需要复杂的参数调优。此外,它易于实现和并行化。我们方法的关键组成部分是将离散子模最小化问题转化为连续最佳逼近问题,通过一系列反射求解,并可通过阈值处理获得最优离散解。该方法解决了连续和离散问题,因此在学习、推断和重建中有应用。在实验中,我们展示了该方法在两个图像分割任务中的优势。

英文摘要

Recently, it has become evident that submodularity naturally captures widely occurring concepts in machine learning, signal processing and computer vision. Consequently, there is need for efficient optimization procedures for submodular functions, especially for minimization problems. While general submodular minimization is challenging, we propose a new method that exploits existing decomposability of submodular functions. In contrast to previous approaches, our method is neither approximate, nor impractical, nor does it need any cumbersome parameter tuning. Moreover, it is easy to implement and parallelize. A key component of our method is a formulation of the discrete submodular minimization problem as a continuous best approximation problem that is solved through a sequence of reflections, and its solution can be easily thresholded to obtain an optimal discrete solution. This method solves both the continuous and discrete formulations of the problem, and therefore has applications in learning, inference, and reconstruction. In our experiments, we illustrate the benefits of our method on two image segmentation tasks.

1311.2796 2026-06-04 math.OC cs.RO cs.SY eess.SY 版本更新

Mixed Human-Robot Team Surveillance

混合人机团队监视

Vaibhav Srivastava, Amit Surana, Miguel P. Eckstein, Francesco Bullo

AI总结 本文研究了混合人机团队在监视任务中的系统理论设计,提出人机决策模型及高效注意力分配策略,通过异常检测算法和随机车辆路由策略实现高效监视。

Comments 16 pages

详情
AI中文摘要

我们从系统理论的角度研究混合人机团队设计,以监视任务为背景。混合团队设计的三个关键耦合组件是(i)人操作员的策略,(ii)补偿错误人决策的策略,(iii)控制自动机的策略。本文调研了人决策的要素,包括证据聚合、态势感知、疲劳和记忆效应。我们将这些要素的模型结合起来,开发出一个单一的、连贯的人决策模型,用于双替代选择任务。我们利用该模型设计高效的注意力分配策略给人操作员。我们提出了一种异常检测算法,利用操作员可能的错误决策来确定被监视区域中的异常区域。最后,我们提出了一种随机车辆路由策略,以高概率监视异常区域。我们的混合团队设计依赖于确定性等价滚动时域控制框架。

英文摘要

We study the mixed human-robot team design in a system theoretic setting using the context of a surveillance mission. The three key coupled components of a mixed team design are (i) policies for the human operator, (ii) policies to account for erroneous human decisions, and (iii) policies to control the automaton. In this paper, we survey elements of human decision-making, including evidence aggregation, situational awareness, fatigue, and memory effects. We bring together the models for these elements in human decision-making to develop a single coherent model for human decision-making in a two-alternative choice task. We utilize the developed model to design efficient attention allocation policies for the human operator. We propose an anomaly detection algorithm that utilizes potentially erroneous decision by the operator to ascertain an anomalous region among the set of regions surveilled. Finally, we propose a stochastic vehicle routing policy that surveils an anomalous region with high probability. Our mixed team design relies on the certainty-equivalent receding-horizon control framework.

1311.1761 2026-06-04 cs.LG cs.AI cs.NE cs.RO cs.SY eess.SY 版本更新

Exploring Deep and Recurrent Architectures for Optimal Control

探索深度和循环架构以实现最优控制

Sergey Levine

AI总结 本文探讨了将深度和循环神经网络应用于连续高维运动控制任务,通过强化学习算法训练控制器,比较不同架构的性能,并讨论深度学习在最优控制中的应用前景。

Comments Appears in the Neural Information Processing Systems (NIPS 2013) Workshop on Deep Learning

详情
AI中文摘要

复杂的多层神经网络在多个监督任务中取得了最先进的结果。然而,此类多层网络在控制领域的成功应用迄今为止主要局限于控制流水线的感知部分。本文探讨了将深度和循环神经网络应用于连续、高维运动任务,其中网络用于表示控制策略,将系统状态(由关节角度表示)直接映射到每个关节的扭矩。通过使用最近的强化学习算法guided policy search,可以成功训练具有数千参数的神经网络控制器,从而比较各种架构。我们讨论了运动控制任务与先前监督感知任务的区别,展示了比较各种架构的实验结果,并讨论了将深度学习技术应用于最优控制问题的未来方向。

英文摘要

Sophisticated multilayer neural networks have achieved state of the art results on multiple supervised tasks. However, successful applications of such multilayer networks to control have so far been limited largely to the perception portion of the control pipeline. In this paper, we explore the application of deep and recurrent neural networks to a continuous, high-dimensional locomotion task, where the network is used to represent a control policy that maps the state of the system (represented by joint angles) directly to the torques at each joint. By using a recent reinforcement learning algorithm called guided policy search, we can successfully train neural network controllers with thousands of parameters, allowing us to compare a variety of architectures. We discuss the differences between the locomotion control task and previous supervised perception tasks, present experimental results comparing various architectures, and discuss future directions in the application of techniques from deep learning to the problem of optimal control.

1310.7568 2026-06-04 q-bio.QM cs.RO cs.SY eess.SY 版本更新

Interlimb neural connection is not required for gait transition in quadruped locomotion

四肢神经连接在四足运动中步态转换中并非必需

Atsushi Tero, Masakazu Akiyama, Dai Owaki, Takeshi Kano, Akio Ishiguro, Ryo Kobayashi

AI总结 本文提出一种耦合振荡器模型,通过身体物理相互作用生成步态模式,发现改变肢体运动速度可自发产生类似真实四足动物的步态转换,且改变物理特性可使 pacing 替代 trotting,模型有助于理解生物运动原理及设计节能腿式机器人。

Comments 6 pages, 2figures

详情
AI中文摘要

四足动物会根据运动速度自发切换多种步态模式(如行走、跑步、踱步、疾跑)。长期以来,这些步态模式的生成机制存在争议。本文提出一种耦合振荡器模型,该模型与身体的物理相互作用耦合。研究结果表明,当根据肢体运动速度改变共振部分时,步态模式会自发地以类似于真实四足动物的方式切换为行走/跑步/踱步/跳跃。此外,通过改变物理特性,我们观察到踱步会取代跑步。除了有助于理解生物运动原理外,本文提出的耦合振荡器模型还预期能促进设计能够自主选择节能步态并进行转换的腿式机器人。

英文摘要

Quadrupeds transition spontaneously to various gait patterns (e.g., walk, trot, pace, gallop) in response to the locomotion speed. The generation of these gait patterns has been the subject of debate for a long time. We propose a coupled oscillator model that is coupled with the physical interactions of the body. The results of this study showed that the gait pattern transitions spontaneously to walking/trotting/pacing/bounding in manner similar to that of real quadruped animals when the resonating portion of the body is changed according to the speed of leg movement. We also observed that pacing is expressed exclusively instead of trotting by changing the physical characteristics. In addition to leading to understanding of the principles of locomotion in living things, the coupled oscillator model proposed in this study is expected to lead to the creation of a legged robot that can select an energy-efficient gait and transition to it spontaneously.

1310.7062 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Real-Time Planning with Primitives for Dynamic Walking over Uneven Terrain

动态不平地形下基于原始体的实时规划

Ian R. Manchester, Jack Umenberger

AI总结 本文提出一种基于有限运动原语的实时运动规划算法,利用虚拟holonomic约束构建闭式解和二叉搜索树,提升动态步行规划效率。

Comments Conference submission

详情
AI中文摘要

我们提出了一种使用有限运动原语的递推视界运动规划算法,用于欠驱动动态步行在不平地形上的运动规划。运动原语被定义为虚拟holonomic约束,利用欠驱动机械系统在虚拟约束下的特殊结构,构建闭式解和特殊的二叉搜索树,大幅加速运动规划。我们提出一种贪心深度优先搜索,并讨论使用能量基启发式方法的改进。所提出的算法可以在数秒内为compass-gait步行器和平面7自由度/五连杆步行器规划多个步态。

英文摘要

We present an algorithm for receding-horizon motion planning using a finite family of motion primitives for underactuated dynamic walking over uneven terrain. The motion primitives are defined as virtual holonomic constraints, and the special structure of underactuated mechanical systems operating subject to virtual constraints is used to construct closed-form solutions and a special binary search tree that dramatically speed up motion planning. We propose a greedy depth-first search and discuss improvement using energy-based heuristics. The resulting algorithm can plan several footsteps ahead in a fraction of a second for both the compass-gait walker and a planar 7-Degree-of-freedom/five-link walker.

1309.5401 2026-06-04 cs.RO cs.CV cs.SY eess.SY 版本更新

Nonmyopic View Planning for Active Object Detection

非我的视图规划用于主动物体检测

Nikolay Atanasov, Bharath Sankaran, Jerome Le Ny, George J. Pappas, Kostas Daniilidis

AI总结 本文提出通过控制移动深度相机视角进行主动物体检测,通过规划视图序列平衡移动能耗与识别正确假设的概率,实验表明优于贪心方法。

Comments 12 pages (two-column); 7 figures; 2 tables; Manuscript submitted to the IEEE Transactions on Robotics (TRO)

详情
AI中文摘要

计算机视觉中的核心问题之一是语义重要物体的检测以及其姿态的估计。大多数物体检测工作基于单张图像处理,其性能受限于遮挡和外观和几何的模糊性。本文提出了一种主动检测方法,通过控制移动深度相机的视角进行物体检测。当初始静态检测阶段识别出感兴趣的物体时,会针对其类别和方向提出多个假设。传感器随后规划一系列视角,平衡移动所消耗的能量与识别正确假设的概率。我们提出了一个包含传感器移动性的主动假设检验问题,并使用基于点的近似POMDP算法进行求解。通过仿真和实际世界实验验证了我们的方法的有效性,结果表明我们的方法优于广泛使用的贪心视角选择方法,并在静态物体检测上提供了显著改进。

英文摘要

One of the central problems in computer vision is the detection of semantically important objects and the estimation of their pose. Most of the work in object detection has been based on single image processing and its performance is limited by occlusions and ambiguity in appearance and geometry. This paper proposes an active approach to object detection by controlling the point of view of a mobile depth camera. When an initial static detection phase identifies an object of interest, several hypotheses are made about its class and orientation. The sensor then plans a sequence of views, which balances the amount of energy used to move with the chance of identifying the correct hypothesis. We formulate an active hypothesis testing problem, which includes sensor mobility, and solve it using a point-based approximate POMDP algorithm. The validity of our approach is verified through simulation and real-world experiments with the PR2 robot. The results suggest that our approach outperforms the widely-used greedy view point selection and provides a significant improvement over static object detection.

1309.5390 2026-06-04 eess.SY cs.RO cs.SY math.DS math.OC 版本更新

Information Acquisition with Sensing Robots: Algorithms and Error Bounds

利用传感机器人获取信息:算法与误差界

Nikolay Atanasov, Jerome Le Ny, Kostas Daniilidis, George J. Pappas

AI总结 本文提出一种非贪心算法,考虑传感器动态,提供子最优保证,优于传统贪心方法,适用于非线性传感模型。

Comments 9 pages (two-column); 2 figures; Manuscript submitted to the 2014 IEEE International Conference on Robotics and Automation

详情
AI中文摘要

利用可配置传感系统的功能需要解决困难的信息收集问题。对于无内部状态的传感系统,存在近优解方法。然而,当优化移动传感器轨迹时,解决方案通常是贪心的,很少提供性能保证。值得注意的是,在线性高斯假设下,问题变为确定性问题,可离线求解。基于子模性的方法通常忽略传感器动态,贪心选择环境中的信息位置。本文提出一种非贪心算法,不依赖子模性,考虑传感器动态。我们的方法在理论上优于广泛使用的贪心方法。结合线性化和模型预测控制,可用于生成适应性策略。应用包括气体浓度映射和目标跟踪。

英文摘要

Utilizing the capabilities of configurable sensing systems requires addressing difficult information gathering problems. Near-optimal approaches exist for sensing systems without internal states. However, when it comes to optimizing the trajectories of mobile sensors the solutions are often greedy and rarely provide performance guarantees. Notably, under linear Gaussian assumptions, the problem becomes deterministic and can be solved off-line. Approaches based on submodularity have been applied by ignoring the sensor dynamics and greedily selecting informative locations in the environment. This paper presents a non-greedy algorithm with suboptimality guarantees, which does not rely on submodularity and takes the sensor dynamics into account. Our method performs provably better than the widely used greedy one. Coupled with linearization and model predictive control, it can be used to generate adaptive policies for mobile sensors with non-linear sensing models. Applications in gas concentration mapping and target tracking are presented.

1309.1029 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Sensor Setups for State and Wind Estimation for Airborne Wind Energy Converters

用于空载风能转换器状态和风速估计的传感器配置

Maximilian Ranneberg

AI总结 本文提出了一种无迹卡尔曼滤波器,用于空载风能转换器的空气动力学、状态和风况估计,通过不同传感器配置验证,展示了低成本高可靠性的可行性。

详情
AI中文摘要

本文提出了一种联合状态和参数估计的无迹卡尔曼滤波器,用于空载风能转换器的空气动力学、状态和风况估计。所提出的估计器依赖于不同的测量配置。由于风能转换器的严格经济约束,传感器配置被选择为最低成本且可靠性问题最小。使用高保真系统模型的仿真数据和飞行数据实验测试,以及通过激光雷达系统获取的海拔风速测量数据进行验证。数据是在德国勃兰登堡进行的EnerKíte EK30空载风能转换器测试飞行中获得的。即使使用最简单的配置,也实现了可行的精度,展示了空载传感器的优势。此外,结果鼓励进一步研究所获得的风速估计用于选址评估。

英文摘要

An unscented Kalman filter with joint state and parameter estimation is proposed for aerodynamics, states and wind conditions for airborne wind energy converters. The proposed estimator relies on different measurement setups. Due to the strict economic constraints of wind energy converters, the sensor setups are chosen with minimal cost and reliability issues in mind. Simulation data with a high fidelity system model and experimental tests using flight data, together with wind measurements obtained using a lidar system for altitude wind measurements, are used for validation. The data was obtained during test flights of the EnerKíte EK30, an airborne wind energy converter currently in research operation in Brandenburg, Germany. Feasible accuracies were achieved even with the simplest of setups and illustrate the gain achievable by airborne sensors. Additionally, the results encourage further research into use of the obtained wind estimates for site assessment.

1309.1788 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Web Standards as Standard Pieces in Robotics

网络标准作为机器人中的标准组件

Sekou L. Remy

AI总结 本文探讨了网络技术在机器人控制中的应用,指出尽管存在性能担忧,但网络技术能有效提升系统集成,并展示了通过Web服务器和浏览器实现开环和闭环控制的实验结果。

Comments 9th Annual IEEE International Conference on Automation Science and Engineering (IEEE CASE 2013)

详情
AI中文摘要

现代机器人技术常常利用网络技术来应对设计和操作的复杂性。许多网络技术已被正式化为标准,但机器人和控制系统领域的人士往往因担心网络速度或不确定性而回避这些标准。本文认为,尽管网络技术不适用于所有控制场景,但不应被完全摒弃,因为它们能为系统集成提供关键帮助。网络技术在过去十年中取得了显著进步。我们展示了使用Web服务器在不同网络拓扑上实现开环和闭环控制(3Hz至1kHz)的应用细节。此外,我们还考虑了使用Web浏览器实现植物控制的影响。实验结果证实,网络技术可用于有意义的控制,并突显了限制其适用性的设计选择。

英文摘要

Modern robotics often involves the use of web technologies as a means to cope with the complexity of design and operation. Many of these technologies have been formalized into standards, which are often avoided by those in robotics and controls because of a sometimes warranted fear that "the web" is too slow, or too uncertain for meaningful control applications. In this work we argue that while web technologies may not be applicable for all control, they should not be dismissed outright because they can provide critical help with system integration. Web technologies have also advanced significantly over the past decade. We present the details of an application of a web server to perform open and close-loop control (between 3Hz and 1kHz) over a variety of different network topologies. In our study we also consider the impact of a web browser to implement the control of the plant. Our results confirm that meaningful control can be performed using web technologies, and also highlight design choices that can limit their applicability.

1308.6250 2026-06-04 eess.SY cs.RO cs.SY math.OC 版本更新

Circumnavigation of an Unknown Target Using UAVs with Range and Range Rate Measurements

利用UAV的测距和测距率测量绕未知目标 circumnavigation

Yongcan Cao, Jonathan Muse, David Casbeer, Derek Kingston

AI总结 本文提出两种控制算法,使UAV能绕未知目标飞行。算法通过测距和测距率测量,使UAV在轨道外或在轨道上时趋近预定轨道切线,在轨道内时施零控制。两种算法在平滑性和饱和性上有所区别,通过Lyapunov函数设计证明算法能实现任意初始状态下的预定轨道。

Comments To appear in IEEE Conference on Decision and Control, 2013

详情
AI中文摘要

本文提出两种控制算法,使无人机能够利用测距和测距率(即距离的导数)测量绕未知目标飞行。给定预定轨道半径,两种算法(i)使无人机在轨道外或在轨道上时趋近预定轨道切线,(ii)在无人机处于预定轨道内时施零控制输入。两种算法的区别在于,第一种算法是平滑且非饱和的,而第二种算法是非平滑且饱和的。通过分析无人机相对于目标的方位角属性,并通过适当设计Lyapunov函数,证明两种算法对于任意初始状态都能产生所需的轨道。三个示例作为概念验证提供。

英文摘要

This paper presents two control algorithms enabling a UAV to circumnavigate an unknown target using range and range rate (i.e., the derivative of range) measurements. Given a prescribed orbit radius, both control algorithms (i) tend to drive the UAV toward the tangent of prescribed orbit when the UAV is outside or on the orbit, and (ii) apply zero control input if the UAV is inside the desired orbit. The algorithms differ in that, the first algorithm is smooth and unsaturated while the second algorithm is non-smooth and saturated. By analyzing properties associated with the bearing angle of the UAV relative to the target and through proper design of Lyapunov functions, it is shown that both algorithms produce the desired orbit for an arbitrary initial state. Three examples are provided as a proof of concept.

1308.3015 2026-06-04 cs.RO cs.SY eess.SY stat.CO stat.ME 版本更新

On Generalized Bayesian Data Fusion with Complex Models in Large Scale Networks

在大规模网络中使用复杂模型的广义贝叶斯数据融合

Nisar Ahmed, Tsung-Lin Yang, Mark Campbell

AI总结 本文提出新的广义贝叶斯分布式数据融合算法,用于处理动态网络拓扑和复杂信念模型,通过混合pdf和条件因子提升多机器人目标搜索中的融合效果。

Comments Revised version of paper submitted to 2013 Workshop on Wireless Intelligent Sensor Networks (WISeNET 2013) at Duke University, June 5, 2013

详情
AI中文摘要

近年来,通信、移动计算和人工智能的进步大大扩展了智能分布式传感器网络的应用空间。这反过来推动了开发广义贝叶斯分布式数据融合(DDF)算法,以在自主代理之间实现鲁棒且高效的资源共享,使用概率信念模型。然而,DDF在需要使用动态/即需网络拓扑和复杂信念模型(如高斯混合或混合贝叶斯网络)的一般现实应用中显著难以实施。为了解决这些问题,我们首先讨论了关于精确DDF和保守近似DDF的一些新关键数学见解。这些见解随后被用来开发基于混合pdf和条件因子的新型广义DDF算法。受多机器人目标搜索启发的数值示例显示,我们的方法导致显著更好的融合结果,因此有潜力增强传感器网络中的分布式智能推理。

英文摘要

Recent advances in communications, mobile computing, and artificial intelligence have greatly expanded the application space of intelligent distributed sensor networks. This in turn motivates the development of generalized Bayesian decentralized data fusion (DDF) algorithms for robust and efficient information sharing among autonomous agents using probabilistic belief models. However, DDF is significantly challenging to implement for general real-world applications requiring the use of dynamic/ad hoc network topologies and complex belief models, such as Gaussian mixtures or hybrid Bayesian networks. To tackle these issues, we first discuss some new key mathematical insights about exact DDF and conservative approximations to DDF. These insights are then used to develop novel generalized DDF algorithms for complex beliefs based on mixture pdfs and conditional factors. Numerical examples motivated by multi-robot target search demonstrate that our methods lead to significantly better fusion results, and thus have great potential to enhance distributed intelligent reasoning in sensor networks.

1308.2923 2026-06-04 cs.NI cs.RO cs.SY eess.SY math.OC 版本更新

Robotic Message Ferrying for Wireless Networks using Coarse-Grained Backpressure Control

用于无线网络的机器人信息中继:基于粗粒度背压控制

Shangxing Wang, Andrea Gasparri, Bhaskar Krishnamachari

AI总结 本文研究了机器人在无线网络中中继信息的容量区域,提出粗粒度背压中继算法,通过优化匹配实现稳定传输,证明在可控移动下可实现常数容量并保证有限延迟。

详情
AI中文摘要

我们提出了机器人在静态放置的源-汇对之间无线通信中中继信息的问题。我们首先分析了在理想和现实条件下的容量区域。我们表明,如果机器人数量与源-汇对数量成比例增长,则网络容量按Θ(1)缩放,类似于Grossglauser和Tse对无控制移动的先前结果;然而,与该先前结果不同,我们发现通过可控移动,可以实现常数容量缩放并保证有限延迟。然后我们考虑了未知到达率的情况,提出了粗粒度背压信息中继算法(CBMF)。在CBMF中,机器人每经过一个周期与源和汇进行匹配,以最大化基于队列差的权重。匹配控制每个机器人的运动和传输:如果机器人匹配到源,它会向该源移动并收集数据;如果匹配到汇,它会向该汇移动并传输数据。我们通过分析和模拟展示了CBMF稳定网络的条件。我们证明,随着调度持续时间和机器人速度的增加,该策略的最大可实现稳定吞吐量趋于理想容量。

英文摘要

We formulate the problem of robots ferrying messages between statically-placed source and sink pairs that they can communicate with wirelessly. We first analyze the capacity region for this problem under both ideal (arbitrarily high velocity, long scheduling periods) and realistic conditions. We indicate how robots could be scheduled optimally to satisfy any arrival rate in the capacity region, given prior knowledge about arrival rates. We find that if the number of robots allocated grows proportionally with the number of source-sink pairs, then the capacity of the network scales as $Θ(1)$, similar to what was shown previously by Grossglauser and Tse for uncontrolled mobility; however, in contrast to that prior result, we also find that with controlled mobility this constant capacity scaling can be obtained while ensuring finite delay. We then consider the setting where the arrival rates are unknown and present a coarse-grained backpressure message ferrying algorithm (CBMF) for it. In CBMF, the robots are matched to sources and sinks once every epoch to maximize a queue-differential-based weight. The matching controls both motion and transmission for each robot: if a robot is matched to a source, it moves towards that source and collects data from it; and if it is matched to a sink, it moves towards that sink and transmits data to it. We show through analysis and simulations the conditions under which CBMF can stabilize the network. We show that the maximum achievable stable throughput with this policy tends to the ideal capacity as the schedule duration and robot velocity increase.

1305.7484 2026-06-04 cs.RO cs.SY eess.SY math.OC 版本更新

Technical Report: Convex Optimization of Nonlinear Feedback Controllers via Occupation Measures

技术报告:通过占据措施进行非线性反馈控制器的凸优化

Anirudha Majumdar, Ram Vasudevan, Mark M. Tobenkin, Russ Tedrake

AI总结 本文提出了一种设计多项式系统反馈控制器的方法,通过最大化时间限制的反向可达集大小,利用占据措施将合成问题转化为无限维线性规划,并通过半正定规划进行有限维近似。方法具有凸性且无需初始化。

详情
AI中文摘要

本文提出了一种设计多项式系统反馈控制器的方法,通过最大化时间限制的反向可达集(BRS)的大小。我们利用占据措施将合成问题转化为无限维线性规划(LP),并以半正定规划(SDPs)形式提供该LP的有限维近似。每个SDP的解产生一个多项式控制策略和BRS的最大可达到大小的外近似。与传统基于Lyapunov的方法不同,我们的方法是凸的,且不需要任何初始化。所得到的时间变化控制器和近似可达集适合用于轨迹库或反馈运动规划算法中。我们在五个非线性系统上展示了该方法的有效性和可扩展性。

英文摘要

In this paper, we present an approach for designing feedback controllers for polynomial systems that maximize the size of the time-limited backwards reachable set (BRS). We rely on the notion of occupation measures to pose the synthesis problem as an infinite dimensional linear program (LP) and provide finite dimensional approximations of this LP in terms of semidefinite programs (SDPs). The solution to each SDP yields a polynomial control policy and an outer approximation of the largest achievable BRS. In contrast to traditional Lyapunov based approaches which are non-convex and require feasible initialization, our approach is convex and does not require any form of initialization. The resulting time-varying controllers and approximated reachable sets are well-suited for use in a trajectory library or feedback motion planning algorithm. We demonstrate the efficacy and scalability of our approach on five nonlinear systems.

1305.5025 2026-06-04 cs.RO cs.CE cs.NA math.NA 版本更新

A Nonlinear Constrained Optimization Framework for Comfortable and Customizable Motion Planning of Nonholonomic Mobile Robots - Part II

一种非线性约束优化框架用于非完整移动机器人舒适且可定制的运动规划——第二部分

Shilpa Gulati, Chetan Jhurani, Benjamin Kuipers

AI总结 本文提出了一种非线性约束优化框架,用于非完整移动机器人的舒适和可定制运动规划,通过有限元方法离散化无限维优化问题,并讨论了如何处理局部最优解和高质量初始猜测。

详情
AI中文摘要

在本系列论文中,我们提出了一种运动规划框架,用于规划非完整移动机器人的舒适且可定制的运动,如智能轮椅和自动驾驶汽车。在第一部分中,我们建立了该框架的数学基础,将运动不适建模为加权成本泛函,并将舒适运动规划定义为非线性约束优化问题,即在适当的边界条件和约束下计算最小化不适的轨迹。在本文中,我们使用符合有限元对无限维优化问题进行离散化。我们描述了形状函数以处理不同类型的边界条件以及未知数的选择以获得稀疏的Hessian矩阵。我们还详细描述了任何轨迹计算问题可以有无限多个局部最优解以及我们的处理方法。此外,由于我们有一个非线性且受约束的问题,计算高质量的初始猜测对于高效求解至关重要。我们展示了如何计算这些初始猜测。

英文摘要

In this series of papers, we present a motion planning framework for planning comfortable and customizable motion of nonholonomic mobile robots such as intelligent wheelchairs and autonomous cars. In Part I, we presented the mathematical foundation of our framework, where we model motion discomfort as a weighted cost functional and define comfortable motion planning as a nonlinear constrained optimization problem of computing trajectories that minimize this discomfort given the appropriate boundary conditions and constraints. In this paper, we discretize the infinite-dimensional optimization problem using conforming finite elements. We describe shape functions to handle different kinds of boundary conditions and the choice of unknowns to obtain a sparse Hessian matrix. We also describe in detail how any trajectory computation problem can have infinitely many locally optimal solutions and our method of handling them. Additionally, since we have a nonlinear and constrained problem, computation of high quality initial guesses is crucial for efficient solution. We show how to compute them.

1305.5024 2026-06-04 cs.RO cs.CE cs.NA math.NA 版本更新

A Nonlinear Constrained Optimization Framework for Comfortable and Customizable Motion Planning of Nonholonomic Mobile Robots - Part I

一种非线性约束优化框架用于非完整移动机器人舒适且可定制的运动规划——第一部分

Shilpa Gulati, Chetan Jhurani, Benjamin Kuipers

AI总结 本文提出非完整移动机器人舒适可定制运动规划的非线性约束优化框架,基于轨迹舒适性建模,结合有限元离散化方法解决无限维优化问题。

详情
AI中文摘要

在本文中,我们提出了一种用于非完整移动机器人舒适且可定制运动规划的非线性约束优化框架。我们识别了轨迹应具备的舒适性属性,并将运动不适建模为加权成本函数,定义舒适运动规划为在适当边界条件和约束下计算最小化不适的轨迹的非线性约束优化问题。该优化问题为无限维问题,我们使用符合有限元进行离散化。我们还概述了不同用户如何通过定制运动来实现个人舒适的方法。现有文献中,我们首次将非完整移动机器人的运动规划作为非线性优化问题进行综合建模,包括边界条件分析、轨迹连续性要求、动态约束、障碍物规避约束以及鲁棒数值实现。本文介绍了运动规划框架的数学基础,并提出了完整的非线性约束优化问题。我们简要描述了使用有限元进行离散化的方法以及计算优化问题初始猜测的过程。上述两个细节将在系列的第二部分中详细阐述。

英文摘要

In this series of papers, we present a motion planning framework for planning comfortable and customizable motion of nonholonomic mobile robots such as intelligent wheelchairs and autonomous cars. In this first one we present the mathematical foundation of our framework. The motion of a mobile robot that transports a human should be comfortable and customizable. We identify several properties that a trajectory must have for comfort. We model motion discomfort as a weighted cost functional and define comfortable motion planning as a nonlinear constrained optimization problem of computing trajectories that minimize this discomfort given the appropriate boundary conditions and constraints. The optimization problem is infinite-dimensional and we discretize it using conforming finite elements. We also outline a method by which different users may customize the motion to achieve personal comfort. There exists significant past work in kinodynamic motion planning, to the best of our knowledge, our work is the first comprehensive formulation of kinodynamic motion planning for a nonholonomic mobile robot as a nonlinear optimization problem that includes all of the following - a careful analysis of boundary conditions, continuity requirements on trajectory, dynamic constraints, obstacle avoidance constraints, and a robust numerical implementation. In this paper, we present the mathematical foundation of the motion planning framework and formulate the full nonlinear constrained optimization problem. We describe, in brief, the discretization method using finite elements and the process of computing initial guesses for the optimization problem. Details of the above two are presented in Part II of the series.

1304.8019 2026-06-04 eess.SY cs.RO cs.SY 版本更新

Recursive Estimation of Orientation Based on the Bingham Distribution

基于Bingham分布的方位递归估计

Gerhard Kurz, Igor Gilitschenski, Simon Julier, Uwe D. Hanebeck

AI总结 本文提出基于Bingham分布的二维递归滤波器,用于解决具有180度对称性的圆形滤波问题,适用于实时应用,并可扩展至四元数以跟踪三维方位。

详情
AI中文摘要

方向估计是许多跟踪应用中的常见问题。传统滤波器如卡尔曼滤波因未考虑问题的周期性而表现不佳。本文提出一种基于Bingham分布的递归滤波器,用于处理二维方向数据。所提滤波器适用于具有180度对称性的圆形滤波问题,即180度旋转无法区分。它易于使用标准数值技术实现,适用于实时应用。所提方法可扩展至四元数,从而允许跟踪任意三维方位。我们在具有挑战性的场景中评估了所提滤波器,并将其与传统卡尔曼滤波方法进行了比较。

英文摘要

Directional estimation is a common problem in many tracking applications. Traditional filters such as the Kalman filter perform poorly because they fail to take the periodic nature of the problem into account. We present a recursive filter for directional data based on the Bingham distribution in two dimensions. The proposed filter can be applied to circular filtering problems with 180 degree symmetry, i.e., rotations by 180 degrees cannot be distinguished. It is easily implemented using standard numerical techniques and suitable for real-time applications. The presented approach is extensible to quaternions, which allow tracking arbitrary three-dimensional orientations. We evaluate our filter in a challenging scenario and compare it to a traditional Kalman filtering approach.

1304.3075 2026-06-04 cs.AI cs.RO cs.SY eess.SY 版本更新

Application of Evidential Reasoning to Helicopter Flight Path Control

证据推理在直升机飞行路径控制中的应用

Shoshana Abel

AI总结 本文提出了一种专家系统推理和知识表示方法,用于在实时车辆导航系统中处理不确定性。提出了一种创新的证据推理方法,即求和与格点方法,并进行了数学推导、并行环境实现及原型软件开发和测试。

Comments Appears in Proceedings of the Second Conference on Uncertainty in Artificial Intelligence (UAI1986)

详情
AI中文摘要

本文提出了一种方法,用于研究和开发专家系统推理和知识表示方面,以在实时车辆导航系统中处理不确定性。此类系统对非地形跟随低空飞行系统在敌对环境中具有重大好处,例如NOE直升机或类似任务飞行器可能遇到的环境。开发了一种创新的证据推理方法,称为求和与格点方法。本文的研究和开发工作包括该方法的数学形式化发展、在并行环境中的公式化和表示、在专家系统中的方法原型软件开发,以及在车辆导航系统内进行的初始测试。

英文摘要

This paper presents a methodology for research and development of the inferencing and knowledge representation aspects of an Expert System approach for performing reasoning under uncertainty in support of a real time vehicle guidance and navigation system. Such a system could be of major benefit for non-terrain following low altitude flight systems operating in foreign hostile environments such as might be experienced by NOE helicopter or similar mission craft. An innovative extension of the evidential reasoning methodology, termed the Sum-and-Lattice-Points Method, has been developed. The research and development effort presented in this paper consists of a formal mathematical development of the Sum-and-Lattice-Points Method, its formulation and representation in a parallel environment, prototype software development of the method within an expert system, and initial testing of the system within the confines of the vehicle guidance system.