
CLC number:
On-line Access: 2026-01-12
Received: 2025-04-23
Revision Accepted: 2025-07-29
Crosschecked: 2026-01-12
Cited: 0
Clicked: 1972
Citations: Bibtex RefMan EndNote GB/T7714
https://orcid.org/0000-0002-2603-2065
https://orcid.org/0000-0003-0236-7896
https://orcid.org/0009-0006-3435-5776
https://orcid.org/0009-0009-4467-7236
https://orcid.org/0009-0003-1613-5823
Junhui ZHANG, Pengyuan JI, Lizhou FANG, Jinyuan LIU, Dandan WANG, Jikun AI, Huaizhi ZONG, Bing XU. Stable and continuous vertical jumping control of hydraulic legged robots through reinforcement learning[J]. Journal of Zhejiang University Science A,in press.Frontiers of Information Technology & Electronic Engineering,in press.https://doi.org/10.1631/jzus.A2500142 @article{title="Stable and continuous vertical jumping control of hydraulic legged robots through reinforcement learning", %0 Journal Article TY - JOUR
基于强化学习的液压足式机器人稳定连续的垂直跳跃控制方法机构:浙江大学,流体动力基础件与机电系统全国重点实验室,中国杭州,310058 目的:液压式足式机器人由于具有较高的功率质量比,在实现高动态运动方面具有巨大潜力。然而,如何同时保证其运动的稳定性与连续性仍面临挑战。本文旨在提出针对液压足式机器人动态跳跃运动的控制方法,在解决控制器优化过程中参数耦合问题的同时实现运动性能的多方面提升。 创新点:1.建立准真实仿真模型,准确反映液压肢腿单元的动态特性;2.提出基于强化学习的液压足式机器人运动控制框架;3.在样机上实现强化学习策略的部署与验证。 方法:1.基于液压驱动系统的动力学分析,结合物理可行性约束,构建液压肢腿单元的准真实模型;2.运用近端策略优化(PPO)强化学习算法,同时优化轨迹生成器与运动跟踪控制器的参数,并在仿真环境中训练控制策略;3.将训练后的策略部署于样机,在竖直跳跃和前向跳跃的不同工况下验证控制策略的性能。 结论:1.所提出的准真实模型能够准确反映物理样机的性能;2.运用强化学习控制框架的液压肢腿单元能够在仿真中实现连续且稳定的跳跃;3.训练后的策略成功部署于物理样机并在高度跟踪和落地柔顺性方面取得显著提升。 关键词组: Darkslateblue:Affiliate; Royal Blue:Author; Turquoise:Article
Reference[1]AhnD, ChoBK, 2022. Online jumping motion generation via model predictive control. IEEE Transactions on Industrial Electronics, 69(5):4957-4965. ![]() [2]BaKX, ChenCH, MaGL, et al., 2024. A compensation strategy of end-effector pose precision based on the virtual constraints for serial robots with RDOFs. Fundamental Research, in press. ![]() [3]BaKX, SheJB, XuB, et al., 2025. Matrix sensitivity-based adaptive iterative feedback control of leg hydraulic drive system of legged robot. Control Engineering Practice, 165:106557. ![]() [4]BjelonicM, SankarPK, BellicosoCD, et al., 2020. Rolling in the deep-hybrid locomotion for wheeled-legged robots using online trajectory optimization. IEEE Robotics and Automation Letters, 5(2):3626-3633. ![]() [5]BoaventuraT, Medrano-CerdaGA, SeminiC, et al., 2013. Stability and performance of the compliance controller of the quadruped robot HyQ. Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, p.1458-1464. ![]() [6]ChoiS, JiG, ParkJ, et al., 2023. Learning quadrupedal locomotion on deformable terrain. Science Robotics, 8(74):eade2256. ![]() [7]DuanP, YuZN, GaoKZ, et al., 2024. Solving the multi-objective path planning problem for mobile robot using an improved NSGA-II algorithm. Swarm and Evolutionary Computation, 87:101576. ![]() [8]EgliP, HutterM, 2022. A general approach for the automation of hydraulic excavator arms using reinforcement learning. IEEE Robotics and Automation Letters, 7(2):5679-5686. ![]() [9]ExarchosI, JiangYF, YuWH, et al., 2021. Policy transfer via kinematic domain randomization and adaptation. Proceedings of the IEEE International Conference on Robotics and Automation, p.45-51. ![]() [10]FangLZ, ZhangK, LuZY, et al., 2025. Adaptive robust joint force control for rapid motion of hydraulic limb leg units. Control Engineering Practice, 165:106596. ![]() [11]GaoHB, LiuYF, DingL, et al., 2019. Low impact force and energy consumption motion planning for hexapod robot with passive compliant ankles. Journal of Intelligent & Robotic Systems, 94(2):349-370. ![]() [12]GuY, YuanCZ, 2020. Adaptive robust trajectory tracking control of fully actuated bipedal robotic walking. Proceedings of the IEEE/ASME International Conference on Advanced Intelligent Mechatronics, p.1310-1315. ![]() [13]HanL, ZhuQX, ShengJP, et al., 2024. Lifelike agility and play in quadrupedal robots using reinforcement learning and generative pre-trained models. Nature Machine Intelligence, 6(7):787-798. ![]() [14]HanYY, LiuGP, LuZY, et al., 2023. A stability locomotion-control strategy for quadruped robots with center-of-mass dynamic planning. Journal of Zhejiang University-SCIENCE A, 24(6):516-530. ![]() [15]HoellerD, RudinN, SakoD, et al., 2024. Anymal parkour: learning agile navigation for quadrupedal robots. Science Robotics, 9(88):eadi7566. ![]() [16]HwangboJ, LeeJ, DosovitskiyA, et al., 2019. Learning agile and dynamic motor skills for legged robots. Science Robotics, 4(26):eaau5872. ![]() [17]JaiswalS, SopanenJ, MikkolaA, 2021. Efficiency comparison of various friction models of a hydraulic cylinder in the framework of multibody system dynamics. Nonlinear Dynamics, 104(4):3497-3515. ![]() [18]LiX, YuHY, ZongHZ, et al., 2024. Light weight design and integrated method for manufacturing hydraulic wheel-legged robots. Journal of Zhejiang University-SCIENCE A, 25(9):701-715. ![]() [19]MaHP, ZhangYJ, SunSY, et al., 2023. A comprehensive survey on NSGA-II for multi-objective optimization and applications. Artificial Intelligence Review, 56(12):15217-15270. ![]() [20]NansaiS, RojasN, ElaraMR, et al., 2015. A novel approach to gait synchronization and transition for reconfigurable walking platforms. Digital Communications and Networks, 1(2):141-151. ![]() [21]SchulmanJ, WolskiF, DhariwalP, et al., 2017. Proximal policy optimization algorithms. arXiv:1707.06347. ![]() [22]SeminiC, BarasuolV, GoldsmithJ, et al., 2017. Design of the hydraulically actuated, torque-controlled quadruped robot HyQ2Max. IEEE/ASME Transactions on Mechatronics, 22(2):635-646. ![]() [23]ShaoX, FanYQ, ShaoJP, et al., 2023. Improved active disturbance rejection control with the optimization algorithm for the leg joint control of a hydraulic quadruped robot. Measurement and Control, 56(7-8):1359-1376. ![]() [24]SpinelliFA, EgliP, NubertJ, et al., 2024. Reinforcement learning control for autonomous hydraulic material handling machines with underactuated tools. Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, p.12694-12701. ![]() [25]SunYR, HuaZS, LiYB, et al., 2021. Modeling and analysis on low energy consumption foot trajectory for hydraulic actuated quadruped robot. International Journal of Advanced Robotic Systems, 2021(11):1-12. ![]() [26]XiangPJ, YanL, LiuXS, et al., 2025. Structural topology design for electromagnetic performance enhancement of permanent-magnet machines. Chinese Journal of Mechanical Engineering, 38(1):26. ![]() [27]XieZM, DaXY, van de PanneM, et al., 2021. Dynamics randomization revisited: a case study for quadrupedal locomotion. Proceedings of the IEEE International Conference on Robotics and Automation, p.4955-4961. ![]() [28]YaoB, 2009. Desired compensation adaptive robust control. Journal of Dynamic Systems, Measurement, and Control, 131(6):061001. ![]() [29]YaoZK, XuFY, JiangGP, et al., 2024. Data-driven control of hydraulic manipulators by reinforcement learning. IEEE/ASME Transactions on Mechatronics, 29(4):2673-2684. ![]() [30]ZhangJH, LiuJY, ZongHZ, et al., 2025. Bridging the gap to bionic motion: challenges in legged robot limb unit design, modeling, and control. Cyborg and Bionic Systems, 6:0365. ![]() [31]ZhangK, ZhangJH, ZongHZ, et al., 2025. High dynamic position control for a typical hydraulic quadruped robot leg based on virtual decomposition control. IEEE/ASME Transactions on Mechatronics, 30(4):2473-2484. ![]() [32]ZhuJ, PayneJJ, JohnsonAM, 2024. Convergent iLQR for safe trajectory planning and control of legged robots. Proceedings of the IEEE International Conference on Robotics and Automation, p.8051-8057. ![]() [33]ZongHZ, ZhangJH, JiangL, et al., 2024. Bionic lightweight design of limb leg units for hydraulic quadruped robots by additive manufacturing and topology optimization. Bio-Design and Manufacturing, 7(1):1-13. ![]() [34]ZongHZ, LouB, YuanHH, et al., 2025. Integrating kinematic and dynamic factors with generative design for high-performance additive manufacturing structures. Virtual and Physical Prototyping, 20(1):e2501383. ![]() Journal of Zhejiang University-SCIENCE, 38 Zheda Road, Hangzhou
310027, China
Tel: +86-571-87952783; E-mail: cjzhang@zju.edu.cn Copyright © 2000 - 2026 Journal of Zhejiang University-SCIENCE | ||||||||||||||


ORCID:
Open peer comments: Debate/Discuss/Question/Opinion
<1>