Dynamic Fuzzy Q-Learning Control of Humanoid Robots for Automatic Gait Synthesis

This paper introduces a novel automatic gait synthesis approach for Humanoid Robots (HRs) by Dynamic Fuzzy Q-Learning (DFQL). The DFQL method is capable of tuning Fuzzy Inference Systems (FISs) online. A salient feature of the proposed approach is that the DFQL controller can automatically generate fuzzy rules without a priori knowledge and it is capable of dealing with highly complex dynamic systems. The challenge for automatic gait synthesis of an HR is to define gait trajectories for hips and ankles so that motions of other joints can be regulate simultaneously. Because stability is one of the most common concerns for HRs, a self-learning control strategy of improving dynamic stability based on the Zero Moment Point (ZMP) criterion is developed. A Dynamic Fuzzy Q-Learning (DFQL) controller is proposed to automatically generate the hip motion trajectory, as hip motion plays the most important role in dynamic stability. Simulation results show that the DFQL controller is capable of improving dynamic stability as the actual ZMP trajectory becomes very close to the ideal case. Comparison studies between the DFQL method and conventional FQL approaches demonstrate that the DFQL method is superior.

並列關鍵字

Fuzzy neural networks ； humanoid robots ； reinforcement learning

被引用紀錄

陳至傑（2008）。運用干擾觀測器於移動式機器人的運動控制〔碩士論文，國立臺北科技大學〕。華藝線上圖書館。https://doi.org/10.6841/NTUT.2008.00619

國際替代計量

Dynamic Fuzzy Q-Learning Control of Humanoid Robots for Automatic Gait Synthesis

全文下載

主題瀏覽