Arrow Research search

Author name cluster

Zhao Guo

Possible papers associated with this exact author name in Arrow. This page groups case-insensitive exact name matches and is not a full identity disambiguation profile.

10 papers
2 author rows

Possible papers

10

AAAI Conference 2026 Conference Paper

WenetSpeech-Yue: A Large-Scale Cantonese Speech Corpus with Multi-dimensional Annotation

  • Longhao Li
  • Zhao Guo
  • Hongjie Chen
  • Yuhang Dai
  • Ziyu Zhang
  • Hongfei Xue
  • Tianlun Zuo
  • Chengyou Wang

The development of speech understanding and generation has been significantly accelerated by the availability of large-scale, high-quality speech datasets. Among these, ASR and TTS are regarded as the most established and fundamental tasks. However, for Cantonese (Yue Chinese), spoken by approximately 84.9 million native speakers worldwide, limited annotated resources have hindered progress and resulted in suboptimal ASR and TTS performance. To address this challenge, we propose WenetSpeech-Pipe, an integrated pipeline for building large-scale speech corpus with multi-dimensional annotation tailored for speech understanding and generation. Based on this pipeline, we release WenetSpeech-Yue, the first large-scale Cantonese speech corpus with multi-dimensional annotation for ASR and TTS, covering 21,800 hours across 10 domains with annotations including ASR transcription, text confidence, speaker identity, age, gender, speech quality scores, among other annotations. We also release WSYue-eval, a comprehensive Cantonese benchmark with two components: WSYue-ASR-eval, a manually annotated set for evaluating ASR on short and long utterances, code-switching, and diverse acoustic conditions, and WSYue-TTS-eval, with base and coverage subsets for standard and generalization testing. Experimental results show that models trained on WenetSpeech-Yue achieve competitive results against state-of-the-art (SOTA) Cantonese ASR and TTS systems, including commercial and LLM-based models, highlighting the value of our dataset and pipeline.

ICRA Conference 2025 Conference Paper

Bird-Inspired Tendon Coupling Improves Paddling Efficiency by Shortening Phase Transition Times

  • Jianfeng Lin 0002
  • Zhao Guo
  • Alexander Badri-Spröwitz

Drag-based swimming using rowing appendages, fins, and webbed feet is a widely adopted mode of locomotion in aquatic animals. To develop efficient underwater and swimming vehicles, various bioinspired drag-based paddle designs have been proposed, often facing a trade-off between propulsive efficiency and versatility. Webbed feet generate effective propulsive force during the power phase, while being lightweight, robust, and partially foldable during the recovery phase. However, the time-consuming process of mechanically folding and unfolding webbed feet extends the transition periods between the recovery and power phases, which in turn increased drag, and reduces overall paddling efficiency. In this study, we draw inspiration from the coupling tendons of aquatic birds. We implement tendon coupling mechanisms to minimize the transition time between the recovery and power phases. Hardware experiments demonstrate that our proposed mechanism improves propulsive efficiency by factors of $\mathbf{2. 0}$ and $\mathbf{2. 4}$ compared to designs without extensor tendons and based on passive paddles, respectively. Additionally, we find that distal leg joint clutching-previously shown to enhance efficiency in terrestrial walking-plays a negligible role in swimming locomotion. In sum, we present a novel principle for efficient drag-based leg and paddle design, with implications for understanding the swimming mechanics of aquatic birds and advancing bioinspired aquatic propulsion systems.

ICRA Conference 2024 Conference Paper

Design and Modeling of A Compact Serial Variable Stiffness Actuator (SVSA-III) with Linear Stiffness Profile

  • Shuowen Yi
  • Siyu Liu
  • Junbei Liao
  • Zhao Guo

Variable stiffness actuator (VSA) can imitate natural muscles in their compliance capbility, which can provide flexible adaptability for robots, improving the safety of robots interacting with the environment or human. This paper presents a new compact serial variable stiffness actuator ((SVSA-III)) with linear stiffness profile based on symmetrical variable lever arm mechanism. The stiffness motor is used to regulate the position of the pivot located on the Archimedean Spiral Relocation Mechanism (ASRM), so that the stiffness of the actuator can be adjusted (softening or hardening). By designing the lever length, the range of stiffness adjustment can change from 0. 3Nm/degree to therotical infinity. Moreover, the continuous linear stiffness profile of the actuator can be customized by solving the transcendental equation of the relationship between the actuator stiffness and the rotation angle of the stiffness motor. SVSA-III has the advantages of compact structure, wide-range stiffness regulation, reduced control difficulty, and linear stiffness profile. Two experiments of step response and stiffness tracking have proved the high accuracy and fast response for both theoretical stiffness and position adjustment.

IROS Conference 2023 Conference Paper

Design and Stiffness Analysis of a Bio-Inspired Soft Actuator with Bi-Direction Tunable Stiffness Property

  • Jianfeng Lin 0002
  • Ruikang Xiao
  • Zhao Guo

Modulating the stiffness of soft actuators is crucial for improving the efficiency of interaction with the environment. However, current stiffness modulation mechanisms are hard to achieve high lateral stiffness and a wide range of bending stiffness simultaneously. Here, we draw inspiration from the anatomical structure of the finger and propose a bi-directional tunable stiffness actuator (BTSA). BTSA is a soft-rigid hybrid structure that combines air-tendon hybrid actuation (ATA) and bone-like structures (BLS). We develop a corresponding fabrication method and a stiffness analysis model to support the design of BLS. The results show that the influence of the BLS on bending deformation is negligible, with a distal point distance error of less than 1. 5 mm. Moreover, the bi-directional tunable stiffness is proved to be functional. The bending stiffness can be tuned by ATA from 0. 23 N/mm to 0. 70 N/mm, with a magnification of 3 times. The addition of BLS improves lateral stiffness up to 4. 2 times compared with the one without BLS, and the lateral stiffness can be tuned decoupling within 1. 2 to 2. 1 times (e. g. from 0. 35 N/mm to 0. 46 N/mm when the bending angle is 45 deg). Finally, a four-BTSA gripper is developed to conduct horizontal lifting and grasping tasks to demonstrate the advantages of BTSA.

ICRA Conference 2019 Conference Paper

Versatile Reactive Bipedal Locomotion Planning Through Hierarchical Optimization

  • Jiatao Ding
  • Chengxu Zhou
  • Zhao Guo
  • Xiaohui Xiao
  • Nikos G. Tsagarakis

When experiencing disturbances during locomotion, human beings use several strategies to maintain balance, e. g. changing posture, modulating step frequency and location. However, when it comes to the gait generation for humanoid robots, modifying step time or body posture in real time introduces nonlinearities in the walking dynamics, thus increases the complexity of the planning. In this paper, we propose a two-layer hierarchical optimization framework to address this issue and provide the humanoids with the abilities of step time and step location adjustment, Center of Mass (CoM) height variation and angular momentum adaptation. In the first layer, times and locations of consecutive two steps are modulated online based on the current CoM state using the Linear Inverted Pendulum Model. By introducing new optimization variables to substitute the hyperbolic functions of step time, the derivatives of the objective function and feasibility constraints are analytically derived, thus reduces the computational cost. Then, taking the generated horizontal CoM trajectory, step times and step locations as inputs, CoM height and angular momentum changes are optimized by the second layer nonlinear model predictive control. This whole procedure will be repeated until the termination condition is met. The improved recovery capability under external disturbances is validated in simulation studies.

JBHI Journal 2018 Journal Article

Design and Evaluation of a Motorized Robotic Bed Mover With Omnidirectional Mobility for Patient Transportation

  • Zhao Guo
  • Xiaohui Xiao
  • Haoyong Yu

Patient transportation in hospitals faces many challenges, including the limited manpower, work-related injuries, and low efficiency of current bed pushing methods. This paper presents a new motorized robotic bed mover with omnidirectional mobility to address this problem. This device is composed of an omnidirectional mobility unit, a force sensing-based human-machine interface, and control hardware with batteries and electronics. The proposed bed mover can be attached to the bottom of a manual hospital stretcher, transforming it into a powered omnidirectional bed (OmniBed) that can be used only by one person. The function of the OmniBed is compared with that of a conventional powered bed, which only provides forward assistance with a fifth powered wheel. We perform a pilot study with 14 subjects to evaluate the performance of this OmniBed and benefits for hospital application. The experimental results show that the OmniBed can half the manpower while decreasing back muscle activities, revealing the potential health benefits for older staffs. The OmniBed also shows the promising signs of high precision and handling in small spaces with its one-step “parallel-parking” ability. This device is more ergonomic, more effective, and safer than the conventional powered bed.

IJCAI Conference 2017 Conference Paper

Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning

  • Jingkuan Song
  • Lianli Gao
  • Zhao Guo
  • Wu Liu
  • Dongxiang Zhang
  • Heng Tao Shen

Recent progress has been made in using attention based encoder-decoder framework for video captioning. However, most existing decoders apply the attention mechanism to every generated words including both visual words (e. g. , “gun” and "shooting“) and non-visual words (e. g. "the“, "a”). However, these non-visual words can be easily predicted using natural language model without considering visual signals or attention. Imposing attention mechanism on non-visual words could mislead and decrease the overall performance of video captioning. To address this issue, we propose a hierarchical LSTM with adjusted temporal attention (hLSTMat) approach for video captioning. Specifically, the proposed framework utilizes the temporal attention for selecting specific frames to predict related words, while the adjusted temporal attention is for deciding whether to depend on the visual information or the language context information. Also, a hierarchical LSTMs is designed to simultaneously consider both low-level visual information and deep semantic information to support the video caption generation. To demonstrate the effectiveness of our proposed framework, we test our method on two prevalent datasets: MSVD and MSR-VTT, and experimental results show that our approach outperforms the state-of-the-art methods on both two datasets.

ICRA Conference 2017 Conference Paper

Mechanical design of a compact Serial Variable Stiffness Actuator (SVSA) based on lever mechanism

  • Jiantao Sun
  • Yubing Zhang
  • Cong Zhang
  • Zhao Guo
  • Xiaohui Xiao

Compliant actuator is widely accepted for physical human-robot interaction due to its safety aspect, dynamic performance improvements and energy saving abilities. In this paper, based on the variable ratio lever mechanism, a new kind of Serial Variable Stiffness Actuator (SVSA) is proposed by using an Archimedean Spiral Relocation Mechanism (ASRM) to change the position of the pivot, implementing large range of adjustable stiffness. The ASRM introduced here makes the SVSA design has continuous stiffness adjustment ability and simply mechanical structure. Within the Variable Stiffness Mechanism (VSM), two linear springs are assembled antagonistically on a spring shaft. Their displacements are perpendicular to the output link to transmit the spring force more efficiently. Stiffness modeling and analysis of the SVSA are carried out to cover large deflection angle. The physical implementation of the SVSA shows that the output stiffness of the VSM is changed from 1. 72 to 150. 56 Nm/rad using a linear spring with stiffness 1882 N/m, working range covered from 0 to 360°. Control experiments also proved the wide range of stiffness adjustment ability of the SVSA.

ICRA Conference 2015 Conference Paper

Power analysis of a series elastic actuator for ankle joint gait rehabilitation

  • Oussama Ben Farah
  • Zhao Guo
  • Chen Gong
  • Chi Zhu 0001
  • Haoyong Yu

Series elastic actuator (SEA) has been widely used in rehabilitation robotics, where human-robot interaction is required. Due to its intrinsic compliance, SEA can improve the usage of power for its motor, which leads to a compact and lightweight SEA design. The aim of this paper is to reduce the energy consumption and the power requirements of the motor of the SEA by optimizing the stiffness of its spring. This study is inspired by the biomechanics of a human ankle joint, which stores elastic energy during the first phases of the walking process and releases the stored energy in the next gait phases to propel the human body forward. Power analysis and optimization procedure are conducted on complete SEA models with different complexity, including inertia, damping and stiffness, and with open loop and closed loop control strategies. Simulation results demonstrate that a reduction of 56. 6% of the peak motor power can be achieved with the optimized spring stiffness.

IROS Conference 2013 Conference Paper

Design of a novel compliant differential Shape Memory Alloy actuator

  • Zhao Guo
  • Haoyong Yu
  • Liang-Boon Wee

This paper presents a novel compliant differential (CD) Shape Memory Alloy (SMA) actuator with improved performance compared to traditional SMA actuators. This actuator is composed of two antagonistic SMA wires and a mechanical joint coupled with a torsion spring. The torsion spring is employed to reduce the total stiffness of SMA actuator and improve the range of motion. The antagonistic wires increase the response time as one wire can be heated up while the other wire is still in the cooling process. Dynamic model of this actuator was established for control design. Experimental results proved that this new actuator can provide larger output range of motion and faster response speed than traditional SMA actuators under the same conditions. Sine wave tracking with 0. 05 Hz, 0. 08 Hz and 0. 1 Hz were performed and our results demonstrated that this compliant actuator has good tracking performance under simple PID control.