Jiabao Yu's research while affiliated with Beijing Institute of Technology and other places

What is this page?


This page lists the scientific contributions of an author, who either does not have a ResearchGate profile, or has not yet added these contributions to their profile.

It was automatically created by ResearchGate to create a record of this author's body of work. We create such pages to advance our goal of creating and maintaining the most comprehensive scientific repository possible. In doing so, we process publicly available (personal) data relating to the author as a member of the scientific community.

If you're a ResearchGate member, you can follow this page to keep up with this author's work.

If you are this author, and you don't want us to display this page anymore, please let us know.

Publications (2)


Figure 6. Comparison of parameter performance curves.
Figure 10. USV model.
Figure 11. Platform training result.
Collision avoidance success rates of each algorithm.
Double Broad Reinforcement Learning Based on Hindsight Experience Replay for Collision Avoidance of Unmanned Surface Vehicles
  • Article
  • Full-text available

December 2022

·

33 Reads

·

2 Citations

Journal of Marine Science and Engineering

Jiabao Yu

·

Jiawei Chen

·

Ying Chen

·

[...]

·

Junwei Duan

Although broad reinforcement learning (BRL) provides a more intelligent autonomous decision-making method for the collision avoidance problem of unmanned surface vehicles (USVs), the algorithm still has the problem of over-estimation and has difficulty converging quickly due to the sparse reward problem in a large area of sea. To overcome the dilemma, we propose a double broad reinforcement learning based on hindsight experience replay (DBRL-HER) for the collision avoidance system of USVs to improve the efficiency and accuracy of decision-making. The algorithm decouples the two steps of target action selection and target Q value calculation to form the double broad reinforcement learning method and then adopts hindsight experience replay to allow the agent to learn from the experience of failure in order to greatly improve the sample utilization efficiency. Through training in a grid environment, the collision avoidance success rate of the proposed algorithm was found to be 31.9 percentage points higher than that in the deep Q network (DQN) and 24.4 percentage points higher than that in BRL. A Unity 3D simulation platform with high fidelity was also designed to simulate the movement of USVs. An experiment on the platform fully verified the effectiveness of the proposed algorithm.

Download
Share

An Image-Based Benchmark Dataset and a Novel Object Detector for Water Surface Object Detection

September 2021

·

360 Reads

·

26 Citations

Frontiers in Neurorobotics

Water surface object detection is one of the most significant tasks in autonomous driving and water surface vision applications. To date, existing public large-scale datasets collected from websites do not focus on specific scenarios. As a characteristic of these datasets, the quantity of the images and instances is also still at a low level. To accelerate the development of water surface autonomous driving, this paper proposes a large-scale, high-quality annotated benchmark dataset, named Water Surface Object Detection Dataset (WSODD), to benchmark different water surface object detection algorithms. The proposed dataset consists of 7,467 water surface images in different water environments, climate conditions, and shooting times. In addition, the dataset comprises a total of 14 common object categories and 21,911 instances. Simultaneously, more specific scenarios are focused on in WSODD. In order to find a straightforward architecture to provide good performance on WSODD, a new object detector, named CRB-Net, is proposed to serve as a baseline. In experiments, CRB-Net was compared with 16 state-of-the-art object detection methods and outperformed all of them in terms of detection precision. In this paper, we further discuss the effect of the dataset diversity (e.g., instance size, lighting conditions), training set size, and dataset details (e.g., method of categorization). Cross-dataset validation shows that WSODD significantly outperforms other relevant datasets and that the adaptability of CRB-Net is excellent.

Citations (2)


... In addition, the DQN employs a target network, which is a secondary network that updates at a slower rate compared to the primary network. This target network is utilized to compute target Q-values by applying the Bellman equation [8]. The integration of a comprehensive methodology, along with the exceptional capabilities of deep learning in managing extensive state spaces, distinguishes DQN as a prominent technique in the field of reinforcement learning. ...

Reference:

Comparison of Deep Q-Learning Network and Double Deep Q-Learning Network for Trading Strategy
Double Broad Reinforcement Learning Based on Hindsight Experience Replay for Collision Avoidance of Unmanned Surface Vehicles

Journal of Marine Science and Engineering

... Unfortunately, environmental awareness for canal waterways has not been fully explored, primarily due to the lack of a diverse and publicly available dataset. Although some databases contain elements of inland rivers, they are mainly designed for the marine environment, such as SMD [16], MID [17], WSSOD [18], MODD [19], Mastr1325 [20], USVInland [21], Tampere WaterSeg [22], Flow [23], ROSEBUD [24], Waterline [25] and LaRS [26], etc. Moreover, multi-task learning methods have received widespread attention, especially in the field of unmanned vehicles in recent years [27], [28]. ...

An Image-Based Benchmark Dataset and a Novel Object Detector for Water Surface Object Detection

Frontiers in Neurorobotics