Tsuguhisa Thoma's research works

2P1-G12 Efficiency Improvement of Reinforcemnt Learning Using Parallel Processing for Combination Value Function

Article

January 2010

5 Reads

The Proceedings of JSME annual Conference on Robotics and Mechatronics (Robomec)

In this paper, efficiency improvement of reinforcement learning using parallel processing for combination value function. We propose the method of periodically composing Q table of local learning clusters to global Q table. We apply this method to two applications. One is maze problem and an another is behavior rule detection problem for modular typed robot. Q Learning method and Monte Carlo method are compared with profit share method that learns robot behaviors. We presented computer experiments of 40 PC clusters. The convergence time and learning times are evaluated and discussed.

Tsuguhisa Thoma's scientific contributions

What is this page?

Publications (1)