Research and Implementation of Imitation Learning Algorithm

doi:10.15966/j.cnki.dnydx.2023.05.020

Computer & Telecommunication

2023, Vol. 1

Issue (5): 38- DOI: 10.15966/j.cnki.dnydx.2023.05.020

Current Issue | Archive | Adv Search

Research and Implementation of Imitation Learning Algorithm

Nanjing University of Science and Technology Zijin College

Download:
Export: BibTeX | EndNote (RIS)

Abstract In order to optimize reinforcement learning for the great errors causing by the unclear reward function, this paper deeply studies and implements the behavior cloning algorithm and data aggregation algorithm in the imitation learning algorithm. The algorithm flow is modeled by activity diagram, the relationship between classes is modeled by class diagram, and the core interaction process is modeled by sequence diagram. According to the experimental results, this paper compares the advantages and disadvantages of the behavior cloning algorithm and the data aggregation algorithm, and discovers that the behavior cloning algorithm offline training can avoid interaction with the real environment, but error accumulation will lead to error results; data aggregation algorithms must interact with the environment online, and select the corresponding state of the observation value according to the strategy to solve the problem of error accumulation.

Key words： reinforcement learning imitation learning behavior cloning algorithm data aggregation algorithm

Published: 24 January 2024

	Service

	E-mail this article
	Add to my bookshelf
	Add to citation manager
	E-mail Alert
	RSS
	Articles by authors

Cite this article:

ZHANG Yu-meng JI Xiao-jun. Research and Implementation of Imitation Learning Algorithm. Computer & Telecommunication, 2023, 1(5): 38-.

URL:

https://www.computertelecom.com.cn/EN/10.15966/j.cnki.dnydx.2023.05.020 OR https://www.computertelecom.com.cn/EN/Y2023/V1/I5/38

No related articles found!

Viewed

Full text

Abstract

Cited

Shared

Discussed