DC Field | Value | Language |
---|---|---|
dc.contributor.author | Lee, Dong-Hyun | ko |
dc.contributor.author | Quang, Vo Van | ko |
dc.contributor.author | Jo, Sungho | ko |
dc.contributor.author | Lee, Ju-Jang | ko |
dc.date.accessioned | 2013-03-27T23:30:17Z | - |
dc.date.available | 2013-03-27T23:30:17Z | - |
dc.date.created | 2012-02-06 | - |
dc.date.created | 2012-02-06 | - |
dc.date.issued | 2009-07-05 | - |
dc.identifier.citation | IEEE International Symposium on Industrial Electronics, IEEE ISIE 2009, pp.449 - 454 | - |
dc.identifier.uri | http://hdl.handle.net/10203/162242 | - |
dc.description.abstract | This paper proposes the online Support Vector Regression (SVR) based value function approximation method for Reinforcement Learning (RL). This approach conserves the Support Vector Machine (SVM)'s good property, the generalization which is a key issue of function approximation. Online SVR can do incremental learning and automatically track variation of environment with time-varying characteristics. Using the online SVR, we can obtain the fast and good estimation of value function and achieve RL objective efficiently. Throughout simulation tests, the feasibility and usefulness of the proposed approach is demonstrated by comparison with SARSA and Q-learning. | - |
dc.language | English | - |
dc.publisher | IEEE | - |
dc.title | Online Support Vector Regression based Value Function Approximation for Reinforcement Learning | - |
dc.type | Conference | - |
dc.identifier.wosid | 000276815500083 | - |
dc.identifier.scopusid | 2-s2.0-77950137292 | - |
dc.type.rims | CONF | - |
dc.citation.beginningpage | 449 | - |
dc.citation.endingpage | 454 | - |
dc.citation.publicationname | IEEE International Symposium on Industrial Electronics, IEEE ISIE 2009 | - |
dc.identifier.conferencecountry | KO | - |
dc.identifier.conferencelocation | Seoul | - |
dc.contributor.localauthor | Jo, Sungho | - |
dc.contributor.localauthor | Lee, Ju-Jang | - |
dc.contributor.nonIdAuthor | Lee, Dong-Hyun | - |
dc.contributor.nonIdAuthor | Quang, Vo Van | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.