A reinforcement learning-based scheme for direct adaptive optimal control of linear stochastic systems

Cited 11 time in webofscience Cited 0 time in scopus
  • Hit : 381
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorWong, Wee Chinko
dc.contributor.authorLee, JayHyungko
dc.date.accessioned2013-03-12T03:21:41Z-
dc.date.available2013-03-12T03:21:41Z-
dc.date.created2012-02-06-
dc.date.created2012-02-06-
dc.date.issued2010-
dc.identifier.citationOPTIMAL CONTROL APPLICATIONS METHODS, v.31, no.4, pp.365 - 374-
dc.identifier.issn0143-2087-
dc.identifier.urihttp://hdl.handle.net/10203/101192-
dc.description.abstractReinforcement learning where decision-making agents learn optimal policies through environmental interactions is an attractive paradigm for model-free, adaptive controller design. However, results for systems with continuous state and action variables are rare. In this paper, we present convergence results for optimal linear quadratic control of discrete-time linear stochastic systems. This work can be viewed as a generalization of a previous work on deterministic linear systems. Key differences between the algorithms for deterministic and stochastic systems are highlighted through examples. The usefulness of the algorithm is demonstrated through a nonlinear chemostat bioreactor case study Copyright (C) 2009 John Wiley & Sons, Ltd.-
dc.languageEnglish-
dc.publisherJOHN WILEY SONS LTD-
dc.subjectIDENTIFICATION-
dc.titleA reinforcement learning-based scheme for direct adaptive optimal control of linear stochastic systems-
dc.typeArticle-
dc.identifier.wosid000280687600006-
dc.identifier.scopusid2-s2.0-77955678039-
dc.type.rimsART-
dc.citation.volume31-
dc.citation.issue4-
dc.citation.beginningpage365-
dc.citation.endingpage374-
dc.citation.publicationnameOPTIMAL CONTROL APPLICATIONS METHODS-
dc.identifier.doi10.1002/oca.915-
dc.contributor.localauthorLee, JayHyung-
dc.contributor.nonIdAuthorWong, Wee Chin-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorreinforcement learning-
dc.subject.keywordAuthorlinear systems-
dc.subject.keywordAuthorstochastic adaptive optimal control-
dc.subject.keywordPlusIDENTIFICATION-
Appears in Collection
CBE-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 11 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0