![]() |
CiteULike | ![]() |
Cavadini's CiteULike | ![]() |
![]() |
|
![]() |
Register | ![]() |
Log in | ![]() |
Reinforcement Distribution in Continuous State Action Space Fuzzy Q–Learning: A Novel Approach |
Reviews
[Write a review of this article]
Notes for this articleL'articolo propone un nuovo approccio al Fuzzy-qlearning utilizzando una particolare fuzzificazione della funzione di valore delle coppie stato azione. Lo scopo rimane cmq quello di poter trattare l'apprendimento per rinforzo anche in casi con spazi degli stati e delle azioni continui, utilizzando i metodi fuzzy per costruire un approssimatore di funzione.
Find related articles from these CiteULike users
Find related articles with these CiteULike tags
Posting History
AbstractFuzzy Q–learning extends the Q–learning algorithm to work in presence of continuous state and action spaces. A Takagi–Sugeno Fuzzy Inference System (FIS) is used to infer the continuous executed action and its action–value, by means of cooperation of several rules. Different kinds of evolution of the parameters of the FIS are possible, depending on different strategies of distribution of the reinforcement signal. In this paper, we compare two strategies: the classical one, focusing on rewarding the rules that have proposed the actions composed to produce the actual action, and a new one we are introducing, where reward goes to the rules proposing actions closest the ones actually executed.
BibTeX record
RIS record