CNS Tech Lab

Browse Bar: Browse by Author | Browse by Category | Browse by Citation | Advanced Search

Integrating temporal difference methods and self-organizing neural networks for reinforcement learning with delayed evaluative feedback

Author(s): Lu, N. | Tan, A. | Xiao, D. |

Year: 2008

Citation: IEEE TRANSACTIONS ON NEURAL NETWORKS Volume: 19 Issue: 2 Pages: 230-244

Abstract: This paper presents a neural architecture for learning category nodes encoding mappings across multimodal patterns involving sensory inputs, actions, and rewards. By integrating adaptive resonance theory (ART) and temporal difference (TD) methods, the proposed neural model, called TD fusion architecture for learning, cognition, and navigation (TD-FALCON), enables an autonomous agent to adapt and function in a dynamic environment with immediate as well as delayed evaluative feedback (reinforcement) signals. TD-FALCON learns the value functions of the state-action space estimated through on-policy and off-policy TD learning methods, specifically state-action-reward-state-action (SARSA) and Q-learning. The learned value functions are then used to determine the optimal actions based on an action selection policy. We have developed TD-FALCON systems using various TD learning strategies and compared their performance in terms of task completion, learning speed, as well as time and space efficiency. Experiments based on a minefield navigation task have shown that TD-FALCON systems are able to learn effectively with both immediate and delayed reinforcement and achieve a stable performance in a pace much faster than those of standard gradient-descent-based reinforcement learning systems.

Topics: Machine Learning, Applications: Other, Models: Fuzzy ARTMAP,

Cross References

A massively parallel architecture for a self organizing neural pattern recognition machine
A neural network architecture for the learning of recognition categories is derived. Real-time network dynamics are completely characterized through mathematical analysis and computer simulations. The architecture ... Article Details
Fuzzy ARTMAP: A neural network architecture for incremental supervised learning of analog multidimensional maps
A new neural network architecture is introduced for incremental supervised learning of recognition categories and multidimensional maps in response to arbitrary sequences of analog or binary input vectors, which may ... Article Details
Adaptive resonance associative map
This article introduces a neural architecture termed Adaptive Resonance Associative Map (ARAM) that extends unsupervised Adaptive Resonance Theory (ART) systems for rapid, yet stable, heteroassociative learning. ARAM can be ... Article Details
Fuzzy ART: Fast stable learning and categorization of analog patterns by an adaptive resonance system
A Fuzzy Adaptive Resonance Theory (ART) model capable of rapid stable learning of recognition categories in response to arbitrary sequences of analog or binary input patterns is described. Fuzzy ART incorporates computations ... Article Details
Rule extraction: From neural architecture to symbolic representation
This paper shows how knowledge, in the form of fuzzy rules, can be derived from a supervised learning neural network called furry ARTMAP. Rule extraction proceeds in two stages: pruning, which simplifies the network ... Article Details

Back to Articles

Resources Menu

Cross References