Free online reference management for clinicians and scientists

Sign up now

Recent "reinforcement" articles

  • These articles and links have been posted by Connotea users using the tag "reinforcement".
  • To add to this collection, or to start your own library:

Learn more

Watch a short video (2m 41s)

EXPORT LIST RSS ?
Bookmarks matching tag reinforcement
 
Number of articles per page:
10 | 25 | 50 | 100
 
Reinforcement-driven dimensionality reduction--a model for information processing in the basal ganglia.
I Bar-Gad et al.
Journal of basic and clinical physiology and pharmacology 11 (4), 305-20 (2000)
Posted by butterchicken to BG Reinforcement on Mon Mar 10 2008 at 01:52 UTC | info | related
 
Actor-critic models of the basal ganglia: new anatomical and computational perspectives.
Daphna Joel, Yael Niv, and Eytan Ruppin
Neural networks : the official journal of the International Neural Network Society 15 (4-6), 535-47
Posted by butterchicken to BG Reinforcement on Mon Mar 10 2008 at 01:49 UTC | info | related
 
Dopamine, uncertainty and TD learning.
Behavioral and brain functions : BBF 1 (1), 6 (04 May 2005)
 
Short-term memory traces for action bias in human reinforcement learning.
Rafal Bogacz et al.
Brain research 1153, 111-21 (11 Jun 2007)
Posted by butterchicken and 3 others to Reinforcement on Tue Dec 11 2007 at 08:45 UTC | info | related
 
Neural signature of fictive learning signals in a sequential investment task.
Terry Lohrenz et al.
Proceedings of the National Academy of Sciences of the United States of America 104 (22), 9493-8 (29 May 2007)
Posted by butterchicken and 1 other to Reinforcement fmri on Tue Dec 11 2007 at 07:37 UTC | info | related
 
Hold Your Horses: Impulsivity, Deep Brain Stimulation, and Medication in Parkinsonism
Michael Frank et al.
Science 318 (5854), 1309-12 (23 Nov 2007)
 
AIDS risk behavior in opioid dependent patients treated with community reinforcement approach and relationships with psychiatric disorders
P J Abbott et al.
Journal of addictive diseases : the official journal of the ASAM, American Society of Addiction Medicine 17 (4), 33-48 (1998)
 
Compulsive drug-seeking behavior and relapse. Neuroadaptation, stress, and conditioning factors
F Weiss et al.
Annals of the New York Academy of Sciences 937, 1-26 (Jun 2001)
 
The role of delta-opioid receptor subtypes in cocaine- and methamphetamine-induced place preferences.
T Suzuki et al.
Life sciences 55 (17), L339-44 (1994)
The effects of delta-receptor antagonists on cocaine- and methamphetamine-induced place preferences were examined in rats. Cocaine- and methamphetamine-induced place preferences were significantly attenuated by naltrindole (NTI: a non-selective delta-opioid receptor antagonist). Furthermore, naltriben (NTB: a selective delta 2-opioid receptor antagonist), but not 7-benzylidenenaltrexone (BNTX: a selective delta 1-opioid receptor antagonist), attenuated the cocaine- and methamphetamine-induced place preferences. These results suggest that delta-opioid receptors, particularly delta 2-opioid receptors, may be involved in the reinforcing effects of cocaine and methamphetamine.
 
Multiagent learning using a variable learning rate
Artificial Intelligence 136 (2), 215 (2002)
Learning to act in a multiagent environment is a difficult problem since the normal definition of an optimal policy no longer applies. The optimal policy at any moment depends on the policies of the other agents. This creates a situation of learning a moving target. Previous learning algorithms have one of two shortcomings depending on their approach. They either converge to a policy that may not be optimal against the specific opponents' policies, or they may not converge at all. In this article we examine this learning problem in the framework of stochastic games. We look at a number of previous learning algorithms showing how they fail at one of the above criteria. We then contribute a new reinforcement learning technique using a variable learning rate to overcome these shortcomings. Specifically, we introduce the WoLF principle, “Win or Learn Fast”, for varying the learning rate. We examine this technique theoretically, proving convergence in self-play on a restricted class of iterated matrix games. We also present empirical results on a variety of more general stochastic games, in situations of self-play and otherwise, demonstrating the wide applicability of this method. Author Keywords: Multiagent learning; Reinforcement learning; Game theory

<< Prev 0      Showing entries 1 to 10 of 36 total      Next 10 >>