Topics Learned
- Cognitive Science
- Artificial Intelligence
- Introduction to MATLAB
- Reinforcement Learning
- Evaluation Feedback
- The Reinforcement Learning Problem
- Dynamic Programming
- Monte Carlo Methods
- Temporal-Difference Learning
- Generalization, Approximation and Unified View of Reinforcement Learning
- Neural Networks: Perceptron, Back-Propagation, Recurrent Network
- Adaptive Resonance Theory (ART1) Model
Project Abstract (paper available on request)
- A model was created to simulate an interaction with a repeated two choice decision-making task presented in a simple gambling situation. In the gambling task, two decks of cards of different monetary value are presented to a player. The goal is to maximize the total earnings over the entire experiment (1000 plays). Unknown to the agent is that a contingency is set up between the recent selection of card decks and the money received on the next trial. One deck of cards has a higher long-term reward while the other has a higher short-term reward. Specifically, the instant reward for the next trial is based on the history of the past four trials, that is, the number of choices made on the long-term deck. The short-term deck generates more reward for each individual trial, but over a period of four trials, repeatedly selecting the long-term goal will exceed that from repeatedly selecting the short-term goal. Obviously the optimal strategy is to select the long-term goal. As in Gureckis and Love [2009], we expect that small amounts of noise will make the agent more likely to converge to the optimal (that is, long-term deck selection) strategy, whereas large amounts of noise are likely to lead to instability which makes convergence to any strategy less optimal. The reinforcement learning model that was created explored ten levels of sigma while varying though seven levels of epsilon and found that more noise helped the model to optimize the long term goal.
Readings
- Thagard, P. (2005). Mind: Introduction to Cognitive Science, 2nd edition, The MIT Press, Cambridge, Massachusetts.
- Sutton, R. S. & Barto, A. G. (1998). Reinforcement Learning: An Introduction, The MIT Press, Cambridge, Massachusetts.
- Gurney, K. (1997). An Introduction to Neural Networks, CRC Press.
- Gureckis, T. M., & Love, B. C. (2009). Learning in noise: Dynamic decision-making in a variable environment. Journal of Mathematical Psychology, 53, 180-193
