In my quest in discovering if temporal difference methods can be used in games, I discovered the article Learning to Play Games Using Temporal Difference Methods (Wiering, Patist, Mannen 2005). The authors used TD methods and Neural Network for function approximation to evaluate Backgammon, Chess, and Draughts. In addition, the paper demonstrated three methods of function valuation from either self play, learning from expert, and database of human master games. In the end, the authors concluded that learning from an expert or self-play the agent was able reach its maximum evaluation function in the neural net compared to the agent that learned from observing games stored in a database. In a way this article was a demonstration of Transfer Learning. Finally, the authors were aware of other function approximation methodologies such as support vector machines and gradient descent.
Pages
Categories
- Advice Language
- Agents
- AI Related References
- AIMA
- Algorithms
- Artificial Intelligence
- Bayesian Inference
- Blocks world
- CLIPS
- Education
- Evolutionary Computing
- Expert Systems
- Explanation Based Learning
- Games and Puzzles
- Inductive Logic Programming
- Java Code
- LISP Code
- Logic
- Logic Programming
- Machine Learning
- MIT OCW
- Pattern Matching
- Predicate Calculus
- Probability Theory
- Prolog Code
- Propositional Calculus
- Reinforcement Learning
- Relational Reinforcement Learning
- Stanford University
- Technology and Society
- Transfer Learning
- Uncategorized
- Wumpus World
Archives
- January 2012
- December 2011
- May 2011
- April 2011
- February 2011
- November 2010
- June 2010
- May 2010
- April 2010
- March 2010
- January 2010
- December 2009
- November 2009
- October 2009
- September 2009
- August 2009
- July 2009
- June 2009
- April 2009
- March 2009
- February 2009
- January 2009
- December 2008
- November 2008
- October 2008
- September 2008
- August 2008
- July 2008
- June 2008
- May 2008
- April 2008