A Teaching Strategy for Memory-Based Control

John W. Sheppard; Steven L. Salzberg

doi:10.1007/978-94-017-2053-3_13

A Teaching Strategy for Memory-Based Control

John W. Sheppard, Steven L. Salzberg

Research output: Contribution to journal › Article › peer-review

24 Scopus citations

Abstract

Combining different machine learning algorithms in the same system can produce benefits above and beyond what either method could achieve alone. This paper demonstrates that genetic algorithms can be used in conjunction with lazy learning to solve examples of a difficult class of delayed reinforcement learning problems better than either method alone. This class, the class of differential games, includes numerous important control problems that arise in robotics, planning, game playing, and other areas, and solutions for differential games suggest solution strategies for the general class of planning and control problems. We conducted a series of experiments applying three learning approaches - lazy Q-learning, k-nearest neighbor (k-NN), and a genetic algorithm - to a particular differential game called a pursuit game. Our experiments demonstrate that k-NN had great difficulty solving the problem, while a lazy version of Q-learning performed moderately well and the genetic algorithm performed even better. These results motivated the next step in the experiments, where we hypothesized k-NN was having difficulty because it did not have good examples - a common source of difficulty for lazy learning. Therefore, we used the genetic algorithm as a bootstrapping method for k-NN to create a system to provide these examples. Our experiments demonstrate that the resulting joint system learned to solve the pursuit games with a high degree of accuracy - outperforming either method alone - and with relatively small memory requirements.

Original language	English (US)
Pages (from-to)	343-370
Number of pages	28
Journal	Artificial Intelligence Review
Volume	11
Issue number	1-5
DOIs	https://doi.org/10.1007/978-94-017-2053-3_13
State	Published - 1997
Externally published	Yes

Keywords

Differential games
Genetic algorithms
Lazy learning
Nearest neighbor
Pursuit games
Reinforcement learning
Teaching

ASJC Scopus subject areas

Language and Linguistics
Linguistics and Language
Artificial Intelligence

Access to Document

10.1007/978-94-017-2053-3_13

Cite this

@article{8a045a37c46a4a18b07d7743cf803439,

title = "A Teaching Strategy for Memory-Based Control",

abstract = "Combining different machine learning algorithms in the same system can produce benefits above and beyond what either method could achieve alone. This paper demonstrates that genetic algorithms can be used in conjunction with lazy learning to solve examples of a difficult class of delayed reinforcement learning problems better than either method alone. This class, the class of differential games, includes numerous important control problems that arise in robotics, planning, game playing, and other areas, and solutions for differential games suggest solution strategies for the general class of planning and control problems. We conducted a series of experiments applying three learning approaches - lazy Q-learning, k-nearest neighbor (k-NN), and a genetic algorithm - to a particular differential game called a pursuit game. Our experiments demonstrate that k-NN had great difficulty solving the problem, while a lazy version of Q-learning performed moderately well and the genetic algorithm performed even better. These results motivated the next step in the experiments, where we hypothesized k-NN was having difficulty because it did not have good examples - a common source of difficulty for lazy learning. Therefore, we used the genetic algorithm as a bootstrapping method for k-NN to create a system to provide these examples. Our experiments demonstrate that the resulting joint system learned to solve the pursuit games with a high degree of accuracy - outperforming either method alone - and with relatively small memory requirements.",

keywords = "Differential games, Genetic algorithms, Lazy learning, Nearest neighbor, Pursuit games, Reinforcement learning, Teaching",

author = "Sheppard, {John W.} and Salzberg, {Steven L.}",

year = "1997",

doi = "10.1007/978-94-017-2053-3_13",

language = "English (US)",

volume = "11",

pages = "343--370",

journal = "Artificial Intelligence Review",

issn = "0269-2821",

publisher = "Springer Netherlands",

number = "1-5",

}

TY - JOUR

T1 - A Teaching Strategy for Memory-Based Control

AU - Sheppard, John W.

AU - Salzberg, Steven L.

PY - 1997

Y1 - 1997

N2 - Combining different machine learning algorithms in the same system can produce benefits above and beyond what either method could achieve alone. This paper demonstrates that genetic algorithms can be used in conjunction with lazy learning to solve examples of a difficult class of delayed reinforcement learning problems better than either method alone. This class, the class of differential games, includes numerous important control problems that arise in robotics, planning, game playing, and other areas, and solutions for differential games suggest solution strategies for the general class of planning and control problems. We conducted a series of experiments applying three learning approaches - lazy Q-learning, k-nearest neighbor (k-NN), and a genetic algorithm - to a particular differential game called a pursuit game. Our experiments demonstrate that k-NN had great difficulty solving the problem, while a lazy version of Q-learning performed moderately well and the genetic algorithm performed even better. These results motivated the next step in the experiments, where we hypothesized k-NN was having difficulty because it did not have good examples - a common source of difficulty for lazy learning. Therefore, we used the genetic algorithm as a bootstrapping method for k-NN to create a system to provide these examples. Our experiments demonstrate that the resulting joint system learned to solve the pursuit games with a high degree of accuracy - outperforming either method alone - and with relatively small memory requirements.

AB - Combining different machine learning algorithms in the same system can produce benefits above and beyond what either method could achieve alone. This paper demonstrates that genetic algorithms can be used in conjunction with lazy learning to solve examples of a difficult class of delayed reinforcement learning problems better than either method alone. This class, the class of differential games, includes numerous important control problems that arise in robotics, planning, game playing, and other areas, and solutions for differential games suggest solution strategies for the general class of planning and control problems. We conducted a series of experiments applying three learning approaches - lazy Q-learning, k-nearest neighbor (k-NN), and a genetic algorithm - to a particular differential game called a pursuit game. Our experiments demonstrate that k-NN had great difficulty solving the problem, while a lazy version of Q-learning performed moderately well and the genetic algorithm performed even better. These results motivated the next step in the experiments, where we hypothesized k-NN was having difficulty because it did not have good examples - a common source of difficulty for lazy learning. Therefore, we used the genetic algorithm as a bootstrapping method for k-NN to create a system to provide these examples. Our experiments demonstrate that the resulting joint system learned to solve the pursuit games with a high degree of accuracy - outperforming either method alone - and with relatively small memory requirements.

KW - Differential games

KW - Genetic algorithms

KW - Lazy learning

KW - Nearest neighbor

KW - Pursuit games

KW - Reinforcement learning

KW - Teaching

UR - http://www.scopus.com/inward/record.url?scp=0031071704&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0031071704&partnerID=8YFLogxK

U2 - 10.1007/978-94-017-2053-3_13

DO - 10.1007/978-94-017-2053-3_13

M3 - Article

AN - SCOPUS:0031071704

SN - 0269-2821

VL - 11

SP - 343

EP - 370

JO - Artificial Intelligence Review

JF - Artificial Intelligence Review

IS - 1-5

ER -

A Teaching Strategy for Memory-Based Control

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this