Area of machine learning concerned with how intelligent agents (artificial intelligence) should take actions to maximize the notion of cumulative reward (e.g., Markov decision process).
« Back to Glossary IndexReinforcement Learning
« Back to Glossary Index