-
Automatically Learning HTN Methods from Landmarks
Authors:
Ruoxi Li,
Dana Nau,
Mark Roberts,
Morgan Fine-Morris
Abstract:
Hierarchical Task Network (HTN) planning usually requires a domain engineer to provide manual input about how to decompose a planning problem. Even HTN-MAKER, a well-known method-learning algorithm, requires a domain engineer to annotate the tasks with information about what to learn. We introduce CURRICULAMA, an HTN method learning algorithm that completely automates the learning process. It uses…
▽ More
Hierarchical Task Network (HTN) planning usually requires a domain engineer to provide manual input about how to decompose a planning problem. Even HTN-MAKER, a well-known method-learning algorithm, requires a domain engineer to annotate the tasks with information about what to learn. We introduce CURRICULAMA, an HTN method learning algorithm that completely automates the learning process. It uses landmark analysis to compose annotated tasks and leverages curriculum learning to order the learning of methods from simpler to more complex. This eliminates the need for manual input, resolving a core issue with HTN-MAKER. We prove CURRICULAMA's soundness, and show experimentally that it has a substantially similar convergence rate in learning a complete set of methods to HTN-MAKER.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
Deliberative Acting, Online Planning and Learning with Hierarchical Operational Models
Authors:
Sunandita Patra,
James Mason,
Malik Ghallab,
Dana Nau,
Paolo Traverso
Abstract:
In AI research, synthesizing a plan of action has typically used descriptive models of the actions that abstractly specify what might happen as a result of an action, and are tailored for efficiently computing state transitions. However, executing the planned actions has needed operational models, in which rich computational control structures and closed-loop online decision-making are used to spe…
▽ More
In AI research, synthesizing a plan of action has typically used descriptive models of the actions that abstractly specify what might happen as a result of an action, and are tailored for efficiently computing state transitions. However, executing the planned actions has needed operational models, in which rich computational control structures and closed-loop online decision-making are used to specify how to perform an action in a nondeterministic execution context, react to events and adapt to an unfolding situation. Deliberative actors, which integrate acting and planning, have typically needed to use both of these models together -- which causes problems when attempting to develop the different models, verify their consistency, and smoothly interleave acting and planning.
As an alternative, we define and implement an integrated acting and planning system in which both planning and acting use the same operational models. These rely on hierarchical task-oriented refinement methods offering rich control structures. The acting component, called Reactive Acting Engine (RAE), is inspired by the well-known PRS system. At each decision step, RAE can get advice from a planner for a near-optimal choice with respect to a utility function. The anytime planner uses a UCT-like Monte Carlo Tree Search procedure, called UPOM, whose rollouts are simulations of the actor's operational models. We also present learning strategies for use with RAE and UPOM that acquire, from online acting experiences and/or simulated planning results, a map** from decision contexts to method instances as well as a heuristic function to guide UPOM. We demonstrate the asymptotic convergence of UPOM towards optimal methods in static domains, and show experimentally that UPOM and the learning strategies significantly improve the acting efficiency and robustness.
△ Less
Submitted 15 November, 2021; v1 submitted 2 October, 2020;
originally announced October 2020.
-
Integrating Acting, Planning and Learning in Hierarchical Operational Models
Authors:
Sunandita Patra,
James Mason,
Amit Kumar,
Malik Ghallab,
Paolo Traverso,
Dana Nau
Abstract:
We present new planning and learning algorithms for RAE, the Refinement Acting Engine. RAE uses hierarchical operational models to perform tasks in dynamically changing environments. Our planning procedure, UPOM, does a UCT-like search in the space of operational models in order to find a near-optimal method to use for the task and context at hand. Our learning strategies acquire, from online acti…
▽ More
We present new planning and learning algorithms for RAE, the Refinement Acting Engine. RAE uses hierarchical operational models to perform tasks in dynamically changing environments. Our planning procedure, UPOM, does a UCT-like search in the space of operational models in order to find a near-optimal method to use for the task and context at hand. Our learning strategies acquire, from online acting experiences and/or simulated planning results, a map** from decision contexts to method instances as well as a heuristic function to guide UPOM. Our experimental results show that UPOM and our learning strategies significantly improve RAE's performance in four test domains using two different metrics: efficiency and success ratio.
△ Less
Submitted 9 March, 2020;
originally announced March 2020.
-
Tip** Points for Norm Change in Human Cultures
Authors:
Soham De,
Dana S. Nau,
Xinyue Pan,
Michele J. Gelfand
Abstract:
Humans interact with each other on a daily basis by develo** and maintaining various social norms and it is critical to form a deeper understanding of how such norms develop, how they change, and how fast they change. In this work, we develop an evolutionary game-theoretic model based on research in cultural psychology that shows that humans in various cultures differ in their tendencies to conf…
▽ More
Humans interact with each other on a daily basis by develo** and maintaining various social norms and it is critical to form a deeper understanding of how such norms develop, how they change, and how fast they change. In this work, we develop an evolutionary game-theoretic model based on research in cultural psychology that shows that humans in various cultures differ in their tendencies to conform with those around them. Using this model, we analyze the evolutionary relationships between the tendency to conform and how quickly a population reacts when conditions make a change in norm desirable. Our analysis identifies conditions when a tip** point is reached in a population, causing norms to change rapidly.
△ Less
Submitted 1 July, 2018; v1 submitted 19 April, 2018;
originally announced April 2018.
-
Understanding Norm Change: An Evolutionary Game-Theoretic Approach (Extended Version)
Authors:
Soham De,
Dana S. Nau,
Michele J. Gelfand
Abstract:
Human societies around the world interact with each other by develo** and maintaining social norms, and it is critically important to understand how such norms emerge and change. In this work, we define an evolutionary game-theoretic model to study how norms change in a society, based on the idea that different strength of norms in societies translate to different game-theoretic interaction stru…
▽ More
Human societies around the world interact with each other by develo** and maintaining social norms, and it is critically important to understand how such norms emerge and change. In this work, we define an evolutionary game-theoretic model to study how norms change in a society, based on the idea that different strength of norms in societies translate to different game-theoretic interaction structures and incentives. We use this model to study, both analytically and with extensive agent-based simulations, the evolutionary relationships of the need for coordination in a society (which is related to its norm strength) with two key aspects of norm change: cultural inertia (whether or how quickly the population responds when faced with conditions that make a norm change desirable), and exploration rate (the willingness of agents to try out new strategies). Our results show that a high need for coordination leads to both high cultural inertia and a low exploration rate, while a low need for coordination leads to low cultural inertia and high exploration rate. This is the first work, to our knowledge, on understanding the evolutionary causal relationships among these factors.
△ Less
Submitted 16 April, 2017;
originally announced April 2017.
-
Using Game Theory to Study the Evolution of Cultural Norms
Authors:
Soham De,
Dana S. Nau,
Michele J. Gelfand
Abstract:
We discuss how to use evolutionary game theory (EGT) as a framework for studying how cultural dynamics and structural properties can influence the evolution of norms and behaviors within a society. We provide a brief tutorial on how EGT works, and discuss what kinds of insights it can provide. We then describe three published studies in which we have developed EGT models that help explain how stru…
▽ More
We discuss how to use evolutionary game theory (EGT) as a framework for studying how cultural dynamics and structural properties can influence the evolution of norms and behaviors within a society. We provide a brief tutorial on how EGT works, and discuss what kinds of insights it can provide. We then describe three published studies in which we have developed EGT models that help explain how structural and external conditions in a society affect the emergence of social norms.
△ Less
Submitted 16 March, 2017; v1 submitted 6 June, 2016;
originally announced June 2016.
-
An Evaluation of Two Alternatives to Minimax
Authors:
Dana Nau,
Paul Purdom,
Chun-Hung Tzeng
Abstract:
In the field of Artificial Intelligence, traditional approaches to choosing moves in games involve the we of the minimax algorithm. However, recent research results indicate that minimizing may not always be the best approach. In this paper we summarize the results of some measurements on several model games with several different evaluation functions. These measurements, which are presented in de…
▽ More
In the field of Artificial Intelligence, traditional approaches to choosing moves in games involve the we of the minimax algorithm. However, recent research results indicate that minimizing may not always be the best approach. In this paper we summarize the results of some measurements on several model games with several different evaluation functions. These measurements, which are presented in detail in [NPT], show that there are some new algorithms that can make significantly better use of evaluation function values than the minimax algorithm does.
△ Less
Submitted 27 March, 2013;
originally announced April 2013.
-
Predicting The Performance of Minimax and Product in Game-Tree
Authors:
**-Chung Chi,
Dana Nau
Abstract:
The discovery that the minimax decision rule performs poorly in some games has sparked interest in possible alternatives to minimax. Until recently, the only games in which minimax was known to perform poorly were games which were mainly of theoretical interest. However, this paper reports results showing poor performance of minimax in a more common game called kalah. For the kalah games tested, a…
▽ More
The discovery that the minimax decision rule performs poorly in some games has sparked interest in possible alternatives to minimax. Until recently, the only games in which minimax was known to perform poorly were games which were mainly of theoretical interest. However, this paper reports results showing poor performance of minimax in a more common game called kalah. For the kalah games tested, a non-minimax decision rule called the product rule performs significantly better than minimax.
This paper also discusses a possible way to predict whether or not minimax will perform well in a game when compared to product. A parameter called the rate of heuristic flaw (rhf) has been found to correlate positively with the. performance of product against minimax. Both analytical and experimental results are given that appear to support the predictive power of rhf.
△ Less
Submitted 27 March, 2013;
originally announced April 2013.
-
SHOP2: An HTN Planning System
Authors:
T. C. Au,
O. Ilghami,
U. Kuter,
J. W. Murdock,
D. S. Nau,
D. Wu,
F. Yaman
Abstract:
The SHOP2 planning system received one of the awards for distinguished performance in the 2002 International Planning Competition. This paper describes the features of SHOP2 which enabled it to excel in the competition, especially those aspects of SHOP2 that deal with temporal and metric planning domains.
The SHOP2 planning system received one of the awards for distinguished performance in the 2002 International Planning Competition. This paper describes the features of SHOP2 which enabled it to excel in the competition, especially those aspects of SHOP2 that deal with temporal and metric planning domains.
△ Less
Submitted 23 June, 2011;
originally announced June 2011.