Search | arXiv e-print repository

Automatically Learning HTN Methods from Landmarks

Authors: Ruoxi Li, Dana Nau, Mark Roberts, Morgan Fine-Morris

Abstract: Hierarchical Task Network (HTN) planning usually requires a domain engineer to provide manual input about how to decompose a planning problem. Even HTN-MAKER, a well-known method-learning algorithm, requires a domain engineer to annotate the tasks with information about what to learn. We introduce CURRICULAMA, an HTN method learning algorithm that completely automates the learning process. It uses… ▽ More Hierarchical Task Network (HTN) planning usually requires a domain engineer to provide manual input about how to decompose a planning problem. Even HTN-MAKER, a well-known method-learning algorithm, requires a domain engineer to annotate the tasks with information about what to learn. We introduce CURRICULAMA, an HTN method learning algorithm that completely automates the learning process. It uses landmark analysis to compose annotated tasks and leverages curriculum learning to order the learning of methods from simpler to more complex. This eliminates the need for manual input, resolving a core issue with HTN-MAKER. We prove CURRICULAMA's soundness, and show experimentally that it has a substantially similar convergence rate in learning a complete set of methods to HTN-MAKER. △ Less

Submitted 9 April, 2024; originally announced April 2024.

Comments: This work has been submitted to FLAIRS-24

arXiv:2010.01909 [pdf, other]

doi 10.1016/j.artint.2021.103523

Deliberative Acting, Online Planning and Learning with Hierarchical Operational Models

Authors: Sunandita Patra, James Mason, Malik Ghallab, Dana Nau, Paolo Traverso

Abstract: In AI research, synthesizing a plan of action has typically used descriptive models of the actions that abstractly specify what might happen as a result of an action, and are tailored for efficiently computing state transitions. However, executing the planned actions has needed operational models, in which rich computational control structures and closed-loop online decision-making are used to spe… ▽ More In AI research, synthesizing a plan of action has typically used descriptive models of the actions that abstractly specify what might happen as a result of an action, and are tailored for efficiently computing state transitions. However, executing the planned actions has needed operational models, in which rich computational control structures and closed-loop online decision-making are used to specify how to perform an action in a nondeterministic execution context, react to events and adapt to an unfolding situation. Deliberative actors, which integrate acting and planning, have typically needed to use both of these models together -- which causes problems when attempting to develop the different models, verify their consistency, and smoothly interleave acting and planning. As an alternative, we define and implement an integrated acting and planning system in which both planning and acting use the same operational models. These rely on hierarchical task-oriented refinement methods offering rich control structures. The acting component, called Reactive Acting Engine (RAE), is inspired by the well-known PRS system. At each decision step, RAE can get advice from a planner for a near-optimal choice with respect to a utility function. The anytime planner uses a UCT-like Monte Carlo Tree Search procedure, called UPOM, whose rollouts are simulations of the actor's operational models. We also present learning strategies for use with RAE and UPOM that acquire, from online acting experiences and/or simulated planning results, a map** from decision contexts to method instances as well as a heuristic function to guide UPOM. We demonstrate the asymptotic convergence of UPOM towards optimal methods in static domains, and show experimentally that UPOM and the learning strategies significantly improve the acting efficiency and robustness. △ Less

Submitted 15 November, 2021; v1 submitted 2 October, 2020; originally announced October 2020.

Comments: Published in Artificial Intelligence (AIJ). Please cite as: Sunandita Patra, James Mason, Malik Ghallab, Dana Nau, Paolo Traverso. Deliberative Acting, Planning and Learning with Hierarchical Operational Models. Artificial Intelligence, Elsevier, 2021, 299, pp.103523. 10.1016/j.artint.2021.103523. arXiv admin note: text overlap with arXiv:2003.03932

Journal ref: Artificial Intelligence, Elsevier, 2021, 299, pp.103523

arXiv:2003.03932 [pdf, other]

Integrating Acting, Planning and Learning in Hierarchical Operational Models

Authors: Sunandita Patra, James Mason, Amit Kumar, Malik Ghallab, Paolo Traverso, Dana Nau

Abstract: We present new planning and learning algorithms for RAE, the Refinement Acting Engine. RAE uses hierarchical operational models to perform tasks in dynamically changing environments. Our planning procedure, UPOM, does a UCT-like search in the space of operational models in order to find a near-optimal method to use for the task and context at hand. Our learning strategies acquire, from online acti… ▽ More We present new planning and learning algorithms for RAE, the Refinement Acting Engine. RAE uses hierarchical operational models to perform tasks in dynamically changing environments. Our planning procedure, UPOM, does a UCT-like search in the space of operational models in order to find a near-optimal method to use for the task and context at hand. Our learning strategies acquire, from online acting experiences and/or simulated planning results, a map** from decision contexts to method instances as well as a heuristic function to guide UPOM. Our experimental results show that UPOM and our learning strategies significantly improve RAE's performance in four test domains using two different metrics: efficiency and success ratio. △ Less

Submitted 9 March, 2020; originally announced March 2020.

Comments: Accepted in ICAPS 2020 (30th International Conference on Automated Planning and Scheduling)

arXiv:1804.07406 [pdf, other]

doi 10.1007/978-3-319-93372-6_7

Tip** Points for Norm Change in Human Cultures

Authors: Soham De, Dana S. Nau, Xinyue Pan, Michele J. Gelfand

Abstract: Humans interact with each other on a daily basis by develo** and maintaining various social norms and it is critical to form a deeper understanding of how such norms develop, how they change, and how fast they change. In this work, we develop an evolutionary game-theoretic model based on research in cultural psychology that shows that humans in various cultures differ in their tendencies to conf… ▽ More Humans interact with each other on a daily basis by develo** and maintaining various social norms and it is critical to form a deeper understanding of how such norms develop, how they change, and how fast they change. In this work, we develop an evolutionary game-theoretic model based on research in cultural psychology that shows that humans in various cultures differ in their tendencies to conform with those around them. Using this model, we analyze the evolutionary relationships between the tendency to conform and how quickly a population reacts when conditions make a change in norm desirable. Our analysis identifies conditions when a tip** point is reached in a population, causing norms to change rapidly. △ Less

Submitted 1 July, 2018; v1 submitted 19 April, 2018; originally announced April 2018.

Comments: SBP-BRiMS 2018

arXiv:1704.04720 [pdf, other]

Understanding Norm Change: An Evolutionary Game-Theoretic Approach (Extended Version)

Authors: Soham De, Dana S. Nau, Michele J. Gelfand

Abstract: Human societies around the world interact with each other by develo** and maintaining social norms, and it is critically important to understand how such norms emerge and change. In this work, we define an evolutionary game-theoretic model to study how norms change in a society, based on the idea that different strength of norms in societies translate to different game-theoretic interaction stru… ▽ More Human societies around the world interact with each other by develo** and maintaining social norms, and it is critically important to understand how such norms emerge and change. In this work, we define an evolutionary game-theoretic model to study how norms change in a society, based on the idea that different strength of norms in societies translate to different game-theoretic interaction structures and incentives. We use this model to study, both analytically and with extensive agent-based simulations, the evolutionary relationships of the need for coordination in a society (which is related to its norm strength) with two key aspects of norm change: cultural inertia (whether or how quickly the population responds when faced with conditions that make a norm change desirable), and exploration rate (the willingness of agents to try out new strategies). Our results show that a high need for coordination leads to both high cultural inertia and a low exploration rate, while a low need for coordination leads to low cultural inertia and high exploration rate. This is the first work, to our knowledge, on understanding the evolutionary causal relationships among these factors. △ Less

Submitted 16 April, 2017; originally announced April 2017.

Comments: In 2017 International Conference on Autonomous Agents & Multiagent Systems (AAMAS)

arXiv:1606.02570 [pdf, other]

Using Game Theory to Study the Evolution of Cultural Norms

Authors: Soham De, Dana S. Nau, Michele J. Gelfand

Abstract: We discuss how to use evolutionary game theory (EGT) as a framework for studying how cultural dynamics and structural properties can influence the evolution of norms and behaviors within a society. We provide a brief tutorial on how EGT works, and discuss what kinds of insights it can provide. We then describe three published studies in which we have developed EGT models that help explain how stru… ▽ More We discuss how to use evolutionary game theory (EGT) as a framework for studying how cultural dynamics and structural properties can influence the evolution of norms and behaviors within a society. We provide a brief tutorial on how EGT works, and discuss what kinds of insights it can provide. We then describe three published studies in which we have developed EGT models that help explain how structural and external conditions in a society affect the emergence of social norms. △ Less

Submitted 16 March, 2017; v1 submitted 6 June, 2016; originally announced June 2016.

Comments: 16 pages

arXiv:1304.3445 [pdf]

An Evaluation of Two Alternatives to Minimax

Authors: Dana Nau, Paul Purdom, Chun-Hung Tzeng

Abstract: In the field of Artificial Intelligence, traditional approaches to choosing moves in games involve the we of the minimax algorithm. However, recent research results indicate that minimizing may not always be the best approach. In this paper we summarize the results of some measurements on several model games with several different evaluation functions. These measurements, which are presented in de… ▽ More In the field of Artificial Intelligence, traditional approaches to choosing moves in games involve the we of the minimax algorithm. However, recent research results indicate that minimizing may not always be the best approach. In this paper we summarize the results of some measurements on several model games with several different evaluation functions. These measurements, which are presented in detail in [NPT], show that there are some new algorithms that can make significantly better use of evaluation function values than the minimax algorithm does. △ Less

Submitted 27 March, 2013; originally announced April 2013.

Comments: Appears in Proceedings of the First Conference on Uncertainty in Artificial Intelligence (UAI1985)

Report number: UAI-P-1985-PG-232-236

arXiv:1304.3081 [pdf]

Predicting The Performance of Minimax and Product in Game-Tree

Authors: **-Chung Chi, Dana Nau

Abstract: The discovery that the minimax decision rule performs poorly in some games has sparked interest in possible alternatives to minimax. Until recently, the only games in which minimax was known to perform poorly were games which were mainly of theoretical interest. However, this paper reports results showing poor performance of minimax in a more common game called kalah. For the kalah games tested, a… ▽ More The discovery that the minimax decision rule performs poorly in some games has sparked interest in possible alternatives to minimax. Until recently, the only games in which minimax was known to perform poorly were games which were mainly of theoretical interest. However, this paper reports results showing poor performance of minimax in a more common game called kalah. For the kalah games tested, a non-minimax decision rule called the product rule performs significantly better than minimax. This paper also discusses a possible way to predict whether or not minimax will perform well in a game when compared to product. A parameter called the rate of heuristic flaw (rhf) has been found to correlate positively with the. performance of product against minimax. Both analytical and experimental results are given that appear to support the predictive power of rhf. △ Less

Submitted 27 March, 2013; originally announced April 2013.

Comments: Appears in Proceedings of the Second Conference on Uncertainty in Artificial Intelligence (UAI1986)

Report number: UAI-P-1986-PG-49-56

arXiv:1106.4869 [pdf, ps]

doi 10.1613/jair.1141

SHOP2: An HTN Planning System

Authors: T. C. Au, O. Ilghami, U. Kuter, J. W. Murdock, D. S. Nau, D. Wu, F. Yaman

Abstract: The SHOP2 planning system received one of the awards for distinguished performance in the 2002 International Planning Competition. This paper describes the features of SHOP2 which enabled it to excel in the competition, especially those aspects of SHOP2 that deal with temporal and metric planning domains. The SHOP2 planning system received one of the awards for distinguished performance in the 2002 International Planning Competition. This paper describes the features of SHOP2 which enabled it to excel in the competition, especially those aspects of SHOP2 that deal with temporal and metric planning domains. △ Less

Submitted 23 June, 2011; originally announced June 2011.

Journal ref: Journal Of Artificial Intelligence Research, Volume 20, pages 379-404, 2003

Showing 1–9 of 9 results for author: Nau, D