Sven Koenig: Fast Replanning (Incremental Heuristic Search)

Project "Fast Replanning (Incremental Heuristic Search)"
(scroll down for publications)

Agents often operate in domains that are only incompletely known or change dynamically. One way of dealing with incomplete information is to interleave search with action execution. In this case, the agents need to replan quickly. To make search fast, I studied both heuristic search methods with limited lookahead (agent-centered search, such as real-time heuristic search) and heuristic search methods that reuse information from previous searches (incremental heuristic search).

Incremental heuristic search methods are often faster than A* searches from scratch yet differ from other replanning methods (such as planning by analogy) in that their solution quality is as good as the solution quality of A* searches from scratch. My collaborators and I developed many incremental heuristic search methods, including ones that work according to different principles than the few previously existing ones, analyzed their properties and applied them in a variety of settings to demonstrate their advantages, including for goal-directed navigation in initially unknown terrain and novel settings such as STRIPS-type symbolic planning, mapping, reinforcement learning, moving-target search and any-angle search. For example, we analyzed goal-directed navigation with the freespace assumption (the common-sense navigation strategy behind Stentz' D* and other systems) and showed that its worst-case travel distance is close to optimal. We later applied the analysis also to other navigation strategies, such as Greedy Mapping. We showed that there are four classes of incremental heuristic search algorithms, characterized their advantages and disadvantages, developed a variety of algorithms for them and explored their applications. For example, Minimax LPA* seems to be the first incremental heuristic minimax search method and is able to speed up the parti-game algorithm substantially. Similarly, the PINCH system is able to speed up one-time planning with HSP substantially, the SHERPA system is able to speed up repeated optimal planning with HSP substantially, RTAA* is able to update the heuristics much faster than the standard real-time heuristic search algorithm LRTA* (resulting in smaller search times), LPA* is able to speed up multi-agent path-finding solvers substantially, and variants/ideas of LPA* and its goal-directed navigation variant D* Lite have been widely used by others (including the developers of D*) in robotics, for example, on the winning DARPA Urban Challenge entry and a Mars rover as well as for sampling-based motion planning (BIT* and RRT#). Finally, our Planning on Demand architecture demonstrated that incremental heuristic search allows the path-planning module of mobile robot architectures to handle initially unknown obstacles rather than having to delegate this task to the reactive-navigation module.

Our D* Lite (which is different from D*) received a AAAI Classic Paper (= "Test of Time") Honorable Mention. It and our other incremental heuristic search methods seem to have become popular incremental heuristic search methods with a variety of research groups (without any involvement by us). Here are some examples:

David Wooden writes in his dissertation about our LPA* that it "has led to a series of algorithms, including D* lite, ... (The name "lite" is perhaps an unfortunate choice, as the algorithm is indeed not a diluted or handicapped version of D*.) ... In fact, Stentz's lab (where D* was introduced) uses D* Lite in many of its current implementations in place of D* ...", citing "Ferguson, D. personal communication, May 2006", see D. Wooden, Graph-based Path Planning for Mobile Robots, Dissertation, School of Electrical and Computer Engineering, Georgia Institute of Technology, December 2006, page 11.
Dave Ferguson and Anthony Stentz used elements of D* Lite in a highly optimized version of their Field D* system, see D. Ferguson and A. Stentz, Using Interpolation to Improve Path Planning: The Field D* Algorithm, Journal of Field Robotics, Volume 23(2), 2006, pages 79-101. The article also describes how to create an efficient implementation of Field D* for the Mars Exploration Rovers.
Maxim Likhachev and Dave Ferguson used a combination of ARA* and D* Lite, called Anytime D*, to plan long motions involving complex maneuvers (such as traversal of parking lots and complex u-turns) for the winning DARPA Urban Challenge entry from CMU, see M. Likhachev, D. Ferguson, G. Gordon, A. Stentz, and S. Thrun, Anytime Dynamic A*: An Anytime, Replanning Algorithm, Proceedings of the International Conference on Autonomous Planning and Scheduling, 2005, pages 262-271 and M. Likhachev and D. Ferguson, Planning Long Dynamically-Feasible Maneuvers for Autonomous Vehicles, International Journal of Robotics Research, Volume 28(8), 2009, pages 933-945.
A team of the University of Sherbrooke (Canada) and the Canadian Space Agency (including Khaled Belghith, Froduald Kabanza and Leo Hartman) used elements of Generalized Adaptive A* in a novel randomized path planning approach that was incorporated into a system for improving group support operations on the Space Station Remote Manipulator, see K. Belghith, F. Kabanza and L. Hartman, Using a Randomized Path Planner to Generate 3D Task Demonstrations of Robot Operations, Proceedings of the International Conference on Autonomous and Intelligence Systems, 2010, pages 1-6.
Oktay Arslan and Panagiotis Tsiotras used ideas of LPA* to develop incremental sampling-based motion planning algorithm. They state that "[s]everal variants of incremental sampling-based algorithms have been recently proposed in order to optimally solve motion planning problems. ... However, the convergence rate to the optimal solution may still be slow. Borrowing from ideas used in the well-known LPA* algorithm, in this paper we present a new incremental sampling-based motion planning algorithm based on Rapidly-exploring Random Graphs (RRG), denoted by RRT# (RRT 'sharp'), which also guarantees asymptotic optimality, but, in addition, it also ensures that the constructed spanning tree rooted at the initial state contains lowest-cost path information for vertices which have the potential to be part of the optimal solution", see the abstract of O. Arslan and P. Tsiotras, Use of Relaxation Methods in Sampling-based Algorithms for Optimal Motion Planning, Proceedings of the International Conference on Robotics and Automation, 2013, pages 2421-2428. (The citation is from a preliminary version that the authors made available to me.)
Jonathan Gammell, Siddhartha Srinivasa and Timothy Barfoot developed Batch Informed Trees (BIT*). They state that "BIT* uses a heuristic to efficiently search a series of increasingly dense implicit RGGs while reusing previous information. It can be viewed as an extension of incremental graph-search techniques, such as Lifelong Planning A* (LPA*) ..." and "BIT* can be viewed as an extension of LPA* ...", see J. Gammell, S. Srinivasa and T. Barfoot, Batch Informed Trees (BIT*): Sampling-based Optimal Planning via the Heuristically Guided Search of Implicit Random Geometric Graphs, Proceedings of the International Conference on Robotics and Automation, 2015, pages 3067-3074.
Abhijeet Anand implemented MT-D* Lite as part of the RACT-GOLD system of the RMIT Agent Contest Team of the Royal Melbourne Institute of Technology - for participating in the gold-mining game of the International Multi-Agent Contest. He concludes that MT-D* Lite "does provide a substantial improvement over A* to agents trying to reach their destination more often from across the grid" in his Master's thesis on "Path Planning in Agents with Incomplete Information in Dynamic Environments," see A. Anand, Path Planning in Agents with Incomplete Information in Dynamic Environments, M.S. Thesis, School of Computer Science and Information Technology, Royal Melbourne Institute of Technology, August 2011, page 48.
Xiaowu Liu, Riqiang Deng, Jinwen Wang and Xunzhang Wang used D* Lite to develop a search algorithm for codon optimization. They write "finding an optimal sequence from the massive number of possible combinations of synonymous codons that can code for the same amino acid sequence is a challenging task. In this article, we introduce COStar, a D-star Lite-based dynamic search algorithm for codon optimization. The algorithm first maps the codon optimization problem into a weighted directed acyclic graph using a sliding window approach. Then, the D-star Lite algorithm is used to compute the shortest path from the start site to the target site in the resulting graph. Optimizing a gene is thus converted to a search in real-time for a shortest path in a generated graph. Using in silico experiments, the performance of the algorithm was shown by optimizing the different genes including the human genome", see the abstract of X. Liu, R. Deng, J. Wang and X. Wang, "COStar: A D-star Lite-based dynamic search algorithm for codon optimization, Journal of Theoretical Biology, Volume 344, pages 19-30. (Unfortunately, I have not seen the whole article myself nor am I an expert on computational biology.)

Most of our publications include a version of the following disclaimer: "It is important to realize that experimental results, such as the runtimes of the search algorithms, depend on a variety of factors, including implementation details (such as the data structures, tie-breaking strategies, and coding tricks used) and experimental setups (such as whether the gridworlds are four-neighbor or eight-neighbor gridworlds). We do not know of any better method for evaluating search algorithms than to implement them as best as possible, publish their runtimes, and let other researchers experiment with their own and thus potentially different implementations and experimental setups." The reason for this disclaimer is that it would be important to have good computational models that can predict the performance of incremental heuristic search methods as a function of the search problems, implementation details and experimental setups. For example, D* Lite runs much faster than A* in some situations but also much slower in other situations. Its runtime advantage depends on the connectivity of the grid (for example, whether it is a four-neighbor or eight-neighbor grid) and its implementation of the priority queue (for example, whether the priority queue is implemented with a binary heap or buckets). In general, runtime proxies, such as the number of expansions, cannot be used to evaluate incremental heuristic search methods since they all need different amounts of time per expansion. Furthermore, they typically operate on search problems that fit into memory. Big O-analyses do not make sense for small search problems and implementation details really do matter and can have a large effect on the runtime. We definitely need to understand better which heuristic search method to use in which situation.

Tutorial

Tutorial Webpage

Applet

Code

The following code should be correct but is somewhat out of date. In particular, Lifelong Planning A* and D* Lite can now use buckets as priority queue or, if they use a heap as priority queue, break ties towards larger g-values. These improvements are not yet reflected in the code.

Lifelong Planning A* on Grids with Random Costs (clean code)
Lifelong Planning A* on Mazes with Random Obstacles
D* Lite with ASCII Interface (mostly untested)

Class Project

Class Project on Fast Trajectory Replanning

Representative Overview Publications

S. Koenig. Creating a Uniform Framework for Task and Motion Planning: A Case for Incremental Heuristic Search? [Overview Paper]. In ICAPS-10 Workshop on Combining Action and Motion Planning, 2010. [downloadable]
S. Koenig, M. Likhachev, Y. Liu and D. Furcy. Incremental Heuristic Search in Artificial Intelligence. Artificial Intelligence Magazine, 25, (2), 99-112, 2004. [downloadable]

Representative Publications on Heuristic Search

C. Hernandez, T. Uras, S. Koenig, J. Baier, X. Sun and P. Meseguer. Reusing Cost-Minimal Paths for Goal-Directed Navigation in Partially Known Terrains. Journal of Autonomous Agents and Multi-Agent Systems, 29, (5), 850-895, 2015. [downloadable]
X. Sun, T. Uras, S. Koenig and W. Yeoh. Incremental ARA*: An Incremental Anytime Search Algorithm for Moving-Target Search. In International Conference on Automated Planning and Scheduling (ICAPS), 2012. [downloadable]
C. Hernandez, J. Baier, T. Uras and S. Koenig. Time-Bounded Adaptive A*. In International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS), 997-1006, 2012. [downloadable]
C. Hernandez, J. Baier, T. Uras and S. Koenig. Position Paper: Incremental Search Algorithms Considered Poorly Understood. In Symposium on Combinatorial Search (SoCS), 2012. [downloadable]
C. Hernandez, X. Sun, S. Koenig and P. Meseguer. Tree Adaptive A*. In International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS), 123-130, 2011. [downloadable]
X. Sun, W. Yeoh and S. Koenig. Generalized Fringe-Retrieving A*: Faster Moving Target Search on State Lattices. In International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS), 1081-1088, 2010. [downloadable]
X. Sun, W. Yeoh and S. Koenig. Moving Target D* Lite. In International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS), 67-74, 2010. [downloadable]
S. Koenig and X. Sun. Comparing Real-Time and Incremental Heuristic Search for Real-Time Situated Agents. Journal of Autonomous Agents and Multi-Agent Systems, 18, (3), 313-341, 2009. [downloadable]
A. Nash, S. Koenig and M. Likhachev. Incremental Phi*: Incremental Any-Angle Path Planning on Grids. In International Joint Conference on Artificial Intelligence (IJCAI), 1824-1830, 2009. [downloadable]
X. Sun, W. Yeoh and S. Koenig. Efficient Incremental Search for Moving Target Search. In International Joint Conference on Artificial Intelligence (IJCAI), 615-620, 2009. [downloadable]
X. Sun, W. Yeoh, P. Chen and S. Koenig. Simple Optimization Techniques for A*-Based Search. In International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS), 931-936, 2009. [downloadable]
C. Hernandez, P. Meseguer, X. Sun and S. Koenig. Path-Adaptive A* for Incremental Heuristic Search in Unknown Terrain. In International Conference on Automated Planning and Scheduling (ICAPS), 358-361, 2009. [downloadable]
X. Sun, S. Koenig and W. Yeoh. Generalized Adaptive A*. In International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS), 469-476, 2008. [downloadable]
X. Sun and S. Koenig. The Fringe-Saving A* Search Algorithm - A Feasibility Study. In International Joint Conference on Artificial Intelligence (IJCAI), 2391-2397, 2007. [downloadable]
S. Koenig, M. Likhachev and X. Sun. Speeding up Moving-Target Search. In International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2007. [downloadable]
S. Koenig and M. Likhachev. Real-Time Adaptive A*. In International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS), 281-288, 2006. [downloadable]
M. Likhachev and S. Koenig. Incremental Heuristic Search in Games: The Quest for Speed [Short Paper]. In Artificial Intelligence and Interactive Digital Entertainment Conference (AIIDE), 118-120, 2006. [downloadable]
S. Koenig and M. Likhachev. A New Principle for Incremental Heuristic Search: Theoretical Results [Short Paper]. In International Conference on Automated Planning and Scheduling (ICAPS), 402-405, 2006. [downloadable]
M. Likhachev and S. Koenig. A Generalized Framework for Lifelong Planning A*. In International Conference on Automated Planning and Scheduling (ICAPS), 99-108, 2005. [downloadable]
S. Koenig and M. Likhachev. Adaptive A* [Short Paper]. In International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS), 1311-1312, 2005. [downloadable]
S. Koenig, M. Likhachev and D. Furcy. Lifelong Planning A*. Artificial Intelligence Journal, 155, (1-2), 93-146, 2004. [downloadable]
S. Koenig. A Comparison of Fast Search Methods for Real-Time Situated Agents. In International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS), 864-871, 2004. [downloadable]
S. Koenig and M. Likhachev. Incremental A*. In Advances in Neural Information Processing Systems (NeurIPS), 1539-1546, 2002. [downloadable]

Representative Publications on Moving Target Search

X. Sun, W. Yeoh and S. Koenig. Generalized Fringe-Retrieving A*: Faster Moving Target Search on State Lattices. In International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS), 1081-1088, 2010. [downloadable]
X. Sun, W. Yeoh and S. Koenig. Moving Target D* Lite. In International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS), 67-74, 2010. [downloadable]
X. Sun, W. Yeoh and S. Koenig. Efficient Incremental Search for Moving Target Search. In International Joint Conference on Artificial Intelligence (IJCAI), 615-620, 2009. [downloadable]
S. Koenig, M. Likhachev and X. Sun. Speeding up Moving-Target Search. In International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2007. [downloadable]

Representative Publications on Symbolic Planning

S. Koenig. Topics for Future Planning Competitions [Position Paper]. In ICAPS-03 Workshop on the Competition: Impact, Organization, Evaluation, Benchmarks, 2003. [downloadable]
Y. Liu, S. Koenig and D. Furcy. Speeding Up the Calculation of Heuristics for Heuristic Search-Based Planning. In AAAI Conference on Artificial Intelligence (AAAI), 484-491, 2002. [downloadable]
S. Koenig, D. Furcy and C. Bauer. Heuristic Search-Based Replanning. In International Conference on Artificial Intelligence Planning and Scheduling (AIPS), 294-301, 2002. [downloadable]

Representative Publications on Robotics

S. Koenig and M. Likhachev. Fast Replanning for Navigation in Unknown Terrain. Transactions on Robotics, 21, (3), 354-363, 2005. [downloadable]
A. Ranganathan and S. Koenig. A Reactive Robot Architecture with Planning on Demand. In IEEE International Conference on Intelligent Robots and Systems (IROS), 1462-1468, 2003. [downloadable]
S. Koenig and M. Likhachev. D* Lite. In AAAI Conference of Artificial Intelligence (AAAI), 476-483, 2002. [downloadable]
M. Likhachev and S. Koenig. Incremental Replanning for Mapping. In IEEE International Conference on Intelligent Robots and Systems (IROS), 667-672, 2002. [downloadable]

Representative Publications on Reinforcement Learning

M. Likhachev and S. Koenig. Speeding Up the Parti-Game Algorithm. In Advances in Neural Information Processing Systems (NeurIPS), 1563-1570, MIT Press, 2003. [downloadable]

Some of this material is based upon work supported by the National Science Foundation under Grant No. 0350584. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.

Many publishers do not want authors to make their papers available electronically after the papers have been published. Please use the electronic versions provided here only if hardcopies are not yet available. If you have comments on any of these papers, please send me an email! Also, please send me your papers if we have common interests.

This page was automatically created by a bibliography maintenance system that was developed as part of an undergraduate research project, advised by Sven Koenig.

Home Page of Sven Koenig