Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hongyuan Su

Understanding World or Predicting Future? A Comprehensive Survey of World Models

Nov 21, 2024

Jingtao Ding, Yunke Zhang, Yu Shang, Yuheng Zhang, Zefang Zong, Jie Feng, Yuan Yuan, Hongyuan Su, Nian Li, Nicholas Sukiennik(+2 more)

Figure 1 for Understanding World or Predicting Future? A Comprehensive Survey of World Models

Figure 2 for Understanding World or Predicting Future? A Comprehensive Survey of World Models

Figure 3 for Understanding World or Predicting Future? A Comprehensive Survey of World Models

Figure 4 for Understanding World or Predicting Future? A Comprehensive Survey of World Models

Abstract:The concept of world models has garnered significant attention due to advancements in multimodal large language models such as GPT-4 and video generation models such as Sora, which are central to the pursuit of artificial general intelligence. This survey offers a comprehensive review of the literature on world models. Generally, world models are regarded as tools for either understanding the present state of the world or predicting its future dynamics. This review presents a systematic categorization of world models, emphasizing two primary functions: (1) constructing internal representations to understand the mechanisms of the world, and (2) predicting future states to simulate and guide decision-making. Initially, we examine the current progress in these two categories. We then explore the application of world models in key domains, including autonomous driving, robotics, and social simulacra, with a focus on how each domain utilizes these aspects. Finally, we outline key challenges and provide insights into potential future research directions.

Via

Access Paper or Ask Questions

Large-scale Urban Facility Location Selection with Knowledge-informed Reinforcement Learning

Sep 03, 2024

Hongyuan Su, Yu Zheng, Jingtao Ding, Depeng Jin, Yong Li

Figure 1 for Large-scale Urban Facility Location Selection with Knowledge-informed Reinforcement Learning

Figure 2 for Large-scale Urban Facility Location Selection with Knowledge-informed Reinforcement Learning

Figure 3 for Large-scale Urban Facility Location Selection with Knowledge-informed Reinforcement Learning

Figure 4 for Large-scale Urban Facility Location Selection with Knowledge-informed Reinforcement Learning

Abstract:The facility location problem (FLP) is a classical combinatorial optimization challenge aimed at strategically laying out facilities to maximize their accessibility. In this paper, we propose a reinforcement learning method tailored to solve large-scale urban FLP, capable of producing near-optimal solutions at superfast inference speed. We distill the essential swap operation from local search, and simulate it by intelligently selecting edges on a graph of urban regions, guided by a knowledge-informed graph neural network, thus sidestepping the need for heavy computation of local search. Extensive experiments on four US cities with different geospatial conditions demonstrate that our approach can achieve comparable performance to commercial solvers with less than 5\% accessibility loss, while displaying up to 1000 times speedup. We deploy our model as an online geospatial application at https://huggingface.co/spaces/randommmm/MFLP.

* 4 pages

Via

Access Paper or Ask Questions

Road Planning for Slums via Deep Reinforcement Learning

May 22, 2023

Yu Zheng, Hongyuan Su, Jingtao Ding, Depeng Jin, Yong Li

Abstract:Millions of slum dwellers suffer from poor accessibility to urban services due to inadequate road infrastructure within slums, and road planning for slums is critical to the sustainable development of cities. Existing re-blocking or heuristic methods are either time-consuming which cannot generalize to different slums, or yield sub-optimal road plans in terms of accessibility and construction costs. In this paper, we present a deep reinforcement learning based approach to automatically layout roads for slums. We propose a generic graph model to capture the topological structure of a slum, and devise a novel graph neural network to select locations for the planned roads. Through masked policy optimization, our model can generate road plans that connect places in a slum at minimal construction costs. Extensive experiments on real-world slums in different countries verify the effectiveness of our model, which can significantly improve accessibility by 14.3% against existing baseline methods. Further investigations on transferring across different tasks demonstrate that our model can master road planning skills in simple scenarios and adapt them to much more complicated ones, indicating the potential of applying our model in real-world slum upgrading.

* KDD'23

Via

Access Paper or Ask Questions

Adaptive Mimic: Deep Reinforcement Learning of Parameterized Bipedal Walking from Infeasible References

Dec 13, 2021

Chong Zhang, Qi Wu, Liqian Ma, Hongyuan Su

Figure 1 for Adaptive Mimic: Deep Reinforcement Learning of Parameterized Bipedal Walking from Infeasible References

Figure 2 for Adaptive Mimic: Deep Reinforcement Learning of Parameterized Bipedal Walking from Infeasible References

Figure 3 for Adaptive Mimic: Deep Reinforcement Learning of Parameterized Bipedal Walking from Infeasible References

Figure 4 for Adaptive Mimic: Deep Reinforcement Learning of Parameterized Bipedal Walking from Infeasible References

Abstract:Not until recently, robust robot locomotion has been achieved by deep reinforcement learning (DRL). However, for efficient learning of parametrized bipedal walking, developed references are usually required, limiting the performance to that of the references. In this paper, we propose to design an adaptive reward function for imitation learning from the references. The agent is encouraged to mimic the references when its performance is low, while to pursue high performance when it reaches the limit of references. We further demonstrate that developed references can be replaced by low-quality references that are generated without laborious tuning and infeasible to deploy by themselves, as long as they can provide a priori knowledge to expedite the learning process.

* 12pages, 9 figures (one added in v2). submitted to L4DC 2022

Via

Access Paper or Ask Questions