Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Michael Xu

PARC: Physics-based Augmentation with Reinforcement Learning for Character Controllers

May 06, 2025

Michael Xu, Yi Shi, KangKang Yin, Xue Bin Peng

Abstract:Humans excel in navigating diverse, complex environments with agile motor skills, exemplified by parkour practitioners performing dynamic maneuvers, such as climbing up walls and jumping across gaps. Reproducing these agile movements with simulated characters remains challenging, in part due to the scarcity of motion capture data for agile terrain traversal behaviors and the high cost of acquiring such data. In this work, we introduce PARC (Physics-based Augmentation with Reinforcement Learning for Character Controllers), a framework that leverages machine learning and physics-based simulation to iteratively augment motion datasets and expand the capabilities of terrain traversal controllers. PARC begins by training a motion generator on a small dataset consisting of core terrain traversal skills. The motion generator is then used to produce synthetic data for traversing new terrains. However, these generated motions often exhibit artifacts, such as incorrect contacts or discontinuities. To correct these artifacts, we train a physics-based tracking controller to imitate the motions in simulation. The corrected motions are then added to the dataset, which is used to continue training the motion generator in the next iteration. PARC's iterative process jointly expands the capabilities of the motion generator and tracker, creating agile and versatile models for interacting with complex environments. PARC provides an effective approach to develop controllers for agile terrain traversal, which bridges the gap between the scarcity of motion data and the need for versatile character controllers.

* SIGGRAPH Conference Papers 2025

Via

Access Paper or Ask Questions

Performance Assessment of Feature Detection Methods for 2-D FS Sonar Imagery

Sep 11, 2024

Hitesh Kyatham, Shahriar Negahdaripour, Michael Xu, Xiaomin Lin, Miao Yu, Yiannis Aloimonos

Figure 1 for Performance Assessment of Feature Detection Methods for 2-D FS Sonar Imagery

Figure 2 for Performance Assessment of Feature Detection Methods for 2-D FS Sonar Imagery

Figure 3 for Performance Assessment of Feature Detection Methods for 2-D FS Sonar Imagery

Figure 4 for Performance Assessment of Feature Detection Methods for 2-D FS Sonar Imagery

Abstract:Underwater robot perception is crucial in scientific subsea exploration and commercial operations. The key challenges include non-uniform lighting and poor visibility in turbid environments. High-frequency forward-look sonar cameras address these issues, by providing high-resolution imagery at maximum range of tens of meters, despite complexities posed by high degree of speckle noise, and lack of color and texture. In particular, robust feature detection is an essential initial step for automated object recognition, localization, navigation, and 3-D mapping. Various local feature detectors developed for RGB images are not well-suited for sonar data. To assess their performances, we evaluate a number of feature detectors using real sonar images from five different sonar devices. Performance metrics such as detection accuracy, false positives, and robustness to variations in target characteristics and sonar devices are applied to analyze the experimental results. The study would provide a deeper insight into the bottlenecks of feature detection for sonar data, and developing more effective methods

Via

Access Paper or Ask Questions

Collaborative Quest Completion with LLM-driven Non-Player Characters in Minecraft

Jul 03, 2024

Sudha Rao, Weijia Xu, Michael Xu, Jorge Leandro, Ken Lobb, Gabriel DesGarennes, Chris Brockett, Bill Dolan

Figure 1 for Collaborative Quest Completion with LLM-driven Non-Player Characters in Minecraft

Figure 2 for Collaborative Quest Completion with LLM-driven Non-Player Characters in Minecraft

Figure 3 for Collaborative Quest Completion with LLM-driven Non-Player Characters in Minecraft

Figure 4 for Collaborative Quest Completion with LLM-driven Non-Player Characters in Minecraft

Abstract:The use of generative AI in video game development is on the rise, and as the conversational and other capabilities of large language models continue to improve, we expect LLM-driven non-player characters (NPCs) to become widely deployed. In this paper, we seek to understand how human players collaborate with LLM-driven NPCs to accomplish in-game goals. We design a minigame within Minecraft where a player works with two GPT4-driven NPCs to complete a quest. We perform a user study in which 28 Minecraft players play this minigame and share their feedback. On analyzing the game logs and recordings, we find that several patterns of collaborative behavior emerge from the NPCs and the human players. We also report on the current limitations of language-only models that do not have rich game-state or visual understanding. We believe that this preliminary study and analysis will inform future game developers on how to better exploit these rapidly improving generative AI models for collaborative roles in games.

* ACL 2024
* Accepted at Wordplay workshop at ACL 2024

Via

Access Paper or Ask Questions

Player-Driven Emergence in LLM-Driven Game Narrative

Apr 25, 2024

Xiangyu Peng, Jessica Quaye, Weijia Xu, Chris Brockett, Bill Dolan, Nebojsa Jojic, Gabriel DesGarennes, Ken Lobb, Michael Xu, Jorge Leandro(+2 more)

Figure 1 for Player-Driven Emergence in LLM-Driven Game Narrative

Figure 2 for Player-Driven Emergence in LLM-Driven Game Narrative

Figure 3 for Player-Driven Emergence in LLM-Driven Game Narrative

Figure 4 for Player-Driven Emergence in LLM-Driven Game Narrative

Abstract:We explore how interaction with large language models (LLMs) can give rise to emergent behaviors, empowering players to participate in the evolution of game narratives. Our testbed is a text-adventure game in which players attempt to solve a mystery under a fixed narrative premise, but can freely interact with non-player characters generated by GPT-4, a large language model. We recruit 28 gamers to play the game and use GPT-4 to automatically convert the game logs into a node-graph representing the narrative in the player's gameplay. We find that through their interactions with the non-deterministic behavior of the LLM, players are able to discover interesting new emergent nodes that were not a part of the original narrative but have potential for being fun and engaging. Players that created the most emergent nodes tended to be those that often enjoy games that facilitate discovery, exploration and experimentation.

* IEEE Conference on Games 2024

Via

Access Paper or Ask Questions

GRIM: GRaph-based Interactive narrative visualization for gaMes

Nov 15, 2023

Jorge Leandro, Sudha Rao, Michael Xu, Weijia Xu, Nebosja Jojic, Chris Brockett, Bill Dolan

Abstract:Dialogue-based Role Playing Games (RPGs) require powerful storytelling. The narratives of these may take years to write and typically involve a large creative team. In this work, we demonstrate the potential of large generative text models to assist this process. \textbf{GRIM}, a prototype \textbf{GR}aph-based \textbf{I}nteractive narrative visualization system for ga\textbf{M}es, generates a rich narrative graph with branching storylines that match a high-level narrative description and constraints provided by the designer. Game designers can interactively edit the graph by automatically generating new sub-graphs that fit the edits within the original narrative and constraints. We illustrate the use of \textbf{GRIM} in conjunction with GPT-4, generating branching narratives for four well-known stories with different contextual constraints.

Via

Access Paper or Ask Questions

Towards Augmented Microscopy with Reinforcement Learning-Enhanced Workflows

Aug 04, 2022

Michael Xu, Abinash Kumar, James M. LeBeau

Figure 1 for Towards Augmented Microscopy with Reinforcement Learning-Enhanced Workflows

Figure 2 for Towards Augmented Microscopy with Reinforcement Learning-Enhanced Workflows

Figure 3 for Towards Augmented Microscopy with Reinforcement Learning-Enhanced Workflows

Figure 4 for Towards Augmented Microscopy with Reinforcement Learning-Enhanced Workflows

Abstract:Here, we report a case study implementation of reinforcement learning (RL) to automate operations in the scanning transmission electron microscopy (STEM) workflow. To do so, we design a virtual, prototypical RL environment to test and develop a network to autonomously align the electron beam without prior knowledge. Using this simulator, we evaluate the impact of environment design and algorithm hyperparameters on alignment accuracy and learning convergence, showing robust convergence across a wide hyperparameter space. Additionally, we deploy a successful model on the microscope to validate the approach and demonstrate the value of designing appropriate virtual environments. Consistent with simulated results, the on-microscope RL model achieves convergence to the goal alignment after minimal training. Overall, the results highlight that by taking advantage of RL, microscope operations can be automated without the need for extensive algorithm design, taking another step towards augmenting electron microscopy with machine learning methods.

Via

Access Paper or Ask Questions