Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yoshiaki Inoue

Evaluation of Large Language Models for Decision Making in Autonomous Driving

Dec 11, 2023

Kotaro Tanahashi, Yuichi Inoue, Yu Yamaguchi, Hidetatsu Yaginuma, Daiki Shiotsuka, Hiroyuki Shimatani, Kohei Iwamasa, Yoshiaki Inoue, Takafumi Yamaguchi, Koki Igari(+4 more)

Figure 1 for Evaluation of Large Language Models for Decision Making in Autonomous Driving

Figure 2 for Evaluation of Large Language Models for Decision Making in Autonomous Driving

Figure 3 for Evaluation of Large Language Models for Decision Making in Autonomous Driving

Figure 4 for Evaluation of Large Language Models for Decision Making in Autonomous Driving

Abstract:Various methods have been proposed for utilizing Large Language Models (LLMs) in autonomous driving. One strategy of using LLMs for autonomous driving involves inputting surrounding objects as text prompts to the LLMs, along with their coordinate and velocity information, and then outputting the subsequent movements of the vehicle. When using LLMs for such purposes, capabilities such as spatial recognition and planning are essential. In particular, two foundational capabilities are required: (1) spatial-aware decision making, which is the ability to recognize space from coordinate information and make decisions to avoid collisions, and (2) the ability to adhere to traffic rules. However, quantitative research has not been conducted on how accurately different types of LLMs can handle these problems. In this study, we quantitatively evaluated these two abilities of LLMs in the context of autonomous driving. Furthermore, to conduct a Proof of Concept (POC) for the feasibility of implementing these abilities in actual vehicles, we developed a system that uses LLMs to drive a vehicle.

* Accepted at the 2023 Symposium on Machine Learning for Autonomous Driving collocated with NeurIPS

Via

Access Paper or Ask Questions

Queueing Analysis of GPU-Based Inference Servers with Dynamic Batching: A Closed-Form Characterization

Dec 13, 2019

Yoshiaki Inoue

Figure 1 for Queueing Analysis of GPU-Based Inference Servers with Dynamic Batching: A Closed-Form Characterization

Figure 2 for Queueing Analysis of GPU-Based Inference Servers with Dynamic Batching: A Closed-Form Characterization

Figure 3 for Queueing Analysis of GPU-Based Inference Servers with Dynamic Batching: A Closed-Form Characterization

Figure 4 for Queueing Analysis of GPU-Based Inference Servers with Dynamic Batching: A Closed-Form Characterization

Abstract:GPU-accelerated computing is a key technology to realize high-speed inference servers using deep neural networks (DNNs). An important characteristic of GPU-based inference is that the computational efficiency, in terms of the processing speed and energy consumption, drastically increases by processing multiple jobs together in a batch. In this paper, we formulate GPU-based inference servers as a batch service queueing model with batch-size dependent processing times. We first show that the energy efficiency of the server monotonically increases with the arrival rate of inference jobs, which suggests that it is energy-efficient to operate the inference server under a utilization level as high as possible within a latency requirement of inference jobs. We then derive a closed-form upper bound for the mean latency, which provides a simple characterization of the latency performance. Through simulation and numerical experiments, we show that the exact value of the mean latency is well approximated by this upper bound.

Via

Access Paper or Ask Questions