Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:SensorBench: Benchmarking LLMs in Coding-Based Sensor Processing

Oct 14, 2024

Pengrui Quan, Xiaomin Ouyang, Jeya Vikranth Jeyakumar, Ziqi Wang, Yang Xing, Mani Srivastava

Figure 1 for SensorBench: Benchmarking LLMs in Coding-Based Sensor Processing

Figure 2 for SensorBench: Benchmarking LLMs in Coding-Based Sensor Processing

Figure 3 for SensorBench: Benchmarking LLMs in Coding-Based Sensor Processing

Figure 4 for SensorBench: Benchmarking LLMs in Coding-Based Sensor Processing

Share this with someone who'll enjoy it:

Abstract:Effective processing, interpretation, and management of sensor data have emerged as a critical component of cyber-physical systems. Traditionally, processing sensor data requires profound theoretical knowledge and proficiency in signal-processing tools. However, recent works show that Large Language Models (LLMs) have promising capabilities in processing sensory data, suggesting their potential as copilots for developing sensing systems. To explore this potential, we construct a comprehensive benchmark, SensorBench, to establish a quantifiable objective. The benchmark incorporates diverse real-world sensor datasets for various tasks. The results show that while LLMs exhibit considerable proficiency in simpler tasks, they face inherent challenges in processing compositional tasks with parameter selections compared to engineering experts. Additionally, we investigate four prompting strategies for sensor processing and show that self-verification can outperform all other baselines in 48% of tasks. Our study provides a comprehensive benchmark and prompting analysis for future developments, paving the way toward an LLM-based sensor processing copilot.

View paper on

Share this with someone who'll enjoy it:

Title:SensorBench: Benchmarking LLMs in Coding-Based Sensor Processing

Paper and Code