Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yancheng Pan

Adaptive Rectification Sampling for Test-Time Compute Scaling

Apr 02, 2025

Zhendong Tan, Xingjun Zhang, Chaoyi Hu, Yancheng Pan, Shaoxun Wang

Abstract:The newly released OpenAI-o1 and DeepSeek-R1 have demonstrated that test-time scaling can significantly improve model performance, especially in complex tasks such as logical reasoning. Common test-time scaling methods involve generating more chain of thoughts (CoTs) or longer CoTs with self-correction. However, while self-correction can improve performance, it may lead to significant token waste and reduce readability of the CoT if the reasoning steps are already correct. To demonstrate that large language models (LLMs) can rectify errors at a more fine-grained level, we propose Adaptive Rectification Sampling (AR-Sampling), which can guide the LLMs to self-correction at the appropriate step. AR-Sampling leverages a process-supervised reward model (PRM) as a verifier and constructed trigger sentences to guide the model in adaptive step-level rethinking. Through the experiments on GSM8K and MATH500, it indicate that our approach enables the models to rethink in more fine-grained level, improving the accuracy of solutions, while generating a reasonable number of additional tokens.

Via

Access Paper or Ask Questions

Understanding the Challenges When 3D Semantic Segmentation Faces Class Imbalanced and OOD Data

Mar 01, 2022

Yancheng Pan, Fan Xie, Huijing Zhao

Figure 1 for Understanding the Challenges When 3D Semantic Segmentation Faces Class Imbalanced and OOD Data

Figure 2 for Understanding the Challenges When 3D Semantic Segmentation Faces Class Imbalanced and OOD Data

Figure 3 for Understanding the Challenges When 3D Semantic Segmentation Faces Class Imbalanced and OOD Data

Figure 4 for Understanding the Challenges When 3D Semantic Segmentation Faces Class Imbalanced and OOD Data

Abstract:3D semantic segmentation (3DSS) is an essential process in the creation of a safe autonomous driving system. However, deep learning models for 3D semantic segmentation often suffer from the class imbalance problem and out-of-distribution (OOD) data. In this study, we explore how the class imbalance problem affects 3DSS performance and whether the model can detect the category prediction correctness, or whether data is ID (in-distribution) or OOD. For these purposes, we conduct two experiments using three representative 3DSS models and five trust scoring methods, and conduct both a confusion and feature analysis of each class. Furthermore, a data augmentation method for the 3D LiDAR dataset is proposed to create a new dataset based on SemanticKITTI and SemanticPOSS, called AugKITTI. We propose the wPre metric and TSD for a more in-depth analysis of the results, and follow are proposals with an insightful discussion. Based on the experimental results, we find that: (1) the classes are not only imbalanced in their data size but also in the basic properties of each semantic category. (2) The intraclass diversity and interclass ambiguity make class learning difficult and greatly limit the models' performance, creating the challenges of semantic and data gaps. (3) The trust scores are unreliable for classes whose features are confused with other classes. For 3DSS models, those misclassified ID classes and OODs may also be given high trust scores, making the 3DSS predictions unreliable, and leading to the challenges in judging 3DSS result trustworthiness. All of these outcomes point to several research directions for improving the performance and reliability of the 3DSS models used for real-world applications.

Via

Access Paper or Ask Questions

Are We Hungry for 3D LiDAR Data for Semantic Segmentation?

Jun 08, 2020

Biao Gao, Yancheng Pan, Chengkun Li, Sibo Geng, Huijing Zhao

Figure 1 for Are We Hungry for 3D LiDAR Data for Semantic Segmentation?

Figure 2 for Are We Hungry for 3D LiDAR Data for Semantic Segmentation?

Figure 3 for Are We Hungry for 3D LiDAR Data for Semantic Segmentation?

Figure 4 for Are We Hungry for 3D LiDAR Data for Semantic Segmentation?

Abstract:3D LiDAR semantic segmentation is a pivotal task that is widely involved in many applications, such as autonomous driving and robotics. Studies of 3D LiDAR semantic segmentation have recently achieved considerable development, especially in terms of deep learning strategies. However, these studies usually rely heavily on considerable fine annotated data, while point-wise 3D LiDAR datasets are extremely insufficient and expensive to label. The performance limitation caused by the lack of training data is called the data hungry effect. This survey aims to explore whether and how we are hungry for 3D LiDAR data for semantic segmentation. Thus, we first provide an organized review of existing 3D datasets and 3D semantic segmentation methods. Then, we provide an in-depth analysis of three representative datasets and several experiments to evaluate the data hungry effects in different aspects. Efforts to solve data hungry problems are summarized for both 3D LiDAR-focused methods and general-purpose methods. Finally, insightful topics are discussed for future research on data hungry problems and open questions.

* 26 pages, 18 figures

Via

Access Paper or Ask Questions

Off-Road Drivable Area Extraction Using 3D LiDAR Data

Mar 10, 2020

Biao Gao, Anran Xu, Yancheng Pan, Xijun Zhao, Wen Yao, Huijing Zhao

Figure 1 for Off-Road Drivable Area Extraction Using 3D LiDAR Data

Figure 2 for Off-Road Drivable Area Extraction Using 3D LiDAR Data

Figure 3 for Off-Road Drivable Area Extraction Using 3D LiDAR Data

Figure 4 for Off-Road Drivable Area Extraction Using 3D LiDAR Data

Abstract:We propose a method for off-road drivable area extraction using 3D LiDAR data with the goal of autonomous driving application. A specific deep learning framework is designed to deal with the ambiguous area, which is one of the main challenges in the off-road environment. To reduce the considerable demand for human-annotated data for network training, we utilize the information from vast quantities of vehicle paths and auto-generated obstacle labels. Using these autogenerated annotations, the proposed network can be trained using weakly supervised or semi-supervised methods, which can achieve better performance with fewer human annotations. The experiments on our dataset illustrate the reasonability of our framework and the validity of our weakly and semi-supervised methods.

* Accepted by IEEE Intelligent Vehicles Symposium (IV2019)

Via

Access Paper or Ask Questions

SemanticPOSS: A Point Cloud Dataset with Large Quantity of Dynamic Instances

Feb 21, 2020

Yancheng Pan, Biao Gao, Jilin Mei, Sibo Geng, Chengkun Li, Huijing Zhao

Figure 1 for SemanticPOSS: A Point Cloud Dataset with Large Quantity of Dynamic Instances

Figure 2 for SemanticPOSS: A Point Cloud Dataset with Large Quantity of Dynamic Instances

Figure 3 for SemanticPOSS: A Point Cloud Dataset with Large Quantity of Dynamic Instances

Figure 4 for SemanticPOSS: A Point Cloud Dataset with Large Quantity of Dynamic Instances

Abstract:3D semantic segmentation is one of the key tasks for autonomous driving system. Recently, deep learning models for 3D semantic segmentation task have been widely researched, but they usually require large amounts of training data. However, the present datasets for 3D semantic segmentation are lack of point-wise annotation, diversiform scenes and dynamic objects. In this paper, we propose the SemanticPOSS dataset, which contains 2988 various and complicated LiDAR scans with large quantity of dynamic instances. The data is collected in Peking University and uses the same data format as SemanticKITTI. In addition, we evaluate several typical 3D semantic segmentation models on our SemanticPOSS dataset. Experimental results show that SemanticPOSS can help to improve the prediction accuracy of dynamic objects as people, car in some degree. SemanticPOSS will be published at \url{www.poss.pku.edu.cn}.

* submited to IEEE Intelligent Vehicles Symposium(2020)

Via

Access Paper or Ask Questions