Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yanjie Liang

SRLCG: Self-Rectified Large-Scale Code Generation with Multidimensional Chain-of-Thought and Dynamic Backtracking

Apr 01, 2025

Hongru Ma, Yanjie Liang, Jiasheng Si, Weiyu Zhang, Hongjiao Guan, Chaoqun Zheng, Bing Xu, Wenpeng Lu

Abstract:Large language models (LLMs) have revolutionized code generation, significantly enhancing developer productivity. However, for a vast number of users with minimal coding knowledge, LLMs provide little support, as they primarily generate isolated code snippets rather than complete, large-scale project code. Without coding expertise, these users struggle to interpret, modify, and iteratively refine the outputs of LLMs, making it impossible to assemble a complete project. To address this issue, we propose Self-Rectified Large-Scale Code Generator (SRLCG), a framework that generates complete multi-file project code from a single prompt. SRLCG employs a novel multidimensional chain-of-thought (CoT) and self-rectification to guide LLMs in generating correct and robust code files, then integrates them into a complete and coherent project using our proposed dynamic backtracking algorithm. Experimental results show that SRLCG generates code 15x longer than DeepSeek-V3, 16x longer than GPT-4, and at least 10x longer than other leading CoT-based baselines. Furthermore, they confirm its improved correctness, robustness, and performance compared to baselines in large-scale code generation.

* 23 pages

Via

Access Paper or Ask Questions

TSG: Target-Selective Gradient Backprop for Probing CNN Visual Saliency

Oct 11, 2021

Lin Cheng, Pengfei Fang, Yanjie Liang, Liao Zhang, Chunhua Shen, Hanzi Wang

Figure 1 for TSG: Target-Selective Gradient Backprop for Probing CNN Visual Saliency

Figure 2 for TSG: Target-Selective Gradient Backprop for Probing CNN Visual Saliency

Figure 3 for TSG: Target-Selective Gradient Backprop for Probing CNN Visual Saliency

Figure 4 for TSG: Target-Selective Gradient Backprop for Probing CNN Visual Saliency

Abstract:The explanation for deep neural networks has drawn extensive attention in the deep learning community over the past few years. In this work, we study the visual saliency, a.k.a. visual explanation, to interpret convolutional neural networks. Compared to iteration based saliency methods, single backward pass based saliency methods benefit from faster speed and are widely used in downstream visual tasks. Thus our work focuses on single backward pass approaches. However, existing methods in this category struggle to successfully produce fine-grained saliency maps concentrating on specific target classes. That said, producing faithful saliency maps satisfying both target-selectiveness and fine-grainedness using a single backward pass is a challenging problem in the field. To mitigate this problem, we revisit the gradient flow inside the network, and find that the entangled semantics and original weights may disturb the propagation of target-relevant saliency. Inspired by those observations, we propose a novel visual saliency framework, termed Target-Selective Gradient (TSG) backprop, which leverages rectification operations to effectively emphasize target classes and further efficiently propagate the saliency to the input space, thereby generating target-selective and fine-grained saliency maps. The proposed TSG consists of two components, namely, TSG-Conv and TSG-FC, which rectify the gradients for convolutional layers and fully-connected layers, respectively. Thorough qualitative and quantitative experiments on ImageNet and Pascal VOC show that the proposed framework achieves more accurate and reliable results than other competitive methods.

* Submitted to IEEE Transactions on Image Processing

Via

Access Paper or Ask Questions

Robust Visual Tracking via Statistical Positive Sample Generation and Gradient Aware Learning

Nov 09, 2020

Lijian Lin, Haosheng Chen, Yanjie Liang, Yan Yan, Hanzi Wang

Figure 1 for Robust Visual Tracking via Statistical Positive Sample Generation and Gradient Aware Learning

Figure 2 for Robust Visual Tracking via Statistical Positive Sample Generation and Gradient Aware Learning

Figure 3 for Robust Visual Tracking via Statistical Positive Sample Generation and Gradient Aware Learning

Figure 4 for Robust Visual Tracking via Statistical Positive Sample Generation and Gradient Aware Learning

Abstract:In recent years, Convolutional Neural Network (CNN) based trackers have achieved state-of-the-art performance on multiple benchmark datasets. Most of these trackers train a binary classifier to distinguish the target from its background. However, they suffer from two limitations. Firstly, these trackers cannot effectively handle significant appearance variations due to the limited number of positive samples. Secondly, there exists a significant imbalance of gradient contributions between easy and hard samples, where the easy samples usually dominate the computation of gradient. In this paper, we propose a robust tracking method via Statistical Positive sample generation and Gradient Aware learning (SPGA) to address the above two limitations. To enrich the diversity of positive samples, we present an effective and efficient statistical positive sample generation algorithm to generate positive samples in the feature space. Furthermore, to handle the issue of imbalance between easy and hard samples, we propose a gradient sensitive loss to harmonize the gradient contributions between easy and hard samples. Extensive experiments on three challenging benchmark datasets including OTB50, OTB100 and VOT2016 demonstrate that the proposed SPGA performs favorably against several state-of-the-art trackers.

* ACM MM Asia2019
* 6 pages

Via

Access Paper or Ask Questions

Correlation filter tracking with adaptive proposal selection for accurate scale estimation

Jul 14, 2020

Luo Xiong, Yanjie Liang, Yan Yan, Hanzi Wang

Figure 1 for Correlation filter tracking with adaptive proposal selection for accurate scale estimation

Figure 2 for Correlation filter tracking with adaptive proposal selection for accurate scale estimation

Figure 3 for Correlation filter tracking with adaptive proposal selection for accurate scale estimation

Figure 4 for Correlation filter tracking with adaptive proposal selection for accurate scale estimation

Abstract:Recently, some correlation filter based trackers with detection proposals have achieved state-of-the-art tracking results. However, a large number of redundant proposals given by the proposal generator may degrade the performance and speed of these trackers. In this paper, we propose an adaptive proposal selection algorithm which can generate a small number of high-quality proposals to handle the problem of scale variations for visual object tracking. Specifically, we firstly utilize the color histograms in the HSV color space to represent the instances (i.e., the initial target in the first frame and the predicted target in the previous frame) and proposals. Then, an adaptive strategy based on the color similarity is formulated to select high-quality proposals. We further integrate the proposed adaptive proposal selection algorithm with coarse-to-fine deep features to validate the generalization and efficiency of the proposed tracker. Experiments on two benchmark datasets demonstrate that the proposed algorithm performs favorably against several state-of-the-art trackers.

* 6 pages, 14 figures

Via

Access Paper or Ask Questions

Asynchronous Tracking-by-Detection on Adaptive Time Surfaces for Event-based Object Tracking

Feb 13, 2020

Haosheng Chen, Qiangqiang Wu, Yanjie Liang, Xinbo Gao, Hanzi Wang

Figure 1 for Asynchronous Tracking-by-Detection on Adaptive Time Surfaces for Event-based Object Tracking

Figure 2 for Asynchronous Tracking-by-Detection on Adaptive Time Surfaces for Event-based Object Tracking

Figure 3 for Asynchronous Tracking-by-Detection on Adaptive Time Surfaces for Event-based Object Tracking

Figure 4 for Asynchronous Tracking-by-Detection on Adaptive Time Surfaces for Event-based Object Tracking

Abstract:Event cameras, which are asynchronous bio-inspired vision sensors, have shown great potential in a variety of situations, such as fast motion and low illumination scenes. However, most of the event-based object tracking methods are designed for scenarios with untextured objects and uncluttered backgrounds. There are few event-based object tracking methods that support bounding box-based object tracking. The main idea behind this work is to propose an asynchronous Event-based Tracking-by-Detection (ETD) method for generic bounding box-based object tracking. To achieve this goal, we present an Adaptive Time-Surface with Linear Time Decay (ATSLTD) event-to-frame conversion algorithm, which asynchronously and effectively warps the spatio-temporal information of asynchronous retinal events to a sequence of ATSLTD frames with clear object contours. We feed the sequence of ATSLTD frames to the proposed ETD method to perform accurate and efficient object tracking, which leverages the high temporal resolution property of event cameras. We compare the proposed ETD method with seven popular object tracking methods, that are based on conventional cameras or event cameras, and two variants of ETD. The experimental results show the superiority of the proposed ETD method in handling various challenging environments.

* Proceedings of the 27th ACM International Conference on Multimedia (MM '19). 2019, Nice, France. ACM, New York, NY, USA
* 9 pages, 5 figures

Via

Access Paper or Ask Questions

DSNet: Deep and Shallow Feature Learning for Efficient Visual Tracking

Nov 06, 2018

Qiangqiang Wu, Yan Yan, Yanjie Liang, Yi Liu, Hanzi Wang

Figure 1 for DSNet: Deep and Shallow Feature Learning for Efficient Visual Tracking

Figure 2 for DSNet: Deep and Shallow Feature Learning for Efficient Visual Tracking

Figure 3 for DSNet: Deep and Shallow Feature Learning for Efficient Visual Tracking

Figure 4 for DSNet: Deep and Shallow Feature Learning for Efficient Visual Tracking

Abstract:In recent years, Discriminative Correlation Filter (DCF) based tracking methods have achieved great success in visual tracking. However, the multi-resolution convolutional feature maps trained from other tasks like image classification, cannot be naturally used in the conventional DCF formulation. Furthermore, these high-dimensional feature maps significantly increase the tracking complexity and thus limit the tracking speed. In this paper, we present a deep and shallow feature learning network, namely DSNet, to learn the multi-level same-resolution compressed (MSC) features for efficient online tracking, in an end-to-end offline manner. Specifically, the proposed DSNet compresses multi-level convolutional features to uniform spatial resolution features. The learned MSC features effectively encode both appearance and semantic information of objects in the same-resolution feature maps, thus enabling an elegant combination of the MSC features with any DCF-based methods. Additionally, a channel reliability measurement (CRM) method is presented to further refine the learned MSC features. We demonstrate the effectiveness of the MSC features learned from the proposed DSNet on two DCF tracking frameworks: the basic DCF framework and the continuous convolution operator framework. Extensive experiments show that the learned MSC features have the appealing advantage of allowing the equipped DCF-based tracking methods to perform favorably against the state-of-the-art methods while running at high frame rates.

* To appear at ACCV 2018. 14 pages, 8 figures

Via

Access Paper or Ask Questions