Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Li Cui

Contour Field based Elliptical Shape Prior for the Segment Anything Model

Apr 17, 2025

Xinyu Zhao, Jun Liu, Faqiang Wang, Li Cui, Yuping Duan

Abstract:The elliptical shape prior information plays a vital role in improving the accuracy of image segmentation for specific tasks in medical and natural images. Existing deep learning-based segmentation methods, including the Segment Anything Model (SAM), often struggle to produce segmentation results with elliptical shapes efficiently. This paper proposes a new approach to integrate the prior of elliptical shapes into the deep learning-based SAM image segmentation techniques using variational methods. The proposed method establishes a parameterized elliptical contour field, which constrains the segmentation results to align with predefined elliptical contours. Utilizing the dual algorithm, the model seamlessly integrates image features with elliptical priors and spatial regularization priors, thereby greatly enhancing segmentation accuracy. By decomposing SAM into four mathematical sub-problems, we integrate the variational ellipse prior to design a new SAM network structure, ensuring that the segmentation output of SAM consists of elliptical regions. Experimental results on some specific image datasets demonstrate an improvement over the original SAM.

Via

Access Paper or Ask Questions

NF-SLAM: Effective, Normalizing Flow-supported Neural Field representations for object-level visual SLAM in automotive applications

Mar 14, 2025

Li Cui, Yang Ding, Richard Hartley, Zirui Xie, Laurent Kneip, Zhenghua Yu

Abstract:We propose a novel, vision-only object-level SLAM framework for automotive applications representing 3D shapes by implicit signed distance functions. Our key innovation consists of augmenting the standard neural representation by a normalizing flow network. As a result, achieving strong representation power on the specific class of road vehicles is made possible by compact networks with only 16-dimensional latent codes. Furthermore, the newly proposed architecture exhibits a significant performance improvement in the presence of only sparse and noisy data, which is demonstrated through comparative experiments on synthetic data. The module is embedded into the back-end of a stereo-vision based framework for joint, incremental shape optimization. The loss function is given by a combination of a sparse 3D point-based SDF loss, a sparse rendering loss, and a semantic mask-based silhouette-consistency term. We furthermore leverage semantic information to determine keypoint extraction density in the front-end. Finally, experimental results on real-world data reveal accurate and reliable performance comparable to alternative frameworks that make use of direct depth readings. The proposed method performs well with only sparse 3D points obtained from bundle adjustment, and eventually continues to deliver stable results even under exclusive use of the mask-consistency term.

* 9 pages, 5 figures, IROS 2024

Via

Access Paper or Ask Questions

An inexact Bregman proximal point method and its acceleration version for unbalanced optimal transport

Feb 26, 2024

Xiang Chen, Faqiang Wang, Jun Liu, Li Cui

Figure 1 for An inexact Bregman proximal point method and its acceleration version for unbalanced optimal transport

Figure 2 for An inexact Bregman proximal point method and its acceleration version for unbalanced optimal transport

Figure 3 for An inexact Bregman proximal point method and its acceleration version for unbalanced optimal transport

Figure 4 for An inexact Bregman proximal point method and its acceleration version for unbalanced optimal transport

Abstract:The Unbalanced Optimal Transport (UOT) problem plays increasingly important roles in computational biology, computational imaging and deep learning. Scaling algorithm is widely used to solve UOT due to its convenience and good convergence properties. However, this algorithm has lower accuracy for large regularization parameters, and due to stability issues, small regularization parameters can easily lead to numerical overflow. We address this challenge by developing an inexact Bregman proximal point method for solving UOT. This algorithm approximates the proximal operator using the Scaling algorithm at each iteration. The algorithm (1) converges to the true solution of UOT, (2) has theoretical guarantees and robust regularization parameter selection, (3) mitigates numerical stability issues, and (4) can achieve comparable computational complexity to the Scaling algorithm in specific practice. Building upon this, we develop an accelerated version of inexact Bregman proximal point method for solving UOT by using acceleration techniques of Bregman proximal point method and provide theoretical guarantees and experimental validation of convergence and acceleration.

Via

Access Paper or Ask Questions

Event-Based Visual Odometry on Non-Holonomic Ground Vehicles

Jan 17, 2024

Wanting Xu, Si'ao Zhang, Li Cui, Xin Peng, Laurent Kneip

Abstract:Despite the promise of superior performance under challenging conditions, event-based motion estimation remains a hard problem owing to the difficulty of extracting and tracking stable features from event streams. In order to robustify the estimation, it is generally believed that fusion with other sensors is a requirement. In this work, we demonstrate reliable, purely event-based visual odometry on planar ground vehicles by employing the constrained non-holonomic motion model of Ackermann steering platforms. We extend single feature n-linearities for regular frame-based cameras to the case of quasi time-continuous event-tracks, and achieve a polynomial form via variable degree Taylor expansions. Robust averaging over multiple event tracks is simply achieved via histogram voting. As demonstrated on both simulated and real data, our algorithm achieves accurate and robust estimates of the vehicle's instantaneous rotational velocity, and thus results that are comparable to the delta rotations obtained by frame-based sensors under normal conditions. We furthermore significantly outperform the more traditional alternatives in challenging illumination scenarios. The code is available at \url{https://github.com/gowanting/NHEVO}.

* Accepted by 3DV 2024

Via

Access Paper or Ask Questions

Image Segmentation with Adaptive Spatial Priors from Joint Registration

Mar 29, 2022

Haifeng Li, Weihong Guo, Jun Liu, Li Cui, Dongxing Xie

Figure 1 for Image Segmentation with Adaptive Spatial Priors from Joint Registration

Figure 2 for Image Segmentation with Adaptive Spatial Priors from Joint Registration

Figure 3 for Image Segmentation with Adaptive Spatial Priors from Joint Registration

Figure 4 for Image Segmentation with Adaptive Spatial Priors from Joint Registration

Abstract:Image segmentation is a crucial but challenging task that has many applications. In medical imaging for instance, intensity inhomogeneity and noise are common. In thigh muscle images, different muscles are closed packed together and there are often no clear boundaries between them. Intensity based segmentation models cannot separate one muscle from another. To solve such problems, in this work we present a segmentation model with adaptive spatial priors from joint registration. This model combines segmentation and registration in a unified framework to leverage their positive mutual influence. The segmentation is based on a modified Gaussian mixture model (GMM), which integrates intensity inhomogeneity and spacial smoothness. The registration plays the role of providing a shape prior. We adopt a modified sum of squared difference (SSD) fidelity term and Tikhonov regularity term for registration, and also utilize Gaussian pyramid and parametric method for robustness. The connection between segmentation and registration is guaranteed by the cross entropy metric that aims to make the segmentation map (from segmentation) and deformed atlas (from registration) as similar as possible. This joint framework is implemented within a constraint optimization framework, which leads to an efficient algorithm. We evaluate our proposed model on synthetic and thigh muscle MR images. Numerical results show the improvement as compared to segmentation and registration performed separately and other joint models.

Via

Access Paper or Ask Questions

Zero-Shot Instance Segmentation

Apr 14, 2021

Ye Zheng, Jiahong Wu, Yongqiang Qin, Faen Zhang, Li Cui

Figure 1 for Zero-Shot Instance Segmentation

Figure 2 for Zero-Shot Instance Segmentation

Figure 3 for Zero-Shot Instance Segmentation

Figure 4 for Zero-Shot Instance Segmentation

Abstract:Deep learning has significantly improved the precision of instance segmentation with abundant labeled data. However, in many areas like medical and manufacturing, collecting sufficient data is extremely hard and labeling this data requires high professional skills. We follow this motivation and propose a new task set named zero-shot instance segmentation (ZSI). In the training phase of ZSI, the model is trained with seen data, while in the testing phase, it is used to segment all seen and unseen instances. We first formulate the ZSI task and propose a method to tackle the challenge, which consists of Zero-shot Detector, Semantic Mask Head, Background Aware RPN and Synchronized Background Strategy. We present a new benchmark for zero-shot instance segmentation based on the MS-COCO dataset. The extensive empirical results in this benchmark show that our method not only surpasses the state-of-the-art results in zero-shot object detection task but also achieves promising performance on ZSI. Our approach will serve as a solid baseline and facilitate future research in zero-shot instance segmentation.

* 8 pages, 6 figures

Via

Access Paper or Ask Questions

Background Learnable Cascade for Zero-Shot Object Detection

Oct 09, 2020

Ye Zheng, Ruoran Huang, Chuanqi Han, Xi Huang, Li Cui

Figure 1 for Background Learnable Cascade for Zero-Shot Object Detection

Figure 2 for Background Learnable Cascade for Zero-Shot Object Detection

Figure 3 for Background Learnable Cascade for Zero-Shot Object Detection

Figure 4 for Background Learnable Cascade for Zero-Shot Object Detection

Abstract:Zero-shot detection (ZSD) is crucial to large-scale object detection with the aim of simultaneously localizing and recognizing unseen objects. There remain several challenges for ZSD, including reducing the ambiguity between background and unseen objects as well as improving the alignment between visual and semantic concept. In this work, we propose a novel framework named Background Learnable Cascade (BLC) to improve ZSD performance. The major contributions for BLC are as follows: (i) we propose a multi-stage cascade structure named Cascade Semantic R-CNN to progressively refine the alignment between visual and semantic of ZSD; (ii) we develop the semantic information flow structure and directly add it between each stage in Cascade Semantic RCNN to further improve the semantic feature learning; (iii) we propose the background learnable region proposal network (BLRPN) to learn an appropriate word vector for background class and use this learned vector in Cascade Semantic R CNN, this design makes \Background Learnable" and reduces the confusion between background and unseen classes. Our extensive experiments show BLC obtains significantly performance improvements for MS-COCO over state-of-the-art methods.

* 18 pages, 5figures

Via

Access Paper or Ask Questions

Volume Preserving Image Segmentation with Entropic Regularization Optimal Transport and Its Applications in Deep Learning

Sep 22, 2019

Haifeng Li, Jun Liu, Li Cui, Haiyang Huang, Xue-cheng Tai

Figure 1 for Volume Preserving Image Segmentation with Entropic Regularization Optimal Transport and Its Applications in Deep Learning

Figure 2 for Volume Preserving Image Segmentation with Entropic Regularization Optimal Transport and Its Applications in Deep Learning

Figure 3 for Volume Preserving Image Segmentation with Entropic Regularization Optimal Transport and Its Applications in Deep Learning

Figure 4 for Volume Preserving Image Segmentation with Entropic Regularization Optimal Transport and Its Applications in Deep Learning

Abstract:Image segmentation with a volume constraint is an important prior for many real applications. In this work, we present a novel volume preserving image segmentation algorithm, which is based on the framework of entropic regularized optimal transport theory. The classical Total Variation (TV) regularizer and volume preserving are integrated into a regularized optimal transport model, and the volume and classification constraints can be regarded as two measures preserving constraints in the optimal transport problem. By studying the dual problem, we develop a simple and efficient dual algorithm for our model. Moreover, to be different from many variational based image segmentation algorithms, the proposed algorithm can be directly unrolled to a new Volume Preserving and TV regularized softmax (VPTV-softmax) layer for semantic segmentation in the popular Deep Convolution Neural Network (DCNN). The experiment results show that our proposed model is very competitive and can improve the performance of many semantic segmentation nets such as the popular U-net.

Via

Access Paper or Ask Questions

EasiCS: the objective and fine-grained classification method of cervical spondylosis dysfunction

May 15, 2019

Nana Wang, Li Cui, Xi Huang, Yingcong Xiang, Jing Xiao, Yi Rao

Figure 1 for EasiCS: the objective and fine-grained classification method of cervical spondylosis dysfunction

Figure 2 for EasiCS: the objective and fine-grained classification method of cervical spondylosis dysfunction

Figure 3 for EasiCS: the objective and fine-grained classification method of cervical spondylosis dysfunction

Abstract:The precise diagnosis is of great significance in developing precise treatment plans to restore neck function and reduce the burden posed by the cervical spondylosis (CS). However, the current available neck function assessment method are subjective and coarse-grained. In this paper, based on the relationship among CS, cervical structure, cervical vertebra function, and surface electromyography (sEMG), we seek to develop a clustering algorithms on the sEMG data set collected from the clinical environment and implement the division. We proposed and developed the framework EasiCS, which consists of dimension reduction, clustering algorithm EasiSOM, spectral clustering algorithm EasiSC. The EasiCS outperform the commonly used seven algorithms overall.

Via

Access Paper or Ask Questions

EasiCSDeep: A deep learning model for Cervical Spondylosis Identification using surface electromyography signal

Dec 12, 2018

Nana Wang, Li Cui, Xi Huang, Yingcong Xiang, Jing Xiao

Figure 1 for EasiCSDeep: A deep learning model for Cervical Spondylosis Identification using surface electromyography signal

Figure 2 for EasiCSDeep: A deep learning model for Cervical Spondylosis Identification using surface electromyography signal

Figure 3 for EasiCSDeep: A deep learning model for Cervical Spondylosis Identification using surface electromyography signal

Figure 4 for EasiCSDeep: A deep learning model for Cervical Spondylosis Identification using surface electromyography signal

Abstract:Cervical spondylosis (CS) is a common chronic disease that affects up to two-thirds of the population and poses a serious burden on individuals and society. The early identification has significant value in improving cure rate and reducing costs. However, the pathology is complex, and the mild symptoms increase the difficulty of the diagnosis, especially in the early stage. Besides, the time-consuming and costliness of hospital medical service reduces the attention to the CS identification. Thus, a convenient, low-cost intelligent CS identification method is imperious demanded. In this paper, we present an intelligent method based on the deep learning to identify CS, using the surface electromyography (sEMG) signal. Faced with the complex, high dimensionality and weak usability of the sEMG signal, we proposed and developed a multi-channel EasiCSDeep algorithm based on the convolutional neural network, which consists of the feature extraction, spatial relationship representation and classification algorithm. To the best of our knowledge, this EasiCSDeep is the first effort to employ the deep learning and the sEMG data to identify CS. Compared with previous state-of-the-art algorithm, our algorithm achieves a significant improvement.

Via

Access Paper or Ask Questions