Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Qian Yin

Compressed Domain Prior-Guided Video Super-Resolution for Cloud Gaming Content

Jan 03, 2025

Qizhe Wang, Qian Yin, Zhimeng Huang, Weijia Jiang, Yi Su, Siwei Ma, Jiaqi Zhang

Abstract:Cloud gaming is an advanced form of Internet service that necessitates local terminals to decode within limited resources and time latency. Super-Resolution (SR) techniques are often employed on these terminals as an efficient way to reduce the required bit-rate bandwidth for cloud gaming. However, insufficient attention has been paid to SR of compressed game video content. Most SR networks amplify block artifacts and ringing effects in decoded frames while ignoring edge details of game content, leading to unsatisfactory reconstruction results. In this paper, we propose a novel lightweight network called Coding Prior-Guided Super-Resolution (CPGSR) to address the SR challenges in compressed game video content. First, we design a Compressed Domain Guided Block (CDGB) to extract features of different depths from coding priors, which are subsequently integrated with features from the U-net backbone. Then, a series of re-parameterization blocks are utilized for reconstruction. Ultimately, inspired by the quantization in video coding, we propose a partitioned focal frequency loss to effectively guide the model's focus on preserving high-frequency information. Extensive experiments demonstrate the advancement of our approach.

* 10 pages, 4 figures, Data Compression Conference2025

Via

Access Paper or Ask Questions

Constructing accurate machine-learned potentials and performing highly efficient atomistic simulations to predict structural and thermal properties

Nov 16, 2024

Junlan Liu, Qian Yin, Mengshu He, Jun Zhou

Figure 1 for Constructing accurate machine-learned potentials and performing highly efficient atomistic simulations to predict structural and thermal properties

Figure 2 for Constructing accurate machine-learned potentials and performing highly efficient atomistic simulations to predict structural and thermal properties

Figure 3 for Constructing accurate machine-learned potentials and performing highly efficient atomistic simulations to predict structural and thermal properties

Figure 4 for Constructing accurate machine-learned potentials and performing highly efficient atomistic simulations to predict structural and thermal properties

Abstract:The $\text{Cu}_7\text{P}\text{S}_6$ compound has garnered significant attention due to its potential in thermoelectric applications. In this study, we introduce a neuroevolution potential (NEP), trained on a dataset generated from ab initio molecular dynamics (AIMD) simulations, using the moment tensor potential (MTP) as a reference. The low root mean square errors (RMSEs) for total energy and atomic forces demonstrate the high accuracy and transferability of both the MTP and NEP. We further calculate the phonon density of states (DOS) and radial distribution function (RDF) using both machine learning potentials, comparing the results to density functional theory (DFT) calculations. While the MTP potential offers slightly higher accuracy, the NEP achieves a remarkable 41-fold increase in computational speed. These findings provide detailed microscopic insights into the dynamics and rapid Cu-ion diffusion, paving the way for future studies on Cu-based solid electrolytes and their applications in energy devices.

Via

Access Paper or Ask Questions

TransCouplet:Transformer based Chinese Couplet Generation

Dec 03, 2021

Kuan-Yu Chiang, Shihao Lin, Joe Chen, Qian Yin, Qizhen Jin

Figure 1 for TransCouplet:Transformer based Chinese Couplet Generation

Figure 2 for TransCouplet:Transformer based Chinese Couplet Generation

Figure 3 for TransCouplet:Transformer based Chinese Couplet Generation

Figure 4 for TransCouplet:Transformer based Chinese Couplet Generation

Abstract:Chinese couplet is a special form of poetry composed of complex syntax with ancient Chinese language. Due to the complexity of semantic and grammatical rules, creation of a suitable couplet is a formidable challenge. This paper presents a transformer-based sequence-to-sequence couplet generation model. With the utilization of AnchiBERT, the model is able to capture ancient Chinese language understanding. Moreover, we evaluate the Glyph, PinYin and Part-of-Speech tagging on the couplet grammatical rules to further improve the model.

Via

Access Paper or Ask Questions

Detecting and Tracking Small and Dense Moving Objects in Satellite Videos: A Benchmark

Nov 25, 2021

Qian Yin, Qingyong Hu, Hao Liu, Feng Zhang, Yingqian Wang, Zaiping Lin, Wei An, Yulan Guo

Figure 1 for Detecting and Tracking Small and Dense Moving Objects in Satellite Videos: A Benchmark

Figure 2 for Detecting and Tracking Small and Dense Moving Objects in Satellite Videos: A Benchmark

Figure 3 for Detecting and Tracking Small and Dense Moving Objects in Satellite Videos: A Benchmark

Figure 4 for Detecting and Tracking Small and Dense Moving Objects in Satellite Videos: A Benchmark

Abstract:Satellite video cameras can provide continuous observation for a large-scale area, which is important for many remote sensing applications. However, achieving moving object detection and tracking in satellite videos remains challenging due to the insufficient appearance information of objects and lack of high-quality datasets. In this paper, we first build a large-scale satellite video dataset with rich annotations for the task of moving object detection and tracking. This dataset is collected by the Jilin-1 satellite constellation and composed of 47 high-quality videos with 1,646,038 instances of interest for object detection and 3,711 trajectories for object tracking. We then introduce a motion modeling baseline to improve the detection rate and reduce false alarms based on accumulative multi-frame differencing and robust matrix completion. Finally, we establish the first public benchmark for moving object detection and tracking in satellite videos, and extensively evaluate the performance of several representative approaches on our dataset. Comprehensive experimental analyses and insightful conclusions are also provided. The dataset is available at https://github.com/QingyongHu/VISO.

* This paper has been accepted by IEEE Transactions on Geoscience and Remote Sensing. Qian Yin and Qingyong Hu have equal contributions to this work and are co-first authors. The dataset is available at https://github.com/QingyongHu/VISO

Via

Access Paper or Ask Questions

Lossless Point Cloud Attribute Compression with Normal-based Intra Prediction

Jun 23, 2021

Qian Yin, Qingshan Ren, Lili Zhao, Wenyi Wang, Jianwen Chen

Figure 1 for Lossless Point Cloud Attribute Compression with Normal-based Intra Prediction

Figure 2 for Lossless Point Cloud Attribute Compression with Normal-based Intra Prediction

Figure 3 for Lossless Point Cloud Attribute Compression with Normal-based Intra Prediction

Figure 4 for Lossless Point Cloud Attribute Compression with Normal-based Intra Prediction

Abstract:The sparse LiDAR point clouds become more and more popular in various applications, e.g., the autonomous driving. However, for this type of data, there exists much under-explored space in the corresponding compression framework proposed by MPEG, i.e., geometry-based point cloud compression (G-PCC). In G-PCC, only the distance-based similarity is considered in the intra prediction for the attribute compression. In this paper, we propose a normal-based intra prediction scheme, which provides a more efficient lossless attribute compression by introducing the normals of point clouds. The angle between normals is used to further explore accurate local similarity, which optimizes the selection of predictors. We implement our method into the G-PCC reference software. Experimental results over LiDAR acquired datasets demonstrate that our proposed method is able to deliver better compression performance than the G-PCC anchor, with $2.1\%$ gains on average for lossless attribute coding.

* Accepted by the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting 2021

Via

Access Paper or Ask Questions

RAI-Net: Range-Adaptive LiDAR Point Cloud Frame Interpolation Network

Jun 01, 2021

Lili Zhao, Zezhi Zhu, Xuhu Lin, Xuezhou Guo, Qian Yin, Wenyi Wang, Jianwen Chen

Figure 1 for RAI-Net: Range-Adaptive LiDAR Point Cloud Frame Interpolation Network

Figure 2 for RAI-Net: Range-Adaptive LiDAR Point Cloud Frame Interpolation Network

Figure 3 for RAI-Net: Range-Adaptive LiDAR Point Cloud Frame Interpolation Network

Figure 4 for RAI-Net: Range-Adaptive LiDAR Point Cloud Frame Interpolation Network

Abstract:LiDAR point cloud frame interpolation, which synthesizes the intermediate frame between the captured frames, has emerged as an important issue for many applications. Especially for reducing the amounts of point cloud transmission, it is by predicting the intermediate frame based on the reference frames to upsample data to high frame rate ones. However, due to high-dimensional and sparse characteristics of point clouds, it is more difficult to predict the intermediate frame for LiDAR point clouds than videos. In this paper, we propose a novel LiDAR point cloud frame interpolation method, which exploits range images (RIs) as an intermediate representation with CNNs to conduct the frame interpolation process. Considering the inherited characteristics of RIs differ from that of color images, we introduce spatially adaptive convolutions to extract range features adaptively, while a high-efficient flow estimation method is presented to generate optical flows. The proposed model then warps the input frames and range features, based on the optical flows to synthesize the interpolated frame. Extensive experiments on the KITTI dataset have clearly demonstrated that our method consistently achieves superior frame interpolation results with better perceptual quality to that of using state-of-the-art video frame interpolation methods. The proposed method could be integrated into any LiDAR point cloud compression systems for inter prediction.

* Accepted by the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting 2021

Via

Access Paper or Ask Questions

Synergetic Learning Systems: Concept, Architecture, and Algorithms

Jun 14, 2020

Ping Guo, Qian Yin

Figure 1 for Synergetic Learning Systems: Concept, Architecture, and Algorithms

Abstract:Drawing on the idea that brain development is a Darwinian process of ``evolution + selection'' and the idea that the current state is a local equilibrium state of many bodies with self-organization and evolution processes driven by the temperature and gravity in our universe, in this work, we describe an artificial intelligence system called the ``Synergetic Learning Systems''. The system is composed of two or more subsystems (models, agents or virtual bodies), and it is an open complex giant system. Inspired by natural intelligence, the system achieves intelligent information processing and decision-making in a given environment through cooperative/competitive synergetic learning. The intelligence evolved by the natural law of ``it is not the strongest of the species that survives, but the one most responsive to change,'' while an artificial intelligence system should adopt the law of ``human selection'' in the evolution process. Therefore, we expect that the proposed system architecture can also be adapted in human-machine synergy or multi-agent synergetic systems. It is also expected that under our design criteria, the proposed system will eventually achieve artificial general intelligence through long term coevolution.

* This work is based on first principle for artificial intelligence. Some terminologies and syntax errors are revised in this version

Via

Access Paper or Ask Questions

Two-dimensional Multi-fiber Spectrum Image Correction Based on Machine Learning Techniques

Feb 16, 2020

Jiali Xu, Qian Yin, Ping Guo, Xin Zheng

Figure 1 for Two-dimensional Multi-fiber Spectrum Image Correction Based on Machine Learning Techniques

Figure 2 for Two-dimensional Multi-fiber Spectrum Image Correction Based on Machine Learning Techniques

Figure 3 for Two-dimensional Multi-fiber Spectrum Image Correction Based on Machine Learning Techniques

Figure 4 for Two-dimensional Multi-fiber Spectrum Image Correction Based on Machine Learning Techniques

Abstract:Due to limited size and imperfect of the optical components in a spectrometer, aberration has inevitably been brought into two-dimensional multi-fiber spectrum image in LAMOST, which leads to obvious spacial variation of the point spread functions (PSFs). Consequently, if spatial variant PSFs are estimated directly , the huge storage and intensive computation requirements result in deconvolutional spectral extraction method become intractable. In this paper, we proposed a novel method to solve the problem of spatial variation PSF through image aberration correction. When CCD image aberration is corrected, PSF, the convolution kernel, can be approximated by one spatial invariant PSF only. Specifically, machine learning techniques are adopted to calibrate distorted spectral image, including Total Least Squares (TLS) algorithm, intelligent sampling method, multi-layer feed-forward neural networks. The calibration experiments on the LAMOST CCD images show that the calibration effect of proposed method is effectible. At the same time, the spectrum extraction results before and after calibration are compared, results show the characteristics of the extracted one-dimensional waveform are more close to an ideal optics system, and the PSF of the corrected object spectrum image estimated by the blind deconvolution method is nearly central symmetry, which indicates that our proposed method can significantly reduce the complexity of spectrum extraction and improve extraction accuracy.

* 10 pages, 14 figures

Via

Access Paper or Ask Questions

Quadratic video interpolation

Nov 02, 2019

Xiangyu Xu, Li Siyao, Wenxiu Sun, Qian Yin, Ming-Hsuan Yang

Figure 1 for Quadratic video interpolation

Figure 2 for Quadratic video interpolation

Figure 3 for Quadratic video interpolation

Figure 4 for Quadratic video interpolation

Abstract:Video interpolation is an important problem in computer vision, which helps overcome the temporal limitation of camera sensors. Existing video interpolation methods usually assume uniform motion between consecutive frames and use linear models for interpolation, which cannot well approximate the complex motion in the real world. To address these issues, we propose a quadratic video interpolation method which exploits the acceleration information in videos. This method allows prediction with curvilinear trajectory and variable velocity, and generates more accurate interpolation results. For high-quality frame synthesis, we develop a flow reversal layer to estimate flow fields starting from the unknown target frame to the source frame. In addition, we present techniques for flow refinement. Extensive experiments demonstrate that our approach performs favorably against the existing linear models on a wide variety of video datasets.

* NeurIPS 2019, project website: https://sites.google.com/view/xiangyuxu/qvi_nips19

Via

Access Paper or Ask Questions

Iterative Residual Image Deconvolution

Nov 04, 2018

Li Si-Yao, Dongwei Ren, Furong Zhao, Zijian Hu, Junfeng Li, Qian Yin

Figure 1 for Iterative Residual Image Deconvolution

Figure 2 for Iterative Residual Image Deconvolution

Figure 3 for Iterative Residual Image Deconvolution

Figure 4 for Iterative Residual Image Deconvolution

Abstract:Image deblurring, a.k.a. image deconvolution, recovers a clear image from pixel superposition caused by blur degradation. Few deep convolutional neural networks (CNN) succeed in addressing this task. In this paper, we first demonstrate that the minimum-mean-square-error (MMSE) solution to image deblurring can be interestingly unfolded into a series of residual components. Based on this analysis, we propose a novel iterative residual deconvolution (IRD) algorithm. Further, IRD motivates us to take one step forward to design an explicable and effective CNN architecture for image deconvolution. Specifically, a sequence of residual CNN units are deployed, whose intermediate outputs are then concatenated and integrated, resulting in concatenated residual convolutional network (CRCNet). The experimental results demonstrate that proposed CRCNet not only achieves better quantitative metrics but also recovers more visually plausible texture details compared with state-of-the-art methods.

* rejected by AAAI 2019

Via

Access Paper or Ask Questions