Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ruichao Xiao

Dynamic Relational Inference in Multi-Agent Trajectories

Jul 16, 2020

Ruichao Xiao, Rose Yu

Figure 1 for Dynamic Relational Inference in Multi-Agent Trajectories

Figure 2 for Dynamic Relational Inference in Multi-Agent Trajectories

Figure 3 for Dynamic Relational Inference in Multi-Agent Trajectories

Figure 4 for Dynamic Relational Inference in Multi-Agent Trajectories

Abstract:Inferring interactions from multi-agent trajectories has broad applications in physics, vision and robotics. Neural relational inference (NRI) is a deep generative model that can reason about relations in complex dynamics without supervision. In this paper, we take a careful look at this approach for relational inference in multi-agent trajectories. First, we discover that NRI can be fundamentally limited without sufficient long-term observations. Its ability to accurately infer interactions degrades drastically for short output sequences. Next, we consider a more general setting of relational inference when interactions are changing overtime. We propose an extension ofNRI, which we call the DYnamic multi-AgentRelational Inference (DYARI) model that can reason about dynamic relations. We conduct exhaustive experiments to study the effect of model architecture, under-lying dynamics and training scheme on the performance of dynamic relational inference using a simulated physics system. We also showcase the usage of our model on real-world multi-agent basketball trajectories.

Via

Access Paper or Ask Questions

DSR: Direct Self-rectification for Uncalibrated Dual-lens Cameras

Sep 26, 2018

Ruichao Xiao, Wenxiu Sun, Jiahao Pang, Qiong Yan, Jimmy Ren

Figure 1 for DSR: Direct Self-rectification for Uncalibrated Dual-lens Cameras

Figure 2 for DSR: Direct Self-rectification for Uncalibrated Dual-lens Cameras

Figure 3 for DSR: Direct Self-rectification for Uncalibrated Dual-lens Cameras

Figure 4 for DSR: Direct Self-rectification for Uncalibrated Dual-lens Cameras

Abstract:With the developments of dual-lens camera modules,depth information representing the third dimension of thecaptured scenes becomes available for smartphones. It isestimated by stereo matching algorithms, taking as input thetwo views captured by dual-lens cameras at slightly differ-ent viewpoints. Depth-of-field rendering (also be referred toas synthetic defocus or bokeh) is one of the trending depth-based applications. However, to achieve fast depth estima-tion on smartphones, the stereo pairs need to be rectified inthe first place. In this paper, we propose a cost-effective so-lution to perform stereo rectification for dual-lens camerascalled direct self-rectification, short for DSR1. It removesthe need of individual offline calibration for every pair ofdual-lens cameras. In addition, the proposed solution isrobust to the slight movements, e.g., due to collisions, ofthe dual-lens cameras after fabrication. Different with ex-isting self-rectification approaches, our approach computesthe homography in a novel way with zero geometric distor-tions introduced to the master image. It is achieved by di-rectly minimizing the vertical displacements of correspond-ing points between the original master image and the trans-formed slave image. Our method is evaluated on both real-istic and synthetic stereo image pairs, and produces supe-rior results compared to the calibrated rectification or otherself-rectification approaches

* Accepted at 3DV2018

Via

Access Paper or Ask Questions

Confidence Inference for Focused Learning in Stereo Matching

Sep 25, 2018

Ruichao Xiao, Wenxiu Sun, Chengxi Yang

Figure 1 for Confidence Inference for Focused Learning in Stereo Matching

Figure 2 for Confidence Inference for Focused Learning in Stereo Matching

Figure 3 for Confidence Inference for Focused Learning in Stereo Matching

Figure 4 for Confidence Inference for Focused Learning in Stereo Matching

Abstract:In this paper, we present confidence inference approachin an unsupervised way in stereo matching. Deep Neu-ral Networks (DNNs) have recently been achieving state-of-the-art performance. However, it is often hard to tellwhether the trained model was making sensible predictionsor just guessing at random. To address this problem, westart from a probabilistic interpretation of theL1loss usedin stereo matching, which inherently assumes an indepen-dent and identical (aka i.i.d.) Laplacian distribution. Weshow that with the newly introduced dense confidence map,the identical assumption is relaxed. Intuitively, the vari-ance in the Laplacian distribution is large for low confidentpixels while small for high-confidence pixels. In practice,the network learns toattenuatelow-confidence pixels (e.g.,noisy input, occlusions, featureless regions) andfocusonhigh-confidence pixels. Moreover, it can be observed fromexperiments that the focused learning is very helpful in find-ing a better convergence state of the trained model, reduc-ing over-fitting on a given dataset.

Via

Access Paper or Ask Questions

Deep Graph Laplacian Regularization

Jul 31, 2018

Jin Zeng, Jiahao Pang, Wenxiu Sun, Gene Cheung, Ruichao Xiao

Figure 1 for Deep Graph Laplacian Regularization

Figure 2 for Deep Graph Laplacian Regularization

Figure 3 for Deep Graph Laplacian Regularization

Figure 4 for Deep Graph Laplacian Regularization

Abstract:We propose to combine the robustness merit of model-based approaches and the learning power of data-driven approaches for image restoration. Specifically, by integrating graph Laplacian regularization as a trainable module into a deep learning framework, we are less susceptible to overfitting than pure CNN-based approaches, achieving higher robustness to small dataset and cross-domain denoising. First, a sparse neighborhood graph is built from the output of a convolutional neural network (CNN). Then the image is restored by solving an unconstrained quadratic programming problem, using a corresponding graph Laplacian regularizer as a prior term. The proposed restoration pipeline is fully differentiable and hence can be end-to-end trained. Experimental results demonstrate that our work avoids overfitting given small training data. It is also endowed with strong cross-domain generalization power, outperforming the state-of-the-art approaches by remarkable margin.

Via

Access Paper or Ask Questions

Zoom and Learn: Generalizing Deep Stereo Matching to Novel Domains

Mar 18, 2018

Jiahao Pang, Wenxiu Sun, Chengxi Yang, Jimmy Ren, Ruichao Xiao, Jin Zeng, Liang Lin

Figure 1 for Zoom and Learn: Generalizing Deep Stereo Matching to Novel Domains

Figure 2 for Zoom and Learn: Generalizing Deep Stereo Matching to Novel Domains

Figure 3 for Zoom and Learn: Generalizing Deep Stereo Matching to Novel Domains

Figure 4 for Zoom and Learn: Generalizing Deep Stereo Matching to Novel Domains

Abstract:Despite the recent success of stereo matching with convolutional neural networks (CNNs), it remains arduous to generalize a pre-trained deep stereo model to a novel domain. A major difficulty is to collect accurate ground-truth disparities for stereo pairs in the target domain. In this work, we propose a self-adaptation approach for CNN training, utilizing both synthetic training data (with ground-truth disparities) and stereo pairs in the new domain (without ground-truths). Our method is driven by two empirical observations. By feeding real stereo pairs of different domains to stereo models pre-trained with synthetic data, we see that: i) a pre-trained model does not generalize well to the new domain, producing artifacts at boundaries and ill-posed regions; however, ii) feeding an up-sampled stereo pair leads to a disparity map with extra details. To avoid i) while exploiting ii), we formulate an iterative optimization problem with graph Laplacian regularization. At each iteration, the CNN adapts itself better to the new domain: we let the CNN learn its own higher-resolution output; at the meanwhile, a graph Laplacian regularization is imposed to discriminatively keep the desired edges while smoothing out the artifacts. We demonstrate the effectiveness of our method in two domains: daily scenes collected by smartphone cameras, and street views captured in a driving car.

* Accepted at CVPR 2018

Via

Access Paper or Ask Questions