Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yufan Guo

PIG: Physically-based Multi-Material Interaction with 3D Gaussians

Jun 09, 2025

Zeyu Xiao, Zhenyi Wu, Mingyang Sun, Qipeng Yan, Yufan Guo, Zhuoer Liang, Lihua Zhang

Abstract:3D Gaussian Splatting has achieved remarkable success in reconstructing both static and dynamic 3D scenes. However, in a scene represented by 3D Gaussian primitives, interactions between objects suffer from inaccurate 3D segmentation, imprecise deformation among different materials, and severe rendering artifacts. To address these challenges, we introduce PIG: Physically-Based Multi-Material Interaction with 3D Gaussians, a novel approach that combines 3D object segmentation with the simulation of interacting objects in high precision. Firstly, our method facilitates fast and accurate mapping from 2D pixels to 3D Gaussians, enabling precise 3D object-level segmentation. Secondly, we assign unique physical properties to correspondingly segmented objects within the scene for multi-material coupled interactions. Finally, we have successfully embedded constraint scales into deformation gradients, specifically clamping the scaling and rotation properties of the Gaussian primitives to eliminate artifacts and achieve geometric fidelity and visual consistency. Experimental results demonstrate that our method not only outperforms the state-of-the-art (SOTA) in terms of visual quality, but also opens up new directions and pipelines for the field of physically realistic scene generation.

Via

Access Paper or Ask Questions

When and Why does a Model Fail? A Human-in-the-loop Error Detection Framework for Sentiment Analysis

Jun 02, 2021

Zhe Liu, Yufan Guo, Jalal Mahmud

Figure 1 for When and Why does a Model Fail? A Human-in-the-loop Error Detection Framework for Sentiment Analysis

Figure 2 for When and Why does a Model Fail? A Human-in-the-loop Error Detection Framework for Sentiment Analysis

Figure 3 for When and Why does a Model Fail? A Human-in-the-loop Error Detection Framework for Sentiment Analysis

Figure 4 for When and Why does a Model Fail? A Human-in-the-loop Error Detection Framework for Sentiment Analysis

Abstract:Although deep neural networks have been widely employed and proven effective in sentiment analysis tasks, it remains challenging for model developers to assess their models for erroneous predictions that might exist prior to deployment. Once deployed, emergent errors can be hard to identify in prediction run-time and impossible to trace back to their sources. To address such gaps, in this paper we propose an error detection framework for sentiment analysis based on explainable features. We perform global-level feature validation with human-in-the-loop assessment, followed by an integration of global and local-level feature contribution analysis. Experimental results show that, given limited human-in-the-loop intervention, our method is able to identify erroneous model predictions on unseen data with high precision.

* NAACL2021

Via

Access Paper or Ask Questions

Bimodal network architectures for automatic generation of image annotation from text

Sep 05, 2018

Mehdi Moradi, Ali Madani, Yaniv Gur, Yufan Guo, Tanveer Syeda-Mahmood

Figure 1 for Bimodal network architectures for automatic generation of image annotation from text

Figure 2 for Bimodal network architectures for automatic generation of image annotation from text

Figure 3 for Bimodal network architectures for automatic generation of image annotation from text

Figure 4 for Bimodal network architectures for automatic generation of image annotation from text

Abstract:Medical image analysis practitioners have embraced big data methodologies. This has created a need for large annotated datasets. The source of big data is typically large image collections and clinical reports recorded for these images. In many cases, however, building algorithms aimed at segmentation and detection of disease requires a training dataset with markings of the areas of interest on the image that match with the described anomalies. This process of annotation is expensive and needs the involvement of clinicians. In this work we propose two separate deep neural network architectures for automatic marking of a region of interest (ROI) on the image best representing a finding location, given a textual report or a set of keywords. One architecture consists of LSTM and CNN components and is trained end to end with images, matching text, and markings of ROIs for those images. The output layer estimates the coordinates of the vertices of a polygonal region. The second architecture uses a network pre-trained on a large dataset of the same image types for learning feature representations of the findings of interest. We show that for a variety of findings from chest X-ray images, both proposed architectures learn to estimate the ROI, as validated by clinical annotations. There is a clear advantage obtained from the architecture with pre-trained imaging network. The centroids of the ROIs marked by this network were on average at a distance equivalent to 5.1% of the image width from the centroids of the ground truth ROIs.

* Lecture Notes in Computer Science (LNCS 11070), Proceedings of Medical Image Computing & Computer Assisted Intervention (MICCAI 2018)
* Accepted to MICCAI 2018, LNCS 11070

Via

Access Paper or Ask Questions

Erratum: Link prediction in drug-target interactions network using similarity indices

Nov 01, 2017

Yiding Lu, Yufan Guo, Anna Korhonen

Figure 1 for Erratum: Link prediction in drug-target interactions network using similarity indices

Figure 2 for Erratum: Link prediction in drug-target interactions network using similarity indices

Figure 3 for Erratum: Link prediction in drug-target interactions network using similarity indices

Figure 4 for Erratum: Link prediction in drug-target interactions network using similarity indices

Abstract:Background: In silico drug-target interaction (DTI) prediction plays an integral role in drug repositioning: the discovery of new uses for existing drugs. One popular method of drug repositioning is network-based DTI prediction, which uses complex network theory to predict DTIs from a drug-target network. Currently, most network-based DTI prediction is based on machine learning methods such as Restricted Boltzmann Machines (RBM) or Support Vector Machines (SVM). These methods require additional information about the characteristics of drugs, targets and DTIs, such as chemical structure, genome sequence, binding types, causes of interactions, etc., and do not perform satisfactorily when such information is unavailable. We propose a new, alternative method for DTI prediction that makes use of only network topology information attempting to solve this problem. Results: We compare our method for DTI prediction against the well-known RBM approach. We show that when applied to the MATADOR database, our approach based on node neighborhoods yield higher precision for high-ranking predictions than RBM when no information regarding DTI types is available. Conclusion: This demonstrates that approaches purely based on network topology provide a more suitable approach to DTI prediction in the many real-life situations where little or no prior knowledge is available about the characteristics of drugs, targets, or their interactions.

* 10 pages

Via

Access Paper or Ask Questions