Picture for Dong-Ming Yan

Dong-Ming Yan

iFlame: Interleaving Full and Linear Attention for Efficient Mesh Generation

Add code
Mar 20, 2025
Viaarxiv icon

Revisiting CAD Model Generation by Learning Raster Sketch

Add code
Mar 02, 2025
Viaarxiv icon

GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expression

Add code
Dec 13, 2024
Figure 1 for GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expression
Figure 2 for GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expression
Figure 3 for GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expression
Figure 4 for GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expression
Viaarxiv icon

OCMG-Net: Neural Oriented Normal Refinement for Unstructured Point Clouds

Add code
Sep 02, 2024
Viaarxiv icon

Correspondence-Free Non-Rigid Point Set Registration Using Unsupervised Clustering Analysis

Add code
Jun 27, 2024
Viaarxiv icon

TCAN: Text-oriented Cross Attention Network for Multimodal Sentiment Analysis

Add code
Apr 06, 2024
Figure 1 for TCAN: Text-oriented Cross Attention Network for Multimodal Sentiment Analysis
Figure 2 for TCAN: Text-oriented Cross Attention Network for Multimodal Sentiment Analysis
Figure 3 for TCAN: Text-oriented Cross Attention Network for Multimodal Sentiment Analysis
Figure 4 for TCAN: Text-oriented Cross Attention Network for Multimodal Sentiment Analysis
Viaarxiv icon

Deep Learning-based Image and Video Inpainting: A Survey

Add code
Jan 07, 2024
Viaarxiv icon

CMG-Net: Robust Normal Estimation for Point Clouds via Chamfer Normal Distance and Multi-scale Geometry

Add code
Dec 14, 2023
Viaarxiv icon

M2HF: Multi-level Multi-modal Hybrid Fusion for Text-Video Retrieval

Add code
Aug 16, 2022
Figure 1 for M2HF: Multi-level Multi-modal Hybrid Fusion for Text-Video Retrieval
Figure 2 for M2HF: Multi-level Multi-modal Hybrid Fusion for Text-Video Retrieval
Figure 3 for M2HF: Multi-level Multi-modal Hybrid Fusion for Text-Video Retrieval
Figure 4 for M2HF: Multi-level Multi-modal Hybrid Fusion for Text-Video Retrieval
Viaarxiv icon

GraphFit: Learning Multi-scale Graph-Convolutional Representation for Point Cloud Normal Estimation

Add code
Jul 23, 2022
Figure 1 for GraphFit: Learning Multi-scale Graph-Convolutional Representation for Point Cloud Normal Estimation
Figure 2 for GraphFit: Learning Multi-scale Graph-Convolutional Representation for Point Cloud Normal Estimation
Figure 3 for GraphFit: Learning Multi-scale Graph-Convolutional Representation for Point Cloud Normal Estimation
Figure 4 for GraphFit: Learning Multi-scale Graph-Convolutional Representation for Point Cloud Normal Estimation
Viaarxiv icon