Picture for Wenyu Liu

Wenyu Liu

SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild

Add code
Jan 07, 2025
Figure 1 for SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild
Figure 2 for SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild
Figure 3 for SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild
Figure 4 for SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild
Viaarxiv icon

GaraMoSt: Parallel Multi-Granularity Motion and Structural Modeling for Efficient Multi-Frame Interpolation in DSA Images

Add code
Dec 19, 2024
Figure 1 for GaraMoSt: Parallel Multi-Granularity Motion and Structural Modeling for Efficient Multi-Frame Interpolation in DSA Images
Figure 2 for GaraMoSt: Parallel Multi-Granularity Motion and Structural Modeling for Efficient Multi-Frame Interpolation in DSA Images
Figure 3 for GaraMoSt: Parallel Multi-Granularity Motion and Structural Modeling for Efficient Multi-Frame Interpolation in DSA Images
Figure 4 for GaraMoSt: Parallel Multi-Granularity Motion and Structural Modeling for Efficient Multi-Frame Interpolation in DSA Images
Viaarxiv icon

GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding

Add code
Dec 17, 2024
Viaarxiv icon

Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation

Add code
Dec 05, 2024
Viaarxiv icon

Partial Scene Text Retrieval

Add code
Nov 15, 2024
Figure 1 for Partial Scene Text Retrieval
Figure 2 for Partial Scene Text Retrieval
Figure 3 for Partial Scene Text Retrieval
Figure 4 for Partial Scene Text Retrieval
Viaarxiv icon

Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving

Add code
Oct 29, 2024
Figure 1 for Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving
Figure 2 for Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving
Figure 3 for Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving
Figure 4 for Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving
Viaarxiv icon

LCD-Net: A Lightweight Remote Sensing Change Detection Network Combining Feature Fusion and Gating Mechanism

Add code
Oct 14, 2024
Figure 1 for LCD-Net: A Lightweight Remote Sensing Change Detection Network Combining Feature Fusion and Gating Mechanism
Figure 2 for LCD-Net: A Lightweight Remote Sensing Change Detection Network Combining Feature Fusion and Gating Mechanism
Figure 3 for LCD-Net: A Lightweight Remote Sensing Change Detection Network Combining Feature Fusion and Gating Mechanism
Figure 4 for LCD-Net: A Lightweight Remote Sensing Change Detection Network Combining Feature Fusion and Gating Mechanism
Viaarxiv icon

FasterDiT: Towards Faster Diffusion Transformers Training without Architecture Modification

Add code
Oct 14, 2024
Viaarxiv icon

ControlAR: Controllable Image Generation with Autoregressive Models

Add code
Oct 03, 2024
Figure 1 for ControlAR: Controllable Image Generation with Autoregressive Models
Figure 2 for ControlAR: Controllable Image Generation with Autoregressive Models
Figure 3 for ControlAR: Controllable Image Generation with Autoregressive Models
Figure 4 for ControlAR: Controllable Image Generation with Autoregressive Models
Viaarxiv icon

Dynamic 2D Gaussians: Geometrically accurate radiance fields for dynamic objects

Add code
Sep 21, 2024
Figure 1 for Dynamic 2D Gaussians: Geometrically accurate radiance fields for dynamic objects
Figure 2 for Dynamic 2D Gaussians: Geometrically accurate radiance fields for dynamic objects
Figure 3 for Dynamic 2D Gaussians: Geometrically accurate radiance fields for dynamic objects
Figure 4 for Dynamic 2D Gaussians: Geometrically accurate radiance fields for dynamic objects
Viaarxiv icon