Picture for Ling Shao

Ling Shao

Terminus Group, Beijing, China

Spatial Preference Rewarding for MLLMs Spatial Understanding

Add code
Oct 16, 2025
Viaarxiv icon

Aesthetic Image Captioning with Saliency Enhanced MLLMs

Add code
Sep 04, 2025
Viaarxiv icon

Spatial-Temporal Graph Mamba for Music-Guided Dance Video Synthesis

Add code
Jul 09, 2025
Figure 1 for Spatial-Temporal Graph Mamba for Music-Guided Dance Video Synthesis
Figure 2 for Spatial-Temporal Graph Mamba for Music-Guided Dance Video Synthesis
Figure 3 for Spatial-Temporal Graph Mamba for Music-Guided Dance Video Synthesis
Figure 4 for Spatial-Temporal Graph Mamba for Music-Guided Dance Video Synthesis
Viaarxiv icon

Replace in Translation: Boost Concept Alignment in Counterfactual Text-to-Image

Add code
May 20, 2025
Viaarxiv icon

Exploring 3D Activity Reasoning and Planning: From Implicit Human Intentions to Route-Aware Planning

Add code
Mar 17, 2025
Viaarxiv icon

MambaIC: State Space Models for High-Performance Learned Image Compression

Add code
Mar 16, 2025
Viaarxiv icon

SOGS: Second-Order Anchor for Advanced 3D Gaussian Splatting

Add code
Mar 10, 2025
Viaarxiv icon

FE-UNet: Frequency Domain Enhanced U-Net with Segment Anything Capability for Versatile Image Segmentation

Add code
Feb 06, 2025
Figure 1 for FE-UNet: Frequency Domain Enhanced U-Net with Segment Anything Capability for Versatile Image Segmentation
Figure 2 for FE-UNet: Frequency Domain Enhanced U-Net with Segment Anything Capability for Versatile Image Segmentation
Figure 3 for FE-UNet: Frequency Domain Enhanced U-Net with Segment Anything Capability for Versatile Image Segmentation
Figure 4 for FE-UNet: Frequency Domain Enhanced U-Net with Segment Anything Capability for Versatile Image Segmentation
Viaarxiv icon

Laser: Efficient Language-Guided Segmentation in Neural Radiance Fields

Add code
Jan 31, 2025
Figure 1 for Laser: Efficient Language-Guided Segmentation in Neural Radiance Fields
Figure 2 for Laser: Efficient Language-Guided Segmentation in Neural Radiance Fields
Figure 3 for Laser: Efficient Language-Guided Segmentation in Neural Radiance Fields
Figure 4 for Laser: Efficient Language-Guided Segmentation in Neural Radiance Fields
Viaarxiv icon

Enhanced Multi-Scale Cross-Attention for Person Image Generation

Add code
Jan 15, 2025
Figure 1 for Enhanced Multi-Scale Cross-Attention for Person Image Generation
Figure 2 for Enhanced Multi-Scale Cross-Attention for Person Image Generation
Figure 3 for Enhanced Multi-Scale Cross-Attention for Person Image Generation
Figure 4 for Enhanced Multi-Scale Cross-Attention for Person Image Generation
Viaarxiv icon