Picture for Ling Shao

Ling Shao

Terminus Group, Beijing, China

Aesthetic Image Captioning with Saliency Enhanced MLLMs

Add code
Sep 04, 2025
Viaarxiv icon

Spatial-Temporal Graph Mamba for Music-Guided Dance Video Synthesis

Add code
Jul 09, 2025
Viaarxiv icon

Replace in Translation: Boost Concept Alignment in Counterfactual Text-to-Image

Add code
May 20, 2025
Viaarxiv icon

Exploring 3D Activity Reasoning and Planning: From Implicit Human Intentions to Route-Aware Planning

Add code
Mar 17, 2025
Viaarxiv icon

MambaIC: State Space Models for High-Performance Learned Image Compression

Add code
Mar 16, 2025
Viaarxiv icon

SOGS: Second-Order Anchor for Advanced 3D Gaussian Splatting

Add code
Mar 10, 2025
Viaarxiv icon

FE-UNet: Frequency Domain Enhanced U-Net with Segment Anything Capability for Versatile Image Segmentation

Add code
Feb 06, 2025
Figure 1 for FE-UNet: Frequency Domain Enhanced U-Net with Segment Anything Capability for Versatile Image Segmentation
Figure 2 for FE-UNet: Frequency Domain Enhanced U-Net with Segment Anything Capability for Versatile Image Segmentation
Figure 3 for FE-UNet: Frequency Domain Enhanced U-Net with Segment Anything Capability for Versatile Image Segmentation
Figure 4 for FE-UNet: Frequency Domain Enhanced U-Net with Segment Anything Capability for Versatile Image Segmentation
Viaarxiv icon

Laser: Efficient Language-Guided Segmentation in Neural Radiance Fields

Add code
Jan 31, 2025
Figure 1 for Laser: Efficient Language-Guided Segmentation in Neural Radiance Fields
Figure 2 for Laser: Efficient Language-Guided Segmentation in Neural Radiance Fields
Figure 3 for Laser: Efficient Language-Guided Segmentation in Neural Radiance Fields
Figure 4 for Laser: Efficient Language-Guided Segmentation in Neural Radiance Fields
Viaarxiv icon

Enhanced Multi-Scale Cross-Attention for Person Image Generation

Add code
Jan 15, 2025
Figure 1 for Enhanced Multi-Scale Cross-Attention for Person Image Generation
Figure 2 for Enhanced Multi-Scale Cross-Attention for Person Image Generation
Figure 3 for Enhanced Multi-Scale Cross-Attention for Person Image Generation
Figure 4 for Enhanced Multi-Scale Cross-Attention for Person Image Generation
Viaarxiv icon

Novel View Extrapolation with Video Diffusion Priors

Add code
Nov 21, 2024
Viaarxiv icon