Picture for Feng Yang

Feng Yang

Focus-N-Fix: Region-Aware Fine-Tuning for Text-to-Image Generation

Add code
Jan 11, 2025
Viaarxiv icon

HV-BEV: Decoupling Horizontal and Vertical Feature Sampling for Multi-View 3D Object Detection

Add code
Dec 25, 2024
Viaarxiv icon

LVMark: Robust Watermark for latent video diffusion models

Add code
Dec 12, 2024
Viaarxiv icon

Motion Artifact Removal in Pixel-Frequency Domain via Alternate Masks and Diffusion Model

Add code
Dec 10, 2024
Viaarxiv icon

FastTrackTr:Towards Fast Multi-Object Tracking with Transformers

Add code
Nov 24, 2024
Figure 1 for FastTrackTr:Towards Fast Multi-Object Tracking with Transformers
Figure 2 for FastTrackTr:Towards Fast Multi-Object Tracking with Transformers
Figure 3 for FastTrackTr:Towards Fast Multi-Object Tracking with Transformers
Figure 4 for FastTrackTr:Towards Fast Multi-Object Tracking with Transformers
Viaarxiv icon

Multi-object Tracking by Detection and Query: an efficient end-to-end manner

Add code
Nov 09, 2024
Figure 1 for Multi-object Tracking by Detection and Query: an efficient end-to-end manner
Figure 2 for Multi-object Tracking by Detection and Query: an efficient end-to-end manner
Figure 3 for Multi-object Tracking by Detection and Query: an efficient end-to-end manner
Figure 4 for Multi-object Tracking by Detection and Query: an efficient end-to-end manner
Viaarxiv icon

Towards Student Actions in Classroom Scenes: New Dataset and Baseline

Add code
Sep 02, 2024
Viaarxiv icon

Cropper: Vision-Language Model for Image Cropping through In-Context Learning

Add code
Aug 14, 2024
Figure 1 for Cropper: Vision-Language Model for Image Cropping through In-Context Learning
Figure 2 for Cropper: Vision-Language Model for Image Cropping through In-Context Learning
Figure 3 for Cropper: Vision-Language Model for Image Cropping through In-Context Learning
Figure 4 for Cropper: Vision-Language Model for Image Cropping through In-Context Learning
Viaarxiv icon

ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling

Add code
Aug 07, 2024
Figure 1 for ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling
Figure 2 for ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling
Figure 3 for ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling
Figure 4 for ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling
Viaarxiv icon

Fluid-Antenna Enhanced ISAC: Joint Antenna Positioning and Dual-Functional Beamforming Design under Perfect and Imperfect CSI

Add code
Jul 25, 2024
Viaarxiv icon