Picture for Wei Feng

Wei Feng

Beijing StoneWise Technology Co Ltd

VOVTrack: Exploring the Potentiality in Videos for Open-Vocabulary Object Tracking

Add code
Oct 11, 2024
Viaarxiv icon

VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understanding

Add code
Oct 11, 2024
Viaarxiv icon

Deep Correlated Prompting for Visual Recognition with Missing Modalities

Add code
Oct 10, 2024
Figure 1 for Deep Correlated Prompting for Visual Recognition with Missing Modalities
Figure 2 for Deep Correlated Prompting for Visual Recognition with Missing Modalities
Figure 3 for Deep Correlated Prompting for Visual Recognition with Missing Modalities
Figure 4 for Deep Correlated Prompting for Visual Recognition with Missing Modalities
Viaarxiv icon

TorchTitan: One-stop PyTorch native solution for production ready LLM pre-training

Add code
Oct 09, 2024
Viaarxiv icon

Pose-Guided Fine-Grained Sign Language Video Generation

Add code
Sep 25, 2024
Figure 1 for Pose-Guided Fine-Grained Sign Language Video Generation
Figure 2 for Pose-Guided Fine-Grained Sign Language Video Generation
Figure 3 for Pose-Guided Fine-Grained Sign Language Video Generation
Figure 4 for Pose-Guided Fine-Grained Sign Language Video Generation
Viaarxiv icon

Sight View Constraint for Robust Point Cloud Registration

Add code
Sep 08, 2024
Viaarxiv icon

Towards Reliable Advertising Image Generation Using Human Feedback

Add code
Aug 01, 2024
Viaarxiv icon

OCTrack: Benchmarking the Open-Corpus Multi-Object Tracking

Add code
Jul 19, 2024
Figure 1 for OCTrack: Benchmarking the Open-Corpus Multi-Object Tracking
Figure 2 for OCTrack: Benchmarking the Open-Corpus Multi-Object Tracking
Figure 3 for OCTrack: Benchmarking the Open-Corpus Multi-Object Tracking
Figure 4 for OCTrack: Benchmarking the Open-Corpus Multi-Object Tracking
Viaarxiv icon

Multi-sentence Video Grounding for Long Video Generation

Add code
Jul 18, 2024
Figure 1 for Multi-sentence Video Grounding for Long Video Generation
Figure 2 for Multi-sentence Video Grounding for Long Video Generation
Figure 3 for Multi-sentence Video Grounding for Long Video Generation
Figure 4 for Multi-sentence Video Grounding for Long Video Generation
Viaarxiv icon

Towards stable training of parallel continual learning

Add code
Jul 11, 2024
Viaarxiv icon