Picture for Wei Ke

Wei Ke

InstructionBench: An Instructional Video Understanding Benchmark

Add code
Apr 07, 2025
Viaarxiv icon

Refining CLIP's Spatial Awareness: A Visual-Centric Perspective

Add code
Apr 03, 2025
Viaarxiv icon

Generating Multimodal Driving Scenes via Next-Scene Prediction

Add code
Mar 19, 2025
Viaarxiv icon

Towards Self-Improving Systematic Cognition for Next-Generation Foundation MLLMs

Add code
Mar 16, 2025
Viaarxiv icon

Monte Carlo Diffusion for Generalizable Learning-Based RANSAC

Add code
Mar 12, 2025
Viaarxiv icon

Coherent and Multi-modality Image Inpainting via Latent Space Optimization

Add code
Jul 10, 2024
Figure 1 for Coherent and Multi-modality Image Inpainting via Latent Space Optimization
Figure 2 for Coherent and Multi-modality Image Inpainting via Latent Space Optimization
Figure 3 for Coherent and Multi-modality Image Inpainting via Latent Space Optimization
Figure 4 for Coherent and Multi-modality Image Inpainting via Latent Space Optimization
Viaarxiv icon

Free Performance Gain from Mixing Multiple Partially Labeled Samples in Multi-label Image Classification

Add code
May 24, 2024
Figure 1 for Free Performance Gain from Mixing Multiple Partially Labeled Samples in Multi-label Image Classification
Figure 2 for Free Performance Gain from Mixing Multiple Partially Labeled Samples in Multi-label Image Classification
Figure 3 for Free Performance Gain from Mixing Multiple Partially Labeled Samples in Multi-label Image Classification
Figure 4 for Free Performance Gain from Mixing Multiple Partially Labeled Samples in Multi-label Image Classification
Viaarxiv icon

Mitigating Object Dependencies: Improving Point Cloud Self-Supervised Learning through Object Exchange

Add code
Apr 11, 2024
Viaarxiv icon

Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation

Add code
Mar 13, 2024
Figure 1 for Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation
Figure 2 for Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation
Figure 3 for Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation
Figure 4 for Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation
Viaarxiv icon

Analysis of the Two-Step Heterogeneous Transfer Learning for Laryngeal Blood Vessel Classification: Issue and Improvement

Add code
Mar 05, 2024
Figure 1 for Analysis of the Two-Step Heterogeneous Transfer Learning for Laryngeal Blood Vessel Classification: Issue and Improvement
Figure 2 for Analysis of the Two-Step Heterogeneous Transfer Learning for Laryngeal Blood Vessel Classification: Issue and Improvement
Figure 3 for Analysis of the Two-Step Heterogeneous Transfer Learning for Laryngeal Blood Vessel Classification: Issue and Improvement
Figure 4 for Analysis of the Two-Step Heterogeneous Transfer Learning for Laryngeal Blood Vessel Classification: Issue and Improvement
Viaarxiv icon