Picture for Kaiyou Song

Kaiyou Song

M2-omni: Advancing Omni-MLLM for Comprehensive Modality Support with Competitive Performance

Add code
Feb 26, 2025
Viaarxiv icon

Bootstrap Masked Visual Modeling via Hard Patches Mining

Add code
Dec 21, 2023
Viaarxiv icon

Semantic-Aware Autoregressive Image Modeling for Visual Representation Learning

Add code
Dec 16, 2023
Viaarxiv icon

DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions

Add code
Sep 22, 2023
Viaarxiv icon

Multi-Mode Online Knowledge Distillation for Self-Supervised Visual Representation Learning

Add code
Apr 13, 2023
Viaarxiv icon

Hard Patches Mining for Masked Image Modeling

Add code
Apr 12, 2023
Figure 1 for Hard Patches Mining for Masked Image Modeling
Figure 2 for Hard Patches Mining for Masked Image Modeling
Figure 3 for Hard Patches Mining for Masked Image Modeling
Figure 4 for Hard Patches Mining for Masked Image Modeling
Viaarxiv icon

Semantics-Consistent Feature Search for Self-Supervised Visual Representation Learning

Add code
Dec 13, 2022
Viaarxiv icon