Picture for Bin Wen

Bin Wen

Kwai-STaR: Transform LLMs into State-Transition Reasoners

Add code
Nov 07, 2024
Viaarxiv icon

EVLM: An Efficient Vision-Language Model for Visual Understanding

Add code
Jul 19, 2024
Figure 1 for EVLM: An Efficient Vision-Language Model for Visual Understanding
Figure 2 for EVLM: An Efficient Vision-Language Model for Visual Understanding
Figure 3 for EVLM: An Efficient Vision-Language Model for Visual Understanding
Figure 4 for EVLM: An Efficient Vision-Language Model for Visual Understanding
Viaarxiv icon

CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation

Add code
Jun 15, 2024
Figure 1 for CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation
Figure 2 for CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation
Figure 3 for CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation
Figure 4 for CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation
Viaarxiv icon

Recognize Any Regions

Add code
Nov 02, 2023
Viaarxiv icon

Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders

Add code
Oct 09, 2022
Figure 1 for Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders
Figure 2 for Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders
Figure 3 for Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders
Figure 4 for Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders
Viaarxiv icon

MetaFormer: A Unified Meta Framework for Fine-Grained Recognition

Add code
Mar 05, 2022
Figure 1 for MetaFormer: A Unified Meta Framework for Fine-Grained Recognition
Figure 2 for MetaFormer: A Unified Meta Framework for Fine-Grained Recognition
Figure 3 for MetaFormer: A Unified Meta Framework for Fine-Grained Recognition
Figure 4 for MetaFormer: A Unified Meta Framework for Fine-Grained Recognition
Viaarxiv icon

Unbiased Scene Graph Generation via Rich and Fair Semantic Extraction

Add code
Feb 01, 2020
Figure 1 for Unbiased Scene Graph Generation via Rich and Fair Semantic Extraction
Figure 2 for Unbiased Scene Graph Generation via Rich and Fair Semantic Extraction
Figure 3 for Unbiased Scene Graph Generation via Rich and Fair Semantic Extraction
Figure 4 for Unbiased Scene Graph Generation via Rich and Fair Semantic Extraction
Viaarxiv icon

An extended description logic system with knowledge element based on ALC

Add code
Apr 16, 2019
Figure 1 for An extended description logic system with knowledge element based on ALC
Viaarxiv icon