Picture for Zijie Zhai

Zijie Zhai

Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models

Add code
Mar 11, 2025
Viaarxiv icon

Diffusion Model with Representation Alignment for Protein Inverse Folding

Add code
Dec 12, 2024
Figure 1 for Diffusion Model with Representation Alignment for Protein Inverse Folding
Figure 2 for Diffusion Model with Representation Alignment for Protein Inverse Folding
Figure 3 for Diffusion Model with Representation Alignment for Protein Inverse Folding
Figure 4 for Diffusion Model with Representation Alignment for Protein Inverse Folding
Viaarxiv icon

Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification

Add code
Dec 03, 2024
Figure 1 for Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification
Figure 2 for Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification
Figure 3 for Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification
Figure 4 for Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification
Viaarxiv icon