Picture for Bowen Zhang

Bowen Zhang

CLIP-UP: A Simple and Efficient Mixture-of-Experts CLIP Training Recipe with Sparse Upcycling

Add code
Feb 03, 2025
Viaarxiv icon

STIV: Scalable Text and Image Conditioned Video Generation

Add code
Dec 10, 2024
Viaarxiv icon

Structured 3D Latents for Scalable and Versatile 3D Generation

Add code
Dec 02, 2024
Figure 1 for Structured 3D Latents for Scalable and Versatile 3D Generation
Figure 2 for Structured 3D Latents for Scalable and Versatile 3D Generation
Figure 3 for Structured 3D Latents for Scalable and Versatile 3D Generation
Figure 4 for Structured 3D Latents for Scalable and Versatile 3D Generation
Viaarxiv icon

Learning from Different Samples: A Source-free Framework for Semi-supervised Domain Adaptation

Add code
Nov 11, 2024
Viaarxiv icon

CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation

Add code
Nov 05, 2024
Figure 1 for CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation
Figure 2 for CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation
Figure 3 for CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation
Figure 4 for CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation
Viaarxiv icon

Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation

Add code
Nov 05, 2024
Figure 1 for Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
Figure 2 for Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
Figure 3 for Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
Figure 4 for Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
Viaarxiv icon

Improve Vision Language Model Chain-of-thought Reasoning

Add code
Oct 21, 2024
Figure 1 for Improve Vision Language Model Chain-of-thought Reasoning
Figure 2 for Improve Vision Language Model Chain-of-thought Reasoning
Figure 3 for Improve Vision Language Model Chain-of-thought Reasoning
Figure 4 for Improve Vision Language Model Chain-of-thought Reasoning
Viaarxiv icon

ReeFRAME: Reeb Graph based Trajectory Analysis Framework to Capture Top-Down and Bottom-Up Patterns of Life

Add code
Oct 19, 2024
Viaarxiv icon

Weak-eval-Strong: Evaluating and Eliciting Lateral Thinking of LLMs with Situation Puzzles

Add code
Oct 09, 2024
Figure 1 for Weak-eval-Strong: Evaluating and Eliciting Lateral Thinking of LLMs with Situation Puzzles
Figure 2 for Weak-eval-Strong: Evaluating and Eliciting Lateral Thinking of LLMs with Situation Puzzles
Figure 3 for Weak-eval-Strong: Evaluating and Eliciting Lateral Thinking of LLMs with Situation Puzzles
Figure 4 for Weak-eval-Strong: Evaluating and Eliciting Lateral Thinking of LLMs with Situation Puzzles
Viaarxiv icon

Toward Physics-guided Time Series Embedding

Add code
Oct 09, 2024
Figure 1 for Toward Physics-guided Time Series Embedding
Figure 2 for Toward Physics-guided Time Series Embedding
Figure 3 for Toward Physics-guided Time Series Embedding
Figure 4 for Toward Physics-guided Time Series Embedding
Viaarxiv icon