Picture for Bowen Zhang

Bowen Zhang

STIV: Scalable Text and Image Conditioned Video Generation

Add code
Dec 10, 2024
Viaarxiv icon

Structured 3D Latents for Scalable and Versatile 3D Generation

Add code
Dec 02, 2024
Viaarxiv icon

Learning from Different Samples: A Source-free Framework for Semi-supervised Domain Adaptation

Add code
Nov 11, 2024
Viaarxiv icon

Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation

Add code
Nov 05, 2024
Figure 1 for Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
Figure 2 for Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
Figure 3 for Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
Figure 4 for Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
Viaarxiv icon

CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation

Add code
Nov 05, 2024
Figure 1 for CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation
Figure 2 for CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation
Figure 3 for CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation
Figure 4 for CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation
Viaarxiv icon

Improve Vision Language Model Chain-of-thought Reasoning

Add code
Oct 21, 2024
Figure 1 for Improve Vision Language Model Chain-of-thought Reasoning
Figure 2 for Improve Vision Language Model Chain-of-thought Reasoning
Figure 3 for Improve Vision Language Model Chain-of-thought Reasoning
Figure 4 for Improve Vision Language Model Chain-of-thought Reasoning
Viaarxiv icon

ReeFRAME: Reeb Graph based Trajectory Analysis Framework to Capture Top-Down and Bottom-Up Patterns of Life

Add code
Oct 19, 2024
Viaarxiv icon

MM-Ego: Towards Building Egocentric Multimodal LLMs

Add code
Oct 09, 2024
Figure 1 for MM-Ego: Towards Building Egocentric Multimodal LLMs
Figure 2 for MM-Ego: Towards Building Egocentric Multimodal LLMs
Figure 3 for MM-Ego: Towards Building Egocentric Multimodal LLMs
Figure 4 for MM-Ego: Towards Building Egocentric Multimodal LLMs
Viaarxiv icon

Toward Physics-guided Time Series Embedding

Add code
Oct 09, 2024
Figure 1 for Toward Physics-guided Time Series Embedding
Figure 2 for Toward Physics-guided Time Series Embedding
Figure 3 for Toward Physics-guided Time Series Embedding
Figure 4 for Toward Physics-guided Time Series Embedding
Viaarxiv icon

Weak-eval-Strong: Evaluating and Eliciting Lateral Thinking of LLMs with Situation Puzzles

Add code
Oct 09, 2024
Figure 1 for Weak-eval-Strong: Evaluating and Eliciting Lateral Thinking of LLMs with Situation Puzzles
Figure 2 for Weak-eval-Strong: Evaluating and Eliciting Lateral Thinking of LLMs with Situation Puzzles
Figure 3 for Weak-eval-Strong: Evaluating and Eliciting Lateral Thinking of LLMs with Situation Puzzles
Figure 4 for Weak-eval-Strong: Evaluating and Eliciting Lateral Thinking of LLMs with Situation Puzzles
Viaarxiv icon