Picture for Ziwei Liu

Ziwei Liu

Nanyang Technological University

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

Add code
Mar 16, 2025
Viaarxiv icon

Cosh-DiT: Co-Speech Gesture Video Synthesis via Hybrid Audio-Visual Diffusion Transformers

Add code
Mar 13, 2025
Viaarxiv icon

EgoLife: Towards Egocentric Life Assistant

Add code
Mar 05, 2025
Viaarxiv icon

Dynamic Parallel Tree Search for Efficient LLM Reasoning

Add code
Feb 22, 2025
Viaarxiv icon

Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT

Add code
Feb 10, 2025
Viaarxiv icon

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment

Add code
Feb 06, 2025
Figure 1 for Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment
Figure 2 for Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment
Figure 3 for Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment
Figure 4 for Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment
Viaarxiv icon

IC-Portrait: In-Context Matching for View-Consistent Personalized Portrait

Add code
Jan 31, 2025
Viaarxiv icon

Optimizing Sentence Embedding with Pseudo-Labeling and Model Ensembles: A Hierarchical Framework for Enhanced NLP Tasks

Add code
Jan 27, 2025
Figure 1 for Optimizing Sentence Embedding with Pseudo-Labeling and Model Ensembles: A Hierarchical Framework for Enhanced NLP Tasks
Figure 2 for Optimizing Sentence Embedding with Pseudo-Labeling and Model Ensembles: A Hierarchical Framework for Enhanced NLP Tasks
Figure 3 for Optimizing Sentence Embedding with Pseudo-Labeling and Model Ensembles: A Hierarchical Framework for Enhanced NLP Tasks
Figure 4 for Optimizing Sentence Embedding with Pseudo-Labeling and Model Ensembles: A Hierarchical Framework for Enhanced NLP Tasks
Viaarxiv icon

A Comprehensive Framework for Semantic Similarity Detection Using Transformer Architectures and Enhanced Ensemble Techniques

Add code
Jan 24, 2025
Figure 1 for A Comprehensive Framework for Semantic Similarity Detection Using Transformer Architectures and Enhanced Ensemble Techniques
Figure 2 for A Comprehensive Framework for Semantic Similarity Detection Using Transformer Architectures and Enhanced Ensemble Techniques
Figure 3 for A Comprehensive Framework for Semantic Similarity Detection Using Transformer Architectures and Enhanced Ensemble Techniques
Figure 4 for A Comprehensive Framework for Semantic Similarity Detection Using Transformer Architectures and Enhanced Ensemble Techniques
Viaarxiv icon

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Add code
Jan 23, 2025
Figure 1 for Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos
Figure 2 for Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos
Figure 3 for Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos
Figure 4 for Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos
Viaarxiv icon