Picture for Ziwei Liu

Ziwei Liu

Nanyang Technological University

Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT

Add code
Feb 10, 2025
Viaarxiv icon

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment

Add code
Feb 06, 2025
Figure 1 for Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment
Figure 2 for Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment
Figure 3 for Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment
Figure 4 for Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment
Viaarxiv icon

IC-Portrait: In-Context Matching for View-Consistent Personalized Portrait

Add code
Jan 31, 2025
Viaarxiv icon

Optimizing Sentence Embedding with Pseudo-Labeling and Model Ensembles: A Hierarchical Framework for Enhanced NLP Tasks

Add code
Jan 27, 2025
Viaarxiv icon

A Comprehensive Framework for Semantic Similarity Detection Using Transformer Architectures and Enhanced Ensemble Techniques

Add code
Jan 24, 2025
Viaarxiv icon

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Add code
Jan 23, 2025
Figure 1 for Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos
Figure 2 for Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos
Figure 3 for Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos
Figure 4 for Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos
Viaarxiv icon

SMPLest-X: Ultimate Scaling for Expressive Human Pose and Shape Estimation

Add code
Jan 16, 2025
Viaarxiv icon

CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities

Add code
Jan 15, 2025
Figure 1 for CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities
Figure 2 for CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities
Figure 3 for CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities
Figure 4 for CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities
Viaarxiv icon

RepVideo: Rethinking Cross-Layer Representation for Video Generation

Add code
Jan 15, 2025
Figure 1 for RepVideo: Rethinking Cross-Layer Representation for Video Generation
Figure 2 for RepVideo: Rethinking Cross-Layer Representation for Video Generation
Figure 3 for RepVideo: Rethinking Cross-Layer Representation for Video Generation
Figure 4 for RepVideo: Rethinking Cross-Layer Representation for Video Generation
Viaarxiv icon

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Add code
Jan 14, 2025
Figure 1 for Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
Figure 2 for Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
Figure 3 for Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
Figure 4 for Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
Viaarxiv icon