Picture for Zhiyu Wu

Zhiyu Wu

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Add code
Dec 13, 2024
Viaarxiv icon

Automated 3D Physical Simulation of Open-world Scene with Gaussian Splatting

Add code
Nov 19, 2024
Figure 1 for Automated 3D Physical Simulation of Open-world Scene with Gaussian Splatting
Figure 2 for Automated 3D Physical Simulation of Open-world Scene with Gaussian Splatting
Figure 3 for Automated 3D Physical Simulation of Open-world Scene with Gaussian Splatting
Figure 4 for Automated 3D Physical Simulation of Open-world Scene with Gaussian Splatting
Viaarxiv icon

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation

Add code
Nov 12, 2024
Viaarxiv icon

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation

Add code
Oct 17, 2024
Viaarxiv icon

MathScape: Evaluating MLLMs in multimodal Math Scenarios through a Hierarchical Benchmark

Add code
Aug 15, 2024
Viaarxiv icon

AllMatch: Exploiting All Unlabeled Data for Semi-Supervised Learning

Add code
Jun 22, 2024
Viaarxiv icon

Delta Tensor: Efficient Vector and Tensor Storage in Delta Lake

Add code
May 03, 2024
Viaarxiv icon

Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming Services

Add code
Apr 25, 2024
Viaarxiv icon

LA-Net: Landmark-Aware Learning for Reliable Facial Expression Recognition under Label Noise

Add code
Jul 20, 2023
Viaarxiv icon

BERT-ERC: Fine-tuning BERT is Enough for Emotion Recognition in Conversation

Add code
Jan 17, 2023
Viaarxiv icon