Picture for Xu Sun

Xu Sun

UVE: Are MLLMs Unified Evaluators for AI-Generated Videos?

Add code
Mar 13, 2025
Viaarxiv icon

Generative Frame Sampler for Long Video Understanding

Add code
Mar 12, 2025
Viaarxiv icon

Next Block Prediction: Video Generation via Semi-Autoregressive Modeling

Add code
Feb 12, 2025
Viaarxiv icon

VidTwin: Video VAE with Decoupled Structure and Dynamics

Add code
Dec 23, 2024
Viaarxiv icon

PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension

Add code
Dec 16, 2024
Figure 1 for PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension
Figure 2 for PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension
Figure 3 for PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension
Figure 4 for PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension
Viaarxiv icon

Hyperspectral Image Cross-Domain Object Detection Method based on Spectral-Spatial Feature Alignment

Add code
Nov 25, 2024
Figure 1 for Hyperspectral Image Cross-Domain Object Detection Method based on Spectral-Spatial Feature Alignment
Figure 2 for Hyperspectral Image Cross-Domain Object Detection Method based on Spectral-Spatial Feature Alignment
Figure 3 for Hyperspectral Image Cross-Domain Object Detection Method based on Spectral-Spatial Feature Alignment
Figure 4 for Hyperspectral Image Cross-Domain Object Detection Method based on Spectral-Spatial Feature Alignment
Viaarxiv icon

Unveiling Molecular Secrets: An LLM-Augmented Linear Model for Explainable and Calibratable Molecular Property Prediction

Add code
Oct 11, 2024
Figure 1 for Unveiling Molecular Secrets: An LLM-Augmented Linear Model for Explainable and Calibratable Molecular Property Prediction
Figure 2 for Unveiling Molecular Secrets: An LLM-Augmented Linear Model for Explainable and Calibratable Molecular Property Prediction
Figure 3 for Unveiling Molecular Secrets: An LLM-Augmented Linear Model for Explainable and Calibratable Molecular Property Prediction
Figure 4 for Unveiling Molecular Secrets: An LLM-Augmented Linear Model for Explainable and Calibratable Molecular Property Prediction
Viaarxiv icon

Temporal Reasoning Transfer from Text to Video

Add code
Oct 08, 2024
Figure 1 for Temporal Reasoning Transfer from Text to Video
Figure 2 for Temporal Reasoning Transfer from Text to Video
Figure 3 for Temporal Reasoning Transfer from Text to Video
Figure 4 for Temporal Reasoning Transfer from Text to Video
Viaarxiv icon

Enhancing Data Quality through Self-learning on Imbalanced Financial Risk Data

Add code
Sep 15, 2024
Viaarxiv icon

Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Experts

Add code
Aug 28, 2024
Viaarxiv icon