Picture for Chong Luo

Chong Luo

LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation

Add code
Nov 07, 2024
Figure 1 for LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation
Figure 2 for LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation
Figure 3 for LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation
Figure 4 for LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation
Viaarxiv icon

PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting

Add code
Oct 29, 2024
Viaarxiv icon

Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms

Add code
Jun 13, 2024
Viaarxiv icon

MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion

Add code
May 30, 2024
Viaarxiv icon

Chameleon: A Data-Efficient Generalist for Dense Visual Prediction in the Wild

Add code
Apr 29, 2024
Viaarxiv icon

OmniVid: A Generative Framework for Universal Video Understanding

Add code
Mar 26, 2024
Viaarxiv icon

Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering

Add code
Mar 14, 2024
Viaarxiv icon

Unifying Correspondence, Pose and NeRF for Pose-Free Novel View Synthesis from Stereo Pairs

Add code
Dec 12, 2023
Viaarxiv icon

ART$\boldsymbol{\cdot}$V: Auto-Regressive Text-to-Video Generation with Diffusion Models

Add code
Nov 30, 2023
Viaarxiv icon

MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation

Add code
Nov 30, 2023
Viaarxiv icon