Picture for Jiayuan Fan

Jiayuan Fan

Scene123: One Prompt to 3D Scene Generation via Video-Assisted and Consistency-Enhanced MAE

Add code
Aug 10, 2024
Viaarxiv icon

Lightweight Model Pre-training via Language Guided Knowledge Distillation

Add code
Jun 17, 2024
Viaarxiv icon

DualMamba: A Lightweight Spectral-Spatial Mamba-Convolution Network for Hyperspectral Image Classification

Add code
Jun 11, 2024
Figure 1 for DualMamba: A Lightweight Spectral-Spatial Mamba-Convolution Network for Hyperspectral Image Classification
Figure 2 for DualMamba: A Lightweight Spectral-Spatial Mamba-Convolution Network for Hyperspectral Image Classification
Figure 3 for DualMamba: A Lightweight Spectral-Spatial Mamba-Convolution Network for Hyperspectral Image Classification
Figure 4 for DualMamba: A Lightweight Spectral-Spatial Mamba-Convolution Network for Hyperspectral Image Classification
Viaarxiv icon

Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision

Add code
Jun 06, 2024
Viaarxiv icon

MotionChain: Conversational Motion Controllers via Multimodal Prompts

Add code
Apr 03, 2024
Viaarxiv icon

MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies

Add code
Mar 03, 2024
Figure 1 for MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies
Figure 2 for MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies
Figure 3 for MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies
Figure 4 for MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies
Viaarxiv icon

Rethinking of Feature Interaction for Multi-task Learning on Dense Prediction

Add code
Dec 21, 2023
Viaarxiv icon

ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model

Add code
Dec 01, 2023
Viaarxiv icon

LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning

Add code
Nov 30, 2023
Figure 1 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Figure 2 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Figure 3 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Figure 4 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Viaarxiv icon

PDF: Point Diffusion Implicit Function for Large-scale Scene Neural Representation

Add code
Nov 03, 2023
Viaarxiv icon