Picture for Wei Ji

Wei Ji

MoTe: Learning Motion-Text Diffusion Model for Multiple Generation Tasks

Add code
Nov 29, 2024
Viaarxiv icon

Differentiable Gaussian Representation for Incomplete CT Reconstruction

Add code
Nov 07, 2024
Viaarxiv icon

Towards Small Object Editing: A Benchmark Dataset and A Training-Free Approach

Add code
Nov 03, 2024
Viaarxiv icon

Grounding is All You Need? Dual Temporal Grounding for Video Dialog

Add code
Oct 08, 2024
Figure 1 for Grounding is All You Need? Dual Temporal Grounding for Video Dialog
Figure 2 for Grounding is All You Need? Dual Temporal Grounding for Video Dialog
Figure 3 for Grounding is All You Need? Dual Temporal Grounding for Video Dialog
Figure 4 for Grounding is All You Need? Dual Temporal Grounding for Video Dialog
Viaarxiv icon

Personalized Knowledge Tracing through Student Representation Reconstruction and Class Imbalance Mitigation

Add code
Sep 10, 2024
Figure 1 for Personalized Knowledge Tracing through Student Representation Reconstruction and Class Imbalance Mitigation
Figure 2 for Personalized Knowledge Tracing through Student Representation Reconstruction and Class Imbalance Mitigation
Figure 3 for Personalized Knowledge Tracing through Student Representation Reconstruction and Class Imbalance Mitigation
Figure 4 for Personalized Knowledge Tracing through Student Representation Reconstruction and Class Imbalance Mitigation
Viaarxiv icon

Semantic Alignment for Multimodal Large Language Models

Add code
Aug 23, 2024
Figure 1 for Semantic Alignment for Multimodal Large Language Models
Figure 2 for Semantic Alignment for Multimodal Large Language Models
Figure 3 for Semantic Alignment for Multimodal Large Language Models
Figure 4 for Semantic Alignment for Multimodal Large Language Models
Viaarxiv icon

Learning Spectral-Decomposed Tokens for Domain Generalized Semantic Segmentation

Add code
Jul 29, 2024
Viaarxiv icon

DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving

Add code
Jul 22, 2024
Figure 1 for DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving
Figure 2 for DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving
Figure 3 for DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving
Figure 4 for DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving
Viaarxiv icon

Described Spatial-Temporal Video Detection

Add code
Jul 08, 2024
Figure 1 for Described Spatial-Temporal Video Detection
Figure 2 for Described Spatial-Temporal Video Detection
Figure 3 for Described Spatial-Temporal Video Detection
Figure 4 for Described Spatial-Temporal Video Detection
Viaarxiv icon

Spider: A Unified Framework for Context-dependent Concept Understanding

Add code
May 02, 2024
Figure 1 for Spider: A Unified Framework for Context-dependent Concept Understanding
Figure 2 for Spider: A Unified Framework for Context-dependent Concept Understanding
Figure 3 for Spider: A Unified Framework for Context-dependent Concept Understanding
Figure 4 for Spider: A Unified Framework for Context-dependent Concept Understanding
Viaarxiv icon