Picture for Xiaoxin Chen

Xiaoxin Chen

Data Quality Enhancement on the Basis of Diversity with Large Language Models for Text Classification: Uncovered, Difficult, and Noisy

Add code
Dec 10, 2024
Viaarxiv icon

BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices

Add code
Nov 16, 2024
Viaarxiv icon

A Learning Rate Path Switching Training Paradigm for Version Updates of Large Language Models

Add code
Oct 05, 2024
Figure 1 for A Learning Rate Path Switching Training Paradigm for Version Updates of Large Language Models
Figure 2 for A Learning Rate Path Switching Training Paradigm for Version Updates of Large Language Models
Figure 3 for A Learning Rate Path Switching Training Paradigm for Version Updates of Large Language Models
Figure 4 for A Learning Rate Path Switching Training Paradigm for Version Updates of Large Language Models
Viaarxiv icon

ControlAR: Controllable Image Generation with Autoregressive Models

Add code
Oct 03, 2024
Figure 1 for ControlAR: Controllable Image Generation with Autoregressive Models
Figure 2 for ControlAR: Controllable Image Generation with Autoregressive Models
Figure 3 for ControlAR: Controllable Image Generation with Autoregressive Models
Figure 4 for ControlAR: Controllable Image Generation with Autoregressive Models
Viaarxiv icon

Efficient Test-Time Prompt Tuning for Vision-Language Models

Add code
Aug 11, 2024
Figure 1 for Efficient Test-Time Prompt Tuning for Vision-Language Models
Figure 2 for Efficient Test-Time Prompt Tuning for Vision-Language Models
Figure 3 for Efficient Test-Time Prompt Tuning for Vision-Language Models
Figure 4 for Efficient Test-Time Prompt Tuning for Vision-Language Models
Viaarxiv icon

EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model

Add code
Jun 28, 2024
Figure 1 for EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model
Figure 2 for EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model
Figure 3 for EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model
Figure 4 for EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model
Viaarxiv icon

FAGhead: Fully Animate Gaussian Head from Monocular Videos

Add code
Jun 27, 2024
Figure 1 for FAGhead: Fully Animate Gaussian Head from Monocular Videos
Figure 2 for FAGhead: Fully Animate Gaussian Head from Monocular Videos
Figure 3 for FAGhead: Fully Animate Gaussian Head from Monocular Videos
Figure 4 for FAGhead: Fully Animate Gaussian Head from Monocular Videos
Viaarxiv icon

Meta-Auxiliary Learning for Micro-Expression Recognition

Add code
Apr 18, 2024
Viaarxiv icon

GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning

Add code
Nov 21, 2023
Viaarxiv icon

ImageBind-LLM: Multi-modality Instruction Tuning

Add code
Sep 11, 2023
Viaarxiv icon