Picture for Xiaoxin Chen

Xiaoxin Chen

Autonomous Deep Agent

Add code
Feb 10, 2025
Viaarxiv icon

Data Quality Enhancement on the Basis of Diversity with Large Language Models for Text Classification: Uncovered, Difficult, and Noisy

Add code
Dec 10, 2024
Viaarxiv icon

BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices

Add code
Nov 16, 2024
Viaarxiv icon

A Learning Rate Path Switching Training Paradigm for Version Updates of Large Language Models

Add code
Oct 05, 2024
Figure 1 for A Learning Rate Path Switching Training Paradigm for Version Updates of Large Language Models
Figure 2 for A Learning Rate Path Switching Training Paradigm for Version Updates of Large Language Models
Figure 3 for A Learning Rate Path Switching Training Paradigm for Version Updates of Large Language Models
Figure 4 for A Learning Rate Path Switching Training Paradigm for Version Updates of Large Language Models
Viaarxiv icon

ControlAR: Controllable Image Generation with Autoregressive Models

Add code
Oct 03, 2024
Figure 1 for ControlAR: Controllable Image Generation with Autoregressive Models
Figure 2 for ControlAR: Controllable Image Generation with Autoregressive Models
Figure 3 for ControlAR: Controllable Image Generation with Autoregressive Models
Figure 4 for ControlAR: Controllable Image Generation with Autoregressive Models
Viaarxiv icon

Efficient Test-Time Prompt Tuning for Vision-Language Models

Add code
Aug 11, 2024
Figure 1 for Efficient Test-Time Prompt Tuning for Vision-Language Models
Figure 2 for Efficient Test-Time Prompt Tuning for Vision-Language Models
Figure 3 for Efficient Test-Time Prompt Tuning for Vision-Language Models
Figure 4 for Efficient Test-Time Prompt Tuning for Vision-Language Models
Viaarxiv icon

EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model

Add code
Jun 28, 2024
Figure 1 for EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model
Figure 2 for EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model
Figure 3 for EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model
Figure 4 for EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model
Viaarxiv icon

FAGhead: Fully Animate Gaussian Head from Monocular Videos

Add code
Jun 27, 2024
Figure 1 for FAGhead: Fully Animate Gaussian Head from Monocular Videos
Figure 2 for FAGhead: Fully Animate Gaussian Head from Monocular Videos
Figure 3 for FAGhead: Fully Animate Gaussian Head from Monocular Videos
Figure 4 for FAGhead: Fully Animate Gaussian Head from Monocular Videos
Viaarxiv icon

Meta-Auxiliary Learning for Micro-Expression Recognition

Add code
Apr 18, 2024
Viaarxiv icon

GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning

Add code
Nov 21, 2023
Viaarxiv icon