Picture for Xiaokang Chen

Xiaokang Chen

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Add code
Dec 13, 2024
Viaarxiv icon

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation

Add code
Nov 12, 2024
Viaarxiv icon

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation

Add code
Oct 17, 2024
Viaarxiv icon

The Devil is in the Neurons: Interpreting and Mitigating Social Biases in Pre-trained Language Models

Add code
Jun 14, 2024
Viaarxiv icon

Improving Long Text Understanding with Knowledge Distilled from Summarization Model

Add code
May 08, 2024
Viaarxiv icon

InTeX: Interactive Text-to-texture Synthesis via Unified Depth-aware Inpainting

Add code
Mar 18, 2024
Viaarxiv icon

LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation

Add code
Feb 07, 2024
Viaarxiv icon

VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks

Add code
May 25, 2023
Viaarxiv icon

Interactive Segment Anything NeRF with Feature Imitation

Add code
May 25, 2023
Viaarxiv icon

Uncovering and Categorizing Social Biases in Text-to-SQL

Add code
May 25, 2023
Viaarxiv icon