Picture for Zhi-Qi Cheng

Zhi-Qi Cheng

ProMQA: Question Answering Dataset for Multimodal Procedural Activity Understanding

Add code
Oct 29, 2024
Viaarxiv icon

Emphasizing Discriminative Features for Dataset Distillation in Complex Scenarios

Add code
Oct 22, 2024
Viaarxiv icon

POPoS: Improving Efficient and Robust Facial Landmark Detection with Parallel Optimal Position Search

Add code
Oct 15, 2024
Viaarxiv icon

DPDEdit: Detail-Preserved Diffusion Models for Multimodal Fashion Image Editing

Add code
Sep 02, 2024
Viaarxiv icon

FlexEdit: Marrying Free-Shape Masks to VLLM for Flexible Image Editing

Add code
Aug 22, 2024
Figure 1 for FlexEdit: Marrying Free-Shape Masks to VLLM for Flexible Image Editing
Figure 2 for FlexEdit: Marrying Free-Shape Masks to VLLM for Flexible Image Editing
Figure 3 for FlexEdit: Marrying Free-Shape Masks to VLLM for Flexible Image Editing
Figure 4 for FlexEdit: Marrying Free-Shape Masks to VLLM for Flexible Image Editing
Viaarxiv icon

SZTU-CMU at MER2024: Improving Emotion-LLaMA with Conv-Attention for Multimodal Emotion Recognition

Add code
Aug 21, 2024
Viaarxiv icon

Combo: Co-speech holistic 3D human motion generation and efficient customizable adaptation in harmony

Add code
Aug 18, 2024
Figure 1 for Combo: Co-speech holistic 3D human motion generation and efficient customizable adaptation in harmony
Figure 2 for Combo: Co-speech holistic 3D human motion generation and efficient customizable adaptation in harmony
Figure 3 for Combo: Co-speech holistic 3D human motion generation and efficient customizable adaptation in harmony
Figure 4 for Combo: Co-speech holistic 3D human motion generation and efficient customizable adaptation in harmony
Viaarxiv icon

SHIELD: LLM-Driven Schema Induction for Predictive Analytics in EV Battery Supply Chain Disruptions

Add code
Aug 09, 2024
Figure 1 for SHIELD: LLM-Driven Schema Induction for Predictive Analytics in EV Battery Supply Chain Disruptions
Figure 2 for SHIELD: LLM-Driven Schema Induction for Predictive Analytics in EV Battery Supply Chain Disruptions
Figure 3 for SHIELD: LLM-Driven Schema Induction for Predictive Analytics in EV Battery Supply Chain Disruptions
Figure 4 for SHIELD: LLM-Driven Schema Induction for Predictive Analytics in EV Battery Supply Chain Disruptions
Viaarxiv icon

Prioritize Alignment in Dataset Distillation

Add code
Aug 06, 2024
Viaarxiv icon

Music2P: A Multi-Modal AI-Driven Tool for Simplifying Album Cover Design

Add code
Aug 03, 2024
Figure 1 for Music2P: A Multi-Modal AI-Driven Tool for Simplifying Album Cover Design
Figure 2 for Music2P: A Multi-Modal AI-Driven Tool for Simplifying Album Cover Design
Figure 3 for Music2P: A Multi-Modal AI-Driven Tool for Simplifying Album Cover Design
Figure 4 for Music2P: A Multi-Modal AI-Driven Tool for Simplifying Album Cover Design
Viaarxiv icon