Picture for Ziyu Liu

Ziyu Liu

MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models

Add code
Oct 23, 2024
Figure 1 for MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models
Figure 2 for MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models
Figure 3 for MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models
Figure 4 for MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models
Viaarxiv icon

ESP-MedSAM: Efficient Self-Prompting SAM for Universal Domain-Generalized Medical Image Segmentation

Add code
Jul 19, 2024
Viaarxiv icon

Guidelines for Augmentation Selection in Contrastive Learning for Time Series Classification

Add code
Jul 12, 2024
Viaarxiv icon

MMLongBench-Doc: Benchmarking Long-context Document Understanding with Visualizations

Add code
Jul 01, 2024
Viaarxiv icon

MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs

Add code
Jun 17, 2024
Viaarxiv icon

V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results

Add code
Jun 17, 2024
Figure 1 for V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results
Figure 2 for V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results
Figure 3 for V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results
Viaarxiv icon

TinyLLaVA Factory: A Modularized Codebase for Small-scale Large Multimodal Models

Add code
May 20, 2024
Figure 1 for TinyLLaVA Factory: A Modularized Codebase for Small-scale Large Multimodal Models
Figure 2 for TinyLLaVA Factory: A Modularized Codebase for Small-scale Large Multimodal Models
Figure 3 for TinyLLaVA Factory: A Modularized Codebase for Small-scale Large Multimodal Models
Viaarxiv icon

TBNet: A Neural Architectural Defense Framework Facilitating DNN Model Protection in Trusted Execution Environments

Add code
May 07, 2024
Viaarxiv icon

RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition

Add code
Mar 20, 2024
Viaarxiv icon

Self-Supervised Learning for Time Series: Contrastive or Generative?

Add code
Mar 14, 2024
Viaarxiv icon