Picture for Yi Bin

Yi Bin

Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping

Add code
Oct 11, 2024
Viaarxiv icon

PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs

Add code
Oct 07, 2024
Figure 1 for PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs
Figure 2 for PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs
Figure 3 for PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs
Figure 4 for PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs
Viaarxiv icon

MM-Forecast: A Multimodal Approach to Temporal Event Forecasting with Large Language Models

Add code
Aug 08, 2024
Viaarxiv icon

GalleryGPT: Analyzing Paintings with Large Multimodal Models

Add code
Aug 01, 2024
Viaarxiv icon

Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning

Add code
Aug 01, 2024
Figure 1 for Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning
Figure 2 for Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning
Figure 3 for Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning
Figure 4 for Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning
Viaarxiv icon

Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object Detection

Add code
Jul 17, 2024
Viaarxiv icon

Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning

Add code
Jul 04, 2024
Viaarxiv icon

Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models

Add code
Jun 26, 2024
Viaarxiv icon

Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives

Add code
Jun 09, 2024
Viaarxiv icon

Non-Autoregressive Sentence Ordering

Add code
Oct 19, 2023
Viaarxiv icon