Picture for Yi Bin

Yi Bin

Multi-Scale Contrastive Learning for Video Temporal Grounding

Add code
Dec 10, 2024
Viaarxiv icon

Motion-aware Contrastive Learning for Temporal Panoptic Scene Graph Generation

Add code
Dec 10, 2024
Viaarxiv icon

Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping

Add code
Oct 11, 2024
Figure 1 for Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
Figure 2 for Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
Figure 3 for Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
Figure 4 for Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
Viaarxiv icon

PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs

Add code
Oct 07, 2024
Figure 1 for PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs
Figure 2 for PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs
Figure 3 for PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs
Figure 4 for PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs
Viaarxiv icon

MM-Forecast: A Multimodal Approach to Temporal Event Forecasting with Large Language Models

Add code
Aug 08, 2024
Viaarxiv icon

GalleryGPT: Analyzing Paintings with Large Multimodal Models

Add code
Aug 01, 2024
Figure 1 for GalleryGPT: Analyzing Paintings with Large Multimodal Models
Figure 2 for GalleryGPT: Analyzing Paintings with Large Multimodal Models
Figure 3 for GalleryGPT: Analyzing Paintings with Large Multimodal Models
Figure 4 for GalleryGPT: Analyzing Paintings with Large Multimodal Models
Viaarxiv icon

Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning

Add code
Aug 01, 2024
Figure 1 for Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning
Figure 2 for Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning
Figure 3 for Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning
Figure 4 for Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning
Viaarxiv icon

Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object Detection

Add code
Jul 17, 2024
Figure 1 for Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object Detection
Figure 2 for Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object Detection
Figure 3 for Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object Detection
Figure 4 for Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object Detection
Viaarxiv icon

Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning

Add code
Jul 04, 2024
Viaarxiv icon

Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models

Add code
Jun 26, 2024
Viaarxiv icon