Picture for Yuxin Song

Yuxin Song

Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization

Add code
Dec 25, 2024
Figure 1 for Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization
Figure 2 for Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization
Figure 3 for Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization
Figure 4 for Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization
Viaarxiv icon

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Add code
Dec 24, 2024
Viaarxiv icon

The Key of Understanding Vision Tasks: Explanatory Instructions

Add code
Dec 24, 2024
Figure 1 for The Key of Understanding Vision Tasks: Explanatory Instructions
Figure 2 for The Key of Understanding Vision Tasks: Explanatory Instructions
Figure 3 for The Key of Understanding Vision Tasks: Explanatory Instructions
Figure 4 for The Key of Understanding Vision Tasks: Explanatory Instructions
Viaarxiv icon

DistinctAD: Distinctive Audio Description Generation in Contexts

Add code
Nov 27, 2024
Figure 1 for DistinctAD: Distinctive Audio Description Generation in Contexts
Figure 2 for DistinctAD: Distinctive Audio Description Generation in Contexts
Figure 3 for DistinctAD: Distinctive Audio Description Generation in Contexts
Figure 4 for DistinctAD: Distinctive Audio Description Generation in Contexts
Viaarxiv icon

A Survey on Consumer IoT Traffic: Security and Privacy

Add code
Mar 24, 2024
Viaarxiv icon

Multi-level graph learning for audio event classification and human-perceived annoyance rating prediction

Add code
Dec 15, 2023
Viaarxiv icon

GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?

Add code
Nov 27, 2023
Viaarxiv icon

What Can Simple Arithmetic Operations Do for Temporal Modeling?

Add code
Jul 18, 2023
Figure 1 for What Can Simple Arithmetic Operations Do for Temporal Modeling?
Figure 2 for What Can Simple Arithmetic Operations Do for Temporal Modeling?
Figure 3 for What Can Simple Arithmetic Operations Do for Temporal Modeling?
Figure 4 for What Can Simple Arithmetic Operations Do for Temporal Modeling?
Viaarxiv icon

UATVR: Uncertainty-Adaptive Text-Video Retrieval

Add code
Jan 16, 2023
Viaarxiv icon

GRATIS: Deep Learning Graph Representation with Task-specific Topology and Multi-dimensional Edge Features

Add code
Nov 19, 2022
Viaarxiv icon