Picture for Bin Sun

Bin Sun

Member, IEEE

DrVideo: Document Retrieval Based Long Video Understanding

Add code
Jun 18, 2024
Figure 1 for DrVideo: Document Retrieval Based Long Video Understanding
Figure 2 for DrVideo: Document Retrieval Based Long Video Understanding
Figure 3 for DrVideo: Document Retrieval Based Long Video Understanding
Figure 4 for DrVideo: Document Retrieval Based Long Video Understanding
Viaarxiv icon

Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation

Add code
Jun 12, 2024
Figure 1 for Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation
Figure 2 for Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation
Figure 3 for Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation
Figure 4 for Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation
Viaarxiv icon

GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering

Add code
Feb 04, 2024
Viaarxiv icon

Escape Sky-high Cost: Early-stopping Self-Consistency for Multi-step Reasoning

Add code
Jan 19, 2024
Viaarxiv icon

Turning Dust into Gold: Distilling Complex Reasoning Capabilities from LLMs by Leveraging Negative Data

Add code
Dec 20, 2023
Viaarxiv icon

EXMODD: An EXplanatory Multimodal Open-Domain Dialogue dataset

Add code
Oct 17, 2023
Viaarxiv icon

Large Language Models Need Holistically Thought in Medical Conversational QA

Add code
May 10, 2023
Figure 1 for Large Language Models Need Holistically Thought in Medical Conversational QA
Figure 2 for Large Language Models Need Holistically Thought in Medical Conversational QA
Figure 3 for Large Language Models Need Holistically Thought in Medical Conversational QA
Figure 4 for Large Language Models Need Holistically Thought in Medical Conversational QA
Viaarxiv icon

LOGO-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial Expression Recognition

Add code
May 05, 2023
Figure 1 for LOGO-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial Expression Recognition
Figure 2 for LOGO-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial Expression Recognition
Figure 3 for LOGO-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial Expression Recognition
Figure 4 for LOGO-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial Expression Recognition
Viaarxiv icon

Heterogeneous-Branch Collaborative Learning for Dialogue Generation

Add code
Mar 21, 2023
Viaarxiv icon

Image as Set of Points

Add code
Mar 02, 2023
Viaarxiv icon