Picture for Bin Sun

Bin Sun

Member, IEEE

Multimodal Prompt Alignment for Facial Expression Recognition

Add code
Jun 26, 2025
Viaarxiv icon

FocalAD: Local Motion Planning for End-to-End Autonomous Driving

Add code
Jun 13, 2025
Viaarxiv icon

NTIRE 2025 challenge on Text to Image Generation Model Quality Assessment

Add code
May 22, 2025
Viaarxiv icon

Enhancing User-Oriented Proactivity in Open-Domain Dialogues with Critic Guidance

Add code
May 18, 2025
Viaarxiv icon

Do Multimodal Language Models Really Understand Direction? A Benchmark for Compass Direction Reasoning

Add code
Dec 21, 2024
Figure 1 for Do Multimodal Language Models Really Understand Direction? A Benchmark for Compass Direction Reasoning
Figure 2 for Do Multimodal Language Models Really Understand Direction? A Benchmark for Compass Direction Reasoning
Figure 3 for Do Multimodal Language Models Really Understand Direction? A Benchmark for Compass Direction Reasoning
Figure 4 for Do Multimodal Language Models Really Understand Direction? A Benchmark for Compass Direction Reasoning
Viaarxiv icon

DrVideo: Document Retrieval Based Long Video Understanding

Add code
Jun 18, 2024
Figure 1 for DrVideo: Document Retrieval Based Long Video Understanding
Figure 2 for DrVideo: Document Retrieval Based Long Video Understanding
Figure 3 for DrVideo: Document Retrieval Based Long Video Understanding
Figure 4 for DrVideo: Document Retrieval Based Long Video Understanding
Viaarxiv icon

Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation

Add code
Jun 12, 2024
Figure 1 for Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation
Figure 2 for Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation
Figure 3 for Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation
Figure 4 for Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation
Viaarxiv icon

GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering

Add code
Feb 04, 2024
Viaarxiv icon

Escape Sky-high Cost: Early-stopping Self-Consistency for Multi-step Reasoning

Add code
Jan 19, 2024
Viaarxiv icon

Turning Dust into Gold: Distilling Complex Reasoning Capabilities from LLMs by Leveraging Negative Data

Add code
Dec 20, 2023
Figure 1 for Turning Dust into Gold: Distilling Complex Reasoning Capabilities from LLMs by Leveraging Negative Data
Figure 2 for Turning Dust into Gold: Distilling Complex Reasoning Capabilities from LLMs by Leveraging Negative Data
Figure 3 for Turning Dust into Gold: Distilling Complex Reasoning Capabilities from LLMs by Leveraging Negative Data
Figure 4 for Turning Dust into Gold: Distilling Complex Reasoning Capabilities from LLMs by Leveraging Negative Data
Viaarxiv icon