Picture for Yanhua Cheng

Yanhua Cheng

Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy

Add code
Nov 23, 2024
Figure 1 for Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy
Figure 2 for Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy
Figure 3 for Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy
Figure 4 for Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy
Viaarxiv icon

Knowledge Condensation and Reasoning for Knowledge-based VQA

Add code
Mar 15, 2024
Viaarxiv icon

Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning

Add code
Jan 01, 2024
Viaarxiv icon

Cross-view Semantic Alignment for Livestreaming Product Recognition

Add code
Aug 19, 2023
Viaarxiv icon

Cross-Domain Product Representation Learning for Rich-Content E-Commerce

Add code
Aug 10, 2023
Viaarxiv icon

3rd Place Solution to "Google Landmark Retrieval 2020"

Add code
Aug 25, 2020
Figure 1 for 3rd Place Solution to "Google Landmark Retrieval 2020"
Figure 2 for 3rd Place Solution to "Google Landmark Retrieval 2020"
Figure 3 for 3rd Place Solution to "Google Landmark Retrieval 2020"
Figure 4 for 3rd Place Solution to "Google Landmark Retrieval 2020"
Viaarxiv icon