Picture for Junyu Gao

Junyu Gao

Scale Efficient Training for Large Datasets

Add code
Mar 17, 2025
Viaarxiv icon

From Captions to Rewards (CAREVL): Leveraging Large Language Model Experts for Enhanced Reward Modeling in Large Vision-Language Models

Add code
Mar 08, 2025
Viaarxiv icon

A Benchmark for Multi-Lingual Vision-Language Learning in Remote Sensing Image Captioning

Add code
Mar 06, 2025
Viaarxiv icon

FGAseg: Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic Segmentation

Add code
Jan 03, 2025
Viaarxiv icon

SignEye: Traffic Sign Interpretation from Vehicle First-Person View

Add code
Nov 18, 2024
Viaarxiv icon

Real-Time Text Detection with Similar Mask in Traffic, Industrial, and Natural Scenes

Add code
Nov 05, 2024
Figure 1 for Real-Time Text Detection with Similar Mask in Traffic, Industrial, and Natural Scenes
Figure 2 for Real-Time Text Detection with Similar Mask in Traffic, Industrial, and Natural Scenes
Figure 3 for Real-Time Text Detection with Similar Mask in Traffic, Industrial, and Natural Scenes
Figure 4 for Real-Time Text Detection with Similar Mask in Traffic, Industrial, and Natural Scenes
Viaarxiv icon

Conjugated Semantic Pool Improves OOD Detection with Pre-trained Vision-Language Models

Add code
Oct 11, 2024
Viaarxiv icon

Revisiting Essential and Nonessential Settings of Evidential Deep Learning

Add code
Oct 01, 2024
Figure 1 for Revisiting Essential and Nonessential Settings of Evidential Deep Learning
Figure 2 for Revisiting Essential and Nonessential Settings of Evidential Deep Learning
Figure 3 for Revisiting Essential and Nonessential Settings of Evidential Deep Learning
Figure 4 for Revisiting Essential and Nonessential Settings of Evidential Deep Learning
Viaarxiv icon

Focus Entirety and Perceive Environment for Arbitrary-Shaped Text Detection

Add code
Sep 25, 2024
Figure 1 for Focus Entirety and Perceive Environment for Arbitrary-Shaped Text Detection
Figure 2 for Focus Entirety and Perceive Environment for Arbitrary-Shaped Text Detection
Figure 3 for Focus Entirety and Perceive Environment for Arbitrary-Shaped Text Detection
Figure 4 for Focus Entirety and Perceive Environment for Arbitrary-Shaped Text Detection
Viaarxiv icon

Spotlight Text Detector: Spotlight on Candidate Regions Like a Camera

Add code
Sep 25, 2024
Figure 1 for Spotlight Text Detector: Spotlight on Candidate Regions Like a Camera
Figure 2 for Spotlight Text Detector: Spotlight on Candidate Regions Like a Camera
Figure 3 for Spotlight Text Detector: Spotlight on Candidate Regions Like a Camera
Figure 4 for Spotlight Text Detector: Spotlight on Candidate Regions Like a Camera
Viaarxiv icon