Picture for Lixin Gu

Lixin Gu

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Add code
Dec 06, 2024
Figure 1 for Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling
Figure 2 for Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling
Figure 3 for Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling
Figure 4 for Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling
Viaarxiv icon

Minimum Efforts to Build an End-to-End Spatial-Temporal Action Detector

Add code
Jun 07, 2022
Figure 1 for Minimum Efforts to Build an End-to-End Spatial-Temporal Action Detector
Figure 2 for Minimum Efforts to Build an End-to-End Spatial-Temporal Action Detector
Figure 3 for Minimum Efforts to Build an End-to-End Spatial-Temporal Action Detector
Figure 4 for Minimum Efforts to Build an End-to-End Spatial-Temporal Action Detector
Viaarxiv icon