Picture for Bo Ren

Bo Ren

AlignFormer: Modality Matching Can Achieve Better Zero-shot Instruction-Following Speech-LLM

Add code
Dec 02, 2024
Viaarxiv icon

Masked Angle-Aware Autoencoder for Remote Sensing Images

Add code
Aug 04, 2024
Viaarxiv icon

Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis

Add code
Jun 10, 2024
Viaarxiv icon

On decoder-only architecture for speech-to-text and large language model integration

Add code
Jul 14, 2023
Viaarxiv icon

Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution

Add code
May 12, 2023
Viaarxiv icon

Multi-Space Neural Radiance Fields

Add code
May 07, 2023
Viaarxiv icon

Looking Through the Glass: Neural Surface Reconstruction Against High Specular Reflections

Add code
Apr 18, 2023
Viaarxiv icon

Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies

Add code
Mar 26, 2023
Figure 1 for Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies
Figure 2 for Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies
Figure 3 for Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies
Figure 4 for Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies
Viaarxiv icon

Turning a CLIP Model into a Scene Text Detector

Add code
Mar 01, 2023
Viaarxiv icon

SLAN: Self-Locator Aided Network for Cross-Modal Understanding

Add code
Dec 08, 2022
Viaarxiv icon