Picture for Jinlong Li

Jinlong Li

University of Science and Technology of China

A Training-free LLM Framework with Interaction between Contextually Related Subtasks in Solving Complex Tasks

Add code
Mar 29, 2025
Viaarxiv icon

Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding

Add code
Mar 20, 2025
Viaarxiv icon

V2X-DG: Domain Generalization for Vehicle-to-Everything Cooperative Perception

Add code
Mar 19, 2025
Viaarxiv icon

Survey on Single-Image Reflection Removal using Deep Learning Techniques

Add code
Feb 12, 2025
Viaarxiv icon

Language model driven: a PROTAC generation pipeline with dual constraints of structure and property

Add code
Dec 12, 2024
Viaarxiv icon

LESS: Label-Efficient and Single-Stage Referring 3D Segmentation

Add code
Oct 17, 2024
Figure 1 for LESS: Label-Efficient and Single-Stage Referring 3D Segmentation
Figure 2 for LESS: Label-Efficient and Single-Stage Referring 3D Segmentation
Figure 3 for LESS: Label-Efficient and Single-Stage Referring 3D Segmentation
Figure 4 for LESS: Label-Efficient and Single-Stage Referring 3D Segmentation
Viaarxiv icon

CoMamba: Real-time Cooperative Perception Unlocked with State Space Models

Add code
Sep 16, 2024
Figure 1 for CoMamba: Real-time Cooperative Perception Unlocked with State Space Models
Figure 2 for CoMamba: Real-time Cooperative Perception Unlocked with State Space Models
Figure 3 for CoMamba: Real-time Cooperative Perception Unlocked with State Space Models
Figure 4 for CoMamba: Real-time Cooperative Perception Unlocked with State Space Models
Viaarxiv icon

When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding

Add code
Aug 15, 2024
Figure 1 for When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding
Figure 2 for When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding
Figure 3 for When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding
Figure 4 for When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding
Viaarxiv icon

3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance

Add code
Jul 13, 2024
Viaarxiv icon

Enhancing Robustness of Vision-Language Models through Orthogonality Learning and Cross-Regularization

Add code
Jul 11, 2024
Figure 1 for Enhancing Robustness of Vision-Language Models through Orthogonality Learning and Cross-Regularization
Figure 2 for Enhancing Robustness of Vision-Language Models through Orthogonality Learning and Cross-Regularization
Figure 3 for Enhancing Robustness of Vision-Language Models through Orthogonality Learning and Cross-Regularization
Figure 4 for Enhancing Robustness of Vision-Language Models through Orthogonality Learning and Cross-Regularization
Viaarxiv icon