Picture for Xiaolong Li

Xiaolong Li

From Visuals to Vocabulary: Establishing Equivalence Between Image and Text Token Through Autoregressive Pre-training in MLLMs

Add code
Feb 13, 2025
Viaarxiv icon

Advancing General Multimodal Capability of Vision-language Models with Pyramid-descent Visual Position Encoding

Add code
Jan 19, 2025
Viaarxiv icon

Memoryless Multimodal Anomaly Detection via Student-Teacher Network and Signed Distance Learning

Add code
Sep 09, 2024
Viaarxiv icon

Deep progressive reinforcement learning-based flexible resource scheduling framework for IRS and UAV-assisted MEC system

Add code
Aug 02, 2024
Figure 1 for Deep progressive reinforcement learning-based flexible resource scheduling framework for IRS and UAV-assisted MEC system
Figure 2 for Deep progressive reinforcement learning-based flexible resource scheduling framework for IRS and UAV-assisted MEC system
Figure 3 for Deep progressive reinforcement learning-based flexible resource scheduling framework for IRS and UAV-assisted MEC system
Figure 4 for Deep progressive reinforcement learning-based flexible resource scheduling framework for IRS and UAV-assisted MEC system
Viaarxiv icon

Grounded Compositional and Diverse Text-to-3D with Pretrained Multi-View Diffusion Model

Add code
Apr 28, 2024
Viaarxiv icon

Fast Sparse View Guided NeRF Update for Object Reconfigurations

Add code
Mar 16, 2024
Viaarxiv icon

A Quantitative Evaluation of Score Distillation Sampling Based Text-to-3D

Add code
Feb 29, 2024
Viaarxiv icon

Anchor function: a type of benchmark functions for studying language models

Add code
Jan 16, 2024
Viaarxiv icon

BD-MSA: Body decouple VHR Remote Sensing Image Change Detection method guided by multi-scale feature information aggregation

Add code
Jan 09, 2024
Viaarxiv icon

IntentDial: An Intent Graph based Multi-Turn Dialogue System with Reasoning Path Visualization

Add code
Oct 18, 2023
Viaarxiv icon