Picture for Zhijian Liu

Zhijian Liu

Token-Efficient Long Video Understanding for Multimodal LLMs

Add code
Mar 06, 2025
Viaarxiv icon

LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

Add code
Feb 20, 2025
Viaarxiv icon

NVILA: Efficient Frontier Visual Language Models

Add code
Dec 05, 2024
Figure 1 for NVILA: Efficient Frontier Visual Language Models
Figure 2 for NVILA: Efficient Frontier Visual Language Models
Figure 3 for NVILA: Efficient Frontier Visual Language Models
Figure 4 for NVILA: Efficient Frontier Visual Language Models
Viaarxiv icon

VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge

Add code
Nov 19, 2024
Figure 1 for VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
Figure 2 for VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
Figure 3 for VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
Figure 4 for VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
Viaarxiv icon

LongVILA: Scaling Long-Context Visual Language Models for Long Videos

Add code
Aug 21, 2024
Figure 1 for LongVILA: Scaling Long-Context Visual Language Models for Long Videos
Figure 2 for LongVILA: Scaling Long-Context Visual Language Models for Long Videos
Figure 3 for LongVILA: Scaling Long-Context Visual Language Models for Long Videos
Figure 4 for LongVILA: Scaling Long-Context Visual Language Models for Long Videos
Viaarxiv icon

Sparse Refinement for Efficient High-Resolution Semantic Segmentation

Add code
Jul 26, 2024
Viaarxiv icon

LidarDM: Generative LiDAR Simulation in a Generated World

Add code
Apr 03, 2024
Viaarxiv icon

StreamDiffusion: A Pipeline-level Solution for Real-time Interactive Generation

Add code
Dec 19, 2023
Viaarxiv icon

Point Transformer V3: Simpler, Faster, Stronger

Add code
Dec 15, 2023
Figure 1 for Point Transformer V3: Simpler, Faster, Stronger
Figure 2 for Point Transformer V3: Simpler, Faster, Stronger
Figure 3 for Point Transformer V3: Simpler, Faster, Stronger
Figure 4 for Point Transformer V3: Simpler, Faster, Stronger
Viaarxiv icon

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

Add code
Sep 21, 2023
Figure 1 for LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Figure 2 for LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Figure 3 for LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Figure 4 for LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Viaarxiv icon