Picture for Li Yuan

Li Yuan

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Add code
Nov 15, 2024
Viaarxiv icon

Sparse Orthogonal Parameters Tuning for Continual Learning

Add code
Nov 05, 2024
Viaarxiv icon

ETTFS: An Efficient Training Framework for Time-to-First-Spike Neuron

Add code
Oct 31, 2024
Viaarxiv icon

Spatial-Temporal Search for Spiking Neural Networks

Add code
Oct 24, 2024
Viaarxiv icon

Few-Shot Joint Multimodal Entity-Relation Extraction via Knowledge-Enhanced Cross-modal Prompt Model

Add code
Oct 18, 2024
Figure 1 for Few-Shot Joint Multimodal Entity-Relation Extraction via Knowledge-Enhanced Cross-modal Prompt Model
Figure 2 for Few-Shot Joint Multimodal Entity-Relation Extraction via Knowledge-Enhanced Cross-modal Prompt Model
Figure 3 for Few-Shot Joint Multimodal Entity-Relation Extraction via Knowledge-Enhanced Cross-modal Prompt Model
Figure 4 for Few-Shot Joint Multimodal Entity-Relation Extraction via Knowledge-Enhanced Cross-modal Prompt Model
Viaarxiv icon

MoH: Multi-Head Attention as Mixture-of-Head Attention

Add code
Oct 15, 2024
Viaarxiv icon

Is Parameter Collision Hindering Continual Learning in LLMs?

Add code
Oct 14, 2024
Viaarxiv icon

MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts

Add code
Oct 09, 2024
Viaarxiv icon

ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis

Add code
Sep 03, 2024
Viaarxiv icon

OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model

Add code
Sep 02, 2024
Viaarxiv icon