Picture for Shitao Xiao

Shitao Xiao

OmniGen: Unified Image Generation

Add code
Sep 17, 2024
Figure 1 for OmniGen: Unified Image Generation
Figure 2 for OmniGen: Unified Image Generation
Figure 3 for OmniGen: Unified Image Generation
Figure 4 for OmniGen: Unified Image Generation
Viaarxiv icon

Large Language Models as Foundations for Next-Gen Dense Retrieval: A Comprehensive Empirical Assessment

Add code
Aug 23, 2024
Figure 1 for Large Language Models as Foundations for Next-Gen Dense Retrieval: A Comprehensive Empirical Assessment
Figure 2 for Large Language Models as Foundations for Next-Gen Dense Retrieval: A Comprehensive Empirical Assessment
Figure 3 for Large Language Models as Foundations for Next-Gen Dense Retrieval: A Comprehensive Empirical Assessment
Figure 4 for Large Language Models as Foundations for Next-Gen Dense Retrieval: A Comprehensive Empirical Assessment
Viaarxiv icon

SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking

Add code
Jul 05, 2024
Figure 1 for SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking
Figure 2 for SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking
Figure 3 for SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking
Figure 4 for SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking
Viaarxiv icon

VISTA: Visualized Text Embedding For Universal Multi-Modal Retrieval

Add code
Jun 06, 2024
Figure 1 for VISTA: Visualized Text Embedding For Universal Multi-Modal Retrieval
Figure 2 for VISTA: Visualized Text Embedding For Universal Multi-Modal Retrieval
Figure 3 for VISTA: Visualized Text Embedding For Universal Multi-Modal Retrieval
Figure 4 for VISTA: Visualized Text Embedding For Universal Multi-Modal Retrieval
Viaarxiv icon

MLVU: A Comprehensive Benchmark for Multi-Task Long Video Understanding

Add code
Jun 06, 2024
Figure 1 for MLVU: A Comprehensive Benchmark for Multi-Task Long Video Understanding
Figure 2 for MLVU: A Comprehensive Benchmark for Multi-Task Long Video Understanding
Figure 3 for MLVU: A Comprehensive Benchmark for Multi-Task Long Video Understanding
Figure 4 for MLVU: A Comprehensive Benchmark for Multi-Task Long Video Understanding
Viaarxiv icon

SpikeLM: Towards General Spike-Driven Language Modeling via Elastic Bi-Spiking Mechanisms

Add code
Jun 05, 2024
Viaarxiv icon

Compressing Lengthy Context With UltraGist

Add code
May 26, 2024
Viaarxiv icon

Extending Llama-3's Context Ten-Fold Overnight

Add code
Apr 30, 2024
Figure 1 for Extending Llama-3's Context Ten-Fold Overnight
Figure 2 for Extending Llama-3's Context Ten-Fold Overnight
Figure 3 for Extending Llama-3's Context Ten-Fold Overnight
Figure 4 for Extending Llama-3's Context Ten-Fold Overnight
Viaarxiv icon

Extensible Embedding: A Flexible Multipler For LLM's Context Length

Add code
Feb 18, 2024
Viaarxiv icon

BGE Landmark Embedding: A Chunking-Free Embedding Method For Retrieval Augmented Long-Context Large Language Models

Add code
Feb 18, 2024
Viaarxiv icon