Picture for Wei Ping

Wei Ping

MM-Embed: Universal Multimodal Retrieval with Multimodal LLMs

Add code
Nov 04, 2024
Viaarxiv icon

NVLM: Open Frontier-Class Multimodal LLMs

Add code
Sep 17, 2024
Figure 1 for NVLM: Open Frontier-Class Multimodal LLMs
Figure 2 for NVLM: Open Frontier-Class Multimodal LLMs
Figure 3 for NVLM: Open Frontier-Class Multimodal LLMs
Figure 4 for NVLM: Open Frontier-Class Multimodal LLMs
Viaarxiv icon

ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities

Add code
Jul 19, 2024
Viaarxiv icon

RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs

Add code
Jul 02, 2024
Viaarxiv icon

Nemotron-4 340B Technical Report

Add code
Jun 17, 2024
Figure 1 for Nemotron-4 340B Technical Report
Figure 2 for Nemotron-4 340B Technical Report
Figure 3 for Nemotron-4 340B Technical Report
Figure 4 for Nemotron-4 340B Technical Report
Viaarxiv icon

X-VILA: Cross-Modality Alignment for Large Language Model

Add code
May 29, 2024
Figure 1 for X-VILA: Cross-Modality Alignment for Large Language Model
Figure 2 for X-VILA: Cross-Modality Alignment for Large Language Model
Figure 3 for X-VILA: Cross-Modality Alignment for Large Language Model
Figure 4 for X-VILA: Cross-Modality Alignment for Large Language Model
Viaarxiv icon

NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models

Add code
May 27, 2024
Viaarxiv icon

Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities

Add code
Feb 02, 2024
Viaarxiv icon

ChatQA: Building GPT-4 Level Conversational QA Models

Add code
Jan 23, 2024
Viaarxiv icon

VILA: On Pre-training for Visual Language Models

Add code
Dec 14, 2023
Viaarxiv icon