Picture for Daniel Li

Daniel Li

Jack

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Perception Encoder: The best visual embeddings are not at the output of the network

Add code
Apr 17, 2025
Figure 1 for Perception Encoder: The best visual embeddings are not at the output of the network
Figure 2 for Perception Encoder: The best visual embeddings are not at the output of the network
Figure 3 for Perception Encoder: The best visual embeddings are not at the output of the network
Figure 4 for Perception Encoder: The best visual embeddings are not at the output of the network
Viaarxiv icon

Data Foundations for Large Scale Multimodal Clinical Foundation Models

Add code
Mar 09, 2025
Figure 1 for Data Foundations for Large Scale Multimodal Clinical Foundation Models
Figure 2 for Data Foundations for Large Scale Multimodal Clinical Foundation Models
Figure 3 for Data Foundations for Large Scale Multimodal Clinical Foundation Models
Figure 4 for Data Foundations for Large Scale Multimodal Clinical Foundation Models
Viaarxiv icon

DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment

Add code
Dec 20, 2024
Figure 1 for DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
Figure 2 for DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
Figure 3 for DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
Figure 4 for DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
Viaarxiv icon

Creating a Cooperative AI Policymaking Platform through Open Source Collaboration

Add code
Dec 09, 2024
Figure 1 for Creating a Cooperative AI Policymaking Platform through Open Source Collaboration
Figure 2 for Creating a Cooperative AI Policymaking Platform through Open Source Collaboration
Viaarxiv icon

Incorporating Human Explanations for Robust Hate Speech Detection

Add code
Nov 09, 2024
Figure 1 for Incorporating Human Explanations for Robust Hate Speech Detection
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild

Add code
Mar 25, 2024
Viaarxiv icon

Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

Add code
Mar 12, 2024
Figure 1 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Figure 2 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Figure 3 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Figure 4 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Viaarxiv icon

HumanEval on Latest GPT Models -- 2024

Add code
Feb 20, 2024
Figure 1 for HumanEval on Latest GPT Models -- 2024
Viaarxiv icon