Picture for Zilong Xie

Zilong Xie

E2E-AFG: An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation

Add code
Nov 01, 2024
Viaarxiv icon

From Pixels to Tokens: Byte-Pair Encoding on Quantized Visual Modalities

Add code
Oct 03, 2024
Figure 1 for From Pixels to Tokens: Byte-Pair Encoding on Quantized Visual Modalities
Figure 2 for From Pixels to Tokens: Byte-Pair Encoding on Quantized Visual Modalities
Figure 3 for From Pixels to Tokens: Byte-Pair Encoding on Quantized Visual Modalities
Figure 4 for From Pixels to Tokens: Byte-Pair Encoding on Quantized Visual Modalities
Viaarxiv icon

Hierarchical Spatial Proximity Reasoning for Vision-and-Language Navigation

Add code
Mar 18, 2024
Viaarxiv icon