Picture for Shi Yu

Shi Yu

Tsinghua University

Craw4LLM: Efficient Web Crawling for LLM Pretraining

Add code
Feb 19, 2025
Viaarxiv icon

Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Search

Add code
Feb 18, 2025
Viaarxiv icon

KBAlign: Efficient Self Adaptation on Specific Knowledge Bases

Add code
Nov 25, 2024
Figure 1 for KBAlign: Efficient Self Adaptation on Specific Knowledge Bases
Figure 2 for KBAlign: Efficient Self Adaptation on Specific Knowledge Bases
Figure 3 for KBAlign: Efficient Self Adaptation on Specific Knowledge Bases
Figure 4 for KBAlign: Efficient Self Adaptation on Specific Knowledge Bases
Viaarxiv icon

Building A Coding Assistant via the Retrieval-Augmented Language Model

Add code
Oct 21, 2024
Figure 1 for Building A Coding Assistant via the Retrieval-Augmented Language Model
Figure 2 for Building A Coding Assistant via the Retrieval-Augmented Language Model
Figure 3 for Building A Coding Assistant via the Retrieval-Augmented Language Model
Figure 4 for Building A Coding Assistant via the Retrieval-Augmented Language Model
Viaarxiv icon

RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards

Add code
Oct 17, 2024
Figure 1 for RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards
Figure 2 for RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards
Figure 3 for RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards
Figure 4 for RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards
Viaarxiv icon

VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents

Add code
Oct 14, 2024
Figure 1 for VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents
Figure 2 for VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents
Figure 3 for VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents
Figure 4 for VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents
Viaarxiv icon

Retriever-and-Memory: Towards Adaptive Note-Enhanced Retrieval-Augmented Generation

Add code
Oct 11, 2024
Figure 1 for Retriever-and-Memory: Towards Adaptive Note-Enhanced Retrieval-Augmented Generation
Figure 2 for Retriever-and-Memory: Towards Adaptive Note-Enhanced Retrieval-Augmented Generation
Figure 3 for Retriever-and-Memory: Towards Adaptive Note-Enhanced Retrieval-Augmented Generation
Figure 4 for Retriever-and-Memory: Towards Adaptive Note-Enhanced Retrieval-Augmented Generation
Viaarxiv icon

Multi-Modal Multi-Granularity Tokenizer for Chu Bamboo Slip Scripts

Add code
Sep 02, 2024
Figure 1 for Multi-Modal Multi-Granularity Tokenizer for Chu Bamboo Slip Scripts
Figure 2 for Multi-Modal Multi-Granularity Tokenizer for Chu Bamboo Slip Scripts
Figure 3 for Multi-Modal Multi-Granularity Tokenizer for Chu Bamboo Slip Scripts
Figure 4 for Multi-Modal Multi-Granularity Tokenizer for Chu Bamboo Slip Scripts
Viaarxiv icon

RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework

Add code
Aug 02, 2024
Figure 1 for RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework
Figure 2 for RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework
Figure 3 for RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework
Figure 4 for RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework
Viaarxiv icon

Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression

Add code
Feb 25, 2024
Figure 1 for Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression
Figure 2 for Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression
Figure 3 for Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression
Figure 4 for Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression
Viaarxiv icon