Picture for Conghui He

Conghui He

Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction

Add code
Oct 29, 2024
Figure 1 for Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction
Figure 2 for Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction
Figure 3 for Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction
Figure 4 for Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction
Viaarxiv icon

MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models

Add code
Oct 23, 2024
Figure 1 for MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models
Figure 2 for MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models
Figure 3 for MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models
Figure 4 for MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models
Viaarxiv icon

PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction

Add code
Oct 22, 2024
Viaarxiv icon

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Add code
Oct 16, 2024
Figure 1 for DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
Figure 2 for DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
Figure 3 for DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
Figure 4 for DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
Viaarxiv icon

LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models

Add code
Oct 13, 2024
Figure 1 for LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
Figure 2 for LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
Figure 3 for LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
Figure 4 for LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
Viaarxiv icon

Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining

Add code
Oct 10, 2024
Figure 1 for Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining
Figure 2 for Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining
Figure 3 for Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining
Figure 4 for Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining
Viaarxiv icon

Utilize the Flow before Stepping into the Same River Twice: Certainty Represented Knowledge Flow for Refusal-Aware Instruction Tuning

Add code
Oct 09, 2024
Viaarxiv icon

Gradual Learning: Optimizing Fine-Tuning with Partially Mastered Knowledge in Large Language Models

Add code
Oct 08, 2024
Figure 1 for Gradual Learning: Optimizing Fine-Tuning with Partially Mastered Knowledge in Large Language Models
Figure 2 for Gradual Learning: Optimizing Fine-Tuning with Partially Mastered Knowledge in Large Language Models
Figure 3 for Gradual Learning: Optimizing Fine-Tuning with Partially Mastered Knowledge in Large Language Models
Figure 4 for Gradual Learning: Optimizing Fine-Tuning with Partially Mastered Knowledge in Large Language Models
Viaarxiv icon

MinerU: An Open-Source Solution for Precise Document Content Extraction

Add code
Sep 27, 2024
Figure 1 for MinerU: An Open-Source Solution for Precise Document Content Extraction
Figure 2 for MinerU: An Open-Source Solution for Precise Document Content Extraction
Figure 3 for MinerU: An Open-Source Solution for Precise Document Content Extraction
Figure 4 for MinerU: An Open-Source Solution for Precise Document Content Extraction
Viaarxiv icon

Harnessing Diversity for Important Data Selection in Pretraining Large Language Models

Add code
Sep 25, 2024
Viaarxiv icon