Picture for Renqiu Xia

Renqiu Xia

GeoX: Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training

Add code
Dec 16, 2024
Viaarxiv icon

Chimera: Improving Generalist Model with Domain-Specific Experts

Add code
Dec 08, 2024
Viaarxiv icon

Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy

Add code
Oct 13, 2024
Figure 1 for Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy
Figure 2 for Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy
Figure 3 for Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy
Figure 4 for Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy
Viaarxiv icon

CDM: A Reliable Metric for Fair and Accurate Formula Recognition Evaluation

Add code
Sep 05, 2024
Figure 1 for CDM: A Reliable Metric for Fair and Accurate Formula Recognition Evaluation
Figure 2 for CDM: A Reliable Metric for Fair and Accurate Formula Recognition Evaluation
Figure 3 for CDM: A Reliable Metric for Fair and Accurate Formula Recognition Evaluation
Figure 4 for CDM: A Reliable Metric for Fair and Accurate Formula Recognition Evaluation
Viaarxiv icon

DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models

Add code
Jun 17, 2024
Figure 1 for DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models
Figure 2 for DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models
Figure 3 for DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models
Figure 4 for DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models
Viaarxiv icon

Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression

Add code
Mar 23, 2024
Viaarxiv icon

ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning

Add code
Feb 19, 2024
Viaarxiv icon

REVO-LION: Evaluating and Refining Vision-Language Instruction Tuning Datasets

Add code
Oct 10, 2023
Viaarxiv icon

ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation

Add code
Sep 25, 2023
Figure 1 for ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation
Figure 2 for ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation
Figure 3 for ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation
Figure 4 for ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation
Viaarxiv icon

StructChart: Perception, Structuring, Reasoning for Visual Chart Understanding

Add code
Sep 25, 2023
Viaarxiv icon