Picture for Conghui He

Conghui He

Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs

Add code
Jan 22, 2026
Viaarxiv icon

ChartVerse: Scaling Chart Reasoning via Reliable Programmatic Synthesis from Scratch

Add code
Jan 20, 2026
Viaarxiv icon

Scientific Image Synthesis: Benchmarking, Methodologies, and Downstream Utility

Add code
Jan 17, 2026
Viaarxiv icon

LRAS: Advanced Legal Reasoning with Agentic Search

Add code
Jan 12, 2026
Viaarxiv icon

IPCV: Information-Preserving Compression for MLLM Visual Encoders

Add code
Dec 21, 2025
Figure 1 for IPCV: Information-Preserving Compression for MLLM Visual Encoders
Figure 2 for IPCV: Information-Preserving Compression for MLLM Visual Encoders
Figure 3 for IPCV: Information-Preserving Compression for MLLM Visual Encoders
Figure 4 for IPCV: Information-Preserving Compression for MLLM Visual Encoders
Viaarxiv icon

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

Add code
Dec 18, 2025
Viaarxiv icon

OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value

Add code
Dec 16, 2025
Figure 1 for OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value
Figure 2 for OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value
Figure 3 for OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value
Figure 4 for OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value
Viaarxiv icon

DOCR-Inspector: Fine-Grained and Automated Evaluation of Document Parsing with VLM

Add code
Dec 11, 2025
Figure 1 for DOCR-Inspector: Fine-Grained and Automated Evaluation of Document Parsing with VLM
Figure 2 for DOCR-Inspector: Fine-Grained and Automated Evaluation of Document Parsing with VLM
Figure 3 for DOCR-Inspector: Fine-Grained and Automated Evaluation of Document Parsing with VLM
Figure 4 for DOCR-Inspector: Fine-Grained and Automated Evaluation of Document Parsing with VLM
Viaarxiv icon

OPV: Outcome-based Process Verifier for Efficient Long Chain-of-Thought Verification

Add code
Dec 11, 2025
Viaarxiv icon

ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning

Add code
Nov 18, 2025
Viaarxiv icon