Picture for Conghui He

Conghui He

Pushing the Boundaries of Natural Reasoning: Interleaved Bonus from Formal-Logic Verification

Add code
Jan 30, 2026
Viaarxiv icon

MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric Methods

Add code
Jan 29, 2026
Viaarxiv icon

Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs

Add code
Jan 22, 2026
Viaarxiv icon

ChartVerse: Scaling Chart Reasoning via Reliable Programmatic Synthesis from Scratch

Add code
Jan 20, 2026
Viaarxiv icon

Scientific Image Synthesis: Benchmarking, Methodologies, and Downstream Utility

Add code
Jan 17, 2026
Viaarxiv icon

LRAS: Advanced Legal Reasoning with Agentic Search

Add code
Jan 12, 2026
Viaarxiv icon

IPCV: Information-Preserving Compression for MLLM Visual Encoders

Add code
Dec 21, 2025
Figure 1 for IPCV: Information-Preserving Compression for MLLM Visual Encoders
Figure 2 for IPCV: Information-Preserving Compression for MLLM Visual Encoders
Figure 3 for IPCV: Information-Preserving Compression for MLLM Visual Encoders
Figure 4 for IPCV: Information-Preserving Compression for MLLM Visual Encoders
Viaarxiv icon

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

Add code
Dec 18, 2025
Viaarxiv icon

OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value

Add code
Dec 16, 2025
Figure 1 for OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value
Figure 2 for OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value
Figure 3 for OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value
Figure 4 for OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value
Viaarxiv icon

OPV: Outcome-based Process Verifier for Efficient Long Chain-of-Thought Verification

Add code
Dec 11, 2025
Viaarxiv icon