Picture for Qi Liu

Qi Liu

Victor

Entailment-Preserving First-order Logic Representations in Natural Language Entailment

Add code
Feb 24, 2025
Viaarxiv icon

Geometry-Aware 3D Salient Object Detection Network

Add code
Feb 23, 2025
Viaarxiv icon

SentiFormer: Metadata Enhanced Transformer for Image Sentiment Analysis

Add code
Feb 21, 2025
Viaarxiv icon

Unveiling the Magic of Code Reasoning through Hypothesis Decomposition and Amendment

Add code
Feb 17, 2025
Viaarxiv icon

Chinese Spelling Correction: A Comprehensive Survey of Progress, Challenges, and Opportunities

Add code
Feb 17, 2025
Viaarxiv icon

CAD-Editor: A Locate-then-Infill Framework with Automated Training Data Synthesis for Text-Based CAD Editing

Add code
Feb 06, 2025
Viaarxiv icon

Fast Underwater Scene Reconstruction using Multi-View Stereo and Physical Imaging

Add code
Jan 21, 2025
Viaarxiv icon

Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems

Add code
Jan 17, 2025
Figure 1 for Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems
Figure 2 for Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems
Figure 3 for Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems
Figure 4 for Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems
Viaarxiv icon

OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning

Add code
Dec 31, 2024
Figure 1 for OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
Figure 2 for OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
Figure 3 for OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
Figure 4 for OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
Viaarxiv icon

Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain Reasoner

Add code
Dec 30, 2024
Figure 1 for Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain Reasoner
Figure 2 for Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain Reasoner
Figure 3 for Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain Reasoner
Figure 4 for Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain Reasoner
Viaarxiv icon