Picture for Hao Liu

Hao Liu

Tony

OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning

Add code
Dec 31, 2024
Figure 1 for OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
Figure 2 for OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
Figure 3 for OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
Figure 4 for OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
Viaarxiv icon

Planning, Living and Judging: A Multi-agent LLM-based Framework for Cyclical Urban Planning

Add code
Dec 29, 2024
Viaarxiv icon

Lightweight Method for Interactive 3D Medical Image Segmentation with Multi-Round Result Fusion

Add code
Dec 11, 2024
Viaarxiv icon

Behavior Backdoor for Deep Learning Models

Add code
Dec 02, 2024
Viaarxiv icon

Second FRCSyn-onGoing: Winning Solutions and Post-Challenge Analysis to Improve Face Recognition with Synthetic Data

Add code
Dec 02, 2024
Viaarxiv icon

Point Cloud Unsupervised Pre-training via 3D Gaussian Splatting

Add code
Nov 27, 2024
Figure 1 for Point Cloud Unsupervised Pre-training via 3D Gaussian Splatting
Figure 2 for Point Cloud Unsupervised Pre-training via 3D Gaussian Splatting
Figure 3 for Point Cloud Unsupervised Pre-training via 3D Gaussian Splatting
Figure 4 for Point Cloud Unsupervised Pre-training via 3D Gaussian Splatting
Viaarxiv icon

3D Scene Graph Guided Vision-Language Pre-training

Add code
Nov 27, 2024
Figure 1 for 3D Scene Graph Guided Vision-Language Pre-training
Figure 2 for 3D Scene Graph Guided Vision-Language Pre-training
Figure 3 for 3D Scene Graph Guided Vision-Language Pre-training
Figure 4 for 3D Scene Graph Guided Vision-Language Pre-training
Viaarxiv icon

Nd-BiMamba2: A Unified Bidirectional Architecture for Multi-Dimensional Data Processing

Add code
Nov 22, 2024
Viaarxiv icon

Morph: A Motion-free Physics Optimization Framework for Human Motion Generation

Add code
Nov 22, 2024
Viaarxiv icon

Reflections from the 2024 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry

Add code
Nov 20, 2024
Figure 1 for Reflections from the 2024 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry
Figure 2 for Reflections from the 2024 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry
Figure 3 for Reflections from the 2024 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry
Figure 4 for Reflections from the 2024 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry
Viaarxiv icon