Picture for Zhiyuan Zhao

Zhiyuan Zhao

How Can Time Series Analysis Benefit From Multiple Modalities? A Survey and Outlook

Add code
Mar 14, 2025
Viaarxiv icon

From Captions to Rewards (CAREVL): Leveraging Large Language Model Experts for Enhanced Reward Modeling in Large Vision-Language Models

Add code
Mar 08, 2025
Viaarxiv icon

FGAseg: Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic Segmentation

Add code
Jan 03, 2025
Viaarxiv icon

OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations

Add code
Dec 10, 2024
Figure 1 for OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations
Figure 2 for OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations
Figure 3 for OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations
Figure 4 for OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations
Viaarxiv icon

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Add code
Oct 16, 2024
Figure 1 for DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
Figure 2 for DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
Figure 3 for DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
Figure 4 for DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
Viaarxiv icon

MinerU: An Open-Source Solution for Precise Document Content Extraction

Add code
Sep 27, 2024
Figure 1 for MinerU: An Open-Source Solution for Precise Document Content Extraction
Figure 2 for MinerU: An Open-Source Solution for Precise Document Content Extraction
Figure 3 for MinerU: An Open-Source Solution for Precise Document Content Extraction
Figure 4 for MinerU: An Open-Source Solution for Precise Document Content Extraction
Viaarxiv icon

Quantum-inspired Interpretable Deep Learning Architecture for Text Sentiment Analysis

Add code
Aug 15, 2024
Viaarxiv icon

StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation

Add code
Aug 02, 2024
Viaarxiv icon

TSI-Bench: Benchmarking Time Series Imputation

Add code
Jun 18, 2024
Viaarxiv icon

Time-Series Forecasting for Out-of-Distribution Generalization Using Invariant Learning

Add code
Jun 13, 2024
Viaarxiv icon