Picture for Zhengyang Li

Zhengyang Li

Noise-Robust AV-ASR Using Visual Features Both in the Whisper Encoder and Decoder

Add code
Jan 26, 2026
Viaarxiv icon

OpenViGA: Video Generation for Automotive Driving Scenes by Streamlining and Fine-Tuning Open Source Models with Public Data

Add code
Sep 18, 2025
Figure 1 for OpenViGA: Video Generation for Automotive Driving Scenes by Streamlining and Fine-Tuning Open Source Models with Public Data
Figure 2 for OpenViGA: Video Generation for Automotive Driving Scenes by Streamlining and Fine-Tuning Open Source Models with Public Data
Figure 3 for OpenViGA: Video Generation for Automotive Driving Scenes by Streamlining and Fine-Tuning Open Source Models with Public Data
Figure 4 for OpenViGA: Video Generation for Automotive Driving Scenes by Streamlining and Fine-Tuning Open Source Models with Public Data
Viaarxiv icon

MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook

Add code
Sep 17, 2025
Figure 1 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Figure 2 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Figure 3 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Figure 4 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Viaarxiv icon

A Comprehensive Review of Multi-Agent Reinforcement Learning in Video Games

Add code
Sep 03, 2025
Viaarxiv icon

Integrated Snapshot Near-infrared Hypersepctral Imaging Framework with Diffractive Optics

Add code
Aug 20, 2025
Figure 1 for Integrated Snapshot Near-infrared Hypersepctral Imaging Framework with Diffractive Optics
Figure 2 for Integrated Snapshot Near-infrared Hypersepctral Imaging Framework with Diffractive Optics
Figure 3 for Integrated Snapshot Near-infrared Hypersepctral Imaging Framework with Diffractive Optics
Figure 4 for Integrated Snapshot Near-infrared Hypersepctral Imaging Framework with Diffractive Optics
Viaarxiv icon

Intelligent road crack detection and analysis based on improved YOLOv8

Add code
Apr 16, 2025
Viaarxiv icon

Calibrating Deep Neural Network using Euclidean Distance

Add code
Oct 23, 2024
Figure 1 for Calibrating Deep Neural Network using Euclidean Distance
Figure 2 for Calibrating Deep Neural Network using Euclidean Distance
Figure 3 for Calibrating Deep Neural Network using Euclidean Distance
Figure 4 for Calibrating Deep Neural Network using Euclidean Distance
Viaarxiv icon

Irregularity-Informed Time Series Analysis: Adaptive Modelling of Spatial and Temporal Dynamics

Add code
Oct 16, 2024
Figure 1 for Irregularity-Informed Time Series Analysis: Adaptive Modelling of Spatial and Temporal Dynamics
Figure 2 for Irregularity-Informed Time Series Analysis: Adaptive Modelling of Spatial and Temporal Dynamics
Figure 3 for Irregularity-Informed Time Series Analysis: Adaptive Modelling of Spatial and Temporal Dynamics
Figure 4 for Irregularity-Informed Time Series Analysis: Adaptive Modelling of Spatial and Temporal Dynamics
Viaarxiv icon

Boosting Certificate Robustness for Time Series Classification with Efficient Self-Ensemble

Add code
Sep 04, 2024
Viaarxiv icon

Correlation Analysis of Adversarial Attack in Time Series Classification

Add code
Aug 21, 2024
Figure 1 for Correlation Analysis of Adversarial Attack in Time Series Classification
Figure 2 for Correlation Analysis of Adversarial Attack in Time Series Classification
Figure 3 for Correlation Analysis of Adversarial Attack in Time Series Classification
Figure 4 for Correlation Analysis of Adversarial Attack in Time Series Classification
Viaarxiv icon