Picture for Shiliang Sun

Shiliang Sun

Global-Local Dual Perception for MLLMs in High-Resolution Text-Rich Image Translation

Add code
Feb 25, 2026
Viaarxiv icon

LEMON: How Well Do MLLMs Perform Temporal Multimodal Understanding on Instructional Videos?

Add code
Jan 27, 2026
Viaarxiv icon

Revealing the Challenges of Sim-to-Real Transfer in Model-Based Reinforcement Learning via Latent Space Modeling

Add code
Jun 15, 2025
Figure 1 for Revealing the Challenges of Sim-to-Real Transfer in Model-Based Reinforcement Learning via Latent Space Modeling
Figure 2 for Revealing the Challenges of Sim-to-Real Transfer in Model-Based Reinforcement Learning via Latent Space Modeling
Figure 3 for Revealing the Challenges of Sim-to-Real Transfer in Model-Based Reinforcement Learning via Latent Space Modeling
Figure 4 for Revealing the Challenges of Sim-to-Real Transfer in Model-Based Reinforcement Learning via Latent Space Modeling
Viaarxiv icon

Multimodal Machine Translation with Visual Scene Graph Pruning

Add code
May 26, 2025
Viaarxiv icon

Memory Reviving, Continuing Learning and Beyond: Evaluation of Pre-trained Encoders and Decoders for Multimodal Machine Translation

Add code
Apr 25, 2025
Viaarxiv icon

Evaluating Menu OCR and Translation: A Benchmark for Aligning Human and Automated Evaluations in Large Vision-Language Models

Add code
Apr 22, 2025
Viaarxiv icon

MST-GAT: A Multimodal Spatial-Temporal Graph Attention Network for Time Series Anomaly Detection

Add code
Oct 17, 2023
Viaarxiv icon

GNN-XML: Graph Neural Networks for Extreme Multi-label Text Classification

Add code
Dec 10, 2020
Figure 1 for GNN-XML: Graph Neural Networks for Extreme Multi-label Text Classification
Figure 2 for GNN-XML: Graph Neural Networks for Extreme Multi-label Text Classification
Figure 3 for GNN-XML: Graph Neural Networks for Extreme Multi-label Text Classification
Figure 4 for GNN-XML: Graph Neural Networks for Extreme Multi-label Text Classification
Viaarxiv icon

Manifold Partition Discriminant Analysis

Add code
Nov 23, 2020
Figure 1 for Manifold Partition Discriminant Analysis
Figure 2 for Manifold Partition Discriminant Analysis
Figure 3 for Manifold Partition Discriminant Analysis
Figure 4 for Manifold Partition Discriminant Analysis
Viaarxiv icon

Adversarial Attacks for Multi-view Deep Models

Add code
Jun 19, 2020
Figure 1 for Adversarial Attacks for Multi-view Deep Models
Figure 2 for Adversarial Attacks for Multi-view Deep Models
Figure 3 for Adversarial Attacks for Multi-view Deep Models
Figure 4 for Adversarial Attacks for Multi-view Deep Models
Viaarxiv icon