Picture for Yan Yang

Yan Yang

Growing a Twig to Accelerate Large Vision-Language Models

Add code
Mar 18, 2025
Viaarxiv icon

ProBench: Judging Multimodal Foundation Models on Open-ended Multi-domain Expert Tasks

Add code
Mar 10, 2025
Viaarxiv icon

Spatial Transcriptomics Analysis of Spatially Dense Gene Expression Prediction

Add code
Mar 03, 2025
Viaarxiv icon

A space-decoupling framework for optimization on bounded-rank matrices with orthogonally invariant constraints

Add code
Jan 23, 2025
Viaarxiv icon

High Resolution Tree Height Mapping of the Amazon Forest using Planet NICFI Images and LiDAR-Informed U-Net Model

Add code
Jan 17, 2025
Figure 1 for High Resolution Tree Height Mapping of the Amazon Forest using Planet NICFI Images and LiDAR-Informed U-Net Model
Figure 2 for High Resolution Tree Height Mapping of the Amazon Forest using Planet NICFI Images and LiDAR-Informed U-Net Model
Figure 3 for High Resolution Tree Height Mapping of the Amazon Forest using Planet NICFI Images and LiDAR-Informed U-Net Model
Figure 4 for High Resolution Tree Height Mapping of the Amazon Forest using Planet NICFI Images and LiDAR-Informed U-Net Model
Viaarxiv icon

Exploring Temporal Event Cues for Dense Video Captioning in Cyclic Co-learning

Add code
Dec 16, 2024
Viaarxiv icon

T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs

Add code
Dec 02, 2024
Figure 1 for T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs
Figure 2 for T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs
Figure 3 for T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs
Figure 4 for T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs
Viaarxiv icon

SpikMamba: When SNN meets Mamba in Event-based Human Action Recognition

Add code
Oct 22, 2024
Figure 1 for SpikMamba: When SNN meets Mamba in Event-based Human Action Recognition
Figure 2 for SpikMamba: When SNN meets Mamba in Event-based Human Action Recognition
Figure 3 for SpikMamba: When SNN meets Mamba in Event-based Human Action Recognition
Figure 4 for SpikMamba: When SNN meets Mamba in Event-based Human Action Recognition
Viaarxiv icon

LMHaze: Intensity-aware Image Dehazing with a Large-scale Multi-intensity Real Haze Dataset

Add code
Oct 21, 2024
Figure 1 for LMHaze: Intensity-aware Image Dehazing with a Large-scale Multi-intensity Real Haze Dataset
Figure 2 for LMHaze: Intensity-aware Image Dehazing with a Large-scale Multi-intensity Real Haze Dataset
Figure 3 for LMHaze: Intensity-aware Image Dehazing with a Large-scale Multi-intensity Real Haze Dataset
Figure 4 for LMHaze: Intensity-aware Image Dehazing with a Large-scale Multi-intensity Real Haze Dataset
Viaarxiv icon

Storyboard guided Alignment for Fine-grained Video Action Recognition

Add code
Oct 18, 2024
Viaarxiv icon