Picture for Can Ma

Can Ma

Customizing Visual Emotion Evaluation for MLLMs: An Open-vocabulary, Multifaceted, and Scalable Approach

Add code
Sep 26, 2025
Viaarxiv icon

Gather and Trace: Rethinking Video TextVQA from an Instance-oriented Perspective

Add code
Aug 06, 2025
Viaarxiv icon

An Empirical Study on Configuring In-Context Learning Demonstrations for Unleashing MLLMs' Sentimental Perception Capability

Add code
May 22, 2025
Viaarxiv icon

Multi-Modal Molecular Representation Learning via Structure Awareness

Add code
May 09, 2025
Viaarxiv icon

Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition

Add code
Mar 24, 2025
Viaarxiv icon

AS-GCL: Asymmetric Spectral Augmentation on Graph Contrastive Learning

Add code
Feb 19, 2025
Viaarxiv icon

Communication-Efficient Personalized Federal Graph Learning via Low-Rank Decomposition

Add code
Dec 18, 2024
Figure 1 for Communication-Efficient Personalized Federal Graph Learning via Low-Rank Decomposition
Figure 2 for Communication-Efficient Personalized Federal Graph Learning via Low-Rank Decomposition
Figure 3 for Communication-Efficient Personalized Federal Graph Learning via Low-Rank Decomposition
Figure 4 for Communication-Efficient Personalized Federal Graph Learning via Low-Rank Decomposition
Viaarxiv icon

Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues

Add code
Dec 17, 2024
Figure 1 for Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues
Figure 2 for Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues
Figure 3 for Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues
Figure 4 for Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues
Viaarxiv icon

Falcon-UI: Understanding GUI Before Following User Instructions

Add code
Dec 12, 2024
Figure 1 for Falcon-UI: Understanding GUI Before Following User Instructions
Figure 2 for Falcon-UI: Understanding GUI Before Following User Instructions
Figure 3 for Falcon-UI: Understanding GUI Before Following User Instructions
Figure 4 for Falcon-UI: Understanding GUI Before Following User Instructions
Viaarxiv icon

Improving Mathematical Reasoning Capabilities of Small Language Models via Feedback-Driven Distillation

Add code
Nov 22, 2024
Figure 1 for Improving Mathematical Reasoning Capabilities of Small Language Models via Feedback-Driven Distillation
Figure 2 for Improving Mathematical Reasoning Capabilities of Small Language Models via Feedback-Driven Distillation
Figure 3 for Improving Mathematical Reasoning Capabilities of Small Language Models via Feedback-Driven Distillation
Figure 4 for Improving Mathematical Reasoning Capabilities of Small Language Models via Feedback-Driven Distillation
Viaarxiv icon