Picture for Jin Ma

Jin Ma

HD-VGGT: High-Resolution Visual Geometry Transformer

Add code
Mar 28, 2026
Viaarxiv icon

Towards Automated Community Notes Generation with Large Vision Language Models for Combating Contextual Deception

Add code
Mar 23, 2026
Viaarxiv icon

A Large-Scale Remote Sensing Dataset and VLM-based Algorithm for Fine-Grained Road Hierarchy Classification

Add code
Mar 22, 2026
Viaarxiv icon

Omni IIE Bench: Benchmarking the Practical Capabilities of Image Editing Models

Add code
Mar 16, 2026
Viaarxiv icon

Beyond Closed-Pool Video Retrieval: A Benchmark and Agent Framework for Real-World Video Search and Moment Localization

Add code
Feb 10, 2026
Viaarxiv icon

VDE Bench: Evaluating The Capability of Image Editing Models to Modify Visual Documents

Add code
Jan 27, 2026
Viaarxiv icon

Rank4Gen: RAG-Preference-Aligned Document Set Selection and Ranking

Add code
Jan 16, 2026
Viaarxiv icon

Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss

Add code
Dec 29, 2025
Viaarxiv icon

Virtual Width Networks

Add code
Nov 17, 2025
Figure 1 for Virtual Width Networks
Figure 2 for Virtual Width Networks
Figure 3 for Virtual Width Networks
Figure 4 for Virtual Width Networks
Viaarxiv icon

DisPatch: Disarming Adversarial Patches in Object Detection with Diffusion Models

Add code
Sep 04, 2025
Figure 1 for DisPatch: Disarming Adversarial Patches in Object Detection with Diffusion Models
Figure 2 for DisPatch: Disarming Adversarial Patches in Object Detection with Diffusion Models
Figure 3 for DisPatch: Disarming Adversarial Patches in Object Detection with Diffusion Models
Figure 4 for DisPatch: Disarming Adversarial Patches in Object Detection with Diffusion Models
Viaarxiv icon