Picture for Xing Zhang

Xing Zhang

LLM Enabled Multi-Agent System for 6G Networks: Framework and Method of Dual-Loop Edge-Terminal Collaboration

Add code
Sep 05, 2025
Viaarxiv icon

Repeating Words for Video-Language Retrieval with Coarse-to-Fine Objectives

Add code
Aug 20, 2025
Viaarxiv icon

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Add code
May 12, 2025
Viaarxiv icon

AnimeDL-2M: Million-Scale AI-Generated Anime Image Detection and Localization in Diffusion Era

Add code
Apr 15, 2025
Viaarxiv icon

MQADet: A Plug-and-Play Paradigm for Enhancing Open-Vocabulary Object Detection via Multimodal Question Answering

Add code
Feb 26, 2025
Viaarxiv icon

Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation

Add code
Jan 27, 2025
Figure 1 for Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation
Figure 2 for Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation
Figure 3 for Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation
Figure 4 for Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation
Viaarxiv icon

Efficient Adaptation of Pre-trained Vision Transformer via Householder Transformation

Add code
Oct 30, 2024
Figure 1 for Efficient Adaptation of Pre-trained Vision Transformer via Householder Transformation
Figure 2 for Efficient Adaptation of Pre-trained Vision Transformer via Householder Transformation
Figure 3 for Efficient Adaptation of Pre-trained Vision Transformer via Householder Transformation
Figure 4 for Efficient Adaptation of Pre-trained Vision Transformer via Householder Transformation
Viaarxiv icon

AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video Grounding

Add code
Jun 11, 2024
Figure 1 for AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video Grounding
Figure 2 for AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video Grounding
Figure 3 for AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video Grounding
Figure 4 for AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video Grounding
Viaarxiv icon

Logic Synthesis with Generative Deep Neural Networks

Add code
Jun 07, 2024
Figure 1 for Logic Synthesis with Generative Deep Neural Networks
Figure 2 for Logic Synthesis with Generative Deep Neural Networks
Figure 3 for Logic Synthesis with Generative Deep Neural Networks
Figure 4 for Logic Synthesis with Generative Deep Neural Networks
Viaarxiv icon

Low-Rank Rescaled Vision Transformer Fine-Tuning: A Residual Design Approach

Add code
Mar 28, 2024
Figure 1 for Low-Rank Rescaled Vision Transformer Fine-Tuning: A Residual Design Approach
Figure 2 for Low-Rank Rescaled Vision Transformer Fine-Tuning: A Residual Design Approach
Figure 3 for Low-Rank Rescaled Vision Transformer Fine-Tuning: A Residual Design Approach
Figure 4 for Low-Rank Rescaled Vision Transformer Fine-Tuning: A Residual Design Approach
Viaarxiv icon