Picture for Guiguang Ding

Guiguang Ding

Breaking the Stage Barrier: A Novel Single-Stage Approach to Long Context Extension for Large Language Models

Add code
Dec 10, 2024
Viaarxiv icon

[CLS] Token Tells Everything Needed for Training-free Efficient MLLMs

Add code
Dec 08, 2024
Viaarxiv icon

PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation

Add code
Dec 04, 2024
Viaarxiv icon

HEIE: MLLM-Based Hierarchical Explainable AIGC Image Implausibility Evaluator

Add code
Nov 26, 2024
Viaarxiv icon

Promptable Anomaly Segmentation with SAM Through Self-Perception Tuning

Add code
Nov 26, 2024
Figure 1 for Promptable Anomaly Segmentation with SAM Through Self-Perception Tuning
Figure 2 for Promptable Anomaly Segmentation with SAM Through Self-Perception Tuning
Figure 3 for Promptable Anomaly Segmentation with SAM Through Self-Perception Tuning
Figure 4 for Promptable Anomaly Segmentation with SAM Through Self-Perception Tuning
Viaarxiv icon

LBPE: Long-token-first Tokenization to Improve Large Language Models

Add code
Nov 08, 2024
Viaarxiv icon

CartesianMoE: Boosting Knowledge Sharing among Experts via Cartesian Product Routing in Mixture-of-Experts

Add code
Oct 21, 2024
Viaarxiv icon

Context Enhancement with Reconstruction as Sequence for Unified Unsupervised Anomaly Detection

Add code
Sep 10, 2024
Figure 1 for Context Enhancement with Reconstruction as Sequence for Unified Unsupervised Anomaly Detection
Figure 2 for Context Enhancement with Reconstruction as Sequence for Unified Unsupervised Anomaly Detection
Figure 3 for Context Enhancement with Reconstruction as Sequence for Unified Unsupervised Anomaly Detection
Figure 4 for Context Enhancement with Reconstruction as Sequence for Unified Unsupervised Anomaly Detection
Viaarxiv icon

TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval

Add code
Sep 02, 2024
Viaarxiv icon

LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image

Add code
Aug 14, 2024
Figure 1 for LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image
Figure 2 for LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image
Figure 3 for LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image
Figure 4 for LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image
Viaarxiv icon