Picture for Tao Gong

Tao Gong

Llama SLayer 8B: Shallow Layers Hold the Key to Knowledge Injection

Add code
Oct 03, 2024
Figure 1 for Llama SLayer 8B: Shallow Layers Hold the Key to Knowledge Injection
Figure 2 for Llama SLayer 8B: Shallow Layers Hold the Key to Knowledge Injection
Figure 3 for Llama SLayer 8B: Shallow Layers Hold the Key to Knowledge Injection
Figure 4 for Llama SLayer 8B: Shallow Layers Hold the Key to Knowledge Injection
Viaarxiv icon

Mixture-of-Noises Enhanced Forgery-Aware Predictor for Multi-Face Manipulation Detection and Localization

Add code
Aug 05, 2024
Viaarxiv icon

MiM-ISTD: Mamba-in-Mamba for Efficient Infrared Small Target Detection

Add code
Mar 08, 2024
Viaarxiv icon

Bootstrapping Audio-Visual Segmentation by Strengthening Audio Cues

Add code
Feb 06, 2024
Viaarxiv icon

Towards More Unified In-context Visual Understanding

Add code
Dec 05, 2023
Figure 1 for Towards More Unified In-context Visual Understanding
Figure 2 for Towards More Unified In-context Visual Understanding
Figure 3 for Towards More Unified In-context Visual Understanding
Figure 4 for Towards More Unified In-context Visual Understanding
Viaarxiv icon

CloudBrain-MRS: An Intelligent Cloud Computing Platform for in vivo Magnetic Resonance Spectroscopy Preprocessing, Quantification, and Analysis

Add code
Jun 19, 2023
Figure 1 for CloudBrain-MRS: An Intelligent Cloud Computing Platform for in vivo Magnetic Resonance Spectroscopy Preprocessing, Quantification, and Analysis
Figure 2 for CloudBrain-MRS: An Intelligent Cloud Computing Platform for in vivo Magnetic Resonance Spectroscopy Preprocessing, Quantification, and Analysis
Figure 3 for CloudBrain-MRS: An Intelligent Cloud Computing Platform for in vivo Magnetic Resonance Spectroscopy Preprocessing, Quantification, and Analysis
Figure 4 for CloudBrain-MRS: An Intelligent Cloud Computing Platform for in vivo Magnetic Resonance Spectroscopy Preprocessing, Quantification, and Analysis
Viaarxiv icon

MultiModal-GPT: A Vision and Language Model for Dialogue with Humans

Add code
May 09, 2023
Figure 1 for MultiModal-GPT: A Vision and Language Model for Dialogue with Humans
Figure 2 for MultiModal-GPT: A Vision and Language Model for Dialogue with Humans
Figure 3 for MultiModal-GPT: A Vision and Language Model for Dialogue with Humans
Figure 4 for MultiModal-GPT: A Vision and Language Model for Dialogue with Humans
Viaarxiv icon

Temporal RoI Align for Video Object Recognition

Add code
Sep 11, 2021
Figure 1 for Temporal RoI Align for Video Object Recognition
Figure 2 for Temporal RoI Align for Video Object Recognition
Figure 3 for Temporal RoI Align for Video Object Recognition
Figure 4 for Temporal RoI Align for Video Object Recognition
Viaarxiv icon

Mining Contextual Information Beyond Image for Semantic Segmentation

Add code
Aug 26, 2021
Figure 1 for Mining Contextual Information Beyond Image for Semantic Segmentation
Figure 2 for Mining Contextual Information Beyond Image for Semantic Segmentation
Figure 3 for Mining Contextual Information Beyond Image for Semantic Segmentation
Figure 4 for Mining Contextual Information Beyond Image for Semantic Segmentation
Viaarxiv icon

Towards Generalizable and Robust Face Manipulation Detection via Bag-of-local-feature

Add code
Mar 14, 2021
Figure 1 for Towards Generalizable and Robust Face Manipulation Detection via Bag-of-local-feature
Figure 2 for Towards Generalizable and Robust Face Manipulation Detection via Bag-of-local-feature
Figure 3 for Towards Generalizable and Robust Face Manipulation Detection via Bag-of-local-feature
Figure 4 for Towards Generalizable and Robust Face Manipulation Detection via Bag-of-local-feature
Viaarxiv icon