Picture for Yukang Gan

Yukang Gan

ATP-LLaVA: Adaptive Token Pruning for Large Vision Language Models

Add code
Nov 30, 2024
Viaarxiv icon

VoCo-LLaMA: Towards Vision Compression with Large Language Models

Add code
Jun 18, 2024
Viaarxiv icon

HRLAIF: Improvements in Helpfulness and Harmlessness in Open-domain Reinforcement Learning From AI Feedback

Add code
Mar 14, 2024
Viaarxiv icon

LLaMA Pro: Progressive LLaMA with Block Expansion

Add code
Jan 04, 2024
Viaarxiv icon

Binary Embedding-based Retrieval at Tencent

Add code
Feb 17, 2023
Viaarxiv icon

Cross-Modal Attentional Context Learning for RGB-D Object Detection

Add code
Oct 30, 2018
Figure 1 for Cross-Modal Attentional Context Learning for RGB-D Object Detection
Figure 2 for Cross-Modal Attentional Context Learning for RGB-D Object Detection
Figure 3 for Cross-Modal Attentional Context Learning for RGB-D Object Detection
Figure 4 for Cross-Modal Attentional Context Learning for RGB-D Object Detection
Viaarxiv icon

Knowledge-Guided Recurrent Neural Network Learning for Task-Oriented Action Prediction

Add code
Jul 15, 2017
Figure 1 for Knowledge-Guided Recurrent Neural Network Learning for Task-Oriented Action Prediction
Figure 2 for Knowledge-Guided Recurrent Neural Network Learning for Task-Oriented Action Prediction
Figure 3 for Knowledge-Guided Recurrent Neural Network Learning for Task-Oriented Action Prediction
Figure 4 for Knowledge-Guided Recurrent Neural Network Learning for Task-Oriented Action Prediction
Viaarxiv icon

LSTM-CF: Unifying Context Modeling and Fusion with LSTMs for RGB-D Scene Labeling

Add code
Jul 26, 2016
Figure 1 for LSTM-CF: Unifying Context Modeling and Fusion with LSTMs for RGB-D Scene Labeling
Figure 2 for LSTM-CF: Unifying Context Modeling and Fusion with LSTMs for RGB-D Scene Labeling
Figure 3 for LSTM-CF: Unifying Context Modeling and Fusion with LSTMs for RGB-D Scene Labeling
Figure 4 for LSTM-CF: Unifying Context Modeling and Fusion with LSTMs for RGB-D Scene Labeling
Viaarxiv icon