Picture for Qinglong Zhang

Qinglong Zhang

Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight

Add code
Jul 22, 2024
Figure 1 for Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight
Figure 2 for Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight
Figure 3 for Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight
Figure 4 for Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight
Viaarxiv icon

RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis

Add code
Feb 25, 2024
Viaarxiv icon

InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

Add code
Jan 15, 2024
Viaarxiv icon

EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought

Add code
May 24, 2023
Viaarxiv icon

InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language

Add code
May 11, 2023
Viaarxiv icon

FedKNOW: Federated Continual Learning with Signature Task Knowledge Integration at Edge

Add code
Dec 04, 2022
Figure 1 for FedKNOW: Federated Continual Learning with Signature Task Knowledge Integration at Edge
Figure 2 for FedKNOW: Federated Continual Learning with Signature Task Knowledge Integration at Edge
Figure 3 for FedKNOW: Federated Continual Learning with Signature Task Knowledge Integration at Edge
Figure 4 for FedKNOW: Federated Continual Learning with Signature Task Knowledge Integration at Edge
Viaarxiv icon

LegoDNN: Block-grained Scaling of Deep Neural Networks for Mobile Vision

Add code
Dec 18, 2021
Figure 1 for LegoDNN: Block-grained Scaling of Deep Neural Networks for Mobile Vision
Figure 2 for LegoDNN: Block-grained Scaling of Deep Neural Networks for Mobile Vision
Figure 3 for LegoDNN: Block-grained Scaling of Deep Neural Networks for Mobile Vision
Figure 4 for LegoDNN: Block-grained Scaling of Deep Neural Networks for Mobile Vision
Viaarxiv icon

ResT: An Efficient Transformer for Visual Recognition

Add code
Jun 06, 2021
Figure 1 for ResT: An Efficient Transformer for Visual Recognition
Figure 2 for ResT: An Efficient Transformer for Visual Recognition
Figure 3 for ResT: An Efficient Transformer for Visual Recognition
Figure 4 for ResT: An Efficient Transformer for Visual Recognition
Viaarxiv icon

Group-CAM: Group Score-Weighted Visual Explanations for Deep Convolutional Networks

Add code
Mar 26, 2021
Figure 1 for Group-CAM: Group Score-Weighted Visual Explanations for Deep Convolutional Networks
Figure 2 for Group-CAM: Group Score-Weighted Visual Explanations for Deep Convolutional Networks
Figure 3 for Group-CAM: Group Score-Weighted Visual Explanations for Deep Convolutional Networks
Figure 4 for Group-CAM: Group Score-Weighted Visual Explanations for Deep Convolutional Networks
Viaarxiv icon