Picture for Wenze Hu

Wenze Hu

Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models

Add code
Oct 03, 2024
Viaarxiv icon

Guiding Instruction-based Image Editing via Multimodal Large Language Models

Add code
Sep 29, 2023
Viaarxiv icon

Million-scale Object Detection with Large Vision Model

Add code
Dec 19, 2022
Viaarxiv icon

NAR-Former: Neural Architecture Representation Learning towards Holistic Attributes Prediction

Add code
Nov 15, 2022
Viaarxiv icon

ParCNetV2: Oversized Kernel with Enhanced Attention

Add code
Nov 14, 2022
Viaarxiv icon

CabViT: Cross Attention among Blocks for Vision Transformer

Add code
Nov 14, 2022
Viaarxiv icon

Fast-ParC: Position Aware Global Kernel for ConvNets and ViTs

Add code
Oct 08, 2022
Figure 1 for Fast-ParC: Position Aware Global Kernel for ConvNets and ViTs
Figure 2 for Fast-ParC: Position Aware Global Kernel for ConvNets and ViTs
Figure 3 for Fast-ParC: Position Aware Global Kernel for ConvNets and ViTs
Figure 4 for Fast-ParC: Position Aware Global Kernel for ConvNets and ViTs
Viaarxiv icon

ALBench: A Framework for Evaluating Active Learning in Object Detection

Add code
Aug 10, 2022
Figure 1 for ALBench: A Framework for Evaluating Active Learning in Object Detection
Figure 2 for ALBench: A Framework for Evaluating Active Learning in Object Detection
Figure 3 for ALBench: A Framework for Evaluating Active Learning in Object Detection
Figure 4 for ALBench: A Framework for Evaluating Active Learning in Object Detection
Viaarxiv icon

Implementation of an Automated Learning System for Non-experts

Add code
Mar 26, 2022
Figure 1 for Implementation of an Automated Learning System for Non-experts
Figure 2 for Implementation of an Automated Learning System for Non-experts
Figure 3 for Implementation of an Automated Learning System for Non-experts
Figure 4 for Implementation of an Automated Learning System for Non-experts
Viaarxiv icon

EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers

Add code
Mar 15, 2022
Figure 1 for EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers
Figure 2 for EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers
Figure 3 for EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers
Figure 4 for EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers
Viaarxiv icon