Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Adaptive Aspect Ratios with Patch-Mixup-ViT-based Vehicle ReID

Nov 09, 2024

Mei Qiu, Lauren Ann Christopher, Stanley Chien, Lingxi Li

Figure 1 for Adaptive Aspect Ratios with Patch-Mixup-ViT-based Vehicle ReID

Figure 2 for Adaptive Aspect Ratios with Patch-Mixup-ViT-based Vehicle ReID

Figure 3 for Adaptive Aspect Ratios with Patch-Mixup-ViT-based Vehicle ReID

Figure 4 for Adaptive Aspect Ratios with Patch-Mixup-ViT-based Vehicle ReID

Share this with someone who'll enjoy it:

Abstract:Vision Transformers (ViTs) have shown exceptional performance in vehicle re-identification (ReID) tasks. However, non-square aspect ratios of image or video inputs can negatively impact re-identification accuracy. To address this challenge, we propose a novel, human perception driven, and general ViT-based ReID framework that fuses models trained on various aspect ratios. Our key contributions are threefold: (i) We analyze the impact of aspect ratios on performance using the VeRi-776 and VehicleID datasets, providing guidance for input settings based on the distribution of original image aspect ratios. (ii) We introduce patch-wise mixup strategy during ViT patchification (guided by spatial attention scores) and implement uneven stride for better alignment with object aspect ratios. (iii) We propose a dynamic feature fusion ReID network to enhance model robustness. Our method outperforms state-of-the-art transformer-based approaches on both datasets, with only a minimal increase in inference time per image.

View paper on

Share this with someone who'll enjoy it:

Title:Adaptive Aspect Ratios with Patch-Mixup-ViT-based Vehicle ReID

Paper and Code