Picture for Joe Dhanith P R

Joe Dhanith P R

SAG-ViT: A Scale-Aware, High-Fidelity Patching Approach with Graph Attention for Vision Transformers

Add code
Nov 14, 2024
Viaarxiv icon

Enhancing Diabetic Retinopathy Detection with CNN-Based Models: A Comparative Study of UNET and Stacked UNET Architectures

Add code
Nov 02, 2024
Figure 1 for Enhancing Diabetic Retinopathy Detection with CNN-Based Models: A Comparative Study of UNET and Stacked UNET Architectures
Figure 2 for Enhancing Diabetic Retinopathy Detection with CNN-Based Models: A Comparative Study of UNET and Stacked UNET Architectures
Figure 3 for Enhancing Diabetic Retinopathy Detection with CNN-Based Models: A Comparative Study of UNET and Stacked UNET Architectures
Figure 4 for Enhancing Diabetic Retinopathy Detection with CNN-Based Models: A Comparative Study of UNET and Stacked UNET Architectures
Viaarxiv icon

Multimodal Emotion Recognition using Audio-Video Transformer Fusion with Cross Attention

Add code
Jul 26, 2024
Viaarxiv icon