Efficient Vits


Vision Transformers for Zero-Shot Clustering of Animal Images: A Comparative Benchmarking Study

Add code
Feb 03, 2026
Viaarxiv icon

FOVI: A biologically-inspired foveated interface for deep vision models

Add code
Feb 03, 2026
Viaarxiv icon

LoopViT: Scaling Visual ARC with Looped Transformers

Add code
Feb 02, 2026
Viaarxiv icon

A Hybrid Mamba-SAM Architecture for Efficient 3D Medical Image Segmentation

Add code
Jan 31, 2026
Viaarxiv icon

RepSFNet : A Single Fusion Network with Structural Reparameterization for Crowd Counting

Add code
Jan 28, 2026
Viaarxiv icon

Comparison of Image Processing Models in Quark Gluon Jet Classification

Add code
Jan 29, 2026
Viaarxiv icon

MetricAnything: Scaling Metric Depth Pretraining with Noisy Heterogeneous Sources

Add code
Jan 29, 2026
Viaarxiv icon

Scalable Analytic Classifiers with Associative Drift Compensation for Class-Incremental Learning of Vision Transformers

Add code
Jan 29, 2026
Viaarxiv icon

Youtu-Parsing: Perception, Structuring and Recognition via High-Parallelism Decoding

Add code
Jan 28, 2026
Viaarxiv icon

Semi-Supervised Masked Autoencoders: Unlocking Vision Transformer Potential with Limited Data

Add code
Jan 27, 2026
Viaarxiv icon