Picture for Georgios Tzimiropoulos

Georgios Tzimiropoulos

Edge-SD-SR: Low Latency and Parameter Efficient On-device Super-Resolution with Stable Diffusion via Bidirectional Conditioning

Add code
Dec 09, 2024
Viaarxiv icon

Discriminative Fine-tuning of LVLMs

Add code
Dec 05, 2024
Viaarxiv icon

FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion

Add code
Nov 27, 2024
Viaarxiv icon

CemiFace: Center-based Semi-hard Synthetic Face Generation for Face Recognition

Add code
Sep 27, 2024
Viaarxiv icon

MM2Latent: Text-to-facial image generation and editing in GANs with multimodal assistance

Add code
Sep 17, 2024
Figure 1 for MM2Latent: Text-to-facial image generation and editing in GANs with multimodal assistance
Figure 2 for MM2Latent: Text-to-facial image generation and editing in GANs with multimodal assistance
Figure 3 for MM2Latent: Text-to-facial image generation and editing in GANs with multimodal assistance
Figure 4 for MM2Latent: Text-to-facial image generation and editing in GANs with multimodal assistance
Viaarxiv icon

MobileQuant: Mobile-friendly Quantization for On-device Language Models

Add code
Aug 25, 2024
Figure 1 for MobileQuant: Mobile-friendly Quantization for On-device Language Models
Figure 2 for MobileQuant: Mobile-friendly Quantization for On-device Language Models
Figure 3 for MobileQuant: Mobile-friendly Quantization for On-device Language Models
Figure 4 for MobileQuant: Mobile-friendly Quantization for On-device Language Models
Viaarxiv icon

CLIP-DPO: Vision-Language Models as a Source of Preference for Fixing Hallucinations in LVLMs

Add code
Aug 19, 2024
Viaarxiv icon

CLIPCleaner: Cleaning Noisy Labels with CLIP

Add code
Aug 19, 2024
Viaarxiv icon

Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing

Add code
Jul 15, 2024
Viaarxiv icon

MeMSVD: Long-Range Temporal Structure Capturing Using Incremental SVD

Add code
Jun 11, 2024
Viaarxiv icon