Picture for Vajira Thambawita

Vajira Thambawita

Calliope: A TTS-based Narrated E-book Creator Ensuring Exact Synchronization, Privacy, and Layout Fidelity

Add code
Feb 11, 2026
Viaarxiv icon

ECG-IMN: Interpretable Mesomorphic Neural Networks for 12-Lead Electrocardiogram Interpretation

Add code
Feb 10, 2026
Viaarxiv icon

Anatomy-Preserving Latent Diffusion for Generation of Brain Segmentation Masks with Ischemic Infarct

Add code
Feb 10, 2026
Viaarxiv icon

VideoHEDGE: Entropy-Based Hallucination Detection for Video-VLMs via Semantic Clustering and Spatiotemporal Perturbations

Add code
Jan 13, 2026
Viaarxiv icon

Medical Imaging AI Competitions Lack Fairness

Add code
Dec 19, 2025
Figure 1 for Medical Imaging AI Competitions Lack Fairness
Figure 2 for Medical Imaging AI Competitions Lack Fairness
Figure 3 for Medical Imaging AI Competitions Lack Fairness
Figure 4 for Medical Imaging AI Competitions Lack Fairness
Viaarxiv icon

From Flat to Feeling: A Feasibility and Impact Study on Dynamic Facial Emotions in AI-Generated Avatars

Add code
Jun 16, 2025
Figure 1 for From Flat to Feeling: A Feasibility and Impact Study on Dynamic Facial Emotions in AI-Generated Avatars
Figure 2 for From Flat to Feeling: A Feasibility and Impact Study on Dynamic Facial Emotions in AI-Generated Avatars
Figure 3 for From Flat to Feeling: A Feasibility and Impact Study on Dynamic Facial Emotions in AI-Generated Avatars
Figure 4 for From Flat to Feeling: A Feasibility and Impact Study on Dynamic Facial Emotions in AI-Generated Avatars
Viaarxiv icon

SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game Understanding

Add code
May 22, 2025
Viaarxiv icon

Embryo 2.0: Merging Synthetic and Real Data for Advanced AI Predictions

Add code
Dec 02, 2024
Viaarxiv icon

Comparative Analysis of Audio Feature Extraction for Real-Time Talking Portrait Synthesis

Add code
Nov 20, 2024
Figure 1 for Comparative Analysis of Audio Feature Extraction for Real-Time Talking Portrait Synthesis
Figure 2 for Comparative Analysis of Audio Feature Extraction for Real-Time Talking Portrait Synthesis
Figure 3 for Comparative Analysis of Audio Feature Extraction for Real-Time Talking Portrait Synthesis
Figure 4 for Comparative Analysis of Audio Feature Extraction for Real-Time Talking Portrait Synthesis
Viaarxiv icon

Kvasir-VQA: A Text-Image Pair GI Tract Dataset

Add code
Sep 02, 2024
Figure 1 for Kvasir-VQA: A Text-Image Pair GI Tract Dataset
Figure 2 for Kvasir-VQA: A Text-Image Pair GI Tract Dataset
Figure 3 for Kvasir-VQA: A Text-Image Pair GI Tract Dataset
Figure 4 for Kvasir-VQA: A Text-Image Pair GI Tract Dataset
Viaarxiv icon