Picture for Pritam Sarkar

Pritam Sarkar

Self-alignment of Large Video Language Models with Refined Regularized Preference Optimization

Add code
Apr 16, 2025
Viaarxiv icon

Mitigating Object Hallucination via Data Augmented Contrastive Tuning

Add code
May 28, 2024
Viaarxiv icon

Region-Disentangled Diffusion Model for High-Fidelity PPG-to-ECG Translation

Add code
Aug 25, 2023
Figure 1 for Region-Disentangled Diffusion Model for High-Fidelity PPG-to-ECG Translation
Figure 2 for Region-Disentangled Diffusion Model for High-Fidelity PPG-to-ECG Translation
Figure 3 for Region-Disentangled Diffusion Model for High-Fidelity PPG-to-ECG Translation
Figure 4 for Region-Disentangled Diffusion Model for High-Fidelity PPG-to-ECG Translation
Viaarxiv icon

Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts

Add code
Jun 03, 2023
Viaarxiv icon

XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning

Add code
Dec 12, 2022
Viaarxiv icon

AVCAffe: A Large Scale Audio-Visual Dataset of Cognitive Load and Affect for Remote Work

Add code
May 13, 2022
Figure 1 for AVCAffe: A Large Scale Audio-Visual Dataset of Cognitive Load and Affect for Remote Work
Figure 2 for AVCAffe: A Large Scale Audio-Visual Dataset of Cognitive Load and Affect for Remote Work
Figure 3 for AVCAffe: A Large Scale Audio-Visual Dataset of Cognitive Load and Affect for Remote Work
Figure 4 for AVCAffe: A Large Scale Audio-Visual Dataset of Cognitive Load and Affect for Remote Work
Viaarxiv icon

Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Temporal Synchronicity

Add code
Nov 14, 2021
Figure 1 for Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Temporal Synchronicity
Figure 2 for Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Temporal Synchronicity
Figure 3 for Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Temporal Synchronicity
Figure 4 for Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Temporal Synchronicity
Viaarxiv icon

CardioGAN: Attentive Generative Adversarial Network with Dual Discriminators for Synthesis of ECG from PPG

Add code
Sep 30, 2020
Figure 1 for CardioGAN: Attentive Generative Adversarial Network with Dual Discriminators for Synthesis of ECG from PPG
Figure 2 for CardioGAN: Attentive Generative Adversarial Network with Dual Discriminators for Synthesis of ECG from PPG
Figure 3 for CardioGAN: Attentive Generative Adversarial Network with Dual Discriminators for Synthesis of ECG from PPG
Figure 4 for CardioGAN: Attentive Generative Adversarial Network with Dual Discriminators for Synthesis of ECG from PPG
Viaarxiv icon

Self-supervised ECG Representation Learning for Emotion Recognition

Add code
Feb 04, 2020
Figure 1 for Self-supervised ECG Representation Learning for Emotion Recognition
Figure 2 for Self-supervised ECG Representation Learning for Emotion Recognition
Figure 3 for Self-supervised ECG Representation Learning for Emotion Recognition
Figure 4 for Self-supervised ECG Representation Learning for Emotion Recognition
Viaarxiv icon

Self-supervised Learning for ECG-based Emotion Recognition

Add code
Oct 24, 2019
Figure 1 for Self-supervised Learning for ECG-based Emotion Recognition
Figure 2 for Self-supervised Learning for ECG-based Emotion Recognition
Figure 3 for Self-supervised Learning for ECG-based Emotion Recognition
Figure 4 for Self-supervised Learning for ECG-based Emotion Recognition
Viaarxiv icon