Picture for Minh Tran

Minh Tran

Amodal Instance Segmentation with Diffusion Shape Prior Estimation

Add code
Sep 26, 2024
Viaarxiv icon

HENASY: Learning to Assemble Scene-Entities for Egocentric Video-Language Model

Add code
Jun 01, 2024
Viaarxiv icon

S3Former: Self-supervised High-resolution Transformer for Solar PV Profiling

Add code
May 07, 2024
Viaarxiv icon

CarcassFormer: An End-to-end Transformer-based Framework for Simultaneous Localization, Segmentation and Classification of Poultry Carcass Defect

Add code
Apr 17, 2024
Viaarxiv icon

Dyadic Interaction Modeling for Social Behavior Generation

Add code
Mar 27, 2024
Viaarxiv icon

ShapeFormer: Shape Prior Visible-to-Amodal Transformer-based Amodal Instance Segmentation

Add code
Mar 22, 2024
Viaarxiv icon

3FM: Multi-modal Meta-learning for Federated Tasks

Add code
Dec 15, 2023
Viaarxiv icon

SolarFormer: Multi-scale Transformer for Solar PV Profiling

Add code
Oct 30, 2023
Viaarxiv icon

Privacy-preserving Representation Learning for Speech Understanding

Add code
Oct 26, 2023
Viaarxiv icon

Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation

Add code
Oct 05, 2023
Viaarxiv icon