Picture for Haoran Tang

Haoran Tang

PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos

Add code
Dec 02, 2024
Viaarxiv icon

PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance

Add code
Nov 05, 2024
Figure 1 for PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance
Figure 2 for PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance
Figure 3 for PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance
Figure 4 for PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance
Viaarxiv icon

A Peaceman-Rachford Splitting Approach with Deep Equilibrium Network for Channel Estimation

Add code
Oct 31, 2024
Viaarxiv icon

MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval

Add code
Aug 20, 2024
Figure 1 for MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval
Figure 2 for MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval
Figure 3 for MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval
Figure 4 for MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval
Viaarxiv icon

Why does in-context learning fail sometimes? Evaluating in-context learning on open and closed questions

Add code
Jul 02, 2024
Figure 1 for Why does in-context learning fail sometimes? Evaluating in-context learning on open and closed questions
Figure 2 for Why does in-context learning fail sometimes? Evaluating in-context learning on open and closed questions
Figure 3 for Why does in-context learning fail sometimes? Evaluating in-context learning on open and closed questions
Figure 4 for Why does in-context learning fail sometimes? Evaluating in-context learning on open and closed questions
Viaarxiv icon

Toward Structure Fairness in Dynamic Graph Embedding: A Trend-aware Dual Debiasing Approach

Add code
Jun 19, 2024
Viaarxiv icon

Expert-Guided Extinction of Toxic Tokens for Debiased Generation

Add code
May 29, 2024
Viaarxiv icon

RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter

Add code
May 29, 2024
Viaarxiv icon

ST-LLM: Large Language Models Are Effective Temporal Learners

Add code
Mar 30, 2024
Figure 1 for ST-LLM: Large Language Models Are Effective Temporal Learners
Figure 2 for ST-LLM: Large Language Models Are Effective Temporal Learners
Figure 3 for ST-LLM: Large Language Models Are Effective Temporal Learners
Figure 4 for ST-LLM: Large Language Models Are Effective Temporal Learners
Viaarxiv icon

Surgment: Segmentation-enabled Semantic Search and Creation of Visual Question and Feedback to Support Video-Based Surgery Learning

Add code
Feb 27, 2024
Figure 1 for Surgment: Segmentation-enabled Semantic Search and Creation of Visual Question and Feedback to Support Video-Based Surgery Learning
Figure 2 for Surgment: Segmentation-enabled Semantic Search and Creation of Visual Question and Feedback to Support Video-Based Surgery Learning
Figure 3 for Surgment: Segmentation-enabled Semantic Search and Creation of Visual Question and Feedback to Support Video-Based Surgery Learning
Figure 4 for Surgment: Segmentation-enabled Semantic Search and Creation of Visual Question and Feedback to Support Video-Based Surgery Learning
Viaarxiv icon