Picture for Xun Yang

Xun Yang

A Survey on fMRI-based Brain Decoding for Reconstructing Multimodal Stimuli

Add code
Mar 20, 2025
Viaarxiv icon

EgoBlind: Towards Egocentric Visual Assistance for the Blind People

Add code
Mar 11, 2025
Viaarxiv icon

CMMLoc: Advancing Text-to-PointCloud Localization with Cauchy-Mixture-Model Based Framework

Add code
Mar 05, 2025
Viaarxiv icon

Precise Localization of Memories: A Fine-grained Neuron-level Knowledge Editing Technique for LLMs

Add code
Mar 03, 2025
Viaarxiv icon

EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering

Add code
Feb 11, 2025
Viaarxiv icon

AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring

Add code
Jan 16, 2025
Figure 1 for AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring
Figure 2 for AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring
Figure 3 for AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring
Figure 4 for AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring
Viaarxiv icon

Tuning-Free Long Video Generation via Global-Local Collaborative Diffusion

Add code
Jan 08, 2025
Figure 1 for Tuning-Free Long Video Generation via Global-Local Collaborative Diffusion
Figure 2 for Tuning-Free Long Video Generation via Global-Local Collaborative Diffusion
Figure 3 for Tuning-Free Long Video Generation via Global-Local Collaborative Diffusion
Figure 4 for Tuning-Free Long Video Generation via Global-Local Collaborative Diffusion
Viaarxiv icon

Learning states enhanced knowledge tracing: Simulating the diversity in real-world learning process

Add code
Dec 27, 2024
Viaarxiv icon

Repetitive Action Counting with Hybrid Temporal Relation Modeling

Add code
Dec 10, 2024
Figure 1 for Repetitive Action Counting with Hybrid Temporal Relation Modeling
Figure 2 for Repetitive Action Counting with Hybrid Temporal Relation Modeling
Figure 3 for Repetitive Action Counting with Hybrid Temporal Relation Modeling
Figure 4 for Repetitive Action Counting with Hybrid Temporal Relation Modeling
Viaarxiv icon

PEMF-VVTO: Point-Enhanced Video Virtual Try-on via Mask-free Paradigm

Add code
Dec 05, 2024
Figure 1 for PEMF-VVTO: Point-Enhanced Video Virtual Try-on via Mask-free Paradigm
Figure 2 for PEMF-VVTO: Point-Enhanced Video Virtual Try-on via Mask-free Paradigm
Figure 3 for PEMF-VVTO: Point-Enhanced Video Virtual Try-on via Mask-free Paradigm
Figure 4 for PEMF-VVTO: Point-Enhanced Video Virtual Try-on via Mask-free Paradigm
Viaarxiv icon