Picture for Hang Zhang

Hang Zhang

VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

Add code
Jan 08, 2025
Viaarxiv icon

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Add code
Jan 03, 2025
Figure 1 for 2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining
Figure 2 for 2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining
Figure 3 for 2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining
Figure 4 for 2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining
Viaarxiv icon

AltGen: AI-Driven Alt Text Generation for Enhancing EPUB Accessibility

Add code
Dec 30, 2024
Figure 1 for AltGen: AI-Driven Alt Text Generation for Enhancing EPUB Accessibility
Figure 2 for AltGen: AI-Driven Alt Text Generation for Enhancing EPUB Accessibility
Figure 3 for AltGen: AI-Driven Alt Text Generation for Enhancing EPUB Accessibility
Figure 4 for AltGen: AI-Driven Alt Text Generation for Enhancing EPUB Accessibility
Viaarxiv icon

Comparative Analysis of Listwise Reranking with Large Language Models in Limited-Resource Language Contexts

Add code
Dec 28, 2024
Viaarxiv icon

Robustness of Large Language Models Against Adversarial Attacks

Add code
Dec 22, 2024
Viaarxiv icon

Feedback Regulated Opto-Mechanical Soft Robotic Actuators

Add code
Dec 20, 2024
Viaarxiv icon

SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation

Add code
Dec 15, 2024
Viaarxiv icon

Splats in Splats: Embedding Invisible 3D Watermark within Gaussian Splatting

Add code
Dec 04, 2024
Figure 1 for Splats in Splats: Embedding Invisible 3D Watermark within Gaussian Splatting
Figure 2 for Splats in Splats: Embedding Invisible 3D Watermark within Gaussian Splatting
Figure 3 for Splats in Splats: Embedding Invisible 3D Watermark within Gaussian Splatting
Figure 4 for Splats in Splats: Embedding Invisible 3D Watermark within Gaussian Splatting
Viaarxiv icon

CLAP: Unsupervised 3D Representation Learning for Fusion 3D Perception via Curvature Sampling and Prototype Learning

Add code
Dec 04, 2024
Viaarxiv icon

Fidelity-Imposed Displacement Editing for the Learn2Reg 2024 SHG-BF Challenge

Add code
Oct 28, 2024
Figure 1 for Fidelity-Imposed Displacement Editing for the Learn2Reg 2024 SHG-BF Challenge
Figure 2 for Fidelity-Imposed Displacement Editing for the Learn2Reg 2024 SHG-BF Challenge
Figure 3 for Fidelity-Imposed Displacement Editing for the Learn2Reg 2024 SHG-BF Challenge
Figure 4 for Fidelity-Imposed Displacement Editing for the Learn2Reg 2024 SHG-BF Challenge
Viaarxiv icon