Picture for Huaxin Zhang

Huaxin Zhang

Improving Multi-modal Large Language Model through Boosting Vision Capabilities

Add code
Oct 17, 2024
Figure 1 for Improving Multi-modal Large Language Model through Boosting Vision Capabilities
Figure 2 for Improving Multi-modal Large Language Model through Boosting Vision Capabilities
Figure 3 for Improving Multi-modal Large Language Model through Boosting Vision Capabilities
Figure 4 for Improving Multi-modal Large Language Model through Boosting Vision Capabilities
Viaarxiv icon

Cross-video Identity Correlating for Person Re-identification Pre-training

Add code
Sep 27, 2024
Viaarxiv icon

Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM

Add code
Jun 18, 2024
Viaarxiv icon

GlanceVAD: Exploring Glance Supervision for Label-efficient Video Anomaly Detection

Add code
Mar 12, 2024
Viaarxiv icon

HR-Pro: Point-supervised Temporal Action Localization via Hierarchical Reliability Propagation

Add code
Aug 24, 2023
Viaarxiv icon

Optimization of Forcemyography Sensor Placement for Arm Movement Recognition

Add code
Jul 22, 2022
Figure 1 for Optimization of Forcemyography Sensor Placement for Arm Movement Recognition
Figure 2 for Optimization of Forcemyography Sensor Placement for Arm Movement Recognition
Figure 3 for Optimization of Forcemyography Sensor Placement for Arm Movement Recognition
Figure 4 for Optimization of Forcemyography Sensor Placement for Arm Movement Recognition
Viaarxiv icon

Context-aware Proposal Network for Temporal Action Detection

Add code
Jun 18, 2022
Figure 1 for Context-aware Proposal Network for Temporal Action Detection
Figure 2 for Context-aware Proposal Network for Temporal Action Detection
Figure 3 for Context-aware Proposal Network for Temporal Action Detection
Figure 4 for Context-aware Proposal Network for Temporal Action Detection
Viaarxiv icon