Picture for Bin Xiao

Bin Xiao

Stephen

Mobius: Text to Seamless Looping Video Generation via Latent Shift

Add code
Feb 27, 2025
Viaarxiv icon

Baichuan-Omni-1.5 Technical Report

Add code
Jan 26, 2025
Viaarxiv icon

UV-Attack: Physical-World Adversarial Attacks for Person Detection via Dynamic-NeRF-based UV Mapping

Add code
Jan 10, 2025
Viaarxiv icon

Mobile Augmented Reality Framework with Fusional Localization and Pose Estimation

Add code
Jan 06, 2025
Figure 1 for Mobile Augmented Reality Framework with Fusional Localization and Pose Estimation
Figure 2 for Mobile Augmented Reality Framework with Fusional Localization and Pose Estimation
Figure 3 for Mobile Augmented Reality Framework with Fusional Localization and Pose Estimation
Figure 4 for Mobile Augmented Reality Framework with Fusional Localization and Pose Estimation
Viaarxiv icon

CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training

Add code
Dec 23, 2024
Viaarxiv icon

Dynamic Ensemble Reasoning for LLM Experts

Add code
Dec 10, 2024
Figure 1 for Dynamic Ensemble Reasoning for LLM Experts
Figure 2 for Dynamic Ensemble Reasoning for LLM Experts
Figure 3 for Dynamic Ensemble Reasoning for LLM Experts
Figure 4 for Dynamic Ensemble Reasoning for LLM Experts
Viaarxiv icon

Jointly RS Image Deblurring and Super-Resolution with Adjustable-Kernel and Multi-Domain Attention

Add code
Dec 07, 2024
Viaarxiv icon

Customize Segment Anything Model for Multi-Modal Semantic Segmentation with Mixture of LoRA Experts

Add code
Dec 05, 2024
Viaarxiv icon

Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

Add code
Dec 05, 2024
Figure 1 for Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion
Figure 2 for Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion
Figure 3 for Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion
Figure 4 for Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion
Viaarxiv icon

Improving Transferable Targeted Attacks with Feature Tuning Mixup

Add code
Nov 23, 2024
Figure 1 for Improving Transferable Targeted Attacks with Feature Tuning Mixup
Figure 2 for Improving Transferable Targeted Attacks with Feature Tuning Mixup
Figure 3 for Improving Transferable Targeted Attacks with Feature Tuning Mixup
Figure 4 for Improving Transferable Targeted Attacks with Feature Tuning Mixup
Viaarxiv icon