Picture for Wei Zhang

Wei Zhang

Alibaba Group

Vision-Language Model Predictive Control for Manipulation Planning and Trajectory Generation

Add code
Apr 07, 2025
Viaarxiv icon

The 1st Solution for 4th PVUW MeViS Challenge: Unleashing the Potential of Large Multimodal Models for Referring Video Segmentation

Add code
Apr 07, 2025
Viaarxiv icon

GROVE: A Generalized Reward for Learning Open-Vocabulary Physical Skill

Add code
Apr 05, 2025
Viaarxiv icon

Gaussian Process Tilted Nonparametric Density Estimation using Fisher Divergence Score Matching

Add code
Apr 04, 2025
Viaarxiv icon

CSF: Fixed-outline Floorplanning Based on the Conjugate Subgradient Algorithm Assisted by Q-Learning

Add code
Apr 04, 2025
Viaarxiv icon

Dexterous Manipulation through Imitation Learning: A Survey

Add code
Apr 04, 2025
Viaarxiv icon

ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement

Add code
Apr 03, 2025
Viaarxiv icon

Reconfigurable Codebook-Based Beamforming for RDARS-Aided mmWave MU-MIMO Systems

Add code
Apr 02, 2025
Viaarxiv icon

Hierarchical Attention Networks for Lossless Point Cloud Attribute Compression

Add code
Apr 01, 2025
Viaarxiv icon

Mapping Geopolitical Bias in 11 Large Language Models: A Bilingual, Dual-Framing Analysis of U.S.-China Tensions

Add code
Mar 31, 2025
Viaarxiv icon