Picture for Fangyun Wei

Fangyun Wei

UniGraspTransformer: Simplified Policy Distillation for Scalable Dexterous Robotic Grasping

Add code
Dec 03, 2024
Viaarxiv icon

CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation

Add code
Nov 29, 2024
Viaarxiv icon

Expressive and Generalizable Low-rank Adaptation for Large Models via Slow Cascaded Learning

Add code
Jul 01, 2024
Viaarxiv icon

EAGLE-2: Faster Inference of Language Models with Dynamic Draft Trees

Add code
Jun 24, 2024
Viaarxiv icon

Revisiting Referring Expression Comprehension Evaluation in the Era of Large Multimodal Models

Add code
Jun 24, 2024
Viaarxiv icon

Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99%

Add code
Jun 17, 2024
Figure 1 for Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99%
Figure 2 for Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99%
Figure 3 for Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99%
Figure 4 for Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99%
Viaarxiv icon

A Hong Kong Sign Language Corpus Collected from Sign-interpreted TV News

Add code
May 02, 2024
Viaarxiv icon

Rethinking Generative Large Language Model Evaluation for Semantic Comprehension

Add code
Mar 12, 2024
Viaarxiv icon

Beyond Text: Frozen Large Language Models in Visual Signal Comprehension

Add code
Mar 12, 2024
Viaarxiv icon

AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API Calls

Add code
Feb 06, 2024
Viaarxiv icon