Picture for Tao Yuan

Tao Yuan

Beyond 2:4: exploring V:N:M sparsity for efficient transformer inference on GPUs

Add code
Oct 21, 2024
Viaarxiv icon

PR2: A Physics- and Photo-realistic Testbed for Embodied AI and Humanoid Robots

Add code
Sep 03, 2024
Viaarxiv icon

FIRE: A Dataset for Feedback Integration and Refinement Evaluation of Multimodal Models

Add code
Jul 16, 2024
Viaarxiv icon

LV-Eval: A Balanced Long-Context Benchmark with 5 Length Levels Up to 256K

Add code
Feb 06, 2024
Viaarxiv icon

Structured Attention for Unsupervised Dialogue Structure Induction

Add code
Oct 09, 2020
Figure 1 for Structured Attention for Unsupervised Dialogue Structure Induction
Figure 2 for Structured Attention for Unsupervised Dialogue Structure Induction
Figure 3 for Structured Attention for Unsupervised Dialogue Structure Induction
Figure 4 for Structured Attention for Unsupervised Dialogue Structure Induction
Viaarxiv icon

Joint Inference of States, Robot Knowledge, and Human Beliefs

Add code
Apr 25, 2020
Figure 1 for Joint Inference of States, Robot Knowledge, and Human Beliefs
Figure 2 for Joint Inference of States, Robot Knowledge, and Human Beliefs
Figure 3 for Joint Inference of States, Robot Knowledge, and Human Beliefs
Figure 4 for Joint Inference of States, Robot Knowledge, and Human Beliefs
Viaarxiv icon

PerspectiveNet: 3D Object Detection from a Single RGB Image via Perspective Points

Add code
Dec 16, 2019
Figure 1 for PerspectiveNet: 3D Object Detection from a Single RGB Image via Perspective Points
Figure 2 for PerspectiveNet: 3D Object Detection from a Single RGB Image via Perspective Points
Figure 3 for PerspectiveNet: 3D Object Detection from a Single RGB Image via Perspective Points
Figure 4 for PerspectiveNet: 3D Object Detection from a Single RGB Image via Perspective Points
Viaarxiv icon

Holistic++ Scene Understanding: Single-view 3D Holistic Scene Parsing and Human Pose Estimation with Human-Object Interaction and Physical Commonsense

Add code
Sep 04, 2019
Figure 1 for Holistic++ Scene Understanding: Single-view 3D Holistic Scene Parsing and Human Pose Estimation with Human-Object Interaction and Physical Commonsense
Figure 2 for Holistic++ Scene Understanding: Single-view 3D Holistic Scene Parsing and Human Pose Estimation with Human-Object Interaction and Physical Commonsense
Figure 3 for Holistic++ Scene Understanding: Single-view 3D Holistic Scene Parsing and Human Pose Estimation with Human-Object Interaction and Physical Commonsense
Figure 4 for Holistic++ Scene Understanding: Single-view 3D Holistic Scene Parsing and Human Pose Estimation with Human-Object Interaction and Physical Commonsense
Viaarxiv icon

HUGE2: a Highly Untangled Generative-model Engine for Edge-computing

Add code
Jul 25, 2019
Figure 1 for HUGE2: a Highly Untangled Generative-model Engine for Edge-computing
Figure 2 for HUGE2: a Highly Untangled Generative-model Engine for Edge-computing
Figure 3 for HUGE2: a Highly Untangled Generative-model Engine for Edge-computing
Figure 4 for HUGE2: a Highly Untangled Generative-model Engine for Edge-computing
Viaarxiv icon

Scene-centric Joint Parsing of Cross-view Videos

Add code
Feb 05, 2018
Figure 1 for Scene-centric Joint Parsing of Cross-view Videos
Figure 2 for Scene-centric Joint Parsing of Cross-view Videos
Figure 3 for Scene-centric Joint Parsing of Cross-view Videos
Figure 4 for Scene-centric Joint Parsing of Cross-view Videos
Viaarxiv icon