Picture for Danyang Zhang

Danyang Zhang

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

Add code
Jul 15, 2024
Figure 1 for Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Figure 2 for Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Figure 3 for Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Figure 4 for Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Viaarxiv icon

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Add code
Apr 11, 2024
Figure 1 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Figure 2 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Figure 3 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Figure 4 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Viaarxiv icon

Large Language Model Is Semi-Parametric Reinforcement Learning Agent

Add code
Jun 09, 2023
Viaarxiv icon

Mobile-Env: A Universal Platform for Training and Evaluation of Mobile Interaction

Add code
May 14, 2023
Viaarxiv icon

Learning from Temporal Spatial Cubism for Cross-Dataset Skeleton-based Action Recognition

Add code
Jul 17, 2022
Figure 1 for Learning from Temporal Spatial Cubism for Cross-Dataset Skeleton-based Action Recognition
Figure 2 for Learning from Temporal Spatial Cubism for Cross-Dataset Skeleton-based Action Recognition
Figure 3 for Learning from Temporal Spatial Cubism for Cross-Dataset Skeleton-based Action Recognition
Figure 4 for Learning from Temporal Spatial Cubism for Cross-Dataset Skeleton-based Action Recognition
Viaarxiv icon

WebSRC: A Dataset for Web-Based Structural Reading Comprehension

Add code
Jan 23, 2021
Figure 1 for WebSRC: A Dataset for Web-Based Structural Reading Comprehension
Figure 2 for WebSRC: A Dataset for Web-Based Structural Reading Comprehension
Figure 3 for WebSRC: A Dataset for Web-Based Structural Reading Comprehension
Figure 4 for WebSRC: A Dataset for Web-Based Structural Reading Comprehension
Viaarxiv icon

Uncertainty-aware Score Distribution Learning for Action Quality Assessment

Add code
Jun 13, 2020
Figure 1 for Uncertainty-aware Score Distribution Learning for Action Quality Assessment
Figure 2 for Uncertainty-aware Score Distribution Learning for Action Quality Assessment
Figure 3 for Uncertainty-aware Score Distribution Learning for Action Quality Assessment
Figure 4 for Uncertainty-aware Score Distribution Learning for Action Quality Assessment
Viaarxiv icon

COIN: A Large-scale Dataset for Comprehensive Instructional Video Analysis

Add code
Mar 07, 2019
Figure 1 for COIN: A Large-scale Dataset for Comprehensive Instructional Video Analysis
Figure 2 for COIN: A Large-scale Dataset for Comprehensive Instructional Video Analysis
Figure 3 for COIN: A Large-scale Dataset for Comprehensive Instructional Video Analysis
Figure 4 for COIN: A Large-scale Dataset for Comprehensive Instructional Video Analysis
Viaarxiv icon