Picture for Yue Yang

Yue Yang

Institute for Transport Studies, University of Leeds, Leeds LS2 9JT, UK

ARCADE: Scalable Demonstration Collection and Generation via Augmented Reality for Imitation Learning

Add code
Oct 21, 2024
Viaarxiv icon

Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping

Add code
Oct 11, 2024
Figure 1 for Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
Figure 2 for Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
Figure 3 for Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
Figure 4 for Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
Viaarxiv icon

MiRAGeNews: Multimodal Realistic AI-Generated News Detection

Add code
Oct 11, 2024
Figure 1 for MiRAGeNews: Multimodal Realistic AI-Generated News Detection
Figure 2 for MiRAGeNews: Multimodal Realistic AI-Generated News Detection
Figure 3 for MiRAGeNews: Multimodal Realistic AI-Generated News Detection
Figure 4 for MiRAGeNews: Multimodal Realistic AI-Generated News Detection
Viaarxiv icon

DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects

Add code
Oct 03, 2024
Viaarxiv icon

StraightTrack: Towards Mixed Reality Navigation System for Percutaneous K-wire Insertion

Add code
Oct 02, 2024
Figure 1 for StraightTrack: Towards Mixed Reality Navigation System for Percutaneous K-wire Insertion
Figure 2 for StraightTrack: Towards Mixed Reality Navigation System for Percutaneous K-wire Insertion
Figure 3 for StraightTrack: Towards Mixed Reality Navigation System for Percutaneous K-wire Insertion
Figure 4 for StraightTrack: Towards Mixed Reality Navigation System for Percutaneous K-wire Insertion
Viaarxiv icon

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Add code
Sep 25, 2024
Figure 1 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Figure 2 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Figure 3 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Figure 4 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Viaarxiv icon

Deep reinforcement learning for tracking a moving target in jellyfish-like swimming

Add code
Sep 13, 2024
Viaarxiv icon

RoomDiffusion: A Specialized Diffusion Model in the Interior Design Industry

Add code
Sep 05, 2024
Viaarxiv icon

Optimizing Automated Picking Systems in Warehouse Robots Using Machine Learning

Add code
Aug 29, 2024
Viaarxiv icon

Unlocking the Potential: Benchmarking Large Language Models in Water Engineering and Research

Add code
Jul 22, 2024
Figure 1 for Unlocking the Potential: Benchmarking Large Language Models in Water Engineering and Research
Figure 2 for Unlocking the Potential: Benchmarking Large Language Models in Water Engineering and Research
Figure 3 for Unlocking the Potential: Benchmarking Large Language Models in Water Engineering and Research
Figure 4 for Unlocking the Potential: Benchmarking Large Language Models in Water Engineering and Research
Viaarxiv icon