Picture for Alexander Toshev

Alexander Toshev

Apple

Expanding LLM Agent Boundaries with Strategy-Guided Exploration

Add code
Mar 02, 2026
Viaarxiv icon

GRACE: A Language Model Framework for Explainable Inverse Reinforcement Learning

Add code
Oct 02, 2025
Viaarxiv icon

Ferret-UI Lite: Lessons from Building Small On-Device GUI Agents

Add code
Sep 30, 2025
Figure 1 for Ferret-UI Lite: Lessons from Building Small On-Device GUI Agents
Figure 2 for Ferret-UI Lite: Lessons from Building Small On-Device GUI Agents
Figure 3 for Ferret-UI Lite: Lessons from Building Small On-Device GUI Agents
Figure 4 for Ferret-UI Lite: Lessons from Building Small On-Device GUI Agents
Viaarxiv icon

MobileCLIP2: Improving Multi-Modal Reinforced Training

Add code
Aug 28, 2025
Viaarxiv icon

Datasets, Documents, and Repetitions: The Practicalities of Unequal Data Quality

Add code
Mar 10, 2025
Viaarxiv icon

From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons

Add code
Dec 11, 2024
Viaarxiv icon

DSplats: 3D Generation by Denoising Splats-Based Multiview Diffusion Models

Add code
Dec 11, 2024
Figure 1 for DSplats: 3D Generation by Denoising Splats-Based Multiview Diffusion Models
Figure 2 for DSplats: 3D Generation by Denoising Splats-Based Multiview Diffusion Models
Figure 3 for DSplats: 3D Generation by Denoising Splats-Based Multiview Diffusion Models
Figure 4 for DSplats: 3D Generation by Denoising Splats-Based Multiview Diffusion Models
Viaarxiv icon

World-consistent Video Diffusion with Explicit 3D Modeling

Add code
Dec 02, 2024
Figure 1 for World-consistent Video Diffusion with Explicit 3D Modeling
Figure 2 for World-consistent Video Diffusion with Explicit 3D Modeling
Figure 3 for World-consistent Video Diffusion with Explicit 3D Modeling
Figure 4 for World-consistent Video Diffusion with Explicit 3D Modeling
Viaarxiv icon

On the Modeling Capabilities of Large Language Models for Sequential Decision Making

Add code
Oct 08, 2024
Figure 1 for On the Modeling Capabilities of Large Language Models for Sequential Decision Making
Figure 2 for On the Modeling Capabilities of Large Language Models for Sequential Decision Making
Figure 3 for On the Modeling Capabilities of Large Language Models for Sequential Decision Making
Figure 4 for On the Modeling Capabilities of Large Language Models for Sequential Decision Making
Viaarxiv icon

DataComp-LM: In search of the next generation of training sets for language models

Add code
Jun 18, 2024
Figure 1 for DataComp-LM: In search of the next generation of training sets for language models
Figure 2 for DataComp-LM: In search of the next generation of training sets for language models
Figure 3 for DataComp-LM: In search of the next generation of training sets for language models
Figure 4 for DataComp-LM: In search of the next generation of training sets for language models
Viaarxiv icon