Picture for Nicolas Chapados

Nicolas Chapados

UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction

Add code
Mar 19, 2025
Viaarxiv icon

AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

Add code
Feb 03, 2025
Figure 1 for AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding
Figure 2 for AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding
Figure 3 for AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding
Figure 4 for AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding
Viaarxiv icon

The BrowserGym Ecosystem for Web Agent Research

Add code
Dec 10, 2024
Figure 1 for The BrowserGym Ecosystem for Web Agent Research
Figure 2 for The BrowserGym Ecosystem for Web Agent Research
Figure 3 for The BrowserGym Ecosystem for Web Agent Research
Figure 4 for The BrowserGym Ecosystem for Web Agent Research
Viaarxiv icon

BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks

Add code
Dec 05, 2024
Figure 1 for BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks
Figure 2 for BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks
Figure 3 for BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks
Figure 4 for BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks
Viaarxiv icon

Context is Key: A Benchmark for Forecasting with Essential Textual Information

Add code
Oct 24, 2024
Figure 1 for Context is Key: A Benchmark for Forecasting with Essential Textual Information
Figure 2 for Context is Key: A Benchmark for Forecasting with Essential Textual Information
Figure 3 for Context is Key: A Benchmark for Forecasting with Essential Textual Information
Figure 4 for Context is Key: A Benchmark for Forecasting with Essential Textual Information
Viaarxiv icon

InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation

Add code
Jul 08, 2024
Figure 1 for InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation
Figure 2 for InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation
Figure 3 for InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation
Figure 4 for InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation
Viaarxiv icon

WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks

Add code
Jul 07, 2024
Figure 1 for WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks
Figure 2 for WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks
Figure 3 for WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks
Figure 4 for WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks
Viaarxiv icon

RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content

Add code
Jun 17, 2024
Viaarxiv icon

XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference

Add code
Apr 23, 2024
Figure 1 for XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference
Figure 2 for XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference
Figure 3 for XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference
Figure 4 for XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference
Viaarxiv icon

LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders

Add code
Apr 09, 2024
Viaarxiv icon