Picture for Alexandre Drouin

Alexandre Drouin

Learning to Defer for Causal Discovery with Imperfect Experts

Add code
Feb 18, 2025
Viaarxiv icon

The BrowserGym Ecosystem for Web Agent Research

Add code
Dec 10, 2024
Figure 1 for The BrowserGym Ecosystem for Web Agent Research
Figure 2 for The BrowserGym Ecosystem for Web Agent Research
Figure 3 for The BrowserGym Ecosystem for Web Agent Research
Figure 4 for The BrowserGym Ecosystem for Web Agent Research
Viaarxiv icon

The Landscape of Causal Discovery Data: Grounding Causal Discovery in Real-World Applications

Add code
Dec 02, 2024
Figure 1 for The Landscape of Causal Discovery Data: Grounding Causal Discovery in Real-World Applications
Figure 2 for The Landscape of Causal Discovery Data: Grounding Causal Discovery in Real-World Applications
Figure 3 for The Landscape of Causal Discovery Data: Grounding Causal Discovery in Real-World Applications
Figure 4 for The Landscape of Causal Discovery Data: Grounding Causal Discovery in Real-World Applications
Viaarxiv icon

Context is Key: A Benchmark for Forecasting with Essential Textual Information

Add code
Oct 24, 2024
Figure 1 for Context is Key: A Benchmark for Forecasting with Essential Textual Information
Figure 2 for Context is Key: A Benchmark for Forecasting with Essential Textual Information
Figure 3 for Context is Key: A Benchmark for Forecasting with Essential Textual Information
Figure 4 for Context is Key: A Benchmark for Forecasting with Essential Textual Information
Viaarxiv icon

Sample Compression Hypernetworks: From Generalization Bounds to Meta-Learning

Add code
Oct 17, 2024
Viaarxiv icon

Causal Representation Learning in Temporal Data via Single-Parent Decoding

Add code
Oct 09, 2024
Figure 1 for Causal Representation Learning in Temporal Data via Single-Parent Decoding
Figure 2 for Causal Representation Learning in Temporal Data via Single-Parent Decoding
Figure 3 for Causal Representation Learning in Temporal Data via Single-Parent Decoding
Figure 4 for Causal Representation Learning in Temporal Data via Single-Parent Decoding
Viaarxiv icon

InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation

Add code
Jul 08, 2024
Figure 1 for InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation
Figure 2 for InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation
Figure 3 for InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation
Figure 4 for InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation
Viaarxiv icon

WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks

Add code
Jul 07, 2024
Figure 1 for WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks
Figure 2 for WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks
Figure 3 for WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks
Figure 4 for WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks
Viaarxiv icon

Evaluating Interventional Reasoning Capabilities of Large Language Models

Add code
Apr 08, 2024
Viaarxiv icon

WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?

Add code
Mar 12, 2024
Figure 1 for WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?
Figure 2 for WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?
Figure 3 for WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?
Figure 4 for WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?
Viaarxiv icon