Picture for Alexandre Drouin

Alexandre Drouin

The BrowserGym Ecosystem for Web Agent Research

Add code
Dec 10, 2024
Viaarxiv icon

The Landscape of Causal Discovery Data: Grounding Causal Discovery in Real-World Applications

Add code
Dec 02, 2024
Viaarxiv icon

Context is Key: A Benchmark for Forecasting with Essential Textual Information

Add code
Oct 24, 2024
Viaarxiv icon

Sample Compression Hypernetworks: From Generalization Bounds to Meta-Learning

Add code
Oct 17, 2024
Viaarxiv icon

Causal Representation Learning in Temporal Data via Single-Parent Decoding

Add code
Oct 09, 2024
Figure 1 for Causal Representation Learning in Temporal Data via Single-Parent Decoding
Figure 2 for Causal Representation Learning in Temporal Data via Single-Parent Decoding
Figure 3 for Causal Representation Learning in Temporal Data via Single-Parent Decoding
Figure 4 for Causal Representation Learning in Temporal Data via Single-Parent Decoding
Viaarxiv icon

InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation

Add code
Jul 08, 2024
Figure 1 for InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation
Figure 2 for InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation
Figure 3 for InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation
Figure 4 for InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation
Viaarxiv icon

WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks

Add code
Jul 07, 2024
Figure 1 for WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks
Figure 2 for WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks
Figure 3 for WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks
Figure 4 for WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks
Viaarxiv icon

Evaluating Interventional Reasoning Capabilities of Large Language Models

Add code
Apr 08, 2024
Viaarxiv icon

WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?

Add code
Mar 12, 2024
Figure 1 for WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?
Figure 2 for WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?
Figure 3 for WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?
Figure 4 for WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?
Viaarxiv icon

Capture the Flag: Uncovering Data Insights with Large Language Models

Add code
Dec 21, 2023
Viaarxiv icon