Picture for Mehran Kazemi

Mehran Kazemi

Dima

Gemma 3 Technical Report

Add code
Mar 25, 2025
Viaarxiv icon

BIG-Bench Extra Hard

Add code
Feb 26, 2025
Viaarxiv icon

GeoCoder: Solving Geometry Problems by Generating Modular Code through Vision-Language Models

Add code
Oct 17, 2024
Figure 1 for GeoCoder: Solving Geometry Problems by Generating Modular Code through Vision-Language Models
Figure 2 for GeoCoder: Solving Geometry Problems by Generating Modular Code through Vision-Language Models
Figure 3 for GeoCoder: Solving Geometry Problems by Generating Modular Code through Vision-Language Models
Figure 4 for GeoCoder: Solving Geometry Problems by Generating Modular Code through Vision-Language Models
Viaarxiv icon

Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling

Add code
Aug 29, 2024
Figure 1 for Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling
Figure 2 for Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling
Figure 3 for Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling
Figure 4 for Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling
Viaarxiv icon

Generative Verifiers: Reward Modeling as Next-Token Prediction

Add code
Aug 27, 2024
Figure 1 for Generative Verifiers: Reward Modeling as Next-Token Prediction
Figure 2 for Generative Verifiers: Reward Modeling as Next-Token Prediction
Figure 3 for Generative Verifiers: Reward Modeling as Next-Token Prediction
Figure 4 for Generative Verifiers: Reward Modeling as Next-Token Prediction
Viaarxiv icon

Gemma 2: Improving Open Language Models at a Practical Size

Add code
Aug 02, 2024
Figure 1 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 2 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 3 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 4 for Gemma 2: Improving Open Language Models at a Practical Size
Viaarxiv icon

SocialQuotes: Learning Contextual Roles of Social Media Quotes on the Web

Add code
Jul 22, 2024
Viaarxiv icon

ReMI: A Dataset for Reasoning with Multiple Images

Add code
Jun 13, 2024
Figure 1 for ReMI: A Dataset for Reasoning with Multiple Images
Figure 2 for ReMI: A Dataset for Reasoning with Multiple Images
Figure 3 for ReMI: A Dataset for Reasoning with Multiple Images
Figure 4 for ReMI: A Dataset for Reasoning with Multiple Images
Viaarxiv icon

Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning

Add code
Jun 13, 2024
Viaarxiv icon

Understanding Transformer Reasoning Capabilities via Graph Algorithms

Add code
May 28, 2024
Viaarxiv icon