Picture for Hanmeng Liu

Hanmeng Liu

ReEfBench: Quantifying the Reasoning Efficiency of LLMs

Add code
Jan 07, 2026
Viaarxiv icon

GeoSketch: A Neural-Symbolic Approach to Geometric Multimodal Reasoning with Auxiliary Line Construction and Affine Transformation

Add code
Sep 26, 2025
Viaarxiv icon

Evaluating the Logical Reasoning Abilities of Large Reasoning Models

Add code
May 17, 2025
Figure 1 for Evaluating the Logical Reasoning Abilities of Large Reasoning Models
Figure 2 for Evaluating the Logical Reasoning Abilities of Large Reasoning Models
Figure 3 for Evaluating the Logical Reasoning Abilities of Large Reasoning Models
Figure 4 for Evaluating the Logical Reasoning Abilities of Large Reasoning Models
Viaarxiv icon

Logical Reasoning in Large Language Models: A Survey

Add code
Feb 13, 2025
Figure 1 for Logical Reasoning in Large Language Models: A Survey
Figure 2 for Logical Reasoning in Large Language Models: A Survey
Figure 3 for Logical Reasoning in Large Language Models: A Survey
Viaarxiv icon

Break the Chain: Large Language Models Can be Shortcut Reasoners

Add code
Jun 04, 2024
Figure 1 for Break the Chain: Large Language Models Can be Shortcut Reasoners
Figure 2 for Break the Chain: Large Language Models Can be Shortcut Reasoners
Figure 3 for Break the Chain: Large Language Models Can be Shortcut Reasoners
Figure 4 for Break the Chain: Large Language Models Can be Shortcut Reasoners
Viaarxiv icon

Logic Agent: Enhancing Validity with Logic Rule Invocation

Add code
Apr 28, 2024
Viaarxiv icon

LogiCoT: Logical Chain-of-Thought Instruction-Tuning Data Collection with GPT-4

Add code
May 20, 2023
Viaarxiv icon

Evaluating the Logical Reasoning Ability of ChatGPT and GPT-4

Add code
Apr 20, 2023
Figure 1 for Evaluating the Logical Reasoning Ability of ChatGPT and GPT-4
Figure 2 for Evaluating the Logical Reasoning Ability of ChatGPT and GPT-4
Figure 3 for Evaluating the Logical Reasoning Ability of ChatGPT and GPT-4
Figure 4 for Evaluating the Logical Reasoning Ability of ChatGPT and GPT-4
Viaarxiv icon

GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective

Add code
Nov 15, 2022
Figure 1 for GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective
Figure 2 for GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective
Figure 3 for GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective
Figure 4 for GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective
Viaarxiv icon

Solving Aspect Category Sentiment Analysis as a Text Generation Task

Add code
Oct 14, 2021
Figure 1 for Solving Aspect Category Sentiment Analysis as a Text Generation Task
Figure 2 for Solving Aspect Category Sentiment Analysis as a Text Generation Task
Figure 3 for Solving Aspect Category Sentiment Analysis as a Text Generation Task
Figure 4 for Solving Aspect Category Sentiment Analysis as a Text Generation Task
Viaarxiv icon