Picture for Liangming Pan

Liangming Pan

FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance

Add code
Mar 07, 2025
Figure 1 for FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance
Figure 2 for FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance
Figure 3 for FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance
Figure 4 for FinTMMBench: Benchmarking Temporal-Aware Multi-Modal RAG in Finance
Viaarxiv icon

InductionBench: LLMs Fail in the Simplest Complexity Class

Add code
Feb 26, 2025
Viaarxiv icon

Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework

Add code
Dec 22, 2024
Viaarxiv icon

AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge

Add code
Dec 18, 2024
Figure 1 for AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge
Figure 2 for AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge
Figure 3 for AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge
Figure 4 for AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge
Viaarxiv icon

Combating Multimodal LLM Hallucination via Bottom-up Holistic Reasoning

Add code
Dec 15, 2024
Viaarxiv icon

RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios

Add code
Dec 12, 2024
Figure 1 for RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios
Figure 2 for RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios
Figure 3 for RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios
Figure 4 for RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios
Viaarxiv icon

Improving Causal Reasoning in Large Language Models: A Survey

Add code
Oct 22, 2024
Figure 1 for Improving Causal Reasoning in Large Language Models: A Survey
Figure 2 for Improving Causal Reasoning in Large Language Models: A Survey
Figure 3 for Improving Causal Reasoning in Large Language Models: A Survey
Figure 4 for Improving Causal Reasoning in Large Language Models: A Survey
Viaarxiv icon

COrAL: Order-Agnostic Language Modeling for Efficient Iterative Refinement

Add code
Oct 12, 2024
Figure 1 for COrAL: Order-Agnostic Language Modeling for Efficient Iterative Refinement
Figure 2 for COrAL: Order-Agnostic Language Modeling for Efficient Iterative Refinement
Figure 3 for COrAL: Order-Agnostic Language Modeling for Efficient Iterative Refinement
Figure 4 for COrAL: Order-Agnostic Language Modeling for Efficient Iterative Refinement
Viaarxiv icon

Understanding the Interplay between Parametric and Contextual Knowledge for Large Language Models

Add code
Oct 10, 2024
Figure 1 for Understanding the Interplay between Parametric and Contextual Knowledge for Large Language Models
Figure 2 for Understanding the Interplay between Parametric and Contextual Knowledge for Large Language Models
Figure 3 for Understanding the Interplay between Parametric and Contextual Knowledge for Large Language Models
Figure 4 for Understanding the Interplay between Parametric and Contextual Knowledge for Large Language Models
Viaarxiv icon

Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement

Add code
Oct 06, 2024
Figure 1 for Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement
Figure 2 for Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement
Figure 3 for Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement
Figure 4 for Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement
Viaarxiv icon