Picture for Zhaowei Wang

Zhaowei Wang

ComparisonQA: Evaluating Factuality Robustness of LLMs Through Knowledge Frequency Control and Uncertainty

Add code
Dec 28, 2024
Viaarxiv icon

What Really is Commonsense Knowledge?

Add code
Nov 06, 2024
Viaarxiv icon

Concept-Reversed Winograd Schema Challenge: Evaluating and Improving Robust Reasoning in Large Language Models via Abstraction

Add code
Oct 15, 2024
Figure 1 for Concept-Reversed Winograd Schema Challenge: Evaluating and Improving Robust Reasoning in Large Language Models via Abstraction
Figure 2 for Concept-Reversed Winograd Schema Challenge: Evaluating and Improving Robust Reasoning in Large Language Models via Abstraction
Figure 3 for Concept-Reversed Winograd Schema Challenge: Evaluating and Improving Robust Reasoning in Large Language Models via Abstraction
Figure 4 for Concept-Reversed Winograd Schema Challenge: Evaluating and Improving Robust Reasoning in Large Language Models via Abstraction
Viaarxiv icon

DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects

Add code
Oct 03, 2024
Figure 1 for DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects
Figure 2 for DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects
Figure 3 for DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects
Figure 4 for DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects
Viaarxiv icon

CLLMate: A Multimodal LLM for Weather and Climate Events Forecasting

Add code
Sep 27, 2024
Figure 1 for CLLMate: A Multimodal LLM for Weather and Climate Events Forecasting
Figure 2 for CLLMate: A Multimodal LLM for Weather and Climate Events Forecasting
Figure 3 for CLLMate: A Multimodal LLM for Weather and Climate Events Forecasting
Figure 4 for CLLMate: A Multimodal LLM for Weather and Climate Events Forecasting
Viaarxiv icon

RoomDiffusion: A Specialized Diffusion Model in the Interior Design Industry

Add code
Sep 05, 2024
Viaarxiv icon

CodeGraph: Enhancing Graph Reasoning of LLMs with Code

Add code
Aug 25, 2024
Viaarxiv icon

KNOWCOMP POKEMON Team at DialAM-2024: A Two-Stage Pipeline for Detecting Relations in Dialogical Argument Mining

Add code
Jul 29, 2024
Viaarxiv icon

LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing

Add code
Jun 25, 2024
Figure 1 for LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing
Figure 2 for LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing
Figure 3 for LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing
Figure 4 for LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing
Viaarxiv icon

AbsInstruct: Eliciting Abstraction Ability from LLMs through Explanation Tuning with Plausibility Estimation

Add code
Feb 16, 2024
Viaarxiv icon