Picture for Zhenting Qi

Zhenting Qi

Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge

Add code
Nov 05, 2024
Viaarxiv icon

P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains

Add code
Oct 11, 2024
Figure 1 for P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains
Figure 2 for P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains
Figure 3 for P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains
Figure 4 for P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains
Viaarxiv icon

Quantifying Generalization Complexity for Large Language Models

Add code
Oct 02, 2024
Viaarxiv icon

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Add code
Aug 12, 2024
Viaarxiv icon

Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation Systems

Add code
Feb 27, 2024
Viaarxiv icon

PILLOW: Enhancing Efficient Instruction Fine-tuning via Prompt Matching

Add code
Dec 09, 2023
Viaarxiv icon

RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations

Add code
Jun 25, 2023
Figure 1 for RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations
Figure 2 for RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations
Figure 3 for RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations
Figure 4 for RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations
Viaarxiv icon

QTSumm: A New Benchmark for Query-Focused Table Summarization

Add code
May 23, 2023
Figure 1 for QTSumm: A New Benchmark for Query-Focused Table Summarization
Figure 2 for QTSumm: A New Benchmark for Query-Focused Table Summarization
Figure 3 for QTSumm: A New Benchmark for Query-Focused Table Summarization
Figure 4 for QTSumm: A New Benchmark for Query-Focused Table Summarization
Viaarxiv icon

LoFT: Enhancing Faithfulness and Diversity for Table-to-Text Generation via Logic Form Control

Add code
Feb 06, 2023
Figure 1 for LoFT: Enhancing Faithfulness and Diversity for Table-to-Text Generation via Logic Form Control
Figure 2 for LoFT: Enhancing Faithfulness and Diversity for Table-to-Text Generation via Logic Form Control
Figure 3 for LoFT: Enhancing Faithfulness and Diversity for Table-to-Text Generation via Logic Form Control
Figure 4 for LoFT: Enhancing Faithfulness and Diversity for Table-to-Text Generation via Logic Form Control
Viaarxiv icon

Weakly Supervised Two-Stage Training Scheme for Deep Video Fight Detection Model

Add code
Sep 23, 2022
Figure 1 for Weakly Supervised Two-Stage Training Scheme for Deep Video Fight Detection Model
Figure 2 for Weakly Supervised Two-Stage Training Scheme for Deep Video Fight Detection Model
Figure 3 for Weakly Supervised Two-Stage Training Scheme for Deep Video Fight Detection Model
Figure 4 for Weakly Supervised Two-Stage Training Scheme for Deep Video Fight Detection Model
Viaarxiv icon