Picture for Yangqiu Song

Yangqiu Song

Can LLMs Generate Tabular Summaries of Science Papers? Rethinking the Evaluation Protocol

Add code
Apr 14, 2025
Viaarxiv icon

The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning

Add code
Apr 07, 2025
Viaarxiv icon

PrivaCI-Bench: Evaluating Privacy with Contextual Integrity and Legal Compliance

Add code
Feb 24, 2025
Viaarxiv icon

Patterns Over Principles: The Fragility of Inductive Reasoning in LLMs under Noisy Observations

Add code
Feb 22, 2025
Viaarxiv icon

Top Ten Challenges Towards Agentic Neural Graph Databases

Add code
Jan 24, 2025
Viaarxiv icon

Enhancing Transformers for Generalizable First-Order Logical Entailment

Add code
Jan 01, 2025
Figure 1 for Enhancing Transformers for Generalizable First-Order Logical Entailment
Figure 2 for Enhancing Transformers for Generalizable First-Order Logical Entailment
Figure 3 for Enhancing Transformers for Generalizable First-Order Logical Entailment
Figure 4 for Enhancing Transformers for Generalizable First-Order Logical Entailment
Viaarxiv icon

ComparisonQA: Evaluating Factuality Robustness of LLMs Through Knowledge Frequency Control and Uncertainty

Add code
Dec 28, 2024
Viaarxiv icon

ConceptEdit: Conceptualization-Augmented Knowledge Editing in Large Language Models for Commonsense Reasoning

Add code
Dec 16, 2024
Figure 1 for ConceptEdit: Conceptualization-Augmented Knowledge Editing in Large Language Models for Commonsense Reasoning
Figure 2 for ConceptEdit: Conceptualization-Augmented Knowledge Editing in Large Language Models for Commonsense Reasoning
Figure 3 for ConceptEdit: Conceptualization-Augmented Knowledge Editing in Large Language Models for Commonsense Reasoning
Figure 4 for ConceptEdit: Conceptualization-Augmented Knowledge Editing in Large Language Models for Commonsense Reasoning
Viaarxiv icon

Intention Knowledge Graph Construction for User Intention Relation Modeling

Add code
Dec 16, 2024
Viaarxiv icon

Assessing the Robustness of Retrieval-Augmented Generation Systems in K-12 Educational Question Answering with Knowledge Discrepancies

Add code
Dec 12, 2024
Figure 1 for Assessing the Robustness of Retrieval-Augmented Generation Systems in K-12 Educational Question Answering with Knowledge Discrepancies
Figure 2 for Assessing the Robustness of Retrieval-Augmented Generation Systems in K-12 Educational Question Answering with Knowledge Discrepancies
Figure 3 for Assessing the Robustness of Retrieval-Augmented Generation Systems in K-12 Educational Question Answering with Knowledge Discrepancies
Figure 4 for Assessing the Robustness of Retrieval-Augmented Generation Systems in K-12 Educational Question Answering with Knowledge Discrepancies
Viaarxiv icon