Picture for Zhengyang Wang

Zhengyang Wang

Shopping MMLU: A Massive Multi-Task Online Shopping Benchmark for Large Language Models

Add code
Oct 28, 2024
Viaarxiv icon

Scaling Laws for Predicting Downstream Performance in LLMs

Add code
Oct 11, 2024
Viaarxiv icon

Inductive or Deductive? Rethinking the Fundamental Reasoning Abilities of LLMs

Add code
Aug 07, 2024
Viaarxiv icon

Relational Database Augmented Large Language Model

Add code
Jul 21, 2024
Figure 1 for Relational Database Augmented Large Language Model
Figure 2 for Relational Database Augmented Large Language Model
Figure 3 for Relational Database Augmented Large Language Model
Figure 4 for Relational Database Augmented Large Language Model
Viaarxiv icon

CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery

Add code
Jun 12, 2024
Figure 1 for CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery
Figure 2 for CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery
Figure 3 for CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery
Figure 4 for CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery
Viaarxiv icon

Towards Unified Multi-Modal Personalization: Large Vision-Language Models for Generative Recommendation and Beyond

Add code
Mar 27, 2024
Viaarxiv icon

Knowledge Editing on Black-box Large Language Models

Add code
Feb 17, 2024
Viaarxiv icon

Enhancing User Intent Capture in Session-Based Recommendation with Attribute Patterns

Add code
Dec 23, 2023
Viaarxiv icon

Language Models As Semantic Indexers

Add code
Oct 11, 2023
Figure 1 for Language Models As Semantic Indexers
Figure 2 for Language Models As Semantic Indexers
Figure 3 for Language Models As Semantic Indexers
Figure 4 for Language Models As Semantic Indexers
Viaarxiv icon

Towards Robust and Generalizable Training: An Empirical Study of Noisy Slot Filling for Input Perturbations

Add code
Oct 05, 2023
Figure 1 for Towards Robust and Generalizable Training: An Empirical Study of Noisy Slot Filling for Input Perturbations
Figure 2 for Towards Robust and Generalizable Training: An Empirical Study of Noisy Slot Filling for Input Perturbations
Figure 3 for Towards Robust and Generalizable Training: An Empirical Study of Noisy Slot Filling for Input Perturbations
Figure 4 for Towards Robust and Generalizable Training: An Empirical Study of Noisy Slot Filling for Input Perturbations
Viaarxiv icon