Picture for Xuewei Wang

Xuewei Wang

Jack

Self-Generated Critiques Boost Reward Modeling for Language Models

Add code
Nov 25, 2024
Figure 1 for Self-Generated Critiques Boost Reward Modeling for Language Models
Figure 2 for Self-Generated Critiques Boost Reward Modeling for Language Models
Figure 3 for Self-Generated Critiques Boost Reward Modeling for Language Models
Figure 4 for Self-Generated Critiques Boost Reward Modeling for Language Models
Viaarxiv icon

Law of the Weakest Link: Cross Capabilities of Large Language Models

Add code
Sep 30, 2024
Figure 1 for Law of the Weakest Link: Cross Capabilities of Large Language Models
Figure 2 for Law of the Weakest Link: Cross Capabilities of Large Language Models
Figure 3 for Law of the Weakest Link: Cross Capabilities of Large Language Models
Figure 4 for Law of the Weakest Link: Cross Capabilities of Large Language Models
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

Scaling User Modeling: Large-scale Online User Representations for Ads Personalization in Meta

Add code
Nov 16, 2023
Figure 1 for Scaling User Modeling: Large-scale Online User Representations for Ads Personalization in Meta
Figure 2 for Scaling User Modeling: Large-scale Online User Representations for Ads Personalization in Meta
Figure 3 for Scaling User Modeling: Large-scale Online User Representations for Ads Personalization in Meta
Figure 4 for Scaling User Modeling: Large-scale Online User Representations for Ads Personalization in Meta
Viaarxiv icon

Towards the Better Ranking Consistency: A Multi-task Learning Framework for Early Stage Ads Ranking

Add code
Jul 12, 2023
Figure 1 for Towards the Better Ranking Consistency: A Multi-task Learning Framework for Early Stage Ads Ranking
Figure 2 for Towards the Better Ranking Consistency: A Multi-task Learning Framework for Early Stage Ads Ranking
Figure 3 for Towards the Better Ranking Consistency: A Multi-task Learning Framework for Early Stage Ads Ranking
Figure 4 for Towards the Better Ranking Consistency: A Multi-task Learning Framework for Early Stage Ads Ranking
Viaarxiv icon

How to Build User Simulators to Train RL-based Dialog Systems

Add code
Sep 03, 2019
Figure 1 for How to Build User Simulators to Train RL-based Dialog Systems
Figure 2 for How to Build User Simulators to Train RL-based Dialog Systems
Figure 3 for How to Build User Simulators to Train RL-based Dialog Systems
Figure 4 for How to Build User Simulators to Train RL-based Dialog Systems
Viaarxiv icon

Persuasion for Good: Towards a Personalized Persuasive Dialogue System for Social Good

Add code
Jun 16, 2019
Figure 1 for Persuasion for Good: Towards a Personalized Persuasive Dialogue System for Social Good
Figure 2 for Persuasion for Good: Towards a Personalized Persuasive Dialogue System for Social Good
Figure 3 for Persuasion for Good: Towards a Personalized Persuasive Dialogue System for Social Good
Figure 4 for Persuasion for Good: Towards a Personalized Persuasive Dialogue System for Social Good
Viaarxiv icon