Picture for Hongning Wang

Hongning Wang

HPSS: Heuristic Prompting Strategy Search for LLM Evaluators

Add code
Feb 18, 2025
Viaarxiv icon

Parametric Retrieval Augmented Generation

Add code
Jan 27, 2025
Viaarxiv icon

MAPS: Advancing Multi-Modal Reasoning in Expert-Level Physical Science

Add code
Jan 18, 2025
Viaarxiv icon

Agent-SafetyBench: Evaluating the Safety of LLM Agents

Add code
Dec 19, 2024
Viaarxiv icon

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

Add code
Dec 16, 2024
Viaarxiv icon

CharacterBench: Benchmarking Character Customization of Large Language Models

Add code
Dec 16, 2024
Figure 1 for CharacterBench: Benchmarking Character Customization of Large Language Models
Figure 2 for CharacterBench: Benchmarking Character Customization of Large Language Models
Figure 3 for CharacterBench: Benchmarking Character Customization of Large Language Models
Figure 4 for CharacterBench: Benchmarking Character Customization of Large Language Models
Viaarxiv icon

Does RLHF Scale? Exploring the Impacts From Data, Model, and Method

Add code
Dec 08, 2024
Figure 1 for Does RLHF Scale? Exploring the Impacts From Data, Model, and Method
Figure 2 for Does RLHF Scale? Exploring the Impacts From Data, Model, and Method
Figure 3 for Does RLHF Scale? Exploring the Impacts From Data, Model, and Method
Figure 4 for Does RLHF Scale? Exploring the Impacts From Data, Model, and Method
Viaarxiv icon

Unveiling User Satisfaction and Creator Productivity Trade-Offs in Recommendation Platforms

Add code
Oct 31, 2024
Figure 1 for Unveiling User Satisfaction and Creator Productivity Trade-Offs in Recommendation Platforms
Figure 2 for Unveiling User Satisfaction and Creator Productivity Trade-Offs in Recommendation Platforms
Figure 3 for Unveiling User Satisfaction and Creator Productivity Trade-Offs in Recommendation Platforms
Figure 4 for Unveiling User Satisfaction and Creator Productivity Trade-Offs in Recommendation Platforms
Viaarxiv icon

RecFlow: An Industrial Full Flow Recommendation Dataset

Add code
Oct 28, 2024
Figure 1 for RecFlow: An Industrial Full Flow Recommendation Dataset
Figure 2 for RecFlow: An Industrial Full Flow Recommendation Dataset
Figure 3 for RecFlow: An Industrial Full Flow Recommendation Dataset
Figure 4 for RecFlow: An Industrial Full Flow Recommendation Dataset
Viaarxiv icon

Data Selection via Optimal Control for Language Models

Add code
Oct 09, 2024
Figure 1 for Data Selection via Optimal Control for Language Models
Figure 2 for Data Selection via Optimal Control for Language Models
Figure 3 for Data Selection via Optimal Control for Language Models
Figure 4 for Data Selection via Optimal Control for Language Models
Viaarxiv icon