Picture for Wenlin Yao

Wenlin Yao

OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization

Add code
Oct 25, 2024
Figure 1 for OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization
Figure 2 for OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization
Figure 3 for OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization
Figure 4 for OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization
Viaarxiv icon

SePPO: Semi-Policy Preference Optimization for Diffusion Alignment

Add code
Oct 07, 2024
Viaarxiv icon

DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search

Add code
Oct 04, 2024
Figure 1 for DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search
Figure 2 for DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search
Figure 3 for DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search
Figure 4 for DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search
Viaarxiv icon

DeFine: Enhancing LLM Decision-Making with Factor Profiles and Analogical Reasoning

Add code
Oct 02, 2024
Figure 1 for DeFine: Enhancing LLM Decision-Making with Factor Profiles and Analogical Reasoning
Figure 2 for DeFine: Enhancing LLM Decision-Making with Factor Profiles and Analogical Reasoning
Figure 3 for DeFine: Enhancing LLM Decision-Making with Factor Profiles and Analogical Reasoning
Figure 4 for DeFine: Enhancing LLM Decision-Making with Factor Profiles and Analogical Reasoning
Viaarxiv icon

IDGen: Item Discrimination Induced Prompt Generation for LLM Evaluation

Add code
Sep 27, 2024
Viaarxiv icon

HDFlow: Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows

Add code
Sep 25, 2024
Viaarxiv icon

DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?

Add code
Sep 12, 2024
Viaarxiv icon

When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives

Add code
Jun 17, 2024
Figure 1 for When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives
Figure 2 for When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives
Figure 3 for When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives
Figure 4 for When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives
Viaarxiv icon

MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions

Add code
May 29, 2024
Figure 1 for MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions
Figure 2 for MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions
Figure 3 for MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions
Figure 4 for MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions
Viaarxiv icon

Usable XAI: 10 Strategies Towards Exploiting Explainability in the LLM Era

Add code
Mar 13, 2024
Figure 1 for Usable XAI: 10 Strategies Towards Exploiting Explainability in the LLM Era
Figure 2 for Usable XAI: 10 Strategies Towards Exploiting Explainability in the LLM Era
Figure 3 for Usable XAI: 10 Strategies Towards Exploiting Explainability in the LLM Era
Figure 4 for Usable XAI: 10 Strategies Towards Exploiting Explainability in the LLM Era
Viaarxiv icon