Picture for Longxu Dou

Longxu Dou

FlowReasoner: Reinforcing Query-Level Meta-Agents

Add code
Apr 21, 2025
Figure 1 for FlowReasoner: Reinforcing Query-Level Meta-Agents
Figure 2 for FlowReasoner: Reinforcing Query-Level Meta-Agents
Figure 3 for FlowReasoner: Reinforcing Query-Level Meta-Agents
Figure 4 for FlowReasoner: Reinforcing Query-Level Meta-Agents
Viaarxiv icon

NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation

Add code
Apr 17, 2025
Viaarxiv icon

Efficient Process Reward Model Training via Active Learning

Add code
Apr 14, 2025
Figure 1 for Efficient Process Reward Model Training via Active Learning
Figure 2 for Efficient Process Reward Model Training via Active Learning
Figure 3 for Efficient Process Reward Model Training via Active Learning
Figure 4 for Efficient Process Reward Model Training via Active Learning
Viaarxiv icon

Reasoning Does Not Necessarily Improve Role-Playing Ability

Add code
Feb 24, 2025
Viaarxiv icon

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Add code
Feb 18, 2025
Figure 1 for Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
Figure 2 for Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
Figure 3 for Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
Figure 4 for Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
Viaarxiv icon

Can Large Language Models Understand You Better? An MBTI Personality Detection Dataset Aligned with Population Traits

Add code
Dec 17, 2024
Figure 1 for Can Large Language Models Understand You Better? An MBTI Personality Detection Dataset Aligned with Population Traits
Figure 2 for Can Large Language Models Understand You Better? An MBTI Personality Detection Dataset Aligned with Population Traits
Figure 3 for Can Large Language Models Understand You Better? An MBTI Personality Detection Dataset Aligned with Population Traits
Figure 4 for Can Large Language Models Understand You Better? An MBTI Personality Detection Dataset Aligned with Population Traits
Viaarxiv icon

SCITAT: A Question Answering Benchmark for Scientific Tables and Text Covering Diverse Reasoning Types

Add code
Dec 16, 2024
Figure 1 for SCITAT: A Question Answering Benchmark for Scientific Tables and Text Covering Diverse Reasoning Types
Figure 2 for SCITAT: A Question Answering Benchmark for Scientific Tables and Text Covering Diverse Reasoning Types
Figure 3 for SCITAT: A Question Answering Benchmark for Scientific Tables and Text Covering Diverse Reasoning Types
Figure 4 for SCITAT: A Question Answering Benchmark for Scientific Tables and Text Covering Diverse Reasoning Types
Viaarxiv icon

A Survey on Large Language Model-Based Social Agents in Game-Theoretic Scenarios

Add code
Dec 05, 2024
Figure 1 for A Survey on Large Language Model-Based Social Agents in Game-Theoretic Scenarios
Figure 2 for A Survey on Large Language Model-Based Social Agents in Game-Theoretic Scenarios
Figure 3 for A Survey on Large Language Model-Based Social Agents in Game-Theoretic Scenarios
Figure 4 for A Survey on Large Language Model-Based Social Agents in Game-Theoretic Scenarios
Viaarxiv icon

SailCompass: Towards Reproducible and Robust Evaluation for Southeast Asian Languages

Add code
Dec 02, 2024
Figure 1 for SailCompass: Towards Reproducible and Robust Evaluation for Southeast Asian Languages
Figure 2 for SailCompass: Towards Reproducible and Robust Evaluation for Southeast Asian Languages
Figure 3 for SailCompass: Towards Reproducible and Robust Evaluation for Southeast Asian Languages
Figure 4 for SailCompass: Towards Reproducible and Robust Evaluation for Southeast Asian Languages
Viaarxiv icon

In-Context Transfer Learning: Demonstration Synthesis by Transferring Similar Tasks

Add code
Oct 02, 2024
Figure 1 for In-Context Transfer Learning: Demonstration Synthesis by Transferring Similar Tasks
Figure 2 for In-Context Transfer Learning: Demonstration Synthesis by Transferring Similar Tasks
Figure 3 for In-Context Transfer Learning: Demonstration Synthesis by Transferring Similar Tasks
Figure 4 for In-Context Transfer Learning: Demonstration Synthesis by Transferring Similar Tasks
Viaarxiv icon