Picture for Qianglong Chen

Qianglong Chen

VCBench: A Controllable Benchmark for Symbolic and Abstract Challenges in Video Cognition

Add code
Nov 14, 2024
Figure 1 for VCBench: A Controllable Benchmark for Symbolic and Abstract Challenges in Video Cognition
Figure 2 for VCBench: A Controllable Benchmark for Symbolic and Abstract Challenges in Video Cognition
Figure 3 for VCBench: A Controllable Benchmark for Symbolic and Abstract Challenges in Video Cognition
Figure 4 for VCBench: A Controllable Benchmark for Symbolic and Abstract Challenges in Video Cognition
Viaarxiv icon

Optimizing Instruction Synthesis: Effective Exploration of Evolutionary Space with Tree Search

Add code
Oct 14, 2024
Figure 1 for Optimizing Instruction Synthesis: Effective Exploration of Evolutionary Space with Tree Search
Figure 2 for Optimizing Instruction Synthesis: Effective Exploration of Evolutionary Space with Tree Search
Figure 3 for Optimizing Instruction Synthesis: Effective Exploration of Evolutionary Space with Tree Search
Figure 4 for Optimizing Instruction Synthesis: Effective Exploration of Evolutionary Space with Tree Search
Viaarxiv icon

Role-RL: Online Long-Context Processing with Role Reinforcement Learning for Distinct LLMs in Their Optimal Roles

Add code
Sep 26, 2024
Figure 1 for Role-RL: Online Long-Context Processing with Role Reinforcement Learning for Distinct LLMs in Their Optimal Roles
Figure 2 for Role-RL: Online Long-Context Processing with Role Reinforcement Learning for Distinct LLMs in Their Optimal Roles
Figure 3 for Role-RL: Online Long-Context Processing with Role Reinforcement Learning for Distinct LLMs in Their Optimal Roles
Figure 4 for Role-RL: Online Long-Context Processing with Role Reinforcement Learning for Distinct LLMs in Their Optimal Roles
Viaarxiv icon

BeamAggR: Beam Aggregation Reasoning over Multi-source Knowledge for Multi-hop Question Answering

Add code
Jun 28, 2024
Figure 1 for BeamAggR: Beam Aggregation Reasoning over Multi-source Knowledge for Multi-hop Question Answering
Figure 2 for BeamAggR: Beam Aggregation Reasoning over Multi-source Knowledge for Multi-hop Question Answering
Figure 3 for BeamAggR: Beam Aggregation Reasoning over Multi-source Knowledge for Multi-hop Question Answering
Figure 4 for BeamAggR: Beam Aggregation Reasoning over Multi-source Knowledge for Multi-hop Question Answering
Viaarxiv icon

An Information Bottleneck Perspective for Effective Noise Filtering on Retrieval-Augmented Generation

Add code
Jun 03, 2024
Figure 1 for An Information Bottleneck Perspective for Effective Noise Filtering on Retrieval-Augmented Generation
Figure 2 for An Information Bottleneck Perspective for Effective Noise Filtering on Retrieval-Augmented Generation
Figure 3 for An Information Bottleneck Perspective for Effective Noise Filtering on Retrieval-Augmented Generation
Figure 4 for An Information Bottleneck Perspective for Effective Noise Filtering on Retrieval-Augmented Generation
Viaarxiv icon

Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code Generation

Add code
May 30, 2024
Figure 1 for Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code Generation
Figure 2 for Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code Generation
Figure 3 for Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code Generation
Figure 4 for Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code Generation
Viaarxiv icon

Apollo's Oracle: Retrieval-Augmented Reasoning in Multi-Agent Debates

Add code
Dec 08, 2023
Figure 1 for Apollo's Oracle: Retrieval-Augmented Reasoning in Multi-Agent Debates
Figure 2 for Apollo's Oracle: Retrieval-Augmented Reasoning in Multi-Agent Debates
Figure 3 for Apollo's Oracle: Retrieval-Augmented Reasoning in Multi-Agent Debates
Figure 4 for Apollo's Oracle: Retrieval-Augmented Reasoning in Multi-Agent Debates
Viaarxiv icon

TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models

Add code
Nov 29, 2023
Figure 1 for TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models
Figure 2 for TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models
Figure 3 for TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models
Figure 4 for TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models
Viaarxiv icon

Trends in Integration of Knowledge and Large Language Models: A Survey and Taxonomy of Methods, Benchmarks, and Applications

Add code
Nov 10, 2023
Figure 1 for Trends in Integration of Knowledge and Large Language Models: A Survey and Taxonomy of Methods, Benchmarks, and Applications
Figure 2 for Trends in Integration of Knowledge and Large Language Models: A Survey and Taxonomy of Methods, Benchmarks, and Applications
Figure 3 for Trends in Integration of Knowledge and Large Language Models: A Survey and Taxonomy of Methods, Benchmarks, and Applications
Figure 4 for Trends in Integration of Knowledge and Large Language Models: A Survey and Taxonomy of Methods, Benchmarks, and Applications
Viaarxiv icon

A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions

Add code
Nov 09, 2023
Viaarxiv icon