Picture for Weijian Lin

Weijian Lin

LongReason: A Synthetic Long-Context Reasoning Benchmark via Context Expansion

Add code
Jan 25, 2025
Figure 1 for LongReason: A Synthetic Long-Context Reasoning Benchmark via Context Expansion
Figure 2 for LongReason: A Synthetic Long-Context Reasoning Benchmark via Context Expansion
Figure 3 for LongReason: A Synthetic Long-Context Reasoning Benchmark via Context Expansion
Figure 4 for LongReason: A Synthetic Long-Context Reasoning Benchmark via Context Expansion
Viaarxiv icon

ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use

Add code
Jan 07, 2025
Viaarxiv icon

RubyStar: A Non-Task-Oriented Mixture Model Dialog System

Add code
Dec 16, 2017
Figure 1 for RubyStar: A Non-Task-Oriented Mixture Model Dialog System
Figure 2 for RubyStar: A Non-Task-Oriented Mixture Model Dialog System
Figure 3 for RubyStar: A Non-Task-Oriented Mixture Model Dialog System
Figure 4 for RubyStar: A Non-Task-Oriented Mixture Model Dialog System
Viaarxiv icon