Picture for Xuanjing Huang

Xuanjing Huang

ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use

Add code
Jan 07, 2025
Viaarxiv icon

TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use

Add code
Dec 20, 2024
Viaarxiv icon

Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective

Add code
Dec 18, 2024
Viaarxiv icon

EvoWiki: Evaluating LLMs on Evolving Knowledge

Add code
Dec 18, 2024
Viaarxiv icon

Activating Distributed Visual Region within LLMs for Efficient and Effective Vision-Language Training and Inference

Add code
Dec 17, 2024
Figure 1 for Activating Distributed Visual Region within LLMs for Efficient and Effective Vision-Language Training and Inference
Figure 2 for Activating Distributed Visual Region within LLMs for Efficient and Effective Vision-Language Training and Inference
Figure 3 for Activating Distributed Visual Region within LLMs for Efficient and Effective Vision-Language Training and Inference
Figure 4 for Activating Distributed Visual Region within LLMs for Efficient and Effective Vision-Language Training and Inference
Viaarxiv icon

COSEE: Consistency-Oriented Signal-Based Early Exiting via Calibrated Sample Weighting Mechanism

Add code
Dec 17, 2024
Figure 1 for COSEE: Consistency-Oriented Signal-Based Early Exiting via Calibrated Sample Weighting Mechanism
Figure 2 for COSEE: Consistency-Oriented Signal-Based Early Exiting via Calibrated Sample Weighting Mechanism
Figure 3 for COSEE: Consistency-Oriented Signal-Based Early Exiting via Calibrated Sample Weighting Mechanism
Figure 4 for COSEE: Consistency-Oriented Signal-Based Early Exiting via Calibrated Sample Weighting Mechanism
Viaarxiv icon

From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents

Add code
Dec 04, 2024
Figure 1 for From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents
Figure 2 for From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents
Figure 3 for From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents
Figure 4 for From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents
Viaarxiv icon

VLSBench: Unveiling Visual Leakage in Multimodal Safety

Add code
Nov 29, 2024
Viaarxiv icon

ObjectRelator: Enabling Cross-View Object Relation Understanding in Ego-Centric and Exo-Centric Videos

Add code
Nov 28, 2024
Viaarxiv icon

Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision

Add code
Nov 25, 2024
Figure 1 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 2 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 3 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 4 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Viaarxiv icon