Picture for Ryo Kamoi

Ryo Kamoi

Bridging the Know-Act Gap via Task-Level Autoregressive Reasoning

Add code
Mar 23, 2026
Viaarxiv icon

Evaluating LLM-Simulated Conversations in Modeling Inconsistent and Uncollaborative Behaviors in Human Social Interaction

Add code
Mar 17, 2026
Viaarxiv icon

One Model, All Roles: Multi-Turn, Multi-Agent Self-Play Reinforcement Learning for Conversational Social Intelligence

Add code
Feb 03, 2026
Viaarxiv icon

Training Step-Level Reasoning Verifiers with Formal Verification Tools

Add code
May 21, 2025
Figure 1 for Training Step-Level Reasoning Verifiers with Formal Verification Tools
Figure 2 for Training Step-Level Reasoning Verifiers with Formal Verification Tools
Figure 3 for Training Step-Level Reasoning Verifiers with Formal Verification Tools
Figure 4 for Training Step-Level Reasoning Verifiers with Formal Verification Tools
Viaarxiv icon

HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?

Add code
Apr 29, 2025
Figure 1 for HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?
Figure 2 for HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?
Figure 3 for HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?
Figure 4 for HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?
Viaarxiv icon

GReaTer: Gradients over Reasoning Makes Smaller Language Models Strong Prompt Optimizers

Add code
Dec 12, 2024
Figure 1 for GReaTer: Gradients over Reasoning Makes Smaller Language Models Strong Prompt Optimizers
Figure 2 for GReaTer: Gradients over Reasoning Makes Smaller Language Models Strong Prompt Optimizers
Figure 3 for GReaTer: Gradients over Reasoning Makes Smaller Language Models Strong Prompt Optimizers
Figure 4 for GReaTer: Gradients over Reasoning Makes Smaller Language Models Strong Prompt Optimizers
Viaarxiv icon

VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception of Geometric Information

Add code
Dec 01, 2024
Figure 1 for VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception of Geometric Information
Figure 2 for VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception of Geometric Information
Figure 3 for VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception of Geometric Information
Figure 4 for VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception of Geometric Information
Viaarxiv icon

AAAR-1.0: Assessing AI's Potential to Assist Research

Add code
Oct 29, 2024
Figure 1 for AAAR-1.0: Assessing AI's Potential to Assist Research
Figure 2 for AAAR-1.0: Assessing AI's Potential to Assist Research
Figure 3 for AAAR-1.0: Assessing AI's Potential to Assist Research
Figure 4 for AAAR-1.0: Assessing AI's Potential to Assist Research
Viaarxiv icon

When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of Self-Correction of LLMs

Add code
Jun 03, 2024
Viaarxiv icon

Evaluating LLMs at Detecting Errors in LLM Responses

Add code
Apr 04, 2024
Figure 1 for Evaluating LLMs at Detecting Errors in LLM Responses
Figure 2 for Evaluating LLMs at Detecting Errors in LLM Responses
Figure 3 for Evaluating LLMs at Detecting Errors in LLM Responses
Figure 4 for Evaluating LLMs at Detecting Errors in LLM Responses
Viaarxiv icon