Picture for Jinyang Gao

Jinyang Gao

Provoking Multi-modal Few-Shot LVLM via Exploration-Exploitation In-Context Learning

Add code
Jun 11, 2025
Viaarxiv icon

Incentivizing Strong Reasoning from Weak Supervision

Add code
May 28, 2025
Figure 1 for Incentivizing Strong Reasoning from Weak Supervision
Figure 2 for Incentivizing Strong Reasoning from Weak Supervision
Figure 3 for Incentivizing Strong Reasoning from Weak Supervision
Figure 4 for Incentivizing Strong Reasoning from Weak Supervision
Viaarxiv icon

Incentivizing Reasoning from Weak Supervision

Add code
May 26, 2025
Figure 1 for Incentivizing Reasoning from Weak Supervision
Figure 2 for Incentivizing Reasoning from Weak Supervision
Figure 3 for Incentivizing Reasoning from Weak Supervision
Figure 4 for Incentivizing Reasoning from Weak Supervision
Viaarxiv icon

Evaluation Report on MCP Servers

Add code
Apr 15, 2025
Figure 1 for Evaluation Report on MCP Servers
Figure 2 for Evaluation Report on MCP Servers
Figure 3 for Evaluation Report on MCP Servers
Figure 4 for Evaluation Report on MCP Servers
Viaarxiv icon

RePO: ReLU-based Preference Optimization

Add code
Mar 10, 2025
Viaarxiv icon

ToolCoder: A Systematic Code-Empowered Tool Learning Framework for Large Language Models

Add code
Feb 17, 2025
Figure 1 for ToolCoder: A Systematic Code-Empowered Tool Learning Framework for Large Language Models
Figure 2 for ToolCoder: A Systematic Code-Empowered Tool Learning Framework for Large Language Models
Figure 3 for ToolCoder: A Systematic Code-Empowered Tool Learning Framework for Large Language Models
Figure 4 for ToolCoder: A Systematic Code-Empowered Tool Learning Framework for Large Language Models
Viaarxiv icon

XiYan-SQL: A Multi-Generator Ensemble Framework for Text-to-SQL

Add code
Nov 13, 2024
Figure 1 for XiYan-SQL: A Multi-Generator Ensemble Framework for Text-to-SQL
Figure 2 for XiYan-SQL: A Multi-Generator Ensemble Framework for Text-to-SQL
Figure 3 for XiYan-SQL: A Multi-Generator Ensemble Framework for Text-to-SQL
Figure 4 for XiYan-SQL: A Multi-Generator Ensemble Framework for Text-to-SQL
Viaarxiv icon

What is Wrong with Perplexity for Long-context Language Modeling?

Add code
Oct 31, 2024
Figure 1 for What is Wrong with Perplexity for Long-context Language Modeling?
Figure 2 for What is Wrong with Perplexity for Long-context Language Modeling?
Figure 3 for What is Wrong with Perplexity for Long-context Language Modeling?
Figure 4 for What is Wrong with Perplexity for Long-context Language Modeling?
Viaarxiv icon

MoMQ: Mixture-of-Experts Enhances Multi-Dialect Query Generation across Relational and Non-Relational Databases

Add code
Oct 24, 2024
Figure 1 for MoMQ: Mixture-of-Experts Enhances Multi-Dialect Query Generation across Relational and Non-Relational Databases
Figure 2 for MoMQ: Mixture-of-Experts Enhances Multi-Dialect Query Generation across Relational and Non-Relational Databases
Figure 3 for MoMQ: Mixture-of-Experts Enhances Multi-Dialect Query Generation across Relational and Non-Relational Databases
Figure 4 for MoMQ: Mixture-of-Experts Enhances Multi-Dialect Query Generation across Relational and Non-Relational Databases
Viaarxiv icon

$α$-DPO: Adaptive Reward Margin is What Direct Preference Optimization Needs

Add code
Oct 14, 2024
Figure 1 for $α$-DPO: Adaptive Reward Margin is What Direct Preference Optimization Needs
Figure 2 for $α$-DPO: Adaptive Reward Margin is What Direct Preference Optimization Needs
Figure 3 for $α$-DPO: Adaptive Reward Margin is What Direct Preference Optimization Needs
Figure 4 for $α$-DPO: Adaptive Reward Margin is What Direct Preference Optimization Needs
Viaarxiv icon