Picture for Xifeng Yan

Xifeng Yan

Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Add code
Apr 02, 2025
Viaarxiv icon

Adaptive Layer-skipping in Pre-trained LLMs

Add code
Mar 31, 2025
Viaarxiv icon

AgentOrca: A Dual-System Framework to Evaluate Language Agents on Operational Routine and Constraint Adherence

Add code
Mar 11, 2025
Figure 1 for AgentOrca: A Dual-System Framework to Evaluate Language Agents on Operational Routine and Constraint Adherence
Figure 2 for AgentOrca: A Dual-System Framework to Evaluate Language Agents on Operational Routine and Constraint Adherence
Figure 3 for AgentOrca: A Dual-System Framework to Evaluate Language Agents on Operational Routine and Constraint Adherence
Figure 4 for AgentOrca: A Dual-System Framework to Evaluate Language Agents on Operational Routine and Constraint Adherence
Viaarxiv icon

Creative and Context-Aware Translation of East Asian Idioms with GPT-4

Add code
Oct 01, 2024
Viaarxiv icon

Can Editing LLMs Inject Harm?

Add code
Jul 29, 2024
Figure 1 for Can Editing LLMs Inject Harm?
Figure 2 for Can Editing LLMs Inject Harm?
Figure 3 for Can Editing LLMs Inject Harm?
Figure 4 for Can Editing LLMs Inject Harm?
Viaarxiv icon

MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension

Add code
Jul 06, 2024
Viaarxiv icon

Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters

Add code
Mar 05, 2024
Figure 1 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Figure 2 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Figure 3 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Figure 4 for Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Viaarxiv icon

Large Language Models as Zero-shot Dialogue State Tracker through Function Calling

Add code
Feb 16, 2024
Figure 1 for Large Language Models as Zero-shot Dialogue State Tracker through Function Calling
Figure 2 for Large Language Models as Zero-shot Dialogue State Tracker through Function Calling
Figure 3 for Large Language Models as Zero-shot Dialogue State Tracker through Function Calling
Figure 4 for Large Language Models as Zero-shot Dialogue State Tracker through Function Calling
Viaarxiv icon

GPT-4V as a Generalist Evaluator for Vision-Language Tasks

Add code
Nov 02, 2023
Viaarxiv icon

Do you really follow me? Adversarial Instructions for Evaluating the Robustness of Large Language Models

Add code
Aug 17, 2023
Figure 1 for Do you really follow me? Adversarial Instructions for Evaluating the Robustness of Large Language Models
Figure 2 for Do you really follow me? Adversarial Instructions for Evaluating the Robustness of Large Language Models
Figure 3 for Do you really follow me? Adversarial Instructions for Evaluating the Robustness of Large Language Models
Figure 4 for Do you really follow me? Adversarial Instructions for Evaluating the Robustness of Large Language Models
Viaarxiv icon