Picture for Xiaoyu Shen

Xiaoyu Shen

Beyond Content Relevance: Evaluating Instruction Following in Retrieval Models

Add code
Oct 31, 2024
Figure 1 for Beyond Content Relevance: Evaluating Instruction Following in Retrieval Models
Figure 2 for Beyond Content Relevance: Evaluating Instruction Following in Retrieval Models
Figure 3 for Beyond Content Relevance: Evaluating Instruction Following in Retrieval Models
Figure 4 for Beyond Content Relevance: Evaluating Instruction Following in Retrieval Models
Viaarxiv icon

Instruction-Tuned LLMs Succeed in Document-Level MT Without Fine-Tuning -- But BLEU Turns a Blind Eye

Add code
Oct 29, 2024
Figure 1 for Instruction-Tuned LLMs Succeed in Document-Level MT Without Fine-Tuning -- But BLEU Turns a Blind Eye
Figure 2 for Instruction-Tuned LLMs Succeed in Document-Level MT Without Fine-Tuning -- But BLEU Turns a Blind Eye
Figure 3 for Instruction-Tuned LLMs Succeed in Document-Level MT Without Fine-Tuning -- But BLEU Turns a Blind Eye
Figure 4 for Instruction-Tuned LLMs Succeed in Document-Level MT Without Fine-Tuning -- But BLEU Turns a Blind Eye
Viaarxiv icon

Corrected Soft Actor Critic for Continuous Control

Add code
Oct 22, 2024
Figure 1 for Corrected Soft Actor Critic for Continuous Control
Figure 2 for Corrected Soft Actor Critic for Continuous Control
Figure 3 for Corrected Soft Actor Critic for Continuous Control
Figure 4 for Corrected Soft Actor Critic for Continuous Control
Viaarxiv icon

Large Language Models Empowered Personalized Web Agents

Add code
Oct 22, 2024
Figure 1 for Large Language Models Empowered Personalized Web Agents
Figure 2 for Large Language Models Empowered Personalized Web Agents
Figure 3 for Large Language Models Empowered Personalized Web Agents
Figure 4 for Large Language Models Empowered Personalized Web Agents
Viaarxiv icon

The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Language Models

Add code
Oct 09, 2024
Viaarxiv icon

To Preserve or To Compress: An In-Depth Study of Connector Selection in Multimodal Large Language Models

Add code
Oct 09, 2024
Viaarxiv icon

Deeper Insights Without Updates: The Power of In-Context Learning Over Fine-Tuning

Add code
Oct 07, 2024
Figure 1 for Deeper Insights Without Updates: The Power of In-Context Learning Over Fine-Tuning
Figure 2 for Deeper Insights Without Updates: The Power of In-Context Learning Over Fine-Tuning
Figure 3 for Deeper Insights Without Updates: The Power of In-Context Learning Over Fine-Tuning
Figure 4 for Deeper Insights Without Updates: The Power of In-Context Learning Over Fine-Tuning
Viaarxiv icon

MC-CoT: A Modular Collaborative CoT Framework for Zero-shot Medical-VQA with LLM and MLLM Integration

Add code
Oct 06, 2024
Figure 1 for MC-CoT: A Modular Collaborative CoT Framework for Zero-shot Medical-VQA with LLM and MLLM Integration
Figure 2 for MC-CoT: A Modular Collaborative CoT Framework for Zero-shot Medical-VQA with LLM and MLLM Integration
Figure 3 for MC-CoT: A Modular Collaborative CoT Framework for Zero-shot Medical-VQA with LLM and MLLM Integration
Figure 4 for MC-CoT: A Modular Collaborative CoT Framework for Zero-shot Medical-VQA with LLM and MLLM Integration
Viaarxiv icon

Achieving Stable High-Speed Locomotion for Humanoid Robots with Deep Reinforcement Learning

Add code
Sep 25, 2024
Viaarxiv icon

From Calculation to Adjudication: Examining LLM judges on Mathematical Reasoning Tasks

Add code
Sep 06, 2024
Viaarxiv icon