Picture for Huaxiu Yao

Huaxiu Yao

S$^{2}$FT: Efficient, Scalable and Generalizable LLM Fine-tuning by Structured Sparsity

Add code
Dec 10, 2024
Viaarxiv icon

MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization

Add code
Dec 09, 2024
Viaarxiv icon

SAUP: Situation Awareness Uncertainty Propagation on LLM Agent

Add code
Dec 02, 2024
Viaarxiv icon

GRAPE: Generalizing Robot Policy via Preference Alignment

Add code
Nov 28, 2024
Figure 1 for GRAPE: Generalizing Robot Policy via Preference Alignment
Figure 2 for GRAPE: Generalizing Robot Policy via Preference Alignment
Figure 3 for GRAPE: Generalizing Robot Policy via Preference Alignment
Figure 4 for GRAPE: Generalizing Robot Policy via Preference Alignment
Viaarxiv icon

Autoregressive Models in Vision: A Survey

Add code
Nov 08, 2024
Figure 1 for Autoregressive Models in Vision: A Survey
Figure 2 for Autoregressive Models in Vision: A Survey
Figure 3 for Autoregressive Models in Vision: A Survey
Figure 4 for Autoregressive Models in Vision: A Survey
Viaarxiv icon

FactTest: Factuality Testing in Large Language Models with Statistical Guarantees

Add code
Nov 04, 2024
Viaarxiv icon

Unveiling Context-Aware Criteria in Self-Assessing LLMs

Add code
Oct 28, 2024
Figure 1 for Unveiling Context-Aware Criteria in Self-Assessing LLMs
Figure 2 for Unveiling Context-Aware Criteria in Self-Assessing LLMs
Figure 3 for Unveiling Context-Aware Criteria in Self-Assessing LLMs
Figure 4 for Unveiling Context-Aware Criteria in Self-Assessing LLMs
Viaarxiv icon

Fine-Grained Verifiers: Preference Modeling as Next-token Prediction in Vision-Language Alignment

Add code
Oct 18, 2024
Figure 1 for Fine-Grained Verifiers: Preference Modeling as Next-token Prediction in Vision-Language Alignment
Figure 2 for Fine-Grained Verifiers: Preference Modeling as Next-token Prediction in Vision-Language Alignment
Figure 3 for Fine-Grained Verifiers: Preference Modeling as Next-token Prediction in Vision-Language Alignment
Figure 4 for Fine-Grained Verifiers: Preference Modeling as Next-token Prediction in Vision-Language Alignment
Viaarxiv icon

CREAM: Consistency Regularized Self-Rewarding Language Models

Add code
Oct 17, 2024
Figure 1 for CREAM: Consistency Regularized Self-Rewarding Language Models
Figure 2 for CREAM: Consistency Regularized Self-Rewarding Language Models
Figure 3 for CREAM: Consistency Regularized Self-Rewarding Language Models
Figure 4 for CREAM: Consistency Regularized Self-Rewarding Language Models
Viaarxiv icon

MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

Add code
Oct 16, 2024
Viaarxiv icon