Picture for Yiming Yang

Yiming Yang

Kuaishou Technology

Adaptive$^2$: Adaptive Domain Mining for Fine-grained Domain Adaptation Modeling

Add code
Dec 11, 2024
Viaarxiv icon

Dynamic Modality-Camera Invariant Clustering for Unsupervised Visible-Infrared Person Re-identification

Add code
Dec 11, 2024
Viaarxiv icon

LDACP: Long-Delayed Ad Conversions Prediction Model for Bidding Strategy

Add code
Nov 25, 2024
Figure 1 for LDACP: Long-Delayed Ad Conversions Prediction Model for Bidding Strategy
Figure 2 for LDACP: Long-Delayed Ad Conversions Prediction Model for Bidding Strategy
Figure 3 for LDACP: Long-Delayed Ad Conversions Prediction Model for Bidding Strategy
Figure 4 for LDACP: Long-Delayed Ad Conversions Prediction Model for Bidding Strategy
Viaarxiv icon

Improve Vision Language Model Chain-of-thought Reasoning

Add code
Oct 21, 2024
Figure 1 for Improve Vision Language Model Chain-of-thought Reasoning
Figure 2 for Improve Vision Language Model Chain-of-thought Reasoning
Figure 3 for Improve Vision Language Model Chain-of-thought Reasoning
Figure 4 for Improve Vision Language Model Chain-of-thought Reasoning
Viaarxiv icon

Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo

Add code
Oct 02, 2024
Figure 1 for Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo
Figure 2 for Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo
Figure 3 for Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo
Figure 4 for Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo
Viaarxiv icon

An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models

Add code
Aug 01, 2024
Figure 1 for An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models
Figure 2 for An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models
Figure 3 for An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models
Figure 4 for An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models
Viaarxiv icon

Lean-STaR: Learning to Interleave Thinking and Proving

Add code
Jul 14, 2024
Viaarxiv icon

Few-shot Personalization of LLMs with Mis-aligned Responses

Add code
Jun 26, 2024
Viaarxiv icon

Learning to Correct for QA Reasoning with Black-box LLMs

Add code
Jun 26, 2024
Viaarxiv icon

Self-Play Preference Optimization for Language Model Alignment

Add code
May 01, 2024
Figure 1 for Self-Play Preference Optimization for Language Model Alignment
Figure 2 for Self-Play Preference Optimization for Language Model Alignment
Figure 3 for Self-Play Preference Optimization for Language Model Alignment
Figure 4 for Self-Play Preference Optimization for Language Model Alignment
Viaarxiv icon