Picture for Lingfeng Shen

Lingfeng Shen

It Takes Two: On the Seamlessness between Reward and Policy Model in RLHF

Add code
Jun 12, 2024
Viaarxiv icon

DiffNorm: Self-Supervised Normalization for Non-autoregressive Speech-to-speech Translation

Add code
May 22, 2024
Viaarxiv icon

AnaloBench: Benchmarking the Identification of Abstract and Long-context Analogies

Add code
Feb 19, 2024
Viaarxiv icon

Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation

Add code
Feb 02, 2024
Viaarxiv icon

The Language Barrier: Dissecting Safety Challenges of LLMs in Multilingual Contexts

Add code
Jan 23, 2024
Viaarxiv icon

Narrowing the Gap between Zero- and Few-shot Machine Translation by Matching Styles

Add code
Nov 04, 2023
Viaarxiv icon

Do pretrained Transformers Really Learn In-context by Gradient Descent?

Add code
Oct 12, 2023
Viaarxiv icon

SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation

Add code
Oct 06, 2023
Viaarxiv icon

The Trickle-down Impact of Reward consistency on RLHF

Add code
Sep 28, 2023
Viaarxiv icon

Sen2Pro: A Probabilistic Perspective to Sentence Embedding from Pre-trained Language Model

Add code
Jun 04, 2023
Viaarxiv icon