Picture for Ruotian Ma

Ruotian Ma

S$^2$R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning

Add code
Feb 18, 2025
Viaarxiv icon

Self-Consistency of the Internal Reward Models Improves Self-Rewarding Language Models

Add code
Feb 13, 2025
Viaarxiv icon

Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs

Add code
Oct 20, 2024
Viaarxiv icon

Are Large Language Models Good Prompt Optimizers?

Add code
Feb 03, 2024
Figure 1 for Are Large Language Models Good Prompt Optimizers?
Figure 2 for Are Large Language Models Good Prompt Optimizers?
Figure 3 for Are Large Language Models Good Prompt Optimizers?
Figure 4 for Are Large Language Models Good Prompt Optimizers?
Viaarxiv icon

Making Harmful Behaviors Unlearnable for Large Language Models

Add code
Nov 02, 2023
Viaarxiv icon

Cross-Linguistic Syntactic Difference in Multilingual BERT: How Good is It and How Does It Affect Transfer?

Add code
Dec 21, 2022
Viaarxiv icon

Learning "O" Helps for Learning More: Handling the Concealed Entity Problem for Class-incremental NER

Add code
Oct 10, 2022
Figure 1 for Learning "O" Helps for Learning More: Handling the Concealed Entity Problem for Class-incremental NER
Figure 2 for Learning "O" Helps for Learning More: Handling the Concealed Entity Problem for Class-incremental NER
Figure 3 for Learning "O" Helps for Learning More: Handling the Concealed Entity Problem for Class-incremental NER
Figure 4 for Learning "O" Helps for Learning More: Handling the Concealed Entity Problem for Class-incremental NER
Viaarxiv icon

Searching for Optimal Subword Tokenization in Cross-domain NER

Add code
Jun 07, 2022
Figure 1 for Searching for Optimal Subword Tokenization in Cross-domain NER
Figure 2 for Searching for Optimal Subword Tokenization in Cross-domain NER
Figure 3 for Searching for Optimal Subword Tokenization in Cross-domain NER
Figure 4 for Searching for Optimal Subword Tokenization in Cross-domain NER
Viaarxiv icon

Rebuild and Ensemble: Exploring Defense Against Text Adversaries

Add code
Mar 27, 2022
Figure 1 for Rebuild and Ensemble: Exploring Defense Against Text Adversaries
Figure 2 for Rebuild and Ensemble: Exploring Defense Against Text Adversaries
Figure 3 for Rebuild and Ensemble: Exploring Defense Against Text Adversaries
Figure 4 for Rebuild and Ensemble: Exploring Defense Against Text Adversaries
Viaarxiv icon

Plug-Tagger: A Pluggable Sequence Labeling Framework Using Language Models

Add code
Oct 14, 2021
Figure 1 for Plug-Tagger: A Pluggable Sequence Labeling Framework Using Language Models
Figure 2 for Plug-Tagger: A Pluggable Sequence Labeling Framework Using Language Models
Figure 3 for Plug-Tagger: A Pluggable Sequence Labeling Framework Using Language Models
Figure 4 for Plug-Tagger: A Pluggable Sequence Labeling Framework Using Language Models
Viaarxiv icon