Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yangguang Shao

UTF:Undertrained Tokens as Fingerprints A Novel Approach to LLM Identification

Oct 16, 2024

Jiacheng Cai, Jiahao Yu, Yangguang Shao, Yuhang Wu, Xinyu Xing

Figure 1 for UTF:Undertrained Tokens as Fingerprints A Novel Approach to LLM Identification

Figure 2 for UTF:Undertrained Tokens as Fingerprints A Novel Approach to LLM Identification

Figure 3 for UTF:Undertrained Tokens as Fingerprints A Novel Approach to LLM Identification

Figure 4 for UTF:Undertrained Tokens as Fingerprints A Novel Approach to LLM Identification

Abstract:Fingerprinting large language models (LLMs) is essential for verifying model ownership, ensuring authenticity, and preventing misuse. Traditional fingerprinting methods often require significant computational overhead or white-box verification access. In this paper, we introduce UTF, a novel and efficient approach to fingerprinting LLMs by leveraging under-trained tokens. Under-trained tokens are tokens that the model has not fully learned during its training phase. By utilizing these tokens, we perform supervised fine-tuning to embed specific input-output pairs into the model. This process allows the LLM to produce predetermined outputs when presented with certain inputs, effectively embedding a unique fingerprint. Our method has minimal overhead and impact on model's performance, and does not require white-box access to target model's ownership identification. Compared to existing fingerprinting methods, UTF is also more effective and robust to fine-tuning and random guess.

Via

Access Paper or Ask Questions