Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Randomization Techniques to Mitigate the Risk of Copyright Infringement

Aug 21, 2024

Wei-Ning Chen, Peter Kairouz, Sewoong Oh, Zheng Xu

Figure 1 for Randomization Techniques to Mitigate the Risk of Copyright Infringement

Figure 2 for Randomization Techniques to Mitigate the Risk of Copyright Infringement

Figure 3 for Randomization Techniques to Mitigate the Risk of Copyright Infringement

Figure 4 for Randomization Techniques to Mitigate the Risk of Copyright Infringement

Share this with someone who'll enjoy it:

Abstract:In this paper, we investigate potential randomization approaches that can complement current practices of input-based methods (such as licensing data and prompt filtering) and output-based methods (such as recitation checker, license checker, and model-based similarity score) for copyright protection. This is motivated by the inherent ambiguity of the rules that determine substantial similarity in copyright precedents. Given that there is no quantifiable measure of substantial similarity that is agreed upon, complementary approaches can potentially further decrease liability. Similar randomized approaches, such as differential privacy, have been successful in mitigating privacy risks. This document focuses on the technical and research perspective on mitigating copyright violation and hence is not confidential. After investigating potential solutions and running numerical experiments, we concluded that using the notion of Near Access-Freeness (NAF) to measure the degree of substantial similarity is challenging, and the standard approach of training a Differentially Private (DP) model costs significantly when used to ensure NAF. Alternative approaches, such as retrieval models, might provide a more controllable scheme for mitigating substantial similarity.

View paper on

Share this with someone who'll enjoy it:

Title:Randomization Techniques to Mitigate the Risk of Copyright Infringement

Paper and Code