Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ruisi Zhang

Robust and Secure Code Watermarking for Large Language Models via ML/Crypto Codesign

Feb 04, 2025

Ruisi Zhang, Neusha Javidnia, Nojan Sheybani, Farinaz Koushanfar

Figure 1 for Robust and Secure Code Watermarking for Large Language Models via ML/Crypto Codesign

Figure 2 for Robust and Secure Code Watermarking for Large Language Models via ML/Crypto Codesign

Figure 3 for Robust and Secure Code Watermarking for Large Language Models via ML/Crypto Codesign

Figure 4 for Robust and Secure Code Watermarking for Large Language Models via ML/Crypto Codesign

Abstract:This paper introduces RoSe, the first-of-its-kind ML/Crypto codesign watermarking framework that regulates LLM-generated code to avoid intellectual property rights violations and inappropriate misuse in software development. High-quality watermarks adhering to the detectability-fidelity-robustness tri-objective are limited due to codes' low-entropy nature. Watermark verification, however, often needs to reveal the signature and requires re-encoding new ones for code reuse, which potentially compromising the system's usability. To overcome these challenges, RoSe obtains high-quality watermarks by training the watermark insertion and extraction modules end-to-end to ensure (i) unaltered watermarked code functionality and (ii) enhanced detectability and robustness leveraging pre-trained CodeT5 as the insertion backbone to enlarge the code syntactic and variable rename transformation search space. In the deployment, RoSe uses zero-knowledge proofs for secure verification without revealing the underlying signatures. Extensive evaluations demonstrated RoSe achieves high detection accuracy while preserving the code functionality. RoSe is also robust against attacks and provides efficient secure watermark verification.

Via

Access Paper or Ask Questions

SimpleFSDP: Simpler Fully Sharded Data Parallel with torch.compile

Nov 01, 2024

Ruisi Zhang, Tianyu Liu, Will Feng, Andrew Gu, Sanket Purandare, Wanchao Liang, Francisco Massa

Figure 1 for SimpleFSDP: Simpler Fully Sharded Data Parallel with torch.compile

Figure 2 for SimpleFSDP: Simpler Fully Sharded Data Parallel with torch.compile

Figure 3 for SimpleFSDP: Simpler Fully Sharded Data Parallel with torch.compile

Figure 4 for SimpleFSDP: Simpler Fully Sharded Data Parallel with torch.compile

Abstract:Distributed training of large models consumes enormous computation resources and requires substantial engineering efforts to compose various training techniques. This paper presents SimpleFSDP, a PyTorch-native compiler-based Fully Sharded Data Parallel (FSDP) framework, which has a simple implementation for maintenance and composability, allows full computation-communication graph tracing, and brings performance enhancement via compiler backend optimizations. SimpleFSDP's novelty lies in its unique torch.compile-friendly implementation of collective communications using existing PyTorch primitives, namely parametrizations, selective activation checkpointing, and DTensor. It also features the first-of-its-kind intermediate representation (IR) nodes bucketing and reordering in the TorchInductor backend for effective computation-communication overlapping. As a result, users can employ the aforementioned optimizations to automatically or manually wrap model components for minimal communication exposure. Extensive evaluations of SimpleFSDP on Llama 3 models (including the ultra-large 405B) using TorchTitan demonstrate up to 28.54% memory reduction and 68.67% throughput improvement compared to the most widely adopted FSDP2 eager framework, when composed with other distributed training techniques.

Via

Access Paper or Ask Questions

Watermarking Large Language Models and the Generated Content: Opportunities and Challenges

Oct 24, 2024

Ruisi Zhang, Farinaz Koushanfar

Figure 1 for Watermarking Large Language Models and the Generated Content: Opportunities and Challenges

Figure 2 for Watermarking Large Language Models and the Generated Content: Opportunities and Challenges

Figure 3 for Watermarking Large Language Models and the Generated Content: Opportunities and Challenges

Figure 4 for Watermarking Large Language Models and the Generated Content: Opportunities and Challenges

Abstract:The widely adopted and powerful generative large language models (LLMs) have raised concerns about intellectual property rights violations and the spread of machine-generated misinformation. Watermarking serves as a promising approch to establish ownership, prevent unauthorized use, and trace the origins of LLM-generated content. This paper summarizes and shares the challenges and opportunities we found when watermarking LLMs. We begin by introducing techniques for watermarking LLMs themselves under different threat models and scenarios. Next, we investigate watermarking methods designed for the content generated by LLMs, assessing their effectiveness and resilience against various attacks. We also highlight the importance of watermarking domain-specific models and data, such as those used in code generation, chip design, and medical applications. Furthermore, we explore methods like hardware acceleration to improve the efficiency of the watermarking process. Finally, we discuss the limitations of current approaches and outline future research directions for the responsible use and protection of these generative AI tools.

* invited paper to Asilomar Conference on Signals, Systems, and Computers

Via

Access Paper or Ask Questions

Transformer-Based Local Feature Matching for Multimodal Image Registration

Apr 25, 2024

Remi Delaunay, Ruisi Zhang, Filipe C. Pedrosa, Navid Feizi, Dianne Sacco, Rajni Patel, Jayender Jagadeesan

Abstract:Ultrasound imaging is a cost-effective and radiation-free modality for visualizing anatomical structures in real-time, making it ideal for guiding surgical interventions. However, its limited field-of-view, speckle noise, and imaging artifacts make it difficult to interpret the images for inexperienced users. In this paper, we propose a new 2D ultrasound to 3D CT registration method to improve surgical guidance during ultrasound-guided interventions. Our approach adopts a dense feature matching method called LoFTR to our multimodal registration problem. We learn to predict dense coarse-to-fine correspondences using a Transformer-based architecture to estimate a robust rigid transformation between a 2D ultrasound frame and a CT scan. Additionally, a fully differentiable pose estimation method is introduced, optimizing LoFTR on pose estimation error during training. Experiments conducted on a multimodal dataset of ex vivo porcine kidneys demonstrate the method's promising results for intraoperative, trackerless ultrasound pose estimation. By mapping 2D ultrasound frames into the 3D CT volume space, the method provides intraoperative guidance, potentially improving surgical workflows and image interpretation.

* Accepted to SPIE Medical Imaging 2024

Via

Access Paper or Ask Questions

Token-Specific Watermarking with Enhanced Detectability and Semantic Coherence for Large Language Models

Mar 07, 2024

Mingjia Huo, Sai Ashish Somayajula, Youwei Liang, Ruisi Zhang, Farinaz Koushanfar, Pengtao Xie

Figure 1 for Token-Specific Watermarking with Enhanced Detectability and Semantic Coherence for Large Language Models

Figure 2 for Token-Specific Watermarking with Enhanced Detectability and Semantic Coherence for Large Language Models

Figure 3 for Token-Specific Watermarking with Enhanced Detectability and Semantic Coherence for Large Language Models

Figure 4 for Token-Specific Watermarking with Enhanced Detectability and Semantic Coherence for Large Language Models

Abstract:Large language models generate high-quality responses with potential misinformation, underscoring the need for regulation by distinguishing AI-generated and human-written texts. Watermarking is pivotal in this context, which involves embedding hidden markers in texts during the LLM inference phase, which is imperceptible to humans. Current watermarking algorithms, however, face the challenge of achieving both the detectability of inserted watermarks and the semantic integrity of generated texts, where enhancing one aspect often undermines the other. To overcome this, we introduce a novel multi-objective optimization (MOO) approach for watermarking that utilizes lightweight networks to generate token-specific watermarking logits and splitting ratios. By leveraging MOO to optimize for both detection and semantic objective functions, our method simultaneously achieves detectability and semantic integrity. Experimental results show that our method outperforms current watermarking techniques in enhancing the detectability of texts generated by LLMs while maintaining their semantic coherence. Our code is available at https://github.com/mignonjia/TS_watermark.

* 16 pages, 9 figures, 2 tables

Via

Access Paper or Ask Questions

EmMark: Robust Watermarks for IP Protection of Embedded Quantized Large Language Models

Feb 27, 2024

Ruisi Zhang, Farinaz Koushanfar

Abstract:This paper introduces EmMark,a novel watermarking framework for protecting the intellectual property (IP) of embedded large language models deployed on resource-constrained edge devices. To address the IP theft risks posed by malicious end-users, EmMark enables proprietors to authenticate ownership by querying the watermarked model weights and matching the inserted signatures. EmMark's novelty lies in its strategic watermark weight parameters selection, nsuring robustness and maintaining model quality. Extensive proof-of-concept evaluations of models from OPT and LLaMA-2 families demonstrate EmMark's fidelity, achieving 100% success in watermark extraction with model performance preservation. EmMark also showcased its resilience against watermark removal and forging attacks.

* Accept to DAC 2024

Via

Access Paper or Ask Questions

REMARK-LLM: A Robust and Efficient Watermarking Framework for Generative Large Language Models

Oct 18, 2023

Ruisi Zhang, Shehzeen Samarah Hussain, Paarth Neekhara, Farinaz Koushanfar

Figure 1 for REMARK-LLM: A Robust and Efficient Watermarking Framework for Generative Large Language Models

Figure 2 for REMARK-LLM: A Robust and Efficient Watermarking Framework for Generative Large Language Models

Figure 3 for REMARK-LLM: A Robust and Efficient Watermarking Framework for Generative Large Language Models

Figure 4 for REMARK-LLM: A Robust and Efficient Watermarking Framework for Generative Large Language Models

Abstract:We present REMARK-LLM, a novel efficient, and robust watermarking framework designed for texts generated by large language models (LLMs). Synthesizing human-like content using LLMs necessitates vast computational resources and extensive datasets, encapsulating critical intellectual property (IP). However, the generated content is prone to malicious exploitation, including spamming and plagiarism. To address the challenges, REMARK-LLM proposes three new components: (i) a learning-based message encoding module to infuse binary signatures into LLM-generated texts; (ii) a reparameterization module to transform the dense distributions from the message encoding to the sparse distribution of the watermarked textual tokens; (iii) a decoding module dedicated for signature extraction; Furthermore, we introduce an optimized beam search algorithm to guarantee the coherence and consistency of the generated content. REMARK-LLM is rigorously trained to encourage the preservation of semantic integrity in watermarked content, while ensuring effective watermark retrieval. Extensive evaluations on multiple unseen datasets highlight REMARK-LLM proficiency and transferability in inserting 2 times more signature bits into the same texts when compared to prior art, all while maintaining semantic integrity. Furthermore, REMARK-LLM exhibits better resilience against a spectrum of watermark detection and removal attacks.

Via

Access Paper or Ask Questions

XVTP3D: Cross-view Trajectory Prediction Using Shared 3D Queries for Autonomous Driving

Aug 17, 2023

Zijian Song, Huikun Bi, Ruisi Zhang, Tianlu Mao, Zhaoqi Wang

Abstract:Trajectory prediction with uncertainty is a critical and challenging task for autonomous driving. Nowadays, we can easily access sensor data represented in multiple views. However, cross-view consistency has not been evaluated by the existing models, which might lead to divergences between the multimodal predictions from different views. It is not practical and effective when the network does not comprehend the 3D scene, which could cause the downstream module in a dilemma. Instead, we predicts multimodal trajectories while maintaining cross-view consistency. We presented a cross-view trajectory prediction method using shared 3D Queries (XVTP3D). We employ a set of 3D queries shared across views to generate multi-goals that are cross-view consistent. We also proposed a random mask method and coarse-to-fine cross-attention to capture robust cross-view features. As far as we know, this is the first work that introduces the outstanding top-down paradigm in BEV detection field to a trajectory prediction problem. The results of experiments on two publicly available datasets show that XVTP3D achieved state-of-the-art performance with consistent cross-view predictions.

* 11 pages, 6 figures, accepted by IJCAI 23

Via

Access Paper or Ask Questions

SABRE: Robust Bayesian Peer-to-Peer Federated Learning

Aug 04, 2023

Nasimeh Heydaribeni, Ruisi Zhang, Tara Javidi, Cristina Nita-Rotaru, Farinaz Koushanfar

Figure 1 for SABRE: Robust Bayesian Peer-to-Peer Federated Learning

Figure 2 for SABRE: Robust Bayesian Peer-to-Peer Federated Learning

Figure 3 for SABRE: Robust Bayesian Peer-to-Peer Federated Learning

Figure 4 for SABRE: Robust Bayesian Peer-to-Peer Federated Learning

Abstract:We introduce SABRE, a novel framework for robust variational Bayesian peer-to-peer federated learning. We analyze the robustness of the known variational Bayesian peer-to-peer federated learning framework (BayP2PFL) against poisoning attacks and subsequently show that BayP2PFL is not robust against those attacks. The new SABRE aggregation methodology is then devised to overcome the limitations of the existing frameworks. SABRE works well in non-IID settings, does not require the majority of the benign nodes over the compromised ones, and even outperforms the baseline algorithm in benign settings. We theoretically prove the robustness of our algorithm against data / model poisoning attacks in a decentralized linear regression setting. Proof-of-Concept evaluations on benchmark data from image classification demonstrate the superiority of SABRE over the existing frameworks under various poisoning attacks.

Via

Access Paper or Ask Questions

Text Revealer: Private Text Reconstruction via Model Inversion Attacks against Transformers

Sep 21, 2022

Ruisi Zhang, Seira Hidano, Farinaz Koushanfar

Figure 1 for Text Revealer: Private Text Reconstruction via Model Inversion Attacks against Transformers

Figure 2 for Text Revealer: Private Text Reconstruction via Model Inversion Attacks against Transformers

Figure 3 for Text Revealer: Private Text Reconstruction via Model Inversion Attacks against Transformers

Figure 4 for Text Revealer: Private Text Reconstruction via Model Inversion Attacks against Transformers

Abstract:Text classification has become widely used in various natural language processing applications like sentiment analysis. Current applications often use large transformer-based language models to classify input texts. However, there is a lack of systematic study on how much private information can be inverted when publishing models. In this paper, we formulate \emph{Text Revealer} -- the first model inversion attack for text reconstruction against text classification with transformers. Our attacks faithfully reconstruct private texts included in training data with access to the target model. We leverage an external dataset and GPT-2 to generate the target domain-like fluent text, and then perturb its hidden state optimally with the feedback from the target model. Our extensive experiments demonstrate that our attacks are effective for datasets with different text lengths and can reconstruct private texts with accuracy.

Via

Access Paper or Ask Questions