Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Valentina Poggioni

Pirates of the RAG: Adaptively Attacking LLMs to Leak Knowledge Bases

Dec 24, 2024

Christian Di Maio, Cristian Cosci, Marco Maggini, Valentina Poggioni, Stefano Melacci

Figure 1 for Pirates of the RAG: Adaptively Attacking LLMs to Leak Knowledge Bases

Figure 2 for Pirates of the RAG: Adaptively Attacking LLMs to Leak Knowledge Bases

Figure 3 for Pirates of the RAG: Adaptively Attacking LLMs to Leak Knowledge Bases

Figure 4 for Pirates of the RAG: Adaptively Attacking LLMs to Leak Knowledge Bases

Abstract:The growing ubiquity of Retrieval-Augmented Generation (RAG) systems in several real-world services triggers severe concerns about their security. A RAG system improves the generative capabilities of a Large Language Models (LLM) by a retrieval mechanism which operates on a private knowledge base, whose unintended exposure could lead to severe consequences, including breaches of private and sensitive information. This paper presents a black-box attack to force a RAG system to leak its private knowledge base which, differently from existing approaches, is adaptive and automatic. A relevance-based mechanism and an attacker-side open-source LLM favor the generation of effective queries to leak most of the (hidden) knowledge base. Extensive experimentation proves the quality of the proposed algorithm in different RAG pipelines and domains, comparing to very recent related approaches, which turn out to be either not fully black-box, not adaptive, or not based on open-source models. The findings from our study remark the urgent need for more robust privacy safeguards in the design and deployment of RAG systems.

Via

Access Paper or Ask Questions

Black-box Attacks on Image Activity Prediction and its Natural Language Explanations

Sep 30, 2023

Alina Elena Baia, Valentina Poggioni, Andrea Cavallaro

Abstract:Explainable AI (XAI) methods aim to describe the decision process of deep neural networks. Early XAI methods produced visual explanations, whereas more recent techniques generate multimodal explanations that include textual information and visual representations. Visual XAI methods have been shown to be vulnerable to white-box and gray-box adversarial attacks, with an attacker having full or partial knowledge of and access to the target system. As the vulnerabilities of multimodal XAI models have not been examined, in this paper we assess for the first time the robustness to black-box attacks of the natural language explanations generated by a self-rationalizing image-based activity recognition model. We generate unrestricted, spatially variant perturbations that disrupt the association between the predictions and the corresponding explanations to mislead the model into generating unfaithful explanations. We show that we can create adversarial images that manipulate the explanations of an activity recognition model by having access only to its final output.

* Accepted at ICCV2023 AROW Workshop

Via

Access Paper or Ask Questions

Smart caching in a Data Lake for High Energy Physics analysis

Aug 02, 2022

Tommaso Tedeschi, Diego Ciangottini, Marco Baioletti, Valentina Poggioni, Daniele Spiga, Loriano Storchi, Mirco Tracolli

Figure 1 for Smart caching in a Data Lake for High Energy Physics analysis

Figure 2 for Smart caching in a Data Lake for High Energy Physics analysis

Figure 3 for Smart caching in a Data Lake for High Energy Physics analysis

Figure 4 for Smart caching in a Data Lake for High Energy Physics analysis

Abstract:The continuous growth of data production in almost all scientific areas raises new problems in data access and management, especially in a scenario where the end-users, as well as the resources that they can access, are worldwide distributed. This work is focused on the data caching management in a Data Lake infrastructure in the context of the High Energy Physics field. We are proposing an autonomous method, based on Reinforcement Learning techniques, to improve the user experience and to contain the maintenance costs of the infrastructure.

Via

Access Paper or Ask Questions