Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Lorenzo Cassano

Scientific QA System with Verifiable Answers

Jul 16, 2024

Adela Ljajić, Miloš Košprdić, Bojana Bašaragin, Darija Medvecki, Lorenzo Cassano, Nikola Milošević

Figure 1 for Scientific QA System with Verifiable Answers

Figure 2 for Scientific QA System with Verifiable Answers

Figure 3 for Scientific QA System with Verifiable Answers

Abstract:In this paper, we introduce the VerifAI project, a pioneering open-source scientific question-answering system, designed to provide answers that are not only referenced but also automatically vetted and verifiable. The components of the system are (1) an Information Retrieval system combining semantic and lexical search techniques over scientific papers (PubMed), (2) a Retrieval-Augmented Generation (RAG) module using fine-tuned generative model (Mistral 7B) and retrieved articles to generate claims with references to the articles from which it was derived, and (3) a Verification engine, based on a fine-tuned DeBERTa and XLM-RoBERTa models on Natural Language Inference task using SciFACT dataset. The verification engine cross-checks the generated claim and the article from which the claim was derived, verifying whether there may have been any hallucinations in generating the claim. By leveraging the Information Retrieval and RAG modules, Verif.ai excels in generating factual information from a vast array of scientific sources. At the same time, the Verification engine rigorously double-checks this output, ensuring its accuracy and reliability. This dual-stage process plays a crucial role in acquiring and confirming factual information, significantly enhancing the information landscape. Our methodology could significantly enhance scientists' productivity, concurrently fostering trust in applying generative language models within scientific domains, where hallucinations and misinformation are unacceptable.

* Accepted at the 6th International Open Search Symposium 2024. arXiv admin note: substantial text overlap with arXiv:2402.18589

Via

Access Paper or Ask Questions

Trust and Resilience in Federated Learning Through Smart Contracts Enabled Decentralized Systems

Jul 09, 2024

Lorenzo Cassano, Jacopo D'Abramo, Siraj Munir, Stefano Ferretti

Figure 1 for Trust and Resilience in Federated Learning Through Smart Contracts Enabled Decentralized Systems

Figure 2 for Trust and Resilience in Federated Learning Through Smart Contracts Enabled Decentralized Systems

Figure 3 for Trust and Resilience in Federated Learning Through Smart Contracts Enabled Decentralized Systems

Figure 4 for Trust and Resilience in Federated Learning Through Smart Contracts Enabled Decentralized Systems

Abstract:In this paper, we present a study of a Federated Learning (FL) system, based on the use of decentralized architectures to ensure trust and increase reliability. The system is based on the idea that the FL collaborators upload the (ciphered) model parameters on the Inter-Planetary File System (IPFS) and interact with a dedicated smart contract to track their behavior. Thank to this smart contract, the phases of parameter updates are managed efficiently, thereby strengthening data security. We have carried out an experimental study that exploits two different methods of weight aggregation, i.e., a classic averaging scheme and a federated proximal aggregation. The results confirm the feasibility of the proposal.

* Proceedings of Blockchain-2024
* TRUSTCHAIN workshop

Via

Access Paper or Ask Questions

How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions

Jul 06, 2024

Bojana Bašaragin, Adela Ljajić, Darija Medvecki, Lorenzo Cassano, Miloš Košprdić, Nikola Milošević

Figure 1 for How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions

Figure 2 for How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions

Figure 3 for How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions

Figure 4 for How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions

Abstract:Large language models (LLMs) have recently become the leading source of answers for users' questions online. Despite their ability to offer eloquent answers, their accuracy and reliability can pose a significant challenge. This is especially true for sensitive domains such as biomedicine, where there is a higher need for factually correct answers. This paper introduces a biomedical retrieval-augmented generation (RAG) system designed to enhance the reliability of generated responses. The system is based on a fine-tuned LLM for the referenced question-answering, where retrieved relevant abstracts from PubMed are passed to LLM's context as input through a prompt. Its output is an answer based on PubMed abstracts, where each statement is referenced accordingly, allowing the users to verify the answer. Our retrieval system achieves an absolute improvement of 23% compared to the PubMed search engine. Based on the manual evaluation on a small sample, our fine-tuned LLM component achieves comparable results to GPT-4 Turbo in referencing relevant abstracts. We make the dataset used to fine-tune the models and the fine-tuned models based on Mistral-7B-instruct-v0.1 and v0.2 publicly available.

* Accepted at BioNLP Workshop 2024, colocated with ACL 2024

Via

Access Paper or Ask Questions