Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Maximilian Idahl

OpenReviewer: A Specialized Large Language Model for Generating Critical Scientific Paper Reviews

Dec 16, 2024

Maximilian Idahl, Zahra Ahmadi

Figure 1 for OpenReviewer: A Specialized Large Language Model for Generating Critical Scientific Paper Reviews

Figure 2 for OpenReviewer: A Specialized Large Language Model for Generating Critical Scientific Paper Reviews

Figure 3 for OpenReviewer: A Specialized Large Language Model for Generating Critical Scientific Paper Reviews

Figure 4 for OpenReviewer: A Specialized Large Language Model for Generating Critical Scientific Paper Reviews

Abstract:We present OpenReviewer, an open-source system for generating high-quality peer reviews of machine learning and AI conference papers. At its core is Llama-OpenReviewer-8B, an 8B parameter language model specifically fine-tuned on 79,000 expert reviews from top ML conferences. Given a PDF paper submission and review template as input, OpenReviewer extracts the full text, including technical content like equations and tables, and generates a structured review following conference-specific guidelines. Our evaluation on 400 test papers shows that OpenReviewer produces significantly more critical and realistic reviews compared to general-purpose LLMs like GPT-4 and Claude-3.5. While other LLMs tend toward overly positive assessments, OpenReviewer's recommendations closely match the distribution of human reviewer ratings. The system provides authors with rapid, constructive feedback to improve their manuscripts before submission, though it is not intended to replace human peer review. OpenReviewer is available as an online demo and open-source tool.

* Demo: https://huggingface.co/spaces/maxidl/openreviewer Model: https://huggingface.co/maxidl/Llama-OpenReviewer-8B

Via

Access Paper or Ask Questions

Explainable Information Retrieval: A Survey

Nov 04, 2022

Avishek Anand, Lijun Lyu, Maximilian Idahl, Yumeng Wang, Jonas Wallat, Zijian Zhang

Abstract:Explainable information retrieval is an emerging research area aiming to make transparent and trustworthy information retrieval systems. Given the increasing use of complex machine learning models in search systems, explainability is essential in building and auditing responsible information retrieval models. This survey fills a vital gap in the otherwise topically diverse literature of explainable information retrieval. It categorizes and discusses recent explainability methods developed for different application domains in information retrieval, providing a common framework and unifying perspectives. In addition, it reflects on the common concern of evaluating explanations and highlights open challenges and opportunities.

* 35 pages, 10 figures. Under review

Via

Access Paper or Ask Questions

Towards Benchmarking the Utility of Explanations for Model Debugging

May 10, 2021

Maximilian Idahl, Lijun Lyu, Ujwal Gadiraju, Avishek Anand

Figure 1 for Towards Benchmarking the Utility of Explanations for Model Debugging

Abstract:Post-hoc explanation methods are an important class of approaches that help understand the rationale underlying a trained model's decision. But how useful are they for an end-user towards accomplishing a given task? In this vision paper, we argue the need for a benchmark to facilitate evaluations of the utility of post-hoc explanation methods. As a first step to this end, we enumerate desirable properties that such a benchmark should possess for the task of debugging text classifiers. Additionally, we highlight that such a benchmark facilitates not only assessing the effectiveness of explanations but also their efficiency.

* Short paper, to appear at TrustNLP @ NAACL 2021

Via

Access Paper or Ask Questions

Multimodal Analytics for Real-world News using Measures of Cross-modal Entity Consistency

Mar 23, 2020

Eric Müller-Budack, Jonas Theiner, Sebastian Diering, Maximilian Idahl, Ralph Ewerth

Figure 1 for Multimodal Analytics for Real-world News using Measures of Cross-modal Entity Consistency

Figure 2 for Multimodal Analytics for Real-world News using Measures of Cross-modal Entity Consistency

Figure 3 for Multimodal Analytics for Real-world News using Measures of Cross-modal Entity Consistency

Figure 4 for Multimodal Analytics for Real-world News using Measures of Cross-modal Entity Consistency

Abstract:The World Wide Web has become a popular source for gathering information and news. Multimodal information, e.g., enriching text with photos, is typically used to convey the news more effectively or to attract attention. Photo content can range from decorative, depict additional important information, or can even contain misleading information. Therefore, automatic approaches to quantify cross-modal consistency of entity representation can support human assessors to evaluate the overall multimodal message, for instance, with regard to bias or sentiment. In some cases such measures could give hints to detect fake news, which is an increasingly important topic in today's society. In this paper, we introduce a novel task of cross-modal consistency verification in real-world news and present a multimodal approach to quantify the entity coherence between image and text. Named entity linking is applied to extract persons, locations, and events from news texts. Several measures are suggested to calculate cross-modal similarity for these entities using state of the art approaches. In contrast to previous work, our system automatically gathers example data from the Web and is applicable to real-world news. Results on two novel datasets that cover different languages, topics, and domains demonstrate the feasibility of our approach. Datasets and code are publicly available to foster research towards this new direction.

* Accepted for publication in: International Conference on Multimedia Retrieval (ICMR), Dublin, 2020

Via

Access Paper or Ask Questions

Finding Interpretable Concept Spaces in Node Embeddings using Knowledge Bases

Oct 11, 2019

Maximilian Idahl, Megha Khosla, Avishek Anand

Figure 1 for Finding Interpretable Concept Spaces in Node Embeddings using Knowledge Bases

Figure 2 for Finding Interpretable Concept Spaces in Node Embeddings using Knowledge Bases

Figure 3 for Finding Interpretable Concept Spaces in Node Embeddings using Knowledge Bases

Figure 4 for Finding Interpretable Concept Spaces in Node Embeddings using Knowledge Bases

Abstract:In this paper we propose and study the novel problem of explaining node embeddings by finding embedded human interpretable subspaces in already trained unsupervised node representation embeddings. We use an external knowledge base that is organized as a taxonomy of human-understandable concepts over entities as a guide to identify subspaces in node embeddings learned from an entity graph derived from Wikipedia. We propose a method that given a concept finds a linear transformation to a subspace where the structure of the concept is retained. Our initial experiments show that we obtain low error in finding fine-grained concepts.

* Accepted for poster presentation at ECML PKDD AIMLAI-XKDD workshop

Via

Access Paper or Ask Questions