Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Haytham Fayek

Less Is More? Examining Fairness in Pruned Large Language Models for Summarising Opinions

Aug 25, 2025

Nannan Huang, Haytham Fayek, Xiuzhen Zhang

Abstract:Model compression through post-training pruning offers a way to reduce model size and computational requirements without significantly impacting model performance. However, the effect of pruning on the fairness of LLM-generated summaries remains unexplored, particularly for opinion summarisation where biased outputs could influence public views.In this paper, we present a comprehensive empirical analysis of opinion summarisation, examining three state-of-the-art pruning methods and various calibration sets across three open-source LLMs using four fairness metrics. Our systematic analysis reveals that pruning methods have a greater impact on fairness than calibration sets. Building on these insights, we propose High Gradient Low Activation (HGLA) pruning, which identifies and removes parameters that are redundant for input processing but influential in output generation. Our experiments demonstrate that HGLA can better maintain or even improve fairness compared to existing methods, showing promise across models and tasks where traditional methods have limitations. Our human evaluation shows HGLA-generated outputs are fairer than existing state-of-the-art pruning methods. Code is available at: https://github.com/amberhuang01/HGLA.

* Accepted to EMNLP 2025 Main Conference

Via

Access Paper or Ask Questions

Foundation Models for Anomaly Detection: Vision and Challenges

Feb 10, 2025

Jing Ren, Tao Tang, Hong Jia, Haytham Fayek, Xiaodong Li, Suyu Ma, Xiwei Xu, Feng Xia

Figure 1 for Foundation Models for Anomaly Detection: Vision and Challenges

Figure 2 for Foundation Models for Anomaly Detection: Vision and Challenges

Figure 3 for Foundation Models for Anomaly Detection: Vision and Challenges

Figure 4 for Foundation Models for Anomaly Detection: Vision and Challenges

Abstract:As data continues to grow in volume and complexity across domains such as finance, manufacturing, and healthcare, effective anomaly detection is essential for identifying irregular patterns that may signal critical issues. Recently, foundation models (FMs) have emerged as a powerful tool for advancing anomaly detection. They have demonstrated unprecedented capabilities in enhancing anomaly identification, generating detailed data descriptions, and providing visual explanations. This survey presents the first comprehensive review of recent advancements in FM-based anomaly detection. We propose a novel taxonomy that classifies FMs into three categories based on their roles in anomaly detection tasks, i.e., as encoders, detectors, or interpreters. We provide a systematic analysis of state-of-the-art methods and discuss key challenges in leveraging FMs for improved anomaly detection. We also outline future research directions in this rapidly evolving field.

* 9 pages, 4 figures

Via

Access Paper or Ask Questions

Bias in Opinion Summarisation from Pre-training to Adaptation: A Case Study in Political Bias

Feb 01, 2024

Nannan Huang, Haytham Fayek, Xiuzhen Zhang

Figure 1 for Bias in Opinion Summarisation from Pre-training to Adaptation: A Case Study in Political Bias

Figure 2 for Bias in Opinion Summarisation from Pre-training to Adaptation: A Case Study in Political Bias

Figure 3 for Bias in Opinion Summarisation from Pre-training to Adaptation: A Case Study in Political Bias

Figure 4 for Bias in Opinion Summarisation from Pre-training to Adaptation: A Case Study in Political Bias

Abstract:Opinion summarisation aims to summarise the salient information and opinions presented in documents such as product reviews, discussion forums, and social media texts into short summaries that enable users to effectively understand the opinions therein. Generating biased summaries has the risk of potentially swaying public opinion. Previous studies focused on studying bias in opinion summarisation using extractive models, but limited research has paid attention to abstractive summarisation models. In this study, using political bias as a case study, we first establish a methodology to quantify bias in abstractive models, then trace it from the pre-trained models to the task of summarising social media opinions using different models and adaptation methods. We find that most models exhibit intrinsic bias. Using a social media text summarisation dataset and contrasting various adaptation methods, we find that tuning a smaller number of parameters is less biased compared to standard fine-tuning; however, the diversity of topics in training data used for fine-tuning is critical.

* 15 pages, 1 figure, 6 tables, Accepted to EACL 2024

Via

Access Paper or Ask Questions

Examining Bias in Opinion Summarisation Through the Perspective of Opinion Diversity

Jun 07, 2023

Nannan Huang, Lin Tian, Haytham Fayek, Xiuzhen Zhang

Figure 1 for Examining Bias in Opinion Summarisation Through the Perspective of Opinion Diversity

Figure 2 for Examining Bias in Opinion Summarisation Through the Perspective of Opinion Diversity

Figure 3 for Examining Bias in Opinion Summarisation Through the Perspective of Opinion Diversity

Figure 4 for Examining Bias in Opinion Summarisation Through the Perspective of Opinion Diversity

Abstract:Opinion summarisation is a task that aims to condense the information presented in the source documents while retaining the core message and opinions. A summary that only represents the majority opinions will leave the minority opinions unrepresented in the summary. In this paper, we use the stance towards a certain target as an opinion. We study bias in opinion summarisation from the perspective of opinion diversity, which measures whether the model generated summary can cover a diverse set of opinions. In addition, we examine opinion similarity, a measure of how closely related two opinions are in terms of their stance on a given topic, and its relationship with opinion diversity. Through the lens of stances towards a topic, we examine opinion diversity and similarity using three debatable topics under COVID-19. Experimental results on these topics revealed that a higher degree of similarity of opinions did not indicate good diversity or fairly cover the various opinions originally presented in the source documents. We found that BART and ChatGPT can better capture diverse opinions presented in the source documents.

* 9 pages, 3 figures, accepted at WASSA, ACL 2023

Via

Access Paper or Ask Questions

Knowledge Capture and Replay for Continual Learning

Dec 12, 2020

Saisubramaniam Gopalakrishnan, Pranshu Ranjan Singh, Haytham Fayek, Savitha Ramasamy, Arulmurugan Ambikapathi

Figure 1 for Knowledge Capture and Replay for Continual Learning

Figure 2 for Knowledge Capture and Replay for Continual Learning

Figure 3 for Knowledge Capture and Replay for Continual Learning

Figure 4 for Knowledge Capture and Replay for Continual Learning

Abstract:Deep neural networks have shown promise in several domains, and the learned task-specific information is implicitly stored in the network parameters. It will be vital to utilize representations from these networks for downstream tasks such as continual learning. In this paper, we introduce the notion of {\em flashcards} that are visual representations to {\em capture} the encoded knowledge of a network, as a function of random image patterns. We demonstrate the effectiveness of flashcards in capturing representations and show that they are efficient replay methods for general and task agnostic continual learning setting. Thus, while adapting to a new task, a limited number of constructed flashcards, help to prevent catastrophic forgetting of the previously learned tasks. Most interestingly, such flashcards neither require external memory storage nor need to be accumulated over multiple tasks and only need to be constructed just before learning the subsequent new task, irrespective of the number of tasks trained before and are hence task agnostic. We first demonstrate the efficacy of flashcards in capturing knowledge representation from a trained network, and empirically validate the efficacy of flashcards on a variety of continual learning tasks: continual unsupervised reconstruction, continual denoising, and new-instance learning classification, using a number of heterogeneous benchmark datasets. These studies also indicate that continual learning algorithms with flashcards as the replay strategy perform better than other state-of-the-art replay methods, and exhibits on par performance with the best possible baseline using coreset sampling, with the least additional computational complexity and storage.

Via

Access Paper or Ask Questions