Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xingwei Tan

SafeSpeech: A Comprehensive and Interactive Tool for Analysing Sexist and Abusive Language in Conversations

Mar 09, 2025

Xingwei Tan, Chen Lyu, Hafiz Muhammad Umer, Sahrish Khan, Mahathi Parvatham, Lois Arthurs, Simon Cullen, Shelley Wilson, Arshad Jhumka, Gabriele Pergola

Abstract:Detecting toxic language including sexism, harassment and abusive behaviour, remains a critical challenge, particularly in its subtle and context-dependent forms. Existing approaches largely focus on isolated message-level classification, overlooking toxicity that emerges across conversational contexts. To promote and enable future research in this direction, we introduce SafeSpeech, a comprehensive platform for toxic content detection and analysis that bridges message-level and conversation-level insights. The platform integrates fine-tuned classifiers and large language models (LLMs) to enable multi-granularity detection, toxic-aware conversation summarization, and persona profiling. SafeSpeech also incorporates explainability mechanisms, such as perplexity gain analysis, to highlight the linguistic elements driving predictions. Evaluations on benchmark datasets, including EDOS, OffensEval, and HatEval, demonstrate the reproduction of state-of-the-art performance across multiple tasks, including fine-grained sexism detection.

* NAACL 2025 system demonstration camera-ready

Via

Access Paper or Ask Questions

Cascading Large Language Models for Salient Event Graph Generation

Jun 26, 2024

Xingwei Tan, Yuxiang Zhou, Gabriele Pergola, Yulan He

Figure 1 for Cascading Large Language Models for Salient Event Graph Generation

Figure 2 for Cascading Large Language Models for Salient Event Graph Generation

Figure 3 for Cascading Large Language Models for Salient Event Graph Generation

Figure 4 for Cascading Large Language Models for Salient Event Graph Generation

Abstract:Generating event graphs from long documents is challenging due to the inherent complexity of multiple tasks involved such as detecting events, identifying their relationships, and reconciling unstructured input with structured graphs. Recent studies typically consider all events with equal importance, failing to distinguish salient events crucial for understanding narratives. This paper presents CALLMSAE, a CAscading Large Language Model framework for SAlient Event graph generation, which leverages the capabilities of LLMs and eliminates the need for costly human annotations. We first identify salient events by prompting LLMs to generate summaries, from which salient events are identified. Next, we develop an iterative code refinement prompting strategy to generate event relation graphs, removing hallucinated relations and recovering missing edges. Fine-tuning contextualised graph generation models on the LLM-generated graphs outperforms the models trained on CAEVO-generated data. Experimental results on a human-annotated test set show that the proposed method generates salient and more accurate graphs, outperforming competitive baselines.

* 9 + 12 pages

Via

Access Paper or Ask Questions

Set-Aligning Framework for Auto-Regressive Event Temporal Graph Generation

Apr 01, 2024

Xingwei Tan, Yuxiang Zhou, Gabriele Pergola, Yulan He

Figure 1 for Set-Aligning Framework for Auto-Regressive Event Temporal Graph Generation

Figure 2 for Set-Aligning Framework for Auto-Regressive Event Temporal Graph Generation

Figure 3 for Set-Aligning Framework for Auto-Regressive Event Temporal Graph Generation

Figure 4 for Set-Aligning Framework for Auto-Regressive Event Temporal Graph Generation

Abstract:Event temporal graphs have been shown as convenient and effective representations of complex temporal relations between events in text. Recent studies, which employ pre-trained language models to auto-regressively generate linearised graphs for constructing event temporal graphs, have shown promising results. However, these methods have often led to suboptimal graph generation as the linearised graphs exhibit set characteristics which are instead treated sequentially by language models. This discrepancy stems from the conventional text generation objectives, leading to erroneous penalisation of correct predictions caused by the misalignment of elements in target sequences. To address these challenges, we reframe the task as a conditional set generation problem, proposing a Set-aligning Framework tailored for the effective utilisation of Large Language Models (LLMs). The framework incorporates data augmentations and set-property regularisations designed to alleviate text generation loss penalties associated with the linearised graph edge sequences, thus encouraging the generation of more relation edges. Experimental results show that our framework surpasses existing baselines for event temporal graph generation. Furthermore, under zero-shot settings, the structural knowledge introduced through our framework notably improves model generalisation, particularly when the training examples available are limited.

* Accepted to NAACL 2024. 9 + 10 pages

Via

Access Paper or Ask Questions

Event Temporal Relation Extraction with Bayesian Translational Model

Feb 10, 2023

Xingwei Tan, Gabriele Pergola, Yulan He

Abstract:Existing models to extract temporal relations between events lack a principled method to incorporate external knowledge. In this study, we introduce Bayesian-Trans, a Bayesian learning-based method that models the temporal relation representations as latent variables and infers their values via Bayesian inference and translational functions. Compared to conventional neural approaches, instead of performing point estimation to find the best set parameters, the proposed model infers the parameters' posterior distribution directly, enhancing the model's capability to encode and express uncertainty about the predictions. Experimental results on the three widely used datasets show that Bayesian-Trans outperforms existing approaches for event temporal relation extraction. We additionally present detailed analyses on uncertainty quantification, comparison of priors, and ablation studies, illustrating the benefits of the proposed approach.

* 9 pages + 2

Via

Access Paper or Ask Questions

Event-Centric Question Answering via Contrastive Learning and Invertible Event Transformation

Oct 24, 2022

Junru Lu, Xingwei Tan, Gabriele Pergola, Lin Gui, Yulan He

Figure 1 for Event-Centric Question Answering via Contrastive Learning and Invertible Event Transformation

Figure 2 for Event-Centric Question Answering via Contrastive Learning and Invertible Event Transformation

Figure 3 for Event-Centric Question Answering via Contrastive Learning and Invertible Event Transformation

Figure 4 for Event-Centric Question Answering via Contrastive Learning and Invertible Event Transformation

Abstract:Human reading comprehension often requires reasoning of event semantic relations in narratives, represented by Event-centric Question-Answering (QA). To address event-centric QA, we propose a novel QA model with contrastive learning and invertible event transformation, call TranCLR. Our proposed model utilizes an invertible transformation matrix to project semantic vectors of events into a common event embedding space, trained with contrastive learning, and thus naturally inject event semantic knowledge into mainstream QA pipelines. The transformation matrix is fine-tuned with the annotated event relation types between events that occurred in questions and those in answers, using event-aware question vectors. Experimental results on the Event Semantic Relation Reasoning (ESTER) dataset show significant improvements in both generative and extractive settings compared to the existing strong baselines, achieving over 8.4% gain in the token-level F1 score and 3.0% gain in Exact Match (EM) score under the multi-answer setting. Qualitative analysis reveals the high quality of the generated answers by TranCLR, demonstrating the feasibility of injecting event knowledge into QA model learning. Our code and models can be found at https://github.com/LuJunru/TranCLR.

* Findings of EMNLP 2022

Via

Access Paper or Ask Questions

Extracting Event Temporal Relations via Hyperbolic Geometry

Sep 12, 2021

Xingwei Tan, Gabriele Pergola, Yulan He

Figure 1 for Extracting Event Temporal Relations via Hyperbolic Geometry

Figure 2 for Extracting Event Temporal Relations via Hyperbolic Geometry

Figure 3 for Extracting Event Temporal Relations via Hyperbolic Geometry

Figure 4 for Extracting Event Temporal Relations via Hyperbolic Geometry

Abstract:Detecting events and their evolution through time is a crucial task in natural language understanding. Recent neural approaches to event temporal relation extraction typically map events to embeddings in the Euclidean space and train a classifier to detect temporal relations between event pairs. However, embeddings in the Euclidean space cannot capture richer asymmetric relations such as event temporal relations. We thus propose to embed events into hyperbolic spaces, which are intrinsically oriented at modeling hierarchical structures. We introduce two approaches to encode events and their temporal relations in hyperbolic spaces. One approach leverages hyperbolic embeddings to directly infer event relations through simple geometrical operations. In the second one, we devise an end-to-end architecture composed of hyperbolic neural units tailored for the temporal relation extraction task. Thorough experimental assessments on widely used datasets have shown the benefits of revisiting the tasks on a different geometrical space, resulting in state-of-the-art performance on several standard metrics. Finally, the ablation study and several qualitative analyses highlighted the rich event semantics implicitly encoded into hyperbolic spaces.

* Accepted by EMNLP 2021, 9 pages + 4 pages (References and Appendix)

Via

Access Paper or Ask Questions