Picture for Bardh Prenkaj

Bardh Prenkaj

Reinforcement Unlearning via Group Relative Policy Optimization

Add code
Jan 28, 2026
Viaarxiv icon

Moral Lenses, Political Coordinates: Towards Ideological Positioning of Morally Conditioned LLMs

Add code
Jan 13, 2026
Viaarxiv icon

Injecting Falsehoods: Adversarial Man-in-the-Middle Attacks Undermining Factual Recall in LLMs

Add code
Nov 08, 2025
Viaarxiv icon

CURE: Controlled Unlearning for Robust Embeddings -- Mitigating Conceptual Shortcuts in Pre-Trained Language Models

Add code
Sep 05, 2025
Viaarxiv icon

Probabilistic Aggregation and Targeted Embedding Optimization for Collective Moral Reasoning in Large Language Models

Add code
Jun 18, 2025
Viaarxiv icon

SCISSOR: Mitigating Semantic Bias through Cluster-Aware Siamese Networks for Robust Classification

Add code
Jun 17, 2025
Viaarxiv icon

Graph Style Transfer for Counterfactual Explainability

Add code
May 23, 2025
Viaarxiv icon

RAZOR: Sharpening Knowledge by Cutting Bias with Unsupervised Text Rewriting

Add code
Dec 10, 2024
Figure 1 for RAZOR: Sharpening Knowledge by Cutting Bias with Unsupervised Text Rewriting
Figure 2 for RAZOR: Sharpening Knowledge by Cutting Bias with Unsupervised Text Rewriting
Figure 3 for RAZOR: Sharpening Knowledge by Cutting Bias with Unsupervised Text Rewriting
Figure 4 for RAZOR: Sharpening Knowledge by Cutting Bias with Unsupervised Text Rewriting
Viaarxiv icon

Seamless Monitoring of Stress Levels Leveraging a Universal Model for Time Sequences

Add code
Jul 04, 2024
Figure 1 for Seamless Monitoring of Stress Levels Leveraging a Universal Model for Time Sequences
Figure 2 for Seamless Monitoring of Stress Levels Leveraging a Universal Model for Time Sequences
Figure 3 for Seamless Monitoring of Stress Levels Leveraging a Universal Model for Time Sequences
Figure 4 for Seamless Monitoring of Stress Levels Leveraging a Universal Model for Time Sequences
Viaarxiv icon

Towards Non-Adversarial Algorithmic Recourse

Add code
Mar 15, 2024
Figure 1 for Towards Non-Adversarial Algorithmic Recourse
Figure 2 for Towards Non-Adversarial Algorithmic Recourse
Figure 3 for Towards Non-Adversarial Algorithmic Recourse
Figure 4 for Towards Non-Adversarial Algorithmic Recourse
Viaarxiv icon