Picture for Mirco Musolesi

Mirco Musolesi

Reward Model Overoptimisation in Iterated RLHF

Add code
May 23, 2025
Viaarxiv icon

Multi-Agent Reinforcement Learning Simulation for Environmental Policy Synthesis

Add code
Apr 17, 2025
Figure 1 for Multi-Agent Reinforcement Learning Simulation for Environmental Policy Synthesis
Viaarxiv icon

DiffSampling: Enhancing Diversity and Accuracy in Neural Text Generation

Add code
Feb 19, 2025
Viaarxiv icon

Thinking Outside the (Gray) Box: A Context-Based Score for Assessing Value and Originality in Neural Text Generation

Add code
Feb 18, 2025
Figure 1 for Thinking Outside the (Gray) Box: A Context-Based Score for Assessing Value and Originality in Neural Text Generation
Figure 2 for Thinking Outside the (Gray) Box: A Context-Based Score for Assessing Value and Originality in Neural Text Generation
Figure 3 for Thinking Outside the (Gray) Box: A Context-Based Score for Assessing Value and Originality in Neural Text Generation
Figure 4 for Thinking Outside the (Gray) Box: A Context-Based Score for Assessing Value and Originality in Neural Text Generation
Viaarxiv icon

Feature Selection for Network Intrusion Detection

Add code
Nov 18, 2024
Figure 1 for Feature Selection for Network Intrusion Detection
Figure 2 for Feature Selection for Network Intrusion Detection
Figure 3 for Feature Selection for Network Intrusion Detection
Figure 4 for Feature Selection for Network Intrusion Detection
Viaarxiv icon

Mutual Information Preserving Neural Network Pruning

Add code
Oct 31, 2024
Figure 1 for Mutual Information Preserving Neural Network Pruning
Figure 2 for Mutual Information Preserving Neural Network Pruning
Figure 3 for Mutual Information Preserving Neural Network Pruning
Figure 4 for Mutual Information Preserving Neural Network Pruning
Viaarxiv icon

Moral Alignment for LLM Agents

Add code
Oct 02, 2024
Figure 1 for Moral Alignment for LLM Agents
Figure 2 for Moral Alignment for LLM Agents
Figure 3 for Moral Alignment for LLM Agents
Figure 4 for Moral Alignment for LLM Agents
Viaarxiv icon

Reinforcement Learning Discovers Efficient Decentralized Graph Path Search Strategies

Add code
Sep 12, 2024
Figure 1 for Reinforcement Learning Discovers Efficient Decentralized Graph Path Search Strategies
Figure 2 for Reinforcement Learning Discovers Efficient Decentralized Graph Path Search Strategies
Figure 3 for Reinforcement Learning Discovers Efficient Decentralized Graph Path Search Strategies
Figure 4 for Reinforcement Learning Discovers Efficient Decentralized Graph Path Search Strategies
Viaarxiv icon

Training Foundation Models as Data Compression: On Information, Model Weights and Copyright Law

Add code
Jul 18, 2024
Figure 1 for Training Foundation Models as Data Compression: On Information, Model Weights and Copyright Law
Figure 2 for Training Foundation Models as Data Compression: On Information, Model Weights and Copyright Law
Viaarxiv icon

Partial Information Decomposition for Data Interpretability and Feature Selection

Add code
May 29, 2024
Figure 1 for Partial Information Decomposition for Data Interpretability and Feature Selection
Figure 2 for Partial Information Decomposition for Data Interpretability and Feature Selection
Figure 3 for Partial Information Decomposition for Data Interpretability and Feature Selection
Figure 4 for Partial Information Decomposition for Data Interpretability and Feature Selection
Viaarxiv icon