Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Giordano d'Aloisio

University of L'Aquila

REPAIR Approach for Social-based City Reconstruction Planning in case of natural disasters

Oct 21, 2025

Ghulam Mudassir, Antinisca Di Marco, Giordano d'Aloisio

Abstract:Natural disasters always have several effects on human lives. It is challenging for governments to tackle these incidents and to rebuild the economic, social and physical infrastructures and facilities with the available resources (mainly budget and time). Governments always define plans and policies according to the law and political strategies that should maximise social benefits. The severity of damage and the vast resources needed to bring life back to normality make such reconstruction a challenge. This article is the extension of our previously published work by conducting comprehensive comparative analysis by integrating additional deep learning models plus random agent which is used as a baseline. Our prior research introduced a decision support system by using the Deep Reinforcement Learning technique for the planning of post-disaster city reconstruction, maximizing the social benefit of the reconstruction process, considering available resources, meeting the needs of the broad community stakeholders (like citizens' social benefits and politicians' priorities) and keeping in consideration city's structural constraints (like dependencies among roads and buildings). The proposed approach, named post disaster REbuilding plAn ProvIdeR (REPAIR) is generic. It can determine a set of alternative plans for local administrators who select the ideal one to implement, and it can be applied to areas of any extension. We show the application of REPAIR in a real use case, i.e., to the L'Aquila reconstruction process, damaged in 2009 by a major earthquake.

* Accepted at International Journal of Data Science and Analytics

Via

Access Paper or Ask Questions

How Do Generative Models Draw a Software Engineer? A Case Study on Stable Diffusion Bias

Jan 15, 2025

Tosin Fadahunsi, Giordano d'Aloisio, Antinisca Di Marco, Federica Sarro

Figure 1 for How Do Generative Models Draw a Software Engineer? A Case Study on Stable Diffusion Bias

Figure 2 for How Do Generative Models Draw a Software Engineer? A Case Study on Stable Diffusion Bias

Figure 3 for How Do Generative Models Draw a Software Engineer? A Case Study on Stable Diffusion Bias

Figure 4 for How Do Generative Models Draw a Software Engineer? A Case Study on Stable Diffusion Bias

Abstract:Generative models are nowadays widely used to generate graphical content used for multiple purposes, e.g. web, art, advertisement. However, it has been shown that the images generated by these models could reinforce societal biases already existing in specific contexts. In this paper, we focus on understanding if this is the case when one generates images related to various software engineering tasks. In fact, the Software Engineering (SE) community is not immune from gender and ethnicity disparities, which could be amplified by the use of these models. Hence, if used without consciousness, artificially generated images could reinforce these biases in the SE domain. Specifically, we perform an extensive empirical evaluation of the gender and ethnicity bias exposed by three versions of the Stable Diffusion (SD) model (a very popular open-source text-to-image model) - SD 2, SD XL, and SD 3 - towards SE tasks. We obtain 6,720 images by feeding each model with two sets of prompts describing different software-related tasks: one set includes the Software Engineer keyword, and one set does not include any specification of the person performing the task. Next, we evaluate the gender and ethnicity disparities in the generated images. Results show how all models are significantly biased towards male figures when representing software engineers. On the contrary, while SD 2 and SD XL are strongly biased towards White figures, SD 3 is slightly more biased towards Asian figures. Nevertheless, all models significantly under-represent Black and Arab figures, regardless of the prompt style used. The results of our analysis highlight severe concerns about adopting those models to generate content for SE tasks and open the field for future research on bias mitigation in this context.

Via

Access Paper or Ask Questions

On the Compression of Language Models for Code: An Empirical Study on CodeBERT

Dec 18, 2024

Giordano d'Aloisio, Luca Traini, Federica Sarro, Antinisca Di Marco

Figure 1 for On the Compression of Language Models for Code: An Empirical Study on CodeBERT

Figure 2 for On the Compression of Language Models for Code: An Empirical Study on CodeBERT

Figure 3 for On the Compression of Language Models for Code: An Empirical Study on CodeBERT

Figure 4 for On the Compression of Language Models for Code: An Empirical Study on CodeBERT

Abstract:Language models have proven successful across a wide range of software engineering tasks, but their significant computational costs often hinder their practical adoption. To address this challenge, researchers have begun applying various compression strategies to improve the efficiency of language models for code. These strategies aim to optimize inference latency and memory usage, though often at the cost of reduced model effectiveness. However, there is still a significant gap in understanding how these strategies influence the efficiency and effectiveness of language models for code. Here, we empirically investigate the impact of three well-known compression strategies -- knowledge distillation, quantization, and pruning -- across three different classes of software engineering tasks: vulnerability detection, code summarization, and code search. Our findings reveal that the impact of these strategies varies greatly depending on the task and the specific compression method employed. Practitioners and researchers can use these insights to make informed decisions when selecting the most appropriate compression strategy, balancing both efficiency and effectiveness based on their specific needs.

Via

Access Paper or Ask Questions

Exploring LLM-Driven Explanations for Quantum Algorithms

Sep 26, 2024

Giordano d'Aloisio, Sophie Fortz, Carol Hanna, Daniel Fortunato, Avner Bensoussan, Eñaut Mendiluze Usandizaga, Federica Sarro

Figure 1 for Exploring LLM-Driven Explanations for Quantum Algorithms

Figure 2 for Exploring LLM-Driven Explanations for Quantum Algorithms

Figure 3 for Exploring LLM-Driven Explanations for Quantum Algorithms

Figure 4 for Exploring LLM-Driven Explanations for Quantum Algorithms

Abstract:Background: Quantum computing is a rapidly growing new programming paradigm that brings significant changes to the design and implementation of algorithms. Understanding quantum algorithms requires knowledge of physics and mathematics, which can be challenging for software developers. Aims: In this work, we provide a first analysis of how LLMs can support developers' understanding of quantum code. Method: We empirically analyse and compare the quality of explanations provided by three widely adopted LLMs (Gpt3.5, Llama2, and Tinyllama) using two different human-written prompt styles for seven state-of-the-art quantum algorithms. We also analyse how consistent LLM explanations are over multiple rounds and how LLMs can improve existing descriptions of quantum algorithms. Results: Llama2 provides the highest quality explanations from scratch, while Gpt3.5 emerged as the LLM best suited to improve existing explanations. In addition, we show that adding a small amount of context to the prompt significantly improves the quality of explanations. Finally, we observe how explanations are qualitatively and syntactically consistent over multiple rounds. Conclusions: This work highlights promising results, and opens challenges for future research in the field of LLMs for quantum code explanation. Future work includes refining the methods through prompt optimisation and parsing of quantum code explanations, as well as carrying out a systematic assessment of the quality of explanations.

Via

Access Paper or Ask Questions

GreenStableYolo: Optimizing Inference Time and Image Quality of Text-to-Image Generation

Jul 20, 2024

Jingzhi Gong, Sisi Li, Giordano d'Aloisio, Zishuo Ding, Yulong Ye, William B. Langdon, Federica Sarro

Abstract:Tuning the parameters and prompts for improving AI-based text-to-image generation has remained a substantial yet unaddressed challenge. Hence we introduce GreenStableYolo, which improves the parameters and prompts for Stable Diffusion to both reduce GPU inference time and increase image generation quality using NSGA-II and Yolo. Our experiments show that despite a relatively slight trade-off (18%) in image quality compared to StableYolo (which only considers image quality), GreenStableYolo achieves a substantial reduction in inference time (266% less) and a 526% higher hypervolume, thereby advancing the state-of-the-art for text-to-image generation.

* This paper is published in the SSBSE Challenge Track 2024

Via

Access Paper or Ask Questions

Towards a Prediction of Machine Learning Training Time to Support Continuous Learning Systems Development

Sep 20, 2023

Francesca Marzi, Giordano d'Aloisio, Antinisca Di Marco, Giovanni Stilo

Figure 1 for Towards a Prediction of Machine Learning Training Time to Support Continuous Learning Systems Development

Figure 2 for Towards a Prediction of Machine Learning Training Time to Support Continuous Learning Systems Development

Figure 3 for Towards a Prediction of Machine Learning Training Time to Support Continuous Learning Systems Development

Figure 4 for Towards a Prediction of Machine Learning Training Time to Support Continuous Learning Systems Development

Abstract:The problem of predicting the training time of machine learning (ML) models has become extremely relevant in the scientific community. Being able to predict a priori the training time of an ML model would enable the automatic selection of the best model both in terms of energy efficiency and in terms of performance in the context of, for instance, MLOps architectures. In this paper, we present the work we are conducting towards this direction. In particular, we present an extensive empirical study of the Full Parameter Time Complexity (FPTC) approach by Zheng et al., which is, to the best of our knowledge, the only approach formalizing the training time of ML models as a function of both dataset's and model's parameters. We study the formulations proposed for the Logistic Regression and Random Forest classifiers, and we highlight the main strengths and weaknesses of the approach. Finally, we observe how, from the conducted study, the prediction of training time is strictly related to the context (i.e., the involved dataset) and how the FPTC approach is not generalizable.

Via

Access Paper or Ask Questions

Modeling Quality and Machine Learning Pipelines through Extended Feature Models

Jul 15, 2022

Giordano d'Aloisio, Antinisca Di Marco, Giovanni Stilo

Figure 1 for Modeling Quality and Machine Learning Pipelines through Extended Feature Models

Figure 2 for Modeling Quality and Machine Learning Pipelines through Extended Feature Models

Figure 3 for Modeling Quality and Machine Learning Pipelines through Extended Feature Models

Figure 4 for Modeling Quality and Machine Learning Pipelines through Extended Feature Models

Abstract:The recently increased complexity of Machine Learning (ML) methods, led to the necessity to lighten both the research and industry development processes. ML pipelines have become an essential tool for experts of many domains, data scientists and researchers, allowing them to easily put together several ML models to cover the full analytic process starting from raw datasets. Over the years, several solutions have been proposed to automate the building of ML pipelines, most of them focused on semantic aspects and characteristics of the input dataset. However, an approach taking into account the new quality concerns needed by ML systems (like fairness, interpretability, privacy, etc.) is still missing. In this paper, we first identify, from the literature, key quality attributes of ML systems. Further, we propose a new engineering approach for quality ML pipeline by properly extending the Feature Models meta-model. The presented approach allows to model ML pipelines, their quality requirements (on the whole pipeline and on single phases), and quality characteristics of algorithms used to implement each pipeline phase. Finally, we demonstrate the expressiveness of our model considering the classification problem.

Via

Access Paper or Ask Questions