Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Maurizio Gabbrielli

A Computational Model of Inclusive Pedagogy: From Understanding to Application

May 02, 2025

Francesco Balzan, Pedro P. Santos, Maurizio Gabbrielli, Mahault Albarracin, Manuel Lopes

Abstract:Human education transcends mere knowledge transfer, it relies on co-adaptation dynamics -- the mutual adjustment of teaching and learning strategies between agents. Despite its centrality, computational models of co-adaptive teacher-student interactions (T-SI) remain underdeveloped. We argue that this gap impedes Educational Science in testing and scaling contextual insights across diverse settings, and limits the potential of Machine Learning systems, which struggle to emulate and adaptively support human learning processes. To address this, we present a computational T-SI model that integrates contextual insights on human education into a testable framework. We use the model to evaluate diverse T-SI strategies in a realistic synthetic classroom setting, simulating student groups with unequal access to sensory information. Results show that strategies incorporating co-adaptation principles (e.g., bidirectional agency) outperform unilateral approaches (i.e., where only the teacher or the student is active), improving the learning outcomes for all learning types. Beyond the testing and scaling of context-dependent educational insights, our model enables hypothesis generation in controlled yet adaptable environments. This work bridges non-computational theories of human education with scalable, inclusive AI in Education systems, providing a foundation for equitable technologies that dynamically adapt to learner needs.

* This is a preprint version of a manuscript intended for submission to the International Journal of Artificial Intelligence in Education (IJAIED)

Via

Access Paper or Ask Questions

One Model to Train them All: Hierarchical Self-Distillation for Enhanced Early Layer Embeddings

Mar 04, 2025

Andrea Gurioli, Federico Pennino, João Monteiro, Maurizio Gabbrielli

Abstract:Deploying language models often requires handling model size vs. performance trade-offs to satisfy downstream latency constraints while preserving the model's usefulness. Model distillation is commonly employed to reduce model size while maintaining acceptable performance. However, distillation can be inefficient since it involves multiple training steps. In this work, we introduce MODULARSTARENCODER, a modular multi-exit encoder with 1B parameters, useful for multiple tasks within the scope of code retrieval. MODULARSTARENCODER is trained with a novel self-distillation mechanism that significantly improves lower-layer representations-allowing different portions of the model to be used while still maintaining a good trade-off in terms of performance. Our architecture focuses on enhancing text-to-code and code-to-code search by systematically capturing syntactic and semantic structures across multiple levels of representation. Specific encoder layers are targeted as exit heads, allowing higher layers to guide earlier layers during training. This self-distillation effect improves intermediate representations, increasing retrieval recall at no extra training cost. In addition to the multi-exit scheme, our approach integrates a repository-level contextual loss that maximally utilizes the training context window, further enhancing the learned representations. We also release a new dataset constructed via code translation, seamlessly expanding traditional text-to-code benchmarks with code-to-code pairs across diverse programming languages. Experimental results highlight the benefits of self-distillation through multi-exit supervision.

Via

Access Paper or Ask Questions

Affordably Fine-tuned LLMs Provide Better Answers to Course-specific MCQs

Jan 10, 2025

Bianca Raimondi, Saverio Giallorenzo, Maurizio Gabbrielli

Figure 1 for Affordably Fine-tuned LLMs Provide Better Answers to Course-specific MCQs

Figure 2 for Affordably Fine-tuned LLMs Provide Better Answers to Course-specific MCQs

Figure 3 for Affordably Fine-tuned LLMs Provide Better Answers to Course-specific MCQs

Figure 4 for Affordably Fine-tuned LLMs Provide Better Answers to Course-specific MCQs

Abstract:In education, the capability of generating human-like text of Large Language Models (LLMs) inspired work on how they can increase the efficiency of learning and teaching. We study the affordability of these models for educators and students by investigating how LLMs answer multiple-choice questions (MCQs) with respect to hardware constraints and refinement techniques. We explore this space by using generic pre-trained LLMs (the 7B, 13B, and 70B variants of LLaMA-2) to answer 162 undergraduate-level MCQs from a course on Programming Languages (PL) -- the MCQ dataset is a contribution of this work, which we make publicly available. Specifically, we dissect how different factors, such as using readily-available material -- (parts of) the course's textbook -- for fine-tuning and quantisation (to decrease resource usage) can change the accuracy of the responses. The main takeaway is that smaller textbook-based fine-tuned models outperform generic larger ones (whose pre-training requires conspicuous resources), making the usage of LLMs for answering MCQs resource- and material-wise affordable.

* The 40th ACM/SIGAPP Symposium On Applied Computing

Via

Access Paper or Ask Questions

Multimodal Side-Tuning for Document Classification

Jan 23, 2023

Stefano Pio Zingaro, Giuseppe Lisanti, Maurizio Gabbrielli

Figure 1 for Multimodal Side-Tuning for Document Classification

Figure 2 for Multimodal Side-Tuning for Document Classification

Figure 3 for Multimodal Side-Tuning for Document Classification

Figure 4 for Multimodal Side-Tuning for Document Classification

Abstract:In this paper, we propose to exploit the side-tuning framework for multimodal document classification. Side-tuning is a methodology for network adaptation recently introduced to solve some of the problems related to previous approaches. Thanks to this technique it is actually possible to overcome model rigidity and catastrophic forgetting of transfer learning by fine-tuning. The proposed solution uses off-the-shelf deep learning architectures leveraging the side-tuning framework to combine a base model with a tandem of two side networks. We show that side-tuning can be successfully employed also when different data sources are considered, e.g. text and images in document classification. The experimental results show that this approach pushes further the limit for document classification accuracy with respect to the state of the art.

* 2020 25th International Conference on Pattern Recognition (ICPR)

Via

Access Paper or Ask Questions

On the evaluation of (meta-)solver approaches

Feb 17, 2022

Roberto Amadini, Maurizio Gabbrielli, Tong Liu, Jacopo Mauro

Figure 1 for On the evaluation of (meta-)solver approaches

Figure 2 for On the evaluation of (meta-)solver approaches

Figure 3 for On the evaluation of (meta-)solver approaches

Figure 4 for On the evaluation of (meta-)solver approaches

Abstract:Meta-solver approaches exploits a number of individual solvers to potentially build a better solver. To assess the performance of meta-solvers, one can simply adopt the metrics typically used for individual solvers (e.g., runtime or solution quality), or employ more specific evaluation metrics (e.g., by measuring how close the meta-solver gets to its virtual best performance). In this paper, based on some recently published works, we provide an overview of different performance metrics for evaluating (meta-)solvers, by underlying their strengths and weaknesses.

Via

Access Paper or Ask Questions

Content-Based Textual File Type Detection at Scale

Jan 21, 2021

Francesca Del Bonifro, Maurizio Gabbrielli, Stefano Zacchiroli

Figure 1 for Content-Based Textual File Type Detection at Scale

Figure 2 for Content-Based Textual File Type Detection at Scale

Figure 3 for Content-Based Textual File Type Detection at Scale

Figure 4 for Content-Based Textual File Type Detection at Scale

Abstract:Programming language detection is a common need in the analysis of large source code bases. It is supported by a number of existing tools that rely on several features, and most notably file extensions, to determine file types. We consider the problem of accurately detecting the type of files commonly found in software code bases, based solely on textual file content. Doing so is helpful to classify source code that lack file extensions (e.g., code snippets posted on the Web or executable scripts), to avoid misclassifying source code that has been recorded with wrong or uncommon file extensions, and also shed some light on the intrinsic recognizability of source code files. We propose a simple model that (a) use a language-agnostic word tokenizer for textual files, (b) group tokens in 1-/2-grams, (c) build feature vectors based on N-gram frequencies, and (d) use a simple fully connected neural network as classifier. As training set we use textual files extracted from GitHub repositories with at least 1000 stars, using existing file extensions as ground truth. Despite its simplicity the proposed model reaches 85% in our experiments for a relatively high number of recognized classes (more than 130 file types).

Via

Access Paper or Ask Questions

sunny-as2: Enhancing SUNNY for Algorithm Selection

Sep 07, 2020

Tong Liu, Roberto Amadini, Jacopo Mauro, Maurizio Gabbrielli

Figure 1 for sunny-as2: Enhancing SUNNY for Algorithm Selection

Figure 2 for sunny-as2: Enhancing SUNNY for Algorithm Selection

Figure 3 for sunny-as2: Enhancing SUNNY for Algorithm Selection

Figure 4 for sunny-as2: Enhancing SUNNY for Algorithm Selection

Abstract:SUNNY is an Algorithm Selection (AS) technique originally tailored for Constraint Programming (CP). SUNNY enables to schedule, from a portfolio of solvers, a subset of solvers to be run on a given CP problem. This approach has proved to be effective for CP problems, and its parallel version won many gold medals in the Open category of the MiniZinc Challenge -- the yearly international competition for CP solvers. In 2015, the ASlib benchmarks were released for comparing AS systems coming from disparate fields (e.g., ASP, QBF, and SAT) and SUNNY was extended to deal with generic AS problems. This led to the development of sunny-as2, an algorithm selector based on SUNNY for ASlib scenarios. A preliminary version of sunny-as2 was submitted to the Open Algorithm Selection Challenge (OASC) in 2017, where it turned out to be the best approach for the runtime minimization of decision problems. In this work, we present the technical advancements of sunny-as2, including: (i) wrapper-based feature selection; (ii) a training approach combining feature selection and neighbourhood size configuration; (iii) the application of nested cross-validation. We show how sunny-as2 performance varies depending on the considered AS scenarios, and we discuss its strengths and weaknesses. Finally, we also show how sunny-as2 improves on its preliminary version submitted to OASC.

Via

Access Paper or Ask Questions

SUNNY-CP and the MiniZinc Challenge

Jul 05, 2017

Roberto Amadini, Maurizio Gabbrielli, Jacopo Mauro

Figure 1 for SUNNY-CP and the MiniZinc Challenge

Figure 2 for SUNNY-CP and the MiniZinc Challenge

Figure 3 for SUNNY-CP and the MiniZinc Challenge

Figure 4 for SUNNY-CP and the MiniZinc Challenge

Abstract:In Constraint Programming (CP) a portfolio solver combines a variety of different constraint solvers for solving a given problem. This fairly recent approach enables to significantly boost the performance of single solvers, especially when multicore architectures are exploited. In this work we give a brief overview of the portfolio solver sunny-cp, and we discuss its performance in the MiniZinc Challenge---the annual international competition for CP solvers---where it won two gold medals in 2015 and 2016. Under consideration in Theory and Practice of Logic Programming (TPLP)

* Under consideration in Theory and Practice of Logic Programming (TPLP)

Via

Access Paper or Ask Questions

A Multicore Tool for Constraint Solving

Apr 30, 2015

Roberto Amadini, Maurizio Gabbrielli, Jacopo Mauro

Figure 1 for A Multicore Tool for Constraint Solving

Figure 2 for A Multicore Tool for Constraint Solving

Figure 3 for A Multicore Tool for Constraint Solving

Figure 4 for A Multicore Tool for Constraint Solving

Abstract:*** To appear in IJCAI 2015 proceedings *** In Constraint Programming (CP), a portfolio solver uses a variety of different solvers for solving a given Constraint Satisfaction / Optimization Problem. In this paper we introduce sunny-cp2: the first parallel CP portfolio solver that enables a dynamic, cooperative, and simultaneous execution of its solvers in a multicore setting. It incorporates state-of-the-art solvers, providing also a usable and configurable framework. Empirical results are very promising. sunny-cp2 can even outperform the performance of the oracle solver which always selects the best solver of the portfolio for a given problem.

Via

Access Paper or Ask Questions

SUNNY: a Lazy Portfolio Approach for Constraint Solving

May 13, 2014

Roberto Amadini, Maurizio Gabbrielli, Jacopo Mauro

Figure 1 for SUNNY: a Lazy Portfolio Approach for Constraint Solving

Figure 2 for SUNNY: a Lazy Portfolio Approach for Constraint Solving

Figure 3 for SUNNY: a Lazy Portfolio Approach for Constraint Solving

Figure 4 for SUNNY: a Lazy Portfolio Approach for Constraint Solving

Abstract:*** To appear in Theory and Practice of Logic Programming (TPLP) *** Within the context of constraint solving, a portfolio approach allows one to exploit the synergy between different solvers in order to create a globally better solver. In this paper we present SUNNY: a simple and flexible algorithm that takes advantage of a portfolio of constraint solvers in order to compute --- without learning an explicit model --- a schedule of them for solving a given Constraint Satisfaction Problem (CSP). Motivated by the performance reached by SUNNY vs. different simulations of other state of the art approaches, we developed sunny-csp, an effective portfolio solver that exploits the underlying SUNNY algorithm in order to solve a given CSP. Empirical tests conducted on exhaustive benchmarks of MiniZinc models show that the actual performance of SUNNY conforms to the predictions. This is encouraging both for improving the power of CSP portfolio solvers and for trying to export them to fields such as Answer Set Programming and Constraint Logic Programming.

Via

Access Paper or Ask Questions