Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ricardo Luna Gutiérrez

N-Critics: Self-Refinement of Large Language Models with Ensemble of Critics

Nov 08, 2023

Sajad Mousavi, Ricardo Luna Gutiérrez, Desik Rengarajan, Vineet Gundecha, Ashwin Ramesh Babu, Avisek Naug, Antonio Guillen, Soumyendu Sarkar

Figure 1 for N-Critics: Self-Refinement of Large Language Models with Ensemble of Critics

Figure 2 for N-Critics: Self-Refinement of Large Language Models with Ensemble of Critics

Figure 3 for N-Critics: Self-Refinement of Large Language Models with Ensemble of Critics

Figure 4 for N-Critics: Self-Refinement of Large Language Models with Ensemble of Critics

Abstract:We propose a self-correction mechanism for Large Language Models (LLMs) to mitigate issues such as toxicity and fact hallucination. This method involves refining model outputs through an ensemble of critics and the model's own feedback. Drawing inspiration from human behavior, we explore whether LLMs can emulate the self-correction process observed in humans who often engage in self-reflection and seek input from others to refine their understanding of complex topics. Our approach is model-agnostic and can be applied across various domains to enhance trustworthiness by addressing fairness, bias, and robustness concerns. We consistently observe performance improvements in LLMs for reducing toxicity and correcting factual errors.

* NeurIPS 2023 Workshop on Robustness of Few-shot and Zero-shot Learning in Foundation Models 2023(NeurIPS 2023)

Via

Access Paper or Ask Questions

PyDCM: Custom Data Center Models with Reinforcement Learning for Sustainability

Oct 15, 2023

Avisek Naug, Antonio Guillen, Ricardo Luna Gutiérrez, Vineet Gundecha, Dejan Markovikj, Lekhapriya Dheeraj Kashyap, Lorenz Krause, Sahand Ghorbanpour, Sajad Mousavi, Ashwin Ramesh Babu(+1 more)

Figure 1 for PyDCM: Custom Data Center Models with Reinforcement Learning for Sustainability

Figure 2 for PyDCM: Custom Data Center Models with Reinforcement Learning for Sustainability

Figure 3 for PyDCM: Custom Data Center Models with Reinforcement Learning for Sustainability

Figure 4 for PyDCM: Custom Data Center Models with Reinforcement Learning for Sustainability

Abstract:The increasing global emphasis on sustainability and reducing carbon emissions is pushing governments and corporations to rethink their approach to data center design and operation. Given their high energy consumption and exponentially large computational workloads, data centers are prime candidates for optimizing power consumption, especially in areas such as cooling and IT energy usage. A significant challenge in this pursuit is the lack of a configurable and scalable thermal data center model that offers an end-to-end pipeline. Data centers consist of multiple IT components whose geometric configuration and heat dissipation make thermal modeling difficult. This paper presents PyDCM, a customizable Data Center Model implemented in Python, that allows users to create unique configurations of IT equipment with custom server specifications and geometric arrangements of IT cabinets. The use of vectorized thermal calculations makes PyDCM orders of magnitude faster (30 times) than current Energy Plus modeling implementations and scales sublinearly with the number of CPUs. Also, PyDCM enables the use of Deep Reinforcement Learning via the Gymnasium wrapper to optimize data center cooling and offers a user-friendly platform for testing various data center design prototypes.

* The 10th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation (BuildSys '23), November 15--16, 2023, Istanbul, Turkey

Via

Access Paper or Ask Questions