Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Chaithanya Bandi

REL: Working out is all you need

Dec 05, 2024

Toby Simonds, Jey Han Lau, Chaithanya Bandi

Abstract:Recent developments, particularly OpenAI's O1 model, have demonstrated the remarkable potential of Large Language Models (LLMs) for complex reasoning tasks. Through analysis of O1's outputs and provided sample Chain-of-Thought (CoT) demonstrations, we observe that it approaches problem-solving in a distinctly human-like manner, systematically brainstorming ideas, testing hypotheses, verifying results, and planning comprehensive solutions. These sophisticated reasoning capabilities remain notably absent in other state-of-the-art language models. In this paper, we hypothesize that this performance gap stems from the limited availability of high-quality reasoning process data in current training sets. We demonstrate that by constructing a specialized dataset focused on explicit problem-solving workflows ("worked solutions"), we can elicit substantially improved planning capabilities from existing models. Additionally, we propose the Reasoning Enhancement Loop (REL), a method for generating synthetic worked solutions.

Via

Access Paper or Ask Questions

Adversarial Multi-Agent Evaluation of Large Language Models through Iterative Debates

Oct 07, 2024

Chaithanya Bandi, Hari Bandi, Abir Harrasse

Abstract:This paper explores optimal architectures for evaluating the outputs of large language models (LLMs) using LLMs themselves. We propose a novel framework that interprets LLMs as advocates within an ensemble of interacting agents, allowing them to defend their answers and reach conclusions through a judge and jury system. This approach offers a more dynamic and comprehensive evaluation process compared to traditional human-based assessments or automated metrics. We discuss the motivation behind this framework, its key components, and comparative advantages. We also present a probabilistic model to evaluate the error reduction achieved by iterative advocate systems. Finally, we outline experiments to validate the effectiveness of multi-advocate architectures and discuss future research directions.

Via

Access Paper or Ask Questions

An Application of Newsboy Problem in Supply Chain Optimisation of Online Fashion E-Commerce

Jul 06, 2020

Chandramouli Kamanchi, Gopinath Ashok Kumar, Nachiappan Sundaram, Ravindra Babu T, Chaithanya Bandi

Figure 1 for An Application of Newsboy Problem in Supply Chain Optimisation of Online Fashion E-Commerce

Figure 2 for An Application of Newsboy Problem in Supply Chain Optimisation of Online Fashion E-Commerce

Figure 3 for An Application of Newsboy Problem in Supply Chain Optimisation of Online Fashion E-Commerce

Figure 4 for An Application of Newsboy Problem in Supply Chain Optimisation of Online Fashion E-Commerce

Abstract:We describe a supply chain optimization model deployed in an online fashion e-commerce company in India called Myntra. Our model is simple, elegant and easy to put into service. The model utilizes historic data and predicts the quantity of Stock Keeping Units (SKUs) to hold so that the metrics "Fulfilment Index" and "Utilization Index" are optimized. We present the mathematics central to our model as well as compare the performance of our model with baseline regression based solutions.

Via

Access Paper or Ask Questions