Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Felipe Soares

Dedicated Feedback and Edit Models Empower Inference-Time Scaling for Open-Ended General-Domain Tasks

Mar 06, 2025

Zhilin Wang, Jiaqi Zeng, Olivier Delalleau, Daniel Egert, Ellie Evans, Hoo-Chang Shin, Felipe Soares, Yi Dong, Oleksii Kuchaiev

Abstract:Inference-Time Scaling has been critical to the success of recent models such as OpenAI o1 and DeepSeek R1. However, many techniques used to train models for inference-time scaling require tasks to have answers that can be verified, limiting their application to domains such as math, coding and logical reasoning. We take inspiration from how humans make first attempts, ask for detailed feedback from others and make improvements based on such feedback across a wide spectrum of open-ended endeavors. To this end, we collect data for and train dedicated Feedback and Edit Models that are capable of performing inference-time scaling for open-ended general-domain tasks. In our setup, one model generates an initial response, which are given feedback by a second model, that are then used by a third model to edit the response. We show that performance on Arena Hard, a benchmark strongly predictive of Chatbot Arena Elo can be boosted by scaling the number of initial response drafts, effective feedback and edited responses. When scaled optimally, our setup based on 70B models from the Llama 3 family can reach SoTA performance on Arena Hard at 92.7 as of 5 Mar 2025, surpassing OpenAI o1-preview-2024-09-12 with 90.4 and DeepSeek R1 with 92.3.

* 22 pages, 2 figures

Via

Access Paper or Ask Questions

Nemotron-4 340B Technical Report

Jun 17, 2024

Nvidia, :, Bo Adler, Niket Agarwal, Ashwath Aithal, Dong H. Anh, Pallab Bhattacharya, Annika Brundyn, Jared Casper, Bryan Catanzaro(+73 more)

Figure 1 for Nemotron-4 340B Technical Report

Figure 2 for Nemotron-4 340B Technical Report

Figure 3 for Nemotron-4 340B Technical Report

Figure 4 for Nemotron-4 340B Technical Report

Abstract:We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4-340B-Instruct, and Nemotron-4-340B-Reward. Our models are open access under the NVIDIA Open Model License Agreement, a permissive model license that allows distribution, modification, and use of the models and its outputs. These models perform competitively to open access models on a wide range of evaluation benchmarks, and were sized to fit on a single DGX H100 with 8 GPUs when deployed in FP8 precision. We believe that the community can benefit from these models in various research studies and commercial applications, especially for generating synthetic data to train smaller language models. Notably, over 98% of data used in our model alignment process is synthetically generated, showcasing the effectiveness of these models in generating synthetic data. To further support open research and facilitate model development, we are also open-sourcing the synthetic data generation pipeline used in our model alignment process.

Via

Access Paper or Ask Questions

UFRGS Participation on the WMT Biomedical Translation Shared Task

May 06, 2019

Felipe Soares, Karin Becker

Figure 1 for UFRGS Participation on the WMT Biomedical Translation Shared Task

Figure 2 for UFRGS Participation on the WMT Biomedical Translation Shared Task

Figure 3 for UFRGS Participation on the WMT Biomedical Translation Shared Task

Figure 4 for UFRGS Participation on the WMT Biomedical Translation Shared Task

Abstract:This paper describes the machine translation systems developed by the Universidade Federal do Rio Grande do Sul (UFRGS) team for the biomedical translation shared task. Our systems are based on statistical machine translation and neural machine translation, using the Moses and OpenNMT toolkits, respectively. We participated in four translation directions for the English/Spanish and English/Portuguese language pairs. To create our training data, we concatenated several parallel corpora, both from in-domain and out-of-domain sources, as well as terminological resources from UMLS. Our systems achieved the best BLEU scores according to the official shared task evaluation.

* Published on the Third Conference on Machine Translation (WMT18)

Via

Access Paper or Ask Questions

A Large Parallel Corpus of Full-Text Scientific Articles

May 06, 2019

Felipe Soares, Viviane Pereira Moreira, Karin Becker

Figure 1 for A Large Parallel Corpus of Full-Text Scientific Articles

Figure 2 for A Large Parallel Corpus of Full-Text Scientific Articles

Figure 3 for A Large Parallel Corpus of Full-Text Scientific Articles

Figure 4 for A Large Parallel Corpus of Full-Text Scientific Articles

Abstract:The Scielo database is an important source of scientific information in Latin America, containing articles from several research domains. A striking characteristic of Scielo is that many of its full-text contents are presented in more than one language, thus being a potential source of parallel corpora. In this article, we present the development of a parallel corpus from Scielo in three languages: English, Portuguese, and Spanish. Sentences were automatically aligned using the Hunalign algorithm for all language pairs, and for a subset of trilingual articles also. We demonstrate the capabilities of our corpus by training a Statistical Machine Translation system (Moses) for each language pair, which outperformed related works on scientific articles. Sentence alignment was also manually evaluated, presenting an average of 98.8% correctly aligned sentences across all languages. Our parallel corpus is freely available in the TMX format, with complementary information regarding article metadata.

* Published in Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

Via

Access Paper or Ask Questions

A Parallel Corpus of Theses and Dissertations Abstracts

May 05, 2019

Felipe Soares, Gabrielli Harumi Yamashita, Michel Jose Anzanello

Figure 1 for A Parallel Corpus of Theses and Dissertations Abstracts

Figure 2 for A Parallel Corpus of Theses and Dissertations Abstracts

Figure 3 for A Parallel Corpus of Theses and Dissertations Abstracts

Figure 4 for A Parallel Corpus of Theses and Dissertations Abstracts

Abstract:In Brazil, the governmental body responsible for overseeing and coordinating post-graduate programs, CAPES, keeps records of all theses and dissertations presented in the country. Information regarding such documents can be accessed online in the Theses and Dissertations Catalog (TDC), which contains abstracts in Portuguese and English, and additional metadata. Thus, this database can be a potential source of parallel corpora for the Portuguese and English languages. In this article, we present the development of a parallel corpus from TDC, which is made available by CAPES under the open data initiative. Approximately 240,000 documents were collected and aligned using the Hunalign tool. We demonstrate the capability of our developed corpus by training Statistical Machine Translation (SMT) and Neural Machine Translation (NMT) models for both language directions, followed by a comparison with Google Translate (GT). Both translation models presented better BLEU scores than GT, with NMT system being the most accurate one. Sentence alignment was also manually evaluated, presenting an average of 82.30% correctly aligned sentences. Our parallel corpus is freely available in TMX format, with complementary information regarding document metadata

* Computational Processing of the Portuguese Language 2018
* Published in the PROPOR Conference. arXiv admin note: text overlap with arXiv:1905.01712

Via

Access Paper or Ask Questions

BVS Corpus: A Multilingual Parallel Corpus of Biomedical Scientific Texts

May 05, 2019

Felipe Soares, Martin Krallinger

Figure 1 for BVS Corpus: A Multilingual Parallel Corpus of Biomedical Scientific Texts

Figure 2 for BVS Corpus: A Multilingual Parallel Corpus of Biomedical Scientific Texts

Figure 3 for BVS Corpus: A Multilingual Parallel Corpus of Biomedical Scientific Texts

Figure 4 for BVS Corpus: A Multilingual Parallel Corpus of Biomedical Scientific Texts

Abstract:The BVS database (Health Virtual Library) is a centralized source of biomedical information for Latin America and Carib, created in 1998 and coordinated by BIREME (Biblioteca Regional de Medicina) in agreement with the Pan American Health Organization (OPAS). Abstracts are available in English, Spanish, and Portuguese, with a subset in more than one language, thus being a possible source of parallel corpora. In this article, we present the development of parallel corpora from BVS in three languages: English, Portuguese, and Spanish. Sentences were automatically aligned using the Hunalign algorithm for EN/ES and EN/PT language pairs, and for a subset of trilingual articles also. We demonstrate the capabilities of our corpus by training a Neural Machine Translation (OpenNMT) system for each language pair, which outperformed related works on scientific biomedical articles. Sentence alignment was also manually evaluated, presenting an average 96% of correctly aligned sentences across all languages. Our parallel corpus is freely available, with complementary information regarding article metadata.

* Accepted at the Copora conference. arXiv admin note: text overlap with arXiv:1905.01715

Via

Access Paper or Ask Questions