Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Wenjia Zhang

FANNO: Augmenting High-Quality Instruction Data with Open-Sourced LLMs Only

Aug 02, 2024

He Zhu, Junyou Su, Tianle Lun, Yicheng Tao, Wenjia Zhang, Zipei Fan, Guanhua Chen

Figure 1 for FANNO: Augmenting High-Quality Instruction Data with Open-Sourced LLMs Only

Figure 2 for FANNO: Augmenting High-Quality Instruction Data with Open-Sourced LLMs Only

Figure 3 for FANNO: Augmenting High-Quality Instruction Data with Open-Sourced LLMs Only

Figure 4 for FANNO: Augmenting High-Quality Instruction Data with Open-Sourced LLMs Only

Abstract:Instruction fine-tuning stands as a crucial advancement in leveraging large language models (LLMs) for enhanced task performance. However, the annotation of instruction datasets has traditionally been expensive and laborious, often relying on manual annotations or costly API calls of proprietary LLMs. To address these challenges, we introduce FANNO, a fully autonomous, open-sourced framework that revolutionizes the annotation process without the need for pre-existing annotated data. Utilizing a Mistral-7b-instruct model, FANNO efficiently produces diverse and high-quality datasets through a structured process involving document pre-screening, instruction generation, and response generation. Experiments on Open LLM Leaderboard and AlpacaEval benchmark show that the FANNO can generate high-quality data with diversity and complexity for free, comparable to human-annotated or cleaned datasets like Alpaca-GPT4-Cleaned.

Via

Access Paper or Ask Questions

Multi-Layer Ranking with Large Language Models for News Source Recommendation

Jun 17, 2024

Wenjia Zhang, Lin Gui, Rob Procter, Yulan He

Abstract:To seek reliable information sources for news events, we introduce a novel task of expert recommendation, which aims to identify trustworthy sources based on their previously quoted statements. To achieve this, we built a novel dataset, called NewsQuote, consisting of 23,571 quote-speaker pairs sourced from a collection of news articles. We formulate the recommendation task as the retrieval of experts based on their likelihood of being associated with a given query. We also propose a multi-layer ranking framework employing Large Language Models to improve the recommendation performance. Our results show that employing an in-context learning based LLM ranker and a multi-layer ranking-based filter significantly improve both the predictive quality and behavioural quality of the recommender system.

* Accepted by the SIGIR 2024. arXiv admin note: text overlap with arXiv:2305.04825

Via

Access Paper or Ask Questions

PlanGPT: Enhancing Urban Planning with Tailored Language Model and Efficient Retrieval

Feb 29, 2024

He Zhu, Wenjia Zhang, Nuoxian Huang, Boyang Li, Luyao Niu, Zipei Fan, Tianle Lun, Yicheng Tao, Junyou Su, Zhaoya Gong(+2 more)

Figure 1 for PlanGPT: Enhancing Urban Planning with Tailored Language Model and Efficient Retrieval

Figure 2 for PlanGPT: Enhancing Urban Planning with Tailored Language Model and Efficient Retrieval

Figure 3 for PlanGPT: Enhancing Urban Planning with Tailored Language Model and Efficient Retrieval

Figure 4 for PlanGPT: Enhancing Urban Planning with Tailored Language Model and Efficient Retrieval

Abstract:In the field of urban planning, general-purpose large language models often struggle to meet the specific needs of planners. Tasks like generating urban planning texts, retrieving related information, and evaluating planning documents pose unique challenges. To enhance the efficiency of urban professionals and overcome these obstacles, we introduce PlanGPT, the first specialized Large Language Model tailored for urban and spatial planning. Developed through collaborative efforts with institutions like the Chinese Academy of Urban Planning, PlanGPT leverages a customized local database retrieval framework, domain-specific fine-tuning of base models, and advanced tooling capabilities. Empirical tests demonstrate that PlanGPT has achieved advanced performance, delivering responses of superior quality precisely tailored to the intricacies of urban planning.

Via

Access Paper or Ask Questions

NarrativePlay: Interactive Narrative Understanding

Oct 02, 2023

Runcong Zhao, Wenjia Zhang, Jiazheng Li, Lixing Zhu, Yanran Li, Yulan He, Lin Gui

Abstract:In this paper, we introduce NarrativePlay, a novel system that allows users to role-play a fictional character and interact with other characters in narratives such as novels in an immersive environment. We leverage Large Language Models (LLMs) to generate human-like responses, guided by personality traits extracted from narratives. The system incorporates auto-generated visual display of narrative settings, character portraits, and character speech, greatly enhancing user experience. Our approach eschews predefined sandboxes, focusing instead on main storyline events extracted from narratives from the perspective of a user-selected character. NarrativePlay has been evaluated on two types of narratives, detective and adventure stories, where users can either explore the world or improve their favorability with the narrative characters through conversations.

Via

Access Paper or Ask Questions

Look Beneath the Surface: Exploiting Fundamental Symmetry for Sample-Efficient Offline RL

Jun 08, 2023

Peng Cheng, Xianyuan Zhan, Zhihao Wu, Wenjia Zhang, Shoucheng Song, Han Wang, Youfang Lin, Li Jiang

Figure 1 for Look Beneath the Surface: Exploiting Fundamental Symmetry for Sample-Efficient Offline RL

Figure 2 for Look Beneath the Surface: Exploiting Fundamental Symmetry for Sample-Efficient Offline RL

Figure 3 for Look Beneath the Surface: Exploiting Fundamental Symmetry for Sample-Efficient Offline RL

Figure 4 for Look Beneath the Surface: Exploiting Fundamental Symmetry for Sample-Efficient Offline RL

Abstract:Offline reinforcement learning (RL) offers an appealing approach to real-world tasks by learning policies from pre-collected datasets without interacting with the environment. However, the performance of existing offline RL algorithms heavily depends on the scale and state-action space coverage of datasets. Real-world data collection is often expensive and uncontrollable, leading to small and narrowly covered datasets and posing significant challenges for practical deployments of offline RL. In this paper, we provide a new insight that leveraging the fundamental symmetry of system dynamics can substantially enhance offline RL performance under small datasets. Specifically, we propose a Time-reversal symmetry (T-symmetry) enforced Dynamics Model (TDM), which establishes consistency between a pair of forward and reverse latent dynamics. TDM provides both well-behaved representations for small datasets and a new reliability measure for OOD samples based on compliance with the T-symmetry. These can be readily used to construct a new offline RL algorithm (TSRL) with less conservative policy constraints and a reliable latent space data augmentation procedure. Based on extensive experiments, we find TSRL achieves great performance on small benchmark datasets with as few as 1% of the original samples, which significantly outperforms the recent offline RL algorithms in terms of data efficiency and generalizability.

* The first two authors contributed equally

Via

Access Paper or Ask Questions

NewsQuote: A Dataset Built on Quote Extraction and Attribution for Expert Recommendation in Fact-Checking

May 05, 2023

Wenjia Zhang, Lin Gui, Rob Procter, Yulan He

Figure 1 for NewsQuote: A Dataset Built on Quote Extraction and Attribution for Expert Recommendation in Fact-Checking

Figure 2 for NewsQuote: A Dataset Built on Quote Extraction and Attribution for Expert Recommendation in Fact-Checking

Figure 3 for NewsQuote: A Dataset Built on Quote Extraction and Attribution for Expert Recommendation in Fact-Checking

Figure 4 for NewsQuote: A Dataset Built on Quote Extraction and Attribution for Expert Recommendation in Fact-Checking

Abstract:To enhance the ability to find credible evidence in news articles, we propose a novel task of expert recommendation, which aims to identify trustworthy experts on a specific news topic. To achieve the aim, we describe the construction of a novel NewsQuote dataset consisting of 24,031 quote-speaker pairs that appeared on a COVID-19 news corpus. We demonstrate an automatic pipeline for speaker and quote extraction via a BERT-based Question Answering model. Then, we formulate expert recommendations as document retrieval task by retrieving relevant quotes first as an intermediate step for expert identification, and expert retrieval by directly retrieving sources based on the probability of a query conditional on a candidate expert. Experimental results on NewsQuote show that document retrieval is more effective in identifying relevant experts for a given news topic compared to expert retrieval

* 11 pages, 5 figures. 17TH International AAAI Conference on Web and Social Media; Mediate 2023: News Media and Computational Journalism Workshop

Via

Access Paper or Ask Questions

Discriminator-Guided Model-Based Offline Imitation Learning

Jul 05, 2022

Wenjia Zhang, Haoran Xu, Haoyi Niu, Peng Cheng, Ming Li, Heming Zhang, Guyue Zhou, Xianyuan Zhan

Figure 1 for Discriminator-Guided Model-Based Offline Imitation Learning

Figure 2 for Discriminator-Guided Model-Based Offline Imitation Learning

Figure 3 for Discriminator-Guided Model-Based Offline Imitation Learning

Figure 4 for Discriminator-Guided Model-Based Offline Imitation Learning

Abstract:Offline imitation learning (IL) is a powerful method to solve decision-making problems from expert demonstrations without reward labels. Existing offline IL methods suffer from severe performance degeneration under limited expert data due to covariate shift. Including a learned dynamics model can potentially improve the state-action space coverage of expert data, however, it also faces challenging issues like model approximation/generalization errors and suboptimality of rollout data. In this paper, we propose the Discriminator-guided Model-based offline Imitation Learning (DMIL) framework, which introduces a discriminator to simultaneously distinguish the dynamics correctness and suboptimality of model rollout data against real expert demonstrations. DMIL adopts a novel cooperative-yet-adversarial learning strategy, which uses the discriminator to guide and couple the learning process of the policy and dynamics model, resulting in improved model performance and robustness. Our framework can also be extended to the case when demonstrations contain a large proportion of suboptimal data. Experimental results show that DMIL and its extension achieve superior performance and robustness compared to state-of-the-art offline IL methods under small datasets.

Via

Access Paper or Ask Questions

A Manifold View of Adversarial Risk

Apr 08, 2022

Wenjia Zhang, Yikai Zhang, Xiaoling Hu, Mayank Goswami, Chao Chen, Dimitris Metaxas

Figure 1 for A Manifold View of Adversarial Risk

Figure 2 for A Manifold View of Adversarial Risk

Figure 3 for A Manifold View of Adversarial Risk

Figure 4 for A Manifold View of Adversarial Risk

Abstract:The adversarial risk of a machine learning model has been widely studied. Most previous works assume that the data lies in the whole ambient space. We propose to take a new angle and take the manifold assumption into consideration. Assuming data lies in a manifold, we investigate two new types of adversarial risk, the normal adversarial risk due to perturbation along normal direction, and the in-manifold adversarial risk due to perturbation within the manifold. We prove that the classic adversarial risk can be bounded from both sides using the normal and in-manifold adversarial risks. We also show with a surprisingly pessimistic case that the standard adversarial risk can be nonzero even when both normal and in-manifold risks are zero. We finalize the paper with empirical studies supporting our theoretical results. Our results suggest the possibility of improving the robustness of a classifier by only focusing on the normal adversarial risk.

Via

Access Paper or Ask Questions

Supervised Contrastive Learning for Multimodal Unreliable News Detection in COVID-19 Pandemic

Sep 09, 2021

Wenjia Zhang, Lin Gui, Yulan He

Figure 1 for Supervised Contrastive Learning for Multimodal Unreliable News Detection in COVID-19 Pandemic

Figure 2 for Supervised Contrastive Learning for Multimodal Unreliable News Detection in COVID-19 Pandemic

Figure 3 for Supervised Contrastive Learning for Multimodal Unreliable News Detection in COVID-19 Pandemic

Figure 4 for Supervised Contrastive Learning for Multimodal Unreliable News Detection in COVID-19 Pandemic

Abstract:As the digital news industry becomes the main channel of information dissemination, the adverse impact of fake news is explosively magnified. The credibility of a news report should not be considered in isolation. Rather, previously published news articles on the similar event could be used to assess the credibility of a news report. Inspired by this, we propose a BERT-based multimodal unreliable news detection framework, which captures both textual and visual information from unreliable articles utilising the contrastive learning strategy. The contrastive learner interacts with the unreliable news classifier to push similar credible news (or similar unreliable news) closer while moving news articles with similar content but opposite credibility labels away from each other in the multimodal embedding space. Experimental results on a COVID-19 related dataset, ReCOVery, show that our model outperforms a number of competitive baseline in unreliable news detection.

* Accepted by the CIKM 2021

Via

Access Paper or Ask Questions

Stability of SGD: Tightness Analysis and Improved Bounds

Feb 10, 2021

Yikai Zhang, Wenjia Zhang, Sammy Bald, Vamsi Pingali, Chao Chen, Mayank Goswami

Figure 1 for Stability of SGD: Tightness Analysis and Improved Bounds

Abstract:Stochastic Gradient Descent (SGD) based methods have been widely used for training large-scale machine learning models that also generalize well in practice. Several explanations have been offered for this generalization performance, a prominent one being algorithmic stability [18]. However, there are no known examples of smooth loss functions for which the analysis can be shown to be tight. Furthermore, apart from the properties of the loss function, data distribution has also been shown to be an important factor in generalization performance. This raises the question: is the stability analysis of [18] tight for smooth functions, and if not, for what kind of loss functions and data distributions can the stability analysis be improved? In this paper we first settle open questions regarding tightness of bounds in the data-independent setting: we show that for general datasets, the existing analysis for convex and strongly-convex loss functions is tight, but it can be improved for non-convex loss functions. Next, we give a novel and improved data-dependent bounds: we show stability upper bounds for a large class of convex regularized loss functions, with negligible regularization parameters, and improve existing data-dependent bounds in the non-convex setting. We hope that our results will initiate further efforts to better understand the data-dependent setting under non-convex loss functions, leading to an improved understanding of the generalization abilities of deep networks.

Via

Access Paper or Ask Questions