Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Haijing Liu

Institute of Software, Chinese Academy of Sciences, University of Chinese Academy of Sciences

Category-Adaptive Cross-Modal Semantic Refinement and Transfer for Open-Vocabulary Multi-Label Recognition

Dec 09, 2024

Haijing Liu, Tao Pu, Hefeng Wu, Keze Wang, Liang Lin

Abstract:Benefiting from the generalization capability of CLIP, recent vision language pre-training (VLP) models have demonstrated an impressive ability to capture virtually any visual concept in daily images. However, due to the presence of unseen categories in open-vocabulary settings, existing algorithms struggle to effectively capture strong semantic correlations between categories, resulting in sub-optimal performance on the open-vocabulary multi-label recognition (OV-MLR). Furthermore, the substantial variation in the number of discriminative areas across diverse object categories is misaligned with the fixed-number patch matching used in current methods, introducing noisy visual cues that hinder the accurate capture of target semantics. To tackle these challenges, we propose a novel category-adaptive cross-modal semantic refinement and transfer (C$^2$SRT) framework to explore the semantic correlation both within each category and across different categories, in a category-adaptive manner. The proposed framework consists of two complementary modules, i.e., intra-category semantic refinement (ISR) module and inter-category semantic transfer (IST) module. Specifically, the ISR module leverages the cross-modal knowledge of the VLP model to adaptively find a set of local discriminative regions that best represent the semantics of the target category. The IST module adaptively discovers a set of most correlated categories for a target category by utilizing the commonsense capabilities of LLMs to construct a category-adaptive correlation graph and transfers semantic knowledge from the correlated seen categories to unseen ones. Extensive experiments on OV-MLR benchmarks clearly demonstrate that the proposed C$^2$SRT framework outperforms current state-of-the-art algorithms.

* 15 pages

Via

Access Paper or Ask Questions

Using Argument-based Features to Predict and Analyse Review Helpfulness

Jul 23, 2017

Haijing Liu, Yang Gao, Pin Lv, Mengxue Li, Shiqiang Geng, Minglan Li, Hao Wang

Figure 1 for Using Argument-based Features to Predict and Analyse Review Helpfulness

Figure 2 for Using Argument-based Features to Predict and Analyse Review Helpfulness

Abstract:We study the helpful product reviews identification problem in this paper. We observe that the evidence-conclusion discourse relations, also known as arguments, often appear in product reviews, and we hypothesise that some argument-based features, e.g. the percentage of argumentative sentences, the evidences-conclusions ratios, are good indicators of helpful reviews. To validate this hypothesis, we manually annotate arguments in 110 hotel reviews, and investigate the effectiveness of several combinations of argument-based features. Experiments suggest that, when being used together with the argument-based features, the state-of-the-art baseline features can enjoy a performance boost (in terms of F1) of 11.01\% in average.

* 6 pages, EMNLP2017

Via

Access Paper or Ask Questions

Joint RNN Model for Argument Component Boundary Detection

May 05, 2017

Minglan Li, Yang Gao, Hui Wen, Yang Du, Haijing Liu, Hao Wang

Figure 1 for Joint RNN Model for Argument Component Boundary Detection

Figure 2 for Joint RNN Model for Argument Component Boundary Detection

Figure 3 for Joint RNN Model for Argument Component Boundary Detection

Figure 4 for Joint RNN Model for Argument Component Boundary Detection

Abstract:Argument Component Boundary Detection (ACBD) is an important sub-task in argumentation mining; it aims at identifying the word sequences that constitute argument components, and is usually considered as the first sub-task in the argumentation mining pipeline. Existing ACBD methods heavily depend on task-specific knowledge, and require considerable human efforts on feature-engineering. To tackle these problems, in this work, we formulate ACBD as a sequence labeling problem and propose a variety of Recurrent Neural Network (RNN) based methods, which do not use domain specific or handcrafted features beyond the relative position of the sentence in the document. In particular, we propose a novel joint RNN model that can predict whether sentences are argumentative or not, and use the predicted results to more precisely detect the argument component boundaries. We evaluate our techniques on two corpora from two different genres; results suggest that our joint RNN model obtain the state-of-the-art performance on both datasets.

* 6 pages, 3 figures, submitted to IEEE SMC 2017

Via

Access Paper or Ask Questions

Crowdsourcing Argumentation Structures in Chinese Hotel Reviews

May 05, 2017

Mengxue Li, Shiqiang Geng, Yang Gao, Haijing Liu, Hao Wang

Figure 1 for Crowdsourcing Argumentation Structures in Chinese Hotel Reviews

Figure 2 for Crowdsourcing Argumentation Structures in Chinese Hotel Reviews

Figure 3 for Crowdsourcing Argumentation Structures in Chinese Hotel Reviews

Figure 4 for Crowdsourcing Argumentation Structures in Chinese Hotel Reviews

Abstract:Argumentation mining aims at automatically extracting the premises-claim discourse structures in natural language texts. There is a great demand for argumentation corpora for customer reviews. However, due to the controversial nature of the argumentation annotation task, there exist very few large-scale argumentation corpora for customer reviews. In this work, we novelly use the crowdsourcing technique to collect argumentation annotations in Chinese hotel reviews. As the first Chinese argumentation dataset, our corpus includes 4814 argument component annotations and 411 argument relation annotations, and its annotations qualities are comparable to some widely used argumentation corpora in other languages.

* 6 pages,3 figures,This article has been submitted to "The 2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC2017)"

Via

Access Paper or Ask Questions