Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xiaohong Li

Explore-Construct-Filter: An Automated Framework for Rich and Reliable API Knowledge Graph Construction

Feb 19, 2025

Yanbang Sun, Qing Huang, Xiaoxue Ren, Zhenchang Xing, Xiaohong Li, Junjie Wang

Abstract:The API Knowledge Graph (API KG) is a structured network that models API entities and their relations, providing essential semantic insights for tasks such as API recommendation, code generation, and API misuse detection. However, constructing a knowledge-rich and reliable API KG presents several challenges. Existing schema-based methods rely heavily on manual annotations to design KG schemas, leading to excessive manual overhead. On the other hand, schema-free methods, due to the lack of schema guidance, are prone to introducing noise, reducing the KG's reliability. To address these issues, we propose the Explore-Construct-Filter framework, an automated approach for API KG construction based on large language models (LLMs). This framework consists of three key modules: 1) KG exploration: LLMs simulate the workflow of annotators to automatically design a schema with comprehensive type triples, minimizing human intervention; 2) KG construction: Guided by the schema, LLMs extract instance triples to construct a rich yet unreliable API KG; 3) KG filtering: Removing invalid type triples and suspicious instance triples to construct a rich and reliable API KG. Experimental results demonstrate that our method surpasses the state-of-the-art method, achieving a 25.2% improvement in F1 score. Moreover, the Explore-Construct-Filter framework proves effective, with the KG exploration module increasing KG richness by 133.6% and the KG filtering module improving reliability by 26.6%. Finally, cross-model experiments confirm the generalizability of our framework.

Via

Access Paper or Ask Questions

Vision-Based Defect Classification and Weight Estimation of Rice Kernels

Oct 06, 2022

Xiang Wang, Kai Wang, Xiaohong Li, Shiguo Lian

Figure 1 for Vision-Based Defect Classification and Weight Estimation of Rice Kernels

Figure 2 for Vision-Based Defect Classification and Weight Estimation of Rice Kernels

Figure 3 for Vision-Based Defect Classification and Weight Estimation of Rice Kernels

Figure 4 for Vision-Based Defect Classification and Weight Estimation of Rice Kernels

Abstract:Rice is one of the main staple food in many areas of the world. The quality estimation of rice kernels are crucial in terms of both food safety and socio-economic impact. This was usually carried out by quality inspectors in the past, which may result in both objective and subjective inaccuracies. In this paper, we present an automatic visual quality estimation system of rice kernels, to classify the sampled rice kernels according to their types of flaws, and evaluate their quality via the weight ratios of the perspective kernel types. To compensate for the imbalance of different kernel numbers and classify kernels with multiple flaws accurately, we propose a multi-stage workflow which is able to locate the kernels in the captured image and classify their properties. We define a novel metric to measure the relative weight of each kernel in the image from its area, such that the relative weight of each type of kernels with regard to the all samples can be computed and used as the basis for rice quality estimation. Various experiments are carried out to show that our system is able to output precise results in a contactless way and replace tedious and error-prone manual works.

* 10 pages, 10 figures

Via

Access Paper or Ask Questions

A Novel Speech-Driven Lip-Sync Model with CNN and LSTM

May 02, 2022

Xiaohong Li, Xiang Wang, Kai Wang, Shiguo Lian

Figure 1 for A Novel Speech-Driven Lip-Sync Model with CNN and LSTM

Figure 2 for A Novel Speech-Driven Lip-Sync Model with CNN and LSTM

Figure 3 for A Novel Speech-Driven Lip-Sync Model with CNN and LSTM

Figure 4 for A Novel Speech-Driven Lip-Sync Model with CNN and LSTM

Abstract:Generating synchronized and natural lip movement with speech is one of the most important tasks in creating realistic virtual characters. In this paper, we present a combined deep neural network of one-dimensional convolutions and LSTM to generate vertex displacement of a 3D template face model from variable-length speech input. The motion of the lower part of the face, which is represented by the vertex movement of 3D lip shapes, is consistent with the input speech. In order to enhance the robustness of the network to different sound signals, we adapt a trained speech recognition model to extract speech feature, and a velocity loss term is adopted to reduce the jitter of generated facial animation. We recorded a series of videos of a Chinese adult speaking Mandarin and created a new speech-animation dataset to compensate the lack of such public data. Qualitative and quantitative evaluations indicate that our model is able to generate smooth and natural lip movements synchronized with speech.

* This paper has been published on CISP-BMEI 2021. See https://ieeexplore.ieee.org/document/9624360

Via

Access Paper or Ask Questions

AVA: Adversarial Vignetting Attack against Visual Recognition

May 12, 2021

Binyu Tian, Felix Juefei-Xu, Qing Guo, Xiaofei Xie, Xiaohong Li, Yang Liu

Figure 1 for AVA: Adversarial Vignetting Attack against Visual Recognition

Figure 2 for AVA: Adversarial Vignetting Attack against Visual Recognition

Figure 3 for AVA: Adversarial Vignetting Attack against Visual Recognition

Figure 4 for AVA: Adversarial Vignetting Attack against Visual Recognition

Abstract:Vignetting is an inherited imaging phenomenon within almost all optical systems, showing as a radial intensity darkening toward the corners of an image. Since it is a common effect for photography and usually appears as a slight intensity variation, people usually regard it as a part of a photo and would not even want to post-process it. Due to this natural advantage, in this work, we study vignetting from a new viewpoint, i.e., adversarial vignetting attack (AVA), which aims to embed intentionally misleading information into vignetting and produce a natural adversarial example without noise patterns. This example can fool the state-of-the-art deep convolutional neural networks (CNNs) but is imperceptible to humans. To this end, we first propose the radial-isotropic adversarial vignetting attack (RI-AVA) based on the physical model of vignetting, where the physical parameters (e.g., illumination factor and focal length) are tuned through the guidance of target CNN models. To achieve higher transferability across different CNNs, we further propose radial-anisotropic adversarial vignetting attack (RA-AVA) by allowing the effective regions of vignetting to be radial-anisotropic and shape-free. Moreover, we propose the geometry-aware level-set optimization method to solve the adversarial vignetting regions and physical parameters jointly. We validate the proposed methods on three popular datasets, i.e., DEV, CIFAR10, and Tiny ImageNet, by attacking four CNNs, e.g., ResNet50, EfficientNet-B0, DenseNet121, and MobileNet-V2, demonstrating the advantages of our methods over baseline methods on both transferability and image quality.

* This work has been accepted to IJCAI2021

Via

Access Paper or Ask Questions

IndoNLG: Benchmark and Resources for Evaluating Indonesian Natural Language Generation

Apr 16, 2021

Samuel Cahyawijaya, Genta Indra Winata, Bryan Wilie, Karissa Vincentio, Xiaohong Li, Adhiguna Kuncoro, Sebastian Ruder, Zhi Yuan Lim, Syafri Bahar, Masayu Leylia Khodra(+2 more)

Figure 1 for IndoNLG: Benchmark and Resources for Evaluating Indonesian Natural Language Generation

Figure 2 for IndoNLG: Benchmark and Resources for Evaluating Indonesian Natural Language Generation

Figure 3 for IndoNLG: Benchmark and Resources for Evaluating Indonesian Natural Language Generation

Figure 4 for IndoNLG: Benchmark and Resources for Evaluating Indonesian Natural Language Generation

Abstract:A benchmark provides an ecosystem to measure the advancement of models with standard datasets and automatic and human evaluation metrics. We introduce IndoNLG, the first such benchmark for the Indonesian language for natural language generation (NLG). It covers six tasks: summarization, question answering, open chitchat, as well as three different language-pairs of machine translation tasks. We provide a vast and clean pre-training corpus of Indonesian, Sundanese, and Javanese datasets called Indo4B-Plus, which is used to train our pre-trained NLG model, IndoBART. We evaluate the effectiveness and efficiency of IndoBART by conducting extensive evaluation on all IndoNLG tasks. Our findings show that IndoBART achieves competitive performance on Indonesian tasks with five times fewer parameters compared to the largest multilingual model in our benchmark, mBART-LARGE (Liu et al., 2020), and an almost 4x and 2.5x faster inference time on the CPU and GPU respectively. We additionally demonstrate the ability of IndoBART to learn Javanese and Sundanese, and it achieves decent performance on machine translation tasks.

* Work in progress, 10 pages

Via

Access Paper or Ask Questions

Stereo Correspondence and Reconstruction of Endoscopic Data Challenge

Jan 28, 2021

Max Allan, Jonathan Mcleod, Congcong Wang, Jean Claude Rosenthal, Zhenglei Hu, Niklas Gard, Peter Eisert, Ke Xue Fu, Trevor Zeffiro, Wenyao Xia(+14 more)

Figure 1 for Stereo Correspondence and Reconstruction of Endoscopic Data Challenge

Figure 2 for Stereo Correspondence and Reconstruction of Endoscopic Data Challenge

Figure 3 for Stereo Correspondence and Reconstruction of Endoscopic Data Challenge

Figure 4 for Stereo Correspondence and Reconstruction of Endoscopic Data Challenge

Abstract:The stereo correspondence and reconstruction of endoscopic data sub-challenge was organized during the Endovis challenge at MICCAI 2019 in Shenzhen, China. The task was to perform dense depth estimation using 7 training datasets and 2 test sets of structured light data captured using porcine cadavers. These were provided by a team at Intuitive Surgical. 10 teams participated in the challenge day. This paper contains 3 additional methods which were submitted after the challenge finished as well as a supplemental section from these teams on issues they found with the dataset.

Via

Access Paper or Ask Questions

Generating Informative CVE Description From ExploitDB Posts by Extractive Summarization

Jan 05, 2021

Jiamou Sun, Zhenchang Xing, Hao Guo, Deheng Ye, Xiaohong Li, Xiwei Xu, Liming Zhu

Figure 1 for Generating Informative CVE Description From ExploitDB Posts by Extractive Summarization

Figure 2 for Generating Informative CVE Description From ExploitDB Posts by Extractive Summarization

Figure 3 for Generating Informative CVE Description From ExploitDB Posts by Extractive Summarization

Figure 4 for Generating Informative CVE Description From ExploitDB Posts by Extractive Summarization

Abstract:ExploitDB is one of the important public websites, which contributes a large number of vulnerabilities to official CVE database. Over 60\% of these vulnerabilities have high- or critical-security risks. Unfortunately, over 73\% of exploits appear publicly earlier than the corresponding CVEs, and about 40\% of exploits do not even have CVEs. To assist in documenting CVEs for the ExploitDB posts, we propose an open information method to extract 9 key vulnerability aspects (vulnerable product/version/component, vulnerability type, vendor, attacker type, root cause, attack vector and impact) from the verbose and noisy ExploitDB posts. The extracted aspects from an ExploitDB post are then composed into a CVE description according to the suggested CVE description templates, which is must-provided information for requesting new CVEs. Through the evaluation on 13,017 manually labeled sentences and the statistically sampling of 3,456 extracted aspects, we confirm the high accuracy of our extraction method. Compared with 27,230 reference CVE descriptions. Our composed CVE descriptions achieve high ROUGH-L (0.38), a longest common subsequence based metric for evaluating text summarization methods.

Via

Access Paper or Ask Questions

IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language Understanding

Oct 08, 2020

Bryan Wilie, Karissa Vincentio, Genta Indra Winata, Samuel Cahyawijaya, Xiaohong Li, Zhi Yuan Lim, Sidik Soleman, Rahmad Mahendra, Pascale Fung, Syafri Bahar(+1 more)

Figure 1 for IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language Understanding

Figure 2 for IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language Understanding

Figure 3 for IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language Understanding

Figure 4 for IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language Understanding

Abstract:Although Indonesian is known to be the fourth most frequently used language over the internet, the research progress on this language in the natural language processing (NLP) is slow-moving due to a lack of available resources. In response, we introduce the first-ever vast resource for the training, evaluating, and benchmarking on Indonesian natural language understanding (IndoNLU) tasks. IndoNLU includes twelve tasks, ranging from single sentence classification to pair-sentences sequence labeling with different levels of complexity. The datasets for the tasks lie in different domains and styles to ensure task diversity. We also provide a set of Indonesian pre-trained models (IndoBERT) trained from a large and clean Indonesian dataset Indo4B collected from publicly available sources such as social media texts, blogs, news, and websites. We release baseline models for all twelve tasks, as well as the framework for benchmark evaluation, and thus it enables everyone to benchmark their system performances.

* This paper will be presented in AACL-IJCNLP 2020 (with new results and acknowledgment)

Via

Access Paper or Ask Questions

Bias Field Poses a Threat to DNN-based X-Ray Recognition

Sep 19, 2020

Binyu Tian, Qing Guo, Felix Juefei-Xu, Wen Le Chan, Yupeng Cheng, Xiaohong Li, Xiaofei Xie, Shengchao Qin

Figure 1 for Bias Field Poses a Threat to DNN-based X-Ray Recognition

Figure 2 for Bias Field Poses a Threat to DNN-based X-Ray Recognition

Figure 3 for Bias Field Poses a Threat to DNN-based X-Ray Recognition

Figure 4 for Bias Field Poses a Threat to DNN-based X-Ray Recognition

Abstract:The chest X-ray plays a key role in screening and diagnosis of many lung diseases including the COVID-19. More recently, many works construct deep neural networks (DNNs) for chest X-ray images to realize automated and efficient diagnosis of lung diseases. However, bias field caused by the improper medical image acquisition process widely exists in the chest X-ray images while the robustness of DNNs to the bias field is rarely explored, which definitely poses a threat to the X-ray-based automated diagnosis system. In this paper, we study this problem based on the recent adversarial attack and propose a brand new attack, i.e., the adversarial bias field attack where the bias field instead of the additive noise works as the adversarial perturbations for fooling the DNNs. This novel attack posts a key problem: how to locally tune the bias field to realize high attack success rate while maintaining its spatial smoothness to guarantee high realisticity. These two goals contradict each other and thus has made the attack significantly challenging. To overcome this challenge, we propose the adversarial-smooth bias field attack that can locally tune the bias field with joint smooth & adversarial constraints. As a result, the adversarial X-ray images can not only fool the DNNs effectively but also retain very high level of realisticity. We validate our method on real chest X-ray datasets with powerful DNNs, e.g., ResNet50, DenseNet121, and MobileNet, and show different properties to the state-of-the-art attacks in both image realisticity and attack transferability. Our method reveals the potential threat to the DNN-based X-ray automated diagnosis and can definitely benefit the development of bias-field-robust automated diagnosis system.

* 9 pages, 6 figures

Via

Access Paper or Ask Questions

An Empirical Study towards Characterizing Deep Learning Development and Deployment across Different Frameworks and Platforms

Sep 15, 2019

Qianyu Guo, Sen Chen, Xiaofei Xie, Lei Ma, Qiang Hu, Hongtao Liu, Yang Liu, Jianjun Zhao, Xiaohong Li

Figure 1 for An Empirical Study towards Characterizing Deep Learning Development and Deployment across Different Frameworks and Platforms

Figure 2 for An Empirical Study towards Characterizing Deep Learning Development and Deployment across Different Frameworks and Platforms

Figure 3 for An Empirical Study towards Characterizing Deep Learning Development and Deployment across Different Frameworks and Platforms

Figure 4 for An Empirical Study towards Characterizing Deep Learning Development and Deployment across Different Frameworks and Platforms

Abstract:Deep Learning (DL) has recently achieved tremendous success. A variety of DL frameworks and platforms play a key role to catalyze such progress. However, the differences in architecture designs and implementations of existing frameworks and platforms bring new challenges for DL software development and deployment. Till now, there is no study on how various mainstream frameworks and platforms influence both DL software development and deployment in practice. To fill this gap, we take the first step towards understanding how the most widely-used DL frameworks and platforms support the DL software development and deployment. We conduct a systematic study on these frameworks and platforms by using two types of DNN architectures and three popular datasets. (1) For development process, we investigate the prediction accuracy under the same runtime training configuration or same model weights/biases. We also study the adversarial robustness of trained models by leveraging the existing adversarial attack techniques. The experimental results show that the computing differences across frameworks could result in an obvious prediction accuracy decline, which should draw the attention of DL developers. (2) For deployment process, we investigate the prediction accuracy and performance (refers to time cost and memory consumption) when the trained models are migrated/quantized from PC to real mobile devices and web browsers. The DL platform study unveils that the migration and quantization still suffer from compatibility and reliability issues. Meanwhile, we find several DL software bugs by using the results as a benchmark. We further validate the results through bug confirmation from stakeholders and industrial positive feedback to highlight the implications of our study. Through our study, we summarize practical guidelines, identify challenges and pinpoint new research directions.

Via

Access Paper or Ask Questions