Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yanhang Zhang

V-Doc : Visual questions answers with Documents

May 31, 2022

Yihao Ding, Zhe Huang, Runlin Wang, Yanhang Zhang, Xianru Chen, Yuzhong Ma, Hyunsuk Chung, Soyeon Caren Han

Figure 1 for V-Doc : Visual questions answers with Documents

Figure 2 for V-Doc : Visual questions answers with Documents

Figure 3 for V-Doc : Visual questions answers with Documents

Figure 4 for V-Doc : Visual questions answers with Documents

Abstract:We propose V-Doc, a question-answering tool using document images and PDF, mainly for researchers and general non-deep learning experts looking to generate, process, and understand the document visual question answering tasks. The V-Doc supports generating and using both extractive and abstractive question-answer pairs using documents images. The extractive QA selects a subset of tokens or phrases from the document contents to predict the answers, while the abstractive QA recognises the language in the content and generates the answer based on the trained model. Both aspects are crucial to understanding the documents, especially in an image format. We include a detailed scenario of question generation for the abstractive QA task. V-Doc supports a wide range of datasets and models, and is highly extensible through a declarative, framework-agnostic platform.

* Accepted by CVPR 2022

Via

Access Paper or Ask Questions

abess: A Fast Best Subset Selection Library in Python and R

Oct 19, 2021

Jin Zhu, Liyuan Hu, Junhao Huang, Kangkang Jiang, Yanhang Zhang, Shiyun Lin, Junxian Zhu, Xueqin Wang

Figure 1 for abess: A Fast Best Subset Selection Library in Python and R

Figure 2 for abess: A Fast Best Subset Selection Library in Python and R

Figure 3 for abess: A Fast Best Subset Selection Library in Python and R

Abstract:We introduce a new library named abess that implements a unified framework of best-subset selection for solving diverse machine learning problems, e.g., linear regression, classification, and principal component analysis. Particularly, the abess certifiably gets the optimal solution within polynomial times under the linear model. Our efficient implementation allows abess to attain the solution of best-subset selection problems as fast as or even 100x faster than existing competing variable (model) selection toolboxes. Furthermore, it supports common variants like best group subset selection and $\ell_2$ regularized best-subset selection. The core of the library is programmed in C++. For ease of use, a Python library is designed for conveniently integrating with scikit-learn, and it can be installed from the Python library Index. In addition, a user-friendly R library is available at the Comprehensive R Archive Network. The source code is available at: https://github.com/abess-team/abess.

Via

Access Paper or Ask Questions

Certifiably Polynomial Algorithm for Best Group Subset Selection

Apr 23, 2021

Yanhang Zhang, Junxian Zhu, Jin Zhu, Xueqin Wang

Figure 1 for Certifiably Polynomial Algorithm for Best Group Subset Selection

Figure 2 for Certifiably Polynomial Algorithm for Best Group Subset Selection

Figure 3 for Certifiably Polynomial Algorithm for Best Group Subset Selection

Figure 4 for Certifiably Polynomial Algorithm for Best Group Subset Selection

Abstract:Best group subset selection aims to choose a small part of non-overlapping groups to achieve the best interpretability on the response variable. It is practically attractive for group variable selection; however, due to the computational intractability in high dimensionality setting, it doesn't catch enough attention. To fill the blank of efficient algorithms for best group subset selection, in this paper, we propose a group-splicing algorithm that iteratively detects effective groups and excludes the helpless ones. Moreover, coupled with a novel Bayesian group information criterion, an adaptive algorithm is developed to determine the true group subset size. It is certifiable that our algorithms enable identifying the optimal group subset in polynomial time under mild conditions. We demonstrate the efficiency and accuracy of our proposal by comparing state-of-the-art algorithms on both synthetic and real-world datasets.

* 45 pages, 2 figures

Via

Access Paper or Ask Questions