Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yuxiang Zhong

AcademicGPT: Empowering Academic Research

Nov 21, 2023

Shufa Wei, Xiaolong Xu, Xianbiao Qi, Xi Yin, Jun Xia, Jingyi Ren, Peijun Tang, Yuxiang Zhong, Yihao Chen, Xiaoqin Ren(+14 more)

Figure 1 for AcademicGPT: Empowering Academic Research

Figure 2 for AcademicGPT: Empowering Academic Research

Figure 3 for AcademicGPT: Empowering Academic Research

Figure 4 for AcademicGPT: Empowering Academic Research

Abstract:Large Language Models (LLMs) have demonstrated exceptional capabilities across various natural language processing tasks. Yet, many of these advanced LLMs are tailored for broad, general-purpose applications. In this technical report, we introduce AcademicGPT, designed specifically to empower academic research. AcademicGPT is a continual training model derived from LLaMA2-70B. Our training corpus mainly consists of academic papers, thesis, content from some academic domain, high-quality Chinese data and others. While it may not be extensive in data scale, AcademicGPT marks our initial venture into a domain-specific GPT tailored for research area. We evaluate AcademicGPT on several established public benchmarks such as MMLU and CEval, as well as on some specialized academic benchmarks like PubMedQA, SCIEval, and our newly-created ComputerScienceQA, to demonstrate its ability from general knowledge ability, to Chinese ability, and to academic ability. Building upon AcademicGPT's foundation model, we also developed several applications catered to the academic area, including General Academic Question Answering, AI-assisted Paper Reading, Paper Review, and AI-assisted Title and Abstract Generation.

* Technical Report. arXiv admin note: text overlap with arXiv:2310.12081, arXiv:2310.10053 by other authors

Via

Access Paper or Ask Questions

1st Place Solution for ICDAR 2021 Competition on Mathematical Formula Detection

Jul 12, 2021

Yuxiang Zhong, Xianbiao Qi, Shanjun Li, Dengyi Gu, Yihao Chen, Peiyang Ning, Rong Xiao

Figure 1 for 1st Place Solution for ICDAR 2021 Competition on Mathematical Formula Detection

Figure 2 for 1st Place Solution for ICDAR 2021 Competition on Mathematical Formula Detection

Figure 3 for 1st Place Solution for ICDAR 2021 Competition on Mathematical Formula Detection

Figure 4 for 1st Place Solution for ICDAR 2021 Competition on Mathematical Formula Detection

Abstract:In this technical report, we present our 1st place solution for the ICDAR 2021 competition on mathematical formula detection (MFD). The MFD task has three key challenges including a large scale span, large variation of the ratio between height and width, and rich character set and mathematical expressions. Considering these challenges, we used Generalized Focal Loss (GFL), an anchor-free method, instead of the anchor-based method, and prove the Adaptive Training Sampling Strategy (ATSS) and proper Feature Pyramid Network (FPN) can well solve the important issue of scale variation. Meanwhile, we also found some tricks, e.g., Deformable Convolution Network (DCN), SyncBN, and Weighted Box Fusion (WBF), were effective in MFD task. Our proposed method ranked 1st in the final 15 teams.

* 1st Place Solution for ICDAR 2021 Competition on Mathematical Formula Detection. http://transcriptorium.eu/~htrcontest/MathsICDAR2021/

Via

Access Paper or Ask Questions