Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Fuju Rong

TabKANet: Tabular Data Modelling with Kolmogorov-Arnold Network and Transformer

Sep 13, 2024

Weihao Gao, Zheng Gong, Zhuo Deng, Fuju Rong, Chucheng Chen, Lan Ma

Figure 1 for TabKANet: Tabular Data Modelling with Kolmogorov-Arnold Network and Transformer

Figure 2 for TabKANet: Tabular Data Modelling with Kolmogorov-Arnold Network and Transformer

Figure 3 for TabKANet: Tabular Data Modelling with Kolmogorov-Arnold Network and Transformer

Figure 4 for TabKANet: Tabular Data Modelling with Kolmogorov-Arnold Network and Transformer

Abstract:Tabular data is the most common type of data in real-life scenarios. In this study, we propose a method based on the TabKANet architecture, which utilizes the Kolmogorov-Arnold network to encode numerical features and merge them with categorical features, enabling unified modeling of tabular data on the Transformer architecture. This model demonstrates outstanding performance in six widely used binary classification tasks, suggesting that TabKANet has the potential to become a standard approach for tabular modeling, surpassing traditional neural networks. Furthermore, this research reveals the significant advantages of the Kolmogorov-Arnold network in encoding numerical features. The code of our work is available at https://github.com/tsinghuamedgao20/TabKANet.

Via

Access Paper or Ask Questions

OphGLM: Training an Ophthalmology Large Language-and-Vision Assistant based on Instructions and Dialogue

Jun 22, 2023

Weihao Gao, Zhuo Deng, Zhiyuan Niu, Fuju Rong, Chucheng Chen, Zheng Gong, Wenze Zhang, Daimin Xiao, Fang Li, Zhenjie Cao(+3 more)

Figure 1 for OphGLM: Training an Ophthalmology Large Language-and-Vision Assistant based on Instructions and Dialogue

Figure 2 for OphGLM: Training an Ophthalmology Large Language-and-Vision Assistant based on Instructions and Dialogue

Figure 3 for OphGLM: Training an Ophthalmology Large Language-and-Vision Assistant based on Instructions and Dialogue

Figure 4 for OphGLM: Training an Ophthalmology Large Language-and-Vision Assistant based on Instructions and Dialogue

Abstract:Large multimodal language models (LMMs) have achieved significant success in general domains. However, due to the significant differences between medical images and text and general web content, the performance of LMMs in medical scenarios is limited. In ophthalmology, clinical diagnosis relies on multiple modalities of medical images, but unfortunately, multimodal ophthalmic large language models have not been explored to date. In this paper, we study and construct an ophthalmic large multimodal model. Firstly, we use fundus images as an entry point to build a disease assessment and diagnosis pipeline to achieve common ophthalmic disease diagnosis and lesion segmentation. Then, we establish a new ophthalmic multimodal instruction-following and dialogue fine-tuning dataset based on disease-related knowledge data and publicly available real-world medical dialogue. We introduce visual ability into the large language model to complete the ophthalmic large language and vision assistant (OphGLM). Our experimental results demonstrate that the OphGLM model performs exceptionally well, and it has the potential to revolutionize clinical applications in ophthalmology. The dataset, code, and models will be made publicly available at https://github.com/ML-AILab/OphGLM.

* OphGLM:The first ophthalmology large language-and-vision assistant based on instructions and dialogue

Via

Access Paper or Ask Questions