Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yingli Li

3D-CT-GPT: Generating 3D Radiology Reports through Integration of Large Vision-Language Models

Sep 28, 2024

Hao Chen, Wei Zhao, Yingli Li, Tianyang Zhong, Yisong Wang, Youlan Shang, Lei Guo, Junwei Han, Tianming Liu, Jun Liu(+1 more)

Figure 1 for 3D-CT-GPT: Generating 3D Radiology Reports through Integration of Large Vision-Language Models

Figure 2 for 3D-CT-GPT: Generating 3D Radiology Reports through Integration of Large Vision-Language Models

Figure 3 for 3D-CT-GPT: Generating 3D Radiology Reports through Integration of Large Vision-Language Models

Figure 4 for 3D-CT-GPT: Generating 3D Radiology Reports through Integration of Large Vision-Language Models

Abstract:Medical image analysis is crucial in modern radiological diagnostics, especially given the exponential growth in medical imaging data. The demand for automated report generation systems has become increasingly urgent. While prior research has mainly focused on using machine learning and multimodal language models for 2D medical images, the generation of reports for 3D medical images has been less explored due to data scarcity and computational complexities. This paper introduces 3D-CT-GPT, a Visual Question Answering (VQA)-based medical visual language model specifically designed for generating radiology reports from 3D CT scans, particularly chest CTs. Extensive experiments on both public and private datasets demonstrate that 3D-CT-GPT significantly outperforms existing methods in terms of report accuracy and quality. Although current methods are few, including the partially open-source CT2Rep and the open-source M3D, we ensured fair comparison through appropriate data conversion and evaluation methodologies. Experimental results indicate that 3D-CT-GPT enhances diagnostic accuracy and report coherence, establishing itself as a robust solution for clinical radiology report generation. Future work will focus on expanding the dataset and further optimizing the model to enhance its performance and applicability.

Via

Access Paper or Ask Questions

A Multi-Domain Feature Learning Method for Visual Place Recognition

Feb 26, 2019

Peng Yin, Lingyun Xu, Xueqian Li, Chen Yin, Yingli Li, Rangaprasad Arun Srivatsan, Lu Li, Jianmin Ji, Yuqing He

Figure 1 for A Multi-Domain Feature Learning Method for Visual Place Recognition

Figure 2 for A Multi-Domain Feature Learning Method for Visual Place Recognition

Figure 3 for A Multi-Domain Feature Learning Method for Visual Place Recognition

Figure 4 for A Multi-Domain Feature Learning Method for Visual Place Recognition

Abstract:Visual Place Recognition (VPR) is an important component in both computer vision and robotics applications, thanks to its ability to determine whether a place has been visited and where specifically. A major challenge in VPR is to handle changes of environmental conditions including weather, season and illumination. Most VPR methods try to improve the place recognition performance by ignoring the environmental factors, leading to decreased accuracy decreases when environmental conditions change significantly, such as day versus night. To this end, we propose an end-to-end conditional visual place recognition method. Specifically, we introduce the multi-domain feature learning method (MDFL) to capture multiple attribute-descriptions for a given place, and then use a feature detaching module to separate the environmental condition-related features from those that are not. The only label required within this feature learning pipeline is the environmental condition. Evaluation of the proposed method is conducted on the multi-season \textit{NORDLAND} dataset, and the multi-weather \textit{GTAV} dataset. Experimental results show that our method improves the feature robustness against variant environmental conditions.

* 6 pages, 5 figures, ICRA 2019 accepted

Via

Access Paper or Ask Questions