Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jingshu Liu

Large Language Model Adaptation for Financial Sentiment Analysis

Jan 26, 2024

Pau Rodriguez Inserte, Mariam Nakhlé, Raheel Qader, Gaetan Caillaut, Jingshu Liu

Abstract:Natural language processing (NLP) has recently gained relevance within financial institutions by providing highly valuable insights into companies and markets' financial documents. However, the landscape of the financial domain presents extra challenges for NLP, due to the complexity of the texts and the use of specific terminology. Generalist language models tend to fall short in tasks specifically tailored for finance, even when using large language models (LLMs) with great natural language understanding and generative capabilities. This paper presents a study on LLM adaptation methods targeted at the financial domain and with high emphasis on financial sentiment analysis. To this purpose, two foundation models with less than 1.5B parameters have been adapted using a wide range of strategies. We show that through careful fine-tuning on both financial documents and instructions, these foundation models can be adapted to the target domain. Moreover, we observe that small LLMs have comparable performance to larger scale models, while being more efficient in terms of parameters and data. In addition to the models, we show how to generate artificial instructions through LLMs to augment the number of samples of the instruction dataset.

Via

Access Paper or Ask Questions

A Machine Learning Approach for Recruitment Prediction in Clinical Trial Design

Nov 14, 2021

Jingshu Liu, Patricia J Allen, Luke Benz, Daniel Blickstein, Evon Okidi, Xiao Shi

Figure 1 for A Machine Learning Approach for Recruitment Prediction in Clinical Trial Design

Figure 2 for A Machine Learning Approach for Recruitment Prediction in Clinical Trial Design

Figure 3 for A Machine Learning Approach for Recruitment Prediction in Clinical Trial Design

Figure 4 for A Machine Learning Approach for Recruitment Prediction in Clinical Trial Design

Abstract:Significant advancements have been made in recent years to optimize patient recruitment for clinical trials, however, improved methods for patient recruitment prediction are needed to support trial site selection and to estimate appropriate enrollment timelines in the trial design stage. In this paper, using data from thousands of historical clinical trials, we explore machine learning methods to predict the number of patients enrolled per month at a clinical trial site over the course of a trial's enrollment duration. We show that these methods can reduce the error that is observed with current industry standards and propose opportunities for further improvement.

* Machine Learning for Health (ML4H) - Extended Abstract

Via

Access Paper or Ask Questions

BERT-XML: Large Scale Automated ICD Coding Using BERT Pretraining

May 26, 2020

Zachariah Zhang, Jingshu Liu, Narges Razavian

Figure 1 for BERT-XML: Large Scale Automated ICD Coding Using BERT Pretraining

Figure 2 for BERT-XML: Large Scale Automated ICD Coding Using BERT Pretraining

Figure 3 for BERT-XML: Large Scale Automated ICD Coding Using BERT Pretraining

Figure 4 for BERT-XML: Large Scale Automated ICD Coding Using BERT Pretraining

Abstract:Clinical interactions are initially recorded and documented in free text medical notes. ICD coding is the task of classifying and coding all diagnoses, symptoms and procedures associated with a patient's visit. The process is often manual and extremely time-consuming and expensive for hospitals. In this paper, we propose a machine learning model, BERT-XML, for large scale automated ICD coding from EHR notes, utilizing recently developed unsupervised pretraining that have achieved state of the art performance on a variety of NLP tasks. We train a BERT model from scratch on EHR notes, learning with vocabulary better suited for EHR tasks and thus outperform off-the-shelf models. We adapt the BERT architecture for ICD coding with multi-label attention. While other works focus on small public medical datasets, we have produced the first large scale ICD-10 classification model using millions of EHR notes to predict thousands of unique ICD codes.

Via

Access Paper or Ask Questions

An Image Based Visual Servo Approach with Deep Learning for Robotic Manipulation

Sep 17, 2019

Jingshu Liu, Yuan Li

Figure 1 for An Image Based Visual Servo Approach with Deep Learning for Robotic Manipulation

Figure 2 for An Image Based Visual Servo Approach with Deep Learning for Robotic Manipulation

Figure 3 for An Image Based Visual Servo Approach with Deep Learning for Robotic Manipulation

Figure 4 for An Image Based Visual Servo Approach with Deep Learning for Robotic Manipulation

Abstract:Aiming at the difficulty of extracting image features and estimating the Jacobian matrix in image based visual servo, this paper proposes an image based visual servo approach with deep learning. With the powerful learning capabilities of convolutional neural networks(CNN), autonomous learning to extract features from images and fitting the nonlinear relationships from image space to task space is achieved, which can greatly facilitate the image based visual servo procedure. Based on the above ideas a two-stream network based on convolutional neural network is designed and the corresponding control scheme is proposed to realize the four degrees of freedom visual servo of the robot manipulator. Collecting images of observed target under different pose parameters of the manipulator as training samples for CNN, the trained network can be used to estimate the nonlinear relationship from 2D image space to 3D Cartesian space. The two-stream network takes the current image and the desirable image as inputs and makes them equal to guide the manipulator to the desirable pose. The effectiveness of the approach is verified with experimental results.

* Accepted by The 6th International Workshop on Advanced Computational Intelligence and Intelligent Informatics (IWACIII2019)

Via

Access Paper or Ask Questions

Deep EHR: Chronic Disease Prediction Using Medical Notes

Aug 15, 2018

Jingshu Liu, Zachariah Zhang, Narges Razavian

Figure 1 for Deep EHR: Chronic Disease Prediction Using Medical Notes

Figure 2 for Deep EHR: Chronic Disease Prediction Using Medical Notes

Figure 3 for Deep EHR: Chronic Disease Prediction Using Medical Notes

Figure 4 for Deep EHR: Chronic Disease Prediction Using Medical Notes

Abstract:Early detection of preventable diseases is important for better disease management, improved inter-ventions, and more efficient health-care resource allocation. Various machine learning approacheshave been developed to utilize information in Electronic Health Record (EHR) for this task. Majorityof previous attempts, however, focus on structured fields and lose the vast amount of information inthe unstructured notes. In this work we propose a general multi-task framework for disease onsetprediction that combines both free-text medical notes and structured information. We compareperformance of different deep learning architectures including CNN, LSTM and hierarchical models.In contrast to traditional text-based prediction models, our approach does not require disease specificfeature engineering, and can handle negations and numerical values that exist in the text. Ourresults on a cohort of about 1 million patients show that models using text outperform modelsusing just structured data, and that models capable of using numerical values and negations in thetext, in addition to the raw text, further improve performance. Additionally, we compare differentvisualization methods for medical professionals to interpret model predictions.

* Machine Learning for Health Care conference

Via

Access Paper or Ask Questions