Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Amirhossein Herandi

Skill-LLM: Repurposing General-Purpose LLMs for Skill Extraction

Oct 15, 2024

Amirhossein Herandi, Yitao Li, Zhanlin Liu, Ximin Hu, Xiao Cai

Abstract:Accurate skill extraction from job descriptions is crucial in the hiring process but remains challenging. Named Entity Recognition (NER) is a common approach used to address this issue. With the demonstrated success of large language models (LLMs) in various NLP tasks, including NER, we propose fine-tuning a specialized Skill-LLM and a light weight model to improve the precision and quality of skill extraction. In our study, we evaluated the fine-tuned Skill-LLM and the light weight model using a benchmark dataset and compared its performance against state-of-the-art (SOTA) methods. Our results show that this approach outperforms existing SOTA techniques.

Via

Access Paper or Ask Questions

Deep Clustering via Joint Convolutional Autoencoder Embedding and Relative Entropy Minimization

Aug 09, 2017

Kamran Ghasedi Dizaji, Amirhossein Herandi, Cheng Deng, Weidong Cai, Heng Huang

Figure 1 for Deep Clustering via Joint Convolutional Autoencoder Embedding and Relative Entropy Minimization

Figure 2 for Deep Clustering via Joint Convolutional Autoencoder Embedding and Relative Entropy Minimization

Figure 3 for Deep Clustering via Joint Convolutional Autoencoder Embedding and Relative Entropy Minimization

Figure 4 for Deep Clustering via Joint Convolutional Autoencoder Embedding and Relative Entropy Minimization

Abstract:Image clustering is one of the most important computer vision applications, which has been extensively studied in literature. However, current clustering methods mostly suffer from lack of efficiency and scalability when dealing with large-scale and high-dimensional data. In this paper, we propose a new clustering model, called DEeP Embedded RegularIzed ClusTering (DEPICT), which efficiently maps data into a discriminative embedding subspace and precisely predicts cluster assignments. DEPICT generally consists of a multinomial logistic regression function stacked on top of a multi-layer convolutional autoencoder. We define a clustering objective function using relative entropy (KL divergence) minimization, regularized by a prior for the frequency of cluster assignments. An alternating strategy is then derived to optimize the objective by updating parameters and estimating cluster assignments. Furthermore, we employ the reconstruction loss functions in our autoencoder, as a data-dependent regularization term, to prevent the deep embedding function from overfitting. In order to benefit from end-to-end optimization and eliminate the necessity for layer-wise pretraining, we introduce a joint learning framework to minimize the unified clustering and reconstruction loss functions together and train all network layers simultaneously. Experimental results indicate the superiority and faster running time of DEPICT in real-world clustering tasks, where no labeled data is available for hyper-parameter tuning.

Via

Access Paper or Ask Questions