Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Khoi-Nguyen Tran

A Library of LLM Intrinsics for Retrieval-Augmented Generation

Apr 16, 2025

Marina Danilevsky, Kristjan Greenewald, Chulaka Gunasekara, Maeda Hanafi, Lihong He, Yannis Katsis, Krishnateja Killamsetty, Yatin Nandwani, Lucian Popa, Dinesh Raghu(+5 more)

Abstract:In the developer community for large language models (LLMs), there is not yet a clean pattern analogous to a software library, to support very large scale collaboration. Even for the commonplace use case of Retrieval-Augmented Generation (RAG), it is not currently possible to write a RAG application against a well-defined set of APIs that are agreed upon by different LLM providers. Inspired by the idea of compiler intrinsics, we propose some elements of such a concept through introducing a library of LLM Intrinsics for RAG. An LLM intrinsic is defined as a capability that can be invoked through a well-defined API that is reasonably stable and independent of how the LLM intrinsic itself is implemented. The intrinsics in our library are released as LoRA adapters on HuggingFace, and through a software interface with clear structured input/output characteristics on top of vLLM as an inference platform, accompanied in both places with documentation and code. This article describes the intended usage, training details, and evaluations for each intrinsic, as well as compositions of multiple intrinsics.

Via

Access Paper or Ask Questions

PriMeSRL-Eval: A Practical Quality Metric for Semantic Role Labeling Systems Evaluation

Oct 12, 2022

Ishan Jindal, Alexandre Rademaker, Khoi-Nguyen Tran, Huaiyu Zhu, Hiroshi Kanayama, Marina Danilevsky, Yunyao Li

Figure 1 for PriMeSRL-Eval: A Practical Quality Metric for Semantic Role Labeling Systems Evaluation

Figure 2 for PriMeSRL-Eval: A Practical Quality Metric for Semantic Role Labeling Systems Evaluation

Figure 3 for PriMeSRL-Eval: A Practical Quality Metric for Semantic Role Labeling Systems Evaluation

Figure 4 for PriMeSRL-Eval: A Practical Quality Metric for Semantic Role Labeling Systems Evaluation

Abstract:Semantic role labeling (SRL) identifies the predicate-argument structure in a sentence. This task is usually accomplished in four steps: predicate identification, predicate sense disambiguation, argument identification, and argument classification. Errors introduced at one step propagate to later steps. Unfortunately, the existing SRL evaluation scripts do not consider the full effect of this error propagation aspect. They either evaluate arguments independent of predicate sense (CoNLL09) or do not evaluate predicate sense at all (CoNLL05), yielding an inaccurate SRL model performance on the argument classification task. In this paper, we address key practical issues with existing evaluation scripts and propose a more strict SRL evaluation metric PriMeSRL. We observe that by employing PriMeSRL, the quality evaluation of all SoTA SRL models drops significantly, and their relative rankings also change. We also show that PriMeSRLsuccessfully penalizes actual failures in SoTA SRL models.

Via

Access Paper or Ask Questions

Towards a Predictive Patent Analytics and Evaluation Platform

Oct 31, 2019

Nebula Alam, Khoi-Nguyen Tran, Sue Ann Chen, John Wagner, Josh Andres, Mukesh Mohania

Figure 1 for Towards a Predictive Patent Analytics and Evaluation Platform

Figure 2 for Towards a Predictive Patent Analytics and Evaluation Platform

Abstract:The importance of patents is well recognised across many regions of the world. Many patent mining systems have been proposed, but with limited predictive capabilities. In this demo, we showcase how predictive algorithms leveraging the state-of-the-art machine learning and deep learning techniques can be used to improve understanding of patents for inventors, patent evaluators, and business analysts alike. Our demo video is available at http://ibm.biz/ecml2019-demo-patent-analytics

* ECML-PKDD 2019 - Demo Track

Via

Access Paper or Ask Questions

Assessing Regulatory Risk in Personal Financial Advice Documents: a Pilot Study

Oct 11, 2019

Wanita Sherchan, Simon Harris, Sue Ann Chen, Nebula Alam, Khoi-Nguyen Tran, Adam J. Makarucha, Christopher J. Butler

Figure 1 for Assessing Regulatory Risk in Personal Financial Advice Documents: a Pilot Study

Figure 2 for Assessing Regulatory Risk in Personal Financial Advice Documents: a Pilot Study

Figure 3 for Assessing Regulatory Risk in Personal Financial Advice Documents: a Pilot Study

Figure 4 for Assessing Regulatory Risk in Personal Financial Advice Documents: a Pilot Study

Abstract:Assessing regulatory compliance of personal financial advice is currently a complex manual process. In Australia, only 5%- 15% of advice documents are audited annually and 75% of these are found to be non-compliant(ASI 2018b). This paper describes a pilot with an Australian government regulation agency where Artificial Intelligence (AI) models based on techniques such natural language processing (NLP), machine learning and deep learning were developed to methodically characterise the regulatory risk status of personal financial advice documents. The solution provides traffic light rating of advice documents for various risk factors enabling comprehensive coverage of documents in the review and allowing rapid identification of documents that are at high risk of non-compliance with government regulations. This pilot serves as a case study of public-private partnership in developing AI systems for government and public sector.

* Presented at AAAI FSS-19: Artificial Intelligence in Government and Public Sector, Arlington, Virginia, USA

Via

Access Paper or Ask Questions

Document Chunking and Learning Objective Generation for Instruction Design

Aug 06, 2018

Khoi-Nguyen Tran, Jey Han Lau, Danish Contractor, Utkarsh Gupta, Bikram Sengupta, Christopher J. Butler, Mukesh Mohania

Figure 1 for Document Chunking and Learning Objective Generation for Instruction Design

Figure 2 for Document Chunking and Learning Objective Generation for Instruction Design

Figure 3 for Document Chunking and Learning Objective Generation for Instruction Design

Figure 4 for Document Chunking and Learning Objective Generation for Instruction Design

Abstract:Instructional Systems Design is the practice of creating of instructional experiences that make the acquisition of knowledge and skill more efficient, effective, and appealing. Specifically in designing courses, an hour of training material can require between 30 to 500 hours of effort in sourcing and organizing reference data for use in just the preparation of course material. In this paper, we present the first system of its kind that helps reduce the effort associated with sourcing reference material and course creation. We present algorithms for document chunking and automatic generation of learning objectives from content, creating descriptive content metadata to improve content-discoverability. Unlike existing methods, the learning objectives generated by our system incorporate pedagogically motivated Bloom's verbs. We demonstrate the usefulness of our methods using real world data from the banking industry and through a live deployment at a large pharmaceutical company.

* Proceedings of the 11th International Conference on Education Data Mining (EDM 2018)

Via

Access Paper or Ask Questions

End-to-end Network for Twitter Geolocation Prediction and Hashing

Oct 13, 2017

Jey Han Lau, Lianhua Chi, Khoi-Nguyen Tran, Trevor Cohn

Figure 1 for End-to-end Network for Twitter Geolocation Prediction and Hashing

Figure 2 for End-to-end Network for Twitter Geolocation Prediction and Hashing

Figure 3 for End-to-end Network for Twitter Geolocation Prediction and Hashing

Figure 4 for End-to-end Network for Twitter Geolocation Prediction and Hashing

Abstract:We propose an end-to-end neural network to predict the geolocation of a tweet. The network takes as input a number of raw Twitter metadata such as the tweet message and associated user account information. Our model is language independent, and despite minimal feature engineering, it is interpretable and capable of learning location indicative words and timing patterns. Compared to state-of-the-art systems, our model outperforms them by 2%-6%. Additionally, we propose extensions to the model to compress representation learnt by the network into binary codes. Experiments show that it produces compact codes compared to benchmark hashing algorithms. An implementation of the model is released publicly.

* 10 pages, in Proceedings of the 8th International Joint Conference on Natural Language Processing (IJCNLP 2017) (to appear)

Via

Access Paper or Ask Questions