Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Keiji Shinzato

RakutenAI-7B: Extending Large Language Models for Japanese

Mar 21, 2024

Rakuten Group, Aaron Levine, Connie Huang, Chenguang Wang, Eduardo Batista, Ewa Szymanska, Hongyi Ding, Hou Wei Chou, Jean-François Pessiot, Johanes Effendi(+20 more)

Figure 1 for RakutenAI-7B: Extending Large Language Models for Japanese

Figure 2 for RakutenAI-7B: Extending Large Language Models for Japanese

Figure 3 for RakutenAI-7B: Extending Large Language Models for Japanese

Figure 4 for RakutenAI-7B: Extending Large Language Models for Japanese

Abstract:We introduce RakutenAI-7B, a suite of Japanese-oriented large language models that achieve the best performance on the Japanese LM Harness benchmarks among the open 7B models. Along with the foundation model, we release instruction- and chat-tuned models, RakutenAI-7B-instruct and RakutenAI-7B-chat respectively, under the Apache 2.0 license.

Via

Access Paper or Ask Questions

A Unified Generative Approach to Product Attribute-Value Identification

Jun 09, 2023

Keiji Shinzato, Naoki Yoshinaga, Yandi Xia, Wei-Te Chen

Figure 1 for A Unified Generative Approach to Product Attribute-Value Identification

Figure 2 for A Unified Generative Approach to Product Attribute-Value Identification

Figure 3 for A Unified Generative Approach to Product Attribute-Value Identification

Figure 4 for A Unified Generative Approach to Product Attribute-Value Identification

Abstract:Product attribute-value identification (PAVI) has been studied to link products on e-commerce sites with their attribute values (e.g., <Material, Cotton>) using product text as clues. Technical demands from real-world e-commerce platforms require PAVI methods to handle unseen values, multi-attribute values, and canonicalized values, which are only partly addressed in existing extraction- and classification-based approaches. Motivated by this, we explore a generative approach to the PAVI task. We finetune a pre-trained generative model, T5, to decode a set of attribute-value pairs as a target sequence from the given product text. Since the attribute value pairs are unordered set elements, how to linearize them will matter; we, thus, explore methods of composing an attribute-value pair and ordering the pairs for the task. Experimental results confirm that our generation-based approach outperforms the existing extraction and classification-based methods on large-scale real-world datasets meant for those methods.

* Accepted to the Findings of ACL 2023

Via

Access Paper or Ask Questions

Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product Attribute Extraction

Jun 28, 2022

Keiji Shinzato, Naoki Yoshinaga, Yandi Xia, Wei-Te Chen

Figure 1 for Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product Attribute Extraction

Figure 2 for Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product Attribute Extraction

Figure 3 for Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product Attribute Extraction

Figure 4 for Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product Attribute Extraction

Abstract:A key challenge in attribute value extraction (AVE) from e-commerce sites is how to handle a large number of attributes for diverse products. Although this challenge is partially addressed by a question answering (QA) approach which finds a value in product data for a given query (attribute), it does not work effectively for rare and ambiguous queries. We thus propose simple knowledge-driven query expansion based on possible answers (values) of a query (attribute) for QA-based AVE. We retrieve values of a query (attribute) from the training data to expand the query. We train a model with two tricks, knowledge dropout and knowledge token mixing, which mimic the imperfection of the value knowledge in testing. Experimental results on our cleaned version of AliExpress dataset show that our method improves the performance of AVE (+6.08 macro F1), especially for rare and ambiguous attributes (+7.82 and +6.86 macro F1, respectively).

* Proceedings of ACL 2022 (Volume 2: Short Papers), 227--234
* Published at ACL 2022

Via

Access Paper or Ask Questions