Picture for Kyosuke Nishida

Kyosuke Nishida

Initialization of Large Language Models via Reparameterization to Mitigate Loss Spikes

Add code
Oct 07, 2024
Viaarxiv icon

InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions

Add code
Jan 24, 2024
Viaarxiv icon

Robust Text-driven Image Editing Method that Adaptively Explores Directions in Latent Spaces of StyleGAN and CLIP

Add code
Apr 03, 2023
Viaarxiv icon

SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images

Add code
Jan 12, 2023
Viaarxiv icon

Self-Adaptive Named Entity Recognition by Retrieving Unstructured Knowledge

Add code
Oct 14, 2022
Figure 1 for Self-Adaptive Named Entity Recognition by Retrieving Unstructured Knowledge
Figure 2 for Self-Adaptive Named Entity Recognition by Retrieving Unstructured Knowledge
Figure 3 for Self-Adaptive Named Entity Recognition by Retrieving Unstructured Knowledge
Figure 4 for Self-Adaptive Named Entity Recognition by Retrieving Unstructured Knowledge
Viaarxiv icon

Improving Few-Shot Image Classification Using Machine- and User-Generated Natural Language Descriptions

Add code
Jul 07, 2022
Figure 1 for Improving Few-Shot Image Classification Using Machine- and User-Generated Natural Language Descriptions
Figure 2 for Improving Few-Shot Image Classification Using Machine- and User-Generated Natural Language Descriptions
Figure 3 for Improving Few-Shot Image Classification Using Machine- and User-Generated Natural Language Descriptions
Figure 4 for Improving Few-Shot Image Classification Using Machine- and User-Generated Natural Language Descriptions
Viaarxiv icon

Towards Interpretable and Reliable Reading Comprehension: A Pipeline Model with Unanswerability Prediction

Add code
Nov 18, 2021
Figure 1 for Towards Interpretable and Reliable Reading Comprehension: A Pipeline Model with Unanswerability Prediction
Figure 2 for Towards Interpretable and Reliable Reading Comprehension: A Pipeline Model with Unanswerability Prediction
Figure 3 for Towards Interpretable and Reliable Reading Comprehension: A Pipeline Model with Unanswerability Prediction
Figure 4 for Towards Interpretable and Reliable Reading Comprehension: A Pipeline Model with Unanswerability Prediction
Viaarxiv icon

Task-adaptive Pre-training of Language Models with Word Embedding Regularization

Add code
Sep 17, 2021
Figure 1 for Task-adaptive Pre-training of Language Models with Word Embedding Regularization
Figure 2 for Task-adaptive Pre-training of Language Models with Word Embedding Regularization
Figure 3 for Task-adaptive Pre-training of Language Models with Word Embedding Regularization
Figure 4 for Task-adaptive Pre-training of Language Models with Word Embedding Regularization
Viaarxiv icon

VisualMRC: Machine Reading Comprehension on Document Images

Add code
Jan 27, 2021
Figure 1 for VisualMRC: Machine Reading Comprehension on Document Images
Figure 2 for VisualMRC: Machine Reading Comprehension on Document Images
Figure 3 for VisualMRC: Machine Reading Comprehension on Document Images
Figure 4 for VisualMRC: Machine Reading Comprehension on Document Images
Viaarxiv icon

A Transformer-based Audio Captioning Model with Keyword Estimation

Add code
Jul 01, 2020
Figure 1 for A Transformer-based Audio Captioning Model with Keyword Estimation
Figure 2 for A Transformer-based Audio Captioning Model with Keyword Estimation
Figure 3 for A Transformer-based Audio Captioning Model with Keyword Estimation
Viaarxiv icon