Picture for Tosho Hirasawa

Tosho Hirasawa

Am I More Pointwise or Pairwise? Revealing Position Bias in Rubric-Based LLM-as-a-Judge

Add code
Feb 02, 2026
Viaarxiv icon

WarrantScore: Modeling Warrants between Claims and Evidence for Substantiation Evaluation in Peer Reviews

Add code
Jan 24, 2026
Viaarxiv icon

Evaluating the Capability of Video Question Generation for Expert Knowledge Elicitation

Add code
Dec 17, 2025
Viaarxiv icon

Assessing the Capabilities of LLMs in Humor:A Multi-dimensional Analysis of Oogiri Generation and Evaluation

Add code
Nov 12, 2025
Figure 1 for Assessing the Capabilities of LLMs in Humor:A Multi-dimensional Analysis of Oogiri Generation and Evaluation
Figure 2 for Assessing the Capabilities of LLMs in Humor:A Multi-dimensional Analysis of Oogiri Generation and Evaluation
Figure 3 for Assessing the Capabilities of LLMs in Humor:A Multi-dimensional Analysis of Oogiri Generation and Evaluation
Figure 4 for Assessing the Capabilities of LLMs in Humor:A Multi-dimensional Analysis of Oogiri Generation and Evaluation
Viaarxiv icon

SBS Figures: Pre-training Figure QA from Stage-by-Stage Synthesized Images

Add code
Dec 23, 2024
Figure 1 for SBS Figures: Pre-training Figure QA from Stage-by-Stage Synthesized Images
Figure 2 for SBS Figures: Pre-training Figure QA from Stage-by-Stage Synthesized Images
Figure 3 for SBS Figures: Pre-training Figure QA from Stage-by-Stage Synthesized Images
Figure 4 for SBS Figures: Pre-training Figure QA from Stage-by-Stage Synthesized Images
Viaarxiv icon

Pruning Multilingual Large Language Models for Multilingual Inference

Add code
Sep 25, 2024
Viaarxiv icon

COM Kitchens: An Unedited Overhead-view Video Dataset as a Vision-Language Benchmark

Add code
Aug 05, 2024
Viaarxiv icon

Construction of a Quality Estimation Dataset for Automatic Evaluation of Japanese Grammatical Error Correction

Add code
Jan 20, 2022
Figure 1 for Construction of a Quality Estimation Dataset for Automatic Evaluation of Japanese Grammatical Error Correction
Figure 2 for Construction of a Quality Estimation Dataset for Automatic Evaluation of Japanese Grammatical Error Correction
Figure 3 for Construction of a Quality Estimation Dataset for Automatic Evaluation of Japanese Grammatical Error Correction
Figure 4 for Construction of a Quality Estimation Dataset for Automatic Evaluation of Japanese Grammatical Error Correction
Viaarxiv icon

Keyframe Segmentation and Positional Encoding for Video-guided Machine Translation Challenge 2020

Add code
Jun 23, 2020
Figure 1 for Keyframe Segmentation and Positional Encoding for Video-guided Machine Translation Challenge 2020
Figure 2 for Keyframe Segmentation and Positional Encoding for Video-guided Machine Translation Challenge 2020
Viaarxiv icon

Towards Multimodal Simultaneous Neural Machine Translation

Add code
Apr 07, 2020
Figure 1 for Towards Multimodal Simultaneous Neural Machine Translation
Figure 2 for Towards Multimodal Simultaneous Neural Machine Translation
Figure 3 for Towards Multimodal Simultaneous Neural Machine Translation
Figure 4 for Towards Multimodal Simultaneous Neural Machine Translation
Viaarxiv icon