Picture for Masahiro Kaneko

Masahiro Kaneko

Intent-Aware Self-Correction for Mitigating Social Biases in Large Language Models

Add code
Mar 08, 2025
Viaarxiv icon

Rectifying Belief Space via Unlearning to Harness LLMs' Reasoning

Add code
Feb 28, 2025
Viaarxiv icon

Balanced Multi-Factor In-Context Learning for Multilingual Large Language Models

Add code
Feb 17, 2025
Viaarxiv icon

Is Human-Like Text Liked by Humans? Multilingual Human Detection and Preference Against AI

Add code
Feb 17, 2025
Viaarxiv icon

GenAI Content Detection Task 1: English and Multilingual Machine-Generated Text Detection: AI vs. Human

Add code
Jan 19, 2025
Viaarxiv icon

HarmonicEval: Multi-modal, Multi-task, Multi-criteria Automatic Evaluation Using a Vision Language Model

Add code
Dec 19, 2024
Figure 1 for HarmonicEval: Multi-modal, Multi-task, Multi-criteria Automatic Evaluation Using a Vision Language Model
Figure 2 for HarmonicEval: Multi-modal, Multi-task, Multi-criteria Automatic Evaluation Using a Vision Language Model
Figure 3 for HarmonicEval: Multi-modal, Multi-task, Multi-criteria Automatic Evaluation Using a Vision Language Model
Figure 4 for HarmonicEval: Multi-modal, Multi-task, Multi-criteria Automatic Evaluation Using a Vision Language Model
Viaarxiv icon

Social Bias Evaluation for Large Language Models Requires Prompt Variations

Add code
Jul 03, 2024
Viaarxiv icon

Sampling-based Pseudo-Likelihood for Membership Inference Attacks

Add code
Apr 17, 2024
Viaarxiv icon

A Little Leak Will Sink a Great Ship: Survey of Transparency for Large Language Models from Start to Finish

Add code
Mar 24, 2024
Viaarxiv icon

Likelihood-based Mitigation of Evaluation Bias in Large Language Models

Add code
Mar 01, 2024
Viaarxiv icon