Picture for Masahiro Kaneko

Masahiro Kaneko

Social Bias Evaluation for Large Language Models Requires Prompt Variations

Add code
Jul 03, 2024
Viaarxiv icon

Sampling-based Pseudo-Likelihood for Membership Inference Attacks

Add code
Apr 17, 2024
Viaarxiv icon

A Little Leak Will Sink a Great Ship: Survey of Transparency for Large Language Models from Start to Finish

Add code
Mar 24, 2024
Viaarxiv icon

Likelihood-based Mitigation of Evaluation Bias in Large Language Models

Add code
Mar 01, 2024
Viaarxiv icon

Eagle: Ethical Dataset Given from Real Interactions

Add code
Feb 22, 2024
Viaarxiv icon

Evaluating Gender Bias in Large Language Models via Chain-of-Thought Prompting

Add code
Jan 28, 2024
Viaarxiv icon

The Gaps between Pre-train and Downstream Settings in Bias Evaluation and Debiasing

Add code
Jan 16, 2024
Viaarxiv icon

SAIE Framework: Support Alone Isn't Enough -- Advancing LLM Training with Adversarial Remarks

Add code
Nov 14, 2023
Viaarxiv icon

How You Prompt Matters! Even Task-Oriented Constraints in Instructions Affect LLM-Generated Text Detection

Add code
Nov 14, 2023
Viaarxiv icon

Controlled Generation with Prompt Insertion for Natural Language Explanations in Grammatical Error Correction

Add code
Sep 20, 2023
Viaarxiv icon