Picture for Ivan P. Yamshchikov

Ivan P. Yamshchikov

Toxicity of the Commons: Curating Open-Source Pre-Training Data

Add code
Oct 29, 2024
Viaarxiv icon

Sui Generis: Large Language Models for Authorship Attribution and Verification in Latin

Add code
Oct 11, 2024
Viaarxiv icon

Individuation in Neural Models with and without Visual Grounding

Add code
Sep 27, 2024
Viaarxiv icon

BPE Gets Picky: Efficient Vocabulary Refinement During Tokenizer Training

Add code
Sep 06, 2024
Viaarxiv icon

Knowledge Graph Representation for Political Information Sources

Add code
Apr 04, 2024
Viaarxiv icon

Echo-chambers and Idea Labs: Communication Styles on Twitter

Add code
Mar 28, 2024
Viaarxiv icon

Vygotsky Distance: Measure for Benchmark Task Similarity

Add code
Feb 26, 2024
Viaarxiv icon

Neural Machine Translation for Malayalam Paraphrase Generation

Add code
Jan 31, 2024
Viaarxiv icon

LLMs Simulate Big Five Personality Traits: Further Evidence

Add code
Jan 31, 2024
Viaarxiv icon

Post Turing: Mapping the landscape of LLM Evaluation

Add code
Nov 03, 2023
Viaarxiv icon