Picture for Kai Chen

Kai Chen

LLM App Squatting and Cloning

Add code
Nov 12, 2024
Viaarxiv icon

Unified Triplet-Level Hallucination Evaluation for Large Vision-Language Models

Add code
Oct 30, 2024
Viaarxiv icon

Shopping MMLU: A Massive Multi-Task Online Shopping Benchmark for Large Language Models

Add code
Oct 28, 2024
Viaarxiv icon

FRTree Planner: Robot Navigation in Cluttered and Unknown Environments with Tree of Free Regions

Add code
Oct 26, 2024
Viaarxiv icon

CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution

Add code
Oct 21, 2024
Viaarxiv icon

InternLM2.5-StepProver: Advancing Automated Theorem Proving via Expert Iteration on Large-Scale LEAN Problems

Add code
Oct 21, 2024
Viaarxiv icon

Training Language Models to Critique With Multi-agent Feedback

Add code
Oct 20, 2024
Viaarxiv icon

The Latent Road to Atoms: Backmapping Coarse-grained Protein Structures with Latent Diffusion

Add code
Oct 17, 2024
Viaarxiv icon

ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs

Add code
Oct 16, 2024
Viaarxiv icon

Uncertainty-aware t-distributed Stochastic Neighbor Embedding for Single-cell RNA-seq Data

Add code
Oct 01, 2024
Viaarxiv icon