Picture for Ruizhe Li

Ruizhe Li

Standardizing Longitudinal Radiology Report Evaluation via Large Language Model Annotation

Add code
Jan 23, 2026
Viaarxiv icon

Benchmarking Text-to-Python against Text-to-SQL: The Impact of Explicit Logic and Ambiguity

Add code
Jan 23, 2026
Viaarxiv icon

Race, Ethnicity and Their Implication on Bias in Large Language Models

Add code
Jan 19, 2026
Viaarxiv icon

Spurious Rewards Paradox: Mechanistically Understanding How RLVR Activates Memorization Shortcuts in LLMs

Add code
Jan 16, 2026
Viaarxiv icon

DeepResearch Bench II: Diagnosing Deep Research Agents via Rubrics from Expert Report

Add code
Jan 13, 2026
Viaarxiv icon

Breast Cancer Neoadjuvant Chemotherapy Treatment Response Prediction Using Aligned Longitudinal MRI and Clinical Data

Add code
Dec 19, 2025
Figure 1 for Breast Cancer Neoadjuvant Chemotherapy Treatment Response Prediction Using Aligned Longitudinal MRI and Clinical Data
Figure 2 for Breast Cancer Neoadjuvant Chemotherapy Treatment Response Prediction Using Aligned Longitudinal MRI and Clinical Data
Figure 3 for Breast Cancer Neoadjuvant Chemotherapy Treatment Response Prediction Using Aligned Longitudinal MRI and Clinical Data
Figure 4 for Breast Cancer Neoadjuvant Chemotherapy Treatment Response Prediction Using Aligned Longitudinal MRI and Clinical Data
Viaarxiv icon

Behind the Scenes: Mechanistic Interpretability of LoRA-adapted Whisper for Speech Emotion Recognition

Add code
Sep 11, 2025
Viaarxiv icon

Helix 1.0: An Open-Source Framework for Reproducible and Interpretable Machine Learning on Tabular Scientific Data

Add code
Jul 23, 2025
Viaarxiv icon

Marco-Bench-MIF: On Multilingual Instruction-Following Capability of Large Language Models

Add code
Jul 16, 2025
Viaarxiv icon

Business as Rulesual: A Benchmark and Framework for Business Rule Flow Modeling with LLMs

Add code
May 29, 2025
Viaarxiv icon