Picture for Jingyi Yang

Jingyi Yang

Sid

A Unified Framework for the Evaluation of LLM Agentic Capabilities

Add code
May 27, 2026
Viaarxiv icon

ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents

Add code
May 12, 2026
Viaarxiv icon

Delving Aleatoric Uncertainty in Medical Image Segmentation via Vision Foundation Models

Add code
Apr 13, 2026
Viaarxiv icon

DARE: Diffusion Large Language Models Alignment and Reinforcement Executor

Add code
Apr 05, 2026
Viaarxiv icon

GeoFusion-CAD: Structure-Aware Diffusion with Geometric State Space for Parametric 3D Design

Add code
Mar 23, 2026
Viaarxiv icon

DeepSight: An All-in-One LM Safety Toolkit

Add code
Feb 12, 2026
Viaarxiv icon

$ρ$-$\texttt{EOS}$: Training-free Bidirectional Variable-Length Control for Masked Diffusion LLMs

Add code
Jan 30, 2026
Viaarxiv icon

FP8-RL: A Practical and Stable Low-Precision Stack for LLM Reinforcement Learning

Add code
Jan 26, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Shall Your Data Strategy Work? Perform a Swift Study

Add code
Feb 19, 2025
Viaarxiv icon