Picture for Yixin Ren

Yixin Ren

AgentIF-OneDay: A Task-level Instruction-Following Benchmark for General AI Agents in Daily Scenarios

Add code
Jan 28, 2026
Viaarxiv icon

BabyVision: Visual Reasoning Beyond Language

Add code
Jan 10, 2026
Viaarxiv icon

xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations

Add code
Jun 16, 2025
Figure 1 for xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations
Figure 2 for xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations
Figure 3 for xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations
Figure 4 for xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations
Viaarxiv icon

Score-based Generative Modeling for Conditional Independence Testing

Add code
May 29, 2025
Viaarxiv icon

Fast Causal Discovery by Approximate Kernel-based Generalized Score Functions with Linear Computational Complexity

Add code
Dec 23, 2024
Figure 1 for Fast Causal Discovery by Approximate Kernel-based Generalized Score Functions with Linear Computational Complexity
Figure 2 for Fast Causal Discovery by Approximate Kernel-based Generalized Score Functions with Linear Computational Complexity
Figure 3 for Fast Causal Discovery by Approximate Kernel-based Generalized Score Functions with Linear Computational Complexity
Figure 4 for Fast Causal Discovery by Approximate Kernel-based Generalized Score Functions with Linear Computational Complexity
Viaarxiv icon