Picture for Renjie Pi

Renjie Pi

May

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Add code
Apr 14, 2026
Viaarxiv icon

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Add code
Mar 19, 2026
Viaarxiv icon

On Data Engineering for Scaling LLM Terminal Capabilities

Add code
Feb 24, 2026
Viaarxiv icon

Beyond Accuracy: Evaluating Grounded Visual Evidence in Thinking with Images

Add code
Jan 14, 2026
Viaarxiv icon

LongVideoAgent: Multi-Agent Reasoning with Long Videos

Add code
Dec 23, 2025
Figure 1 for LongVideoAgent: Multi-Agent Reasoning with Long Videos
Figure 2 for LongVideoAgent: Multi-Agent Reasoning with Long Videos
Figure 3 for LongVideoAgent: Multi-Agent Reasoning with Long Videos
Figure 4 for LongVideoAgent: Multi-Agent Reasoning with Long Videos
Viaarxiv icon

Look Less, Reason More: Rollout-Guided Adaptive Pixel-Space Reasoning

Add code
Oct 02, 2025
Viaarxiv icon

Pointing to a Llama and Call it a Camel: On the Sycophancy of Multimodal Large Language Models

Add code
Sep 19, 2025
Figure 1 for Pointing to a Llama and Call it a Camel: On the Sycophancy of Multimodal Large Language Models
Figure 2 for Pointing to a Llama and Call it a Camel: On the Sycophancy of Multimodal Large Language Models
Figure 3 for Pointing to a Llama and Call it a Camel: On the Sycophancy of Multimodal Large Language Models
Figure 4 for Pointing to a Llama and Call it a Camel: On the Sycophancy of Multimodal Large Language Models
Viaarxiv icon

Generalizable Geometric Image Caption Synthesis

Add code
Sep 18, 2025
Viaarxiv icon

ExeSQL: Self-Taught Text-to-SQL Models with Execution-Driven Bootstrapping for SQL Dialects

Add code
May 22, 2025
Viaarxiv icon

MR. Judge: Multimodal Reasoner as a Judge

Add code
May 19, 2025
Viaarxiv icon