Picture for Caiming Xiong

Caiming Xiong

Salesforce AI Research

CLOVER: A Test Case Generation Benchmark with Coverage, Long-Context, and Verification

Add code
Feb 12, 2025
Viaarxiv icon

BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation

Add code
Feb 06, 2025
Figure 1 for BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation
Figure 2 for BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation
Figure 3 for BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation
Figure 4 for BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation
Viaarxiv icon

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

Add code
Jan 31, 2025
Figure 1 for Reward-Guided Speculative Decoding for Efficient LLM Reasoning
Figure 2 for Reward-Guided Speculative Decoding for Efficient LLM Reasoning
Figure 3 for Reward-Guided Speculative Decoding for Efficient LLM Reasoning
Figure 4 for Reward-Guided Speculative Decoding for Efficient LLM Reasoning
Viaarxiv icon

Demystifying Domain-adaptive Post-training for Financial LLMs

Add code
Jan 09, 2025
Figure 1 for Demystifying Domain-adaptive Post-training for Financial LLMs
Figure 2 for Demystifying Domain-adaptive Post-training for Financial LLMs
Figure 3 for Demystifying Domain-adaptive Post-training for Financial LLMs
Figure 4 for Demystifying Domain-adaptive Post-training for Financial LLMs
Viaarxiv icon

StructTest: Benchmarking LLMs' Reasoning through Compositional Structured Outputs

Add code
Dec 23, 2024
Viaarxiv icon

Bridging the Data Provenance Gap Across Text, Speech and Video

Add code
Dec 19, 2024
Figure 1 for Bridging the Data Provenance Gap Across Text, Speech and Video
Figure 2 for Bridging the Data Provenance Gap Across Text, Speech and Video
Figure 3 for Bridging the Data Provenance Gap Across Text, Speech and Video
Figure 4 for Bridging the Data Provenance Gap Across Text, Speech and Video
Viaarxiv icon

Unanswerability Evaluation for Retreival Augmented Generation

Add code
Dec 16, 2024
Viaarxiv icon

AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials

Add code
Dec 12, 2024
Viaarxiv icon

ViUniT: Visual Unit Tests for More Robust Visual Programming

Add code
Dec 12, 2024
Figure 1 for ViUniT: Visual Unit Tests for More Robust Visual Programming
Figure 2 for ViUniT: Visual Unit Tests for More Robust Visual Programming
Figure 3 for ViUniT: Visual Unit Tests for More Robust Visual Programming
Figure 4 for ViUniT: Visual Unit Tests for More Robust Visual Programming
Viaarxiv icon

GReaTer: Gradients over Reasoning Makes Smaller Language Models Strong Prompt Optimizers

Add code
Dec 12, 2024
Viaarxiv icon