Picture for Huzefa Rangwala

Huzefa Rangwala

Relatron: Automating Relational Machine Learning over Relational Databases

Add code
Feb 26, 2026
Viaarxiv icon

ReSyn: Autonomously Scaling Synthetic Environments for Reasoning Models

Add code
Feb 23, 2026
Viaarxiv icon

Train Less, Learn More: Adaptive Efficient Rollout Optimization for Group-Based Reinforcement Learning

Add code
Feb 15, 2026
Viaarxiv icon

Generating Data-Driven Reasoning Rubrics for Domain-Adaptive Reward Modeling

Add code
Feb 06, 2026
Viaarxiv icon

SQL-Trail: Multi-Turn Reinforcement Learning with Interleaved Feedback for Text-to-SQL

Add code
Jan 25, 2026
Viaarxiv icon

VeriCoT: Neuro-symbolic Chain-of-Thought Validation via Logical Consistency Checks

Add code
Nov 06, 2025
Figure 1 for VeriCoT: Neuro-symbolic Chain-of-Thought Validation via Logical Consistency Checks
Figure 2 for VeriCoT: Neuro-symbolic Chain-of-Thought Validation via Logical Consistency Checks
Figure 3 for VeriCoT: Neuro-symbolic Chain-of-Thought Validation via Logical Consistency Checks
Figure 4 for VeriCoT: Neuro-symbolic Chain-of-Thought Validation via Logical Consistency Checks
Viaarxiv icon

Hierarchical Lexical Graph for Enhanced Multi-Hop Retrieval

Add code
Jun 09, 2025
Figure 1 for Hierarchical Lexical Graph for Enhanced Multi-Hop Retrieval
Figure 2 for Hierarchical Lexical Graph for Enhanced Multi-Hop Retrieval
Figure 3 for Hierarchical Lexical Graph for Enhanced Multi-Hop Retrieval
Figure 4 for Hierarchical Lexical Graph for Enhanced Multi-Hop Retrieval
Viaarxiv icon

MLZero: A Multi-Agent System for End-to-end Machine Learning Automation

Add code
May 20, 2025
Viaarxiv icon

Teaching Large Language Models to Reason through Learning and Forgetting

Add code
Apr 15, 2025
Viaarxiv icon

From Demonstrations to Rewards: Alignment Without Explicit Human Preferences

Add code
Mar 15, 2025
Viaarxiv icon