Picture for Tong Zhang

Tong Zhang

Nanjing University of Science and Technology, Nanjing, China

VGRP-Bench: Visual Grid Reasoning Puzzle Benchmark for Large Vision-Language Models

Add code
Apr 02, 2025
Viaarxiv icon

ASGO: Adaptive Structured Gradient Optimization

Add code
Mar 26, 2025
Viaarxiv icon

FDS: Frequency-Aware Denoising Score for Text-Guided Latent Diffusion Image Editing

Add code
Mar 24, 2025
Viaarxiv icon

Generating Multimodal Driving Scenes via Next-Scene Prediction

Add code
Mar 19, 2025
Viaarxiv icon

RAG-RL: Advancing Retrieval-Augmented Generation via RL and Curriculum Learning

Add code
Mar 17, 2025
Viaarxiv icon

Monte Carlo Diffusion for Generalizable Learning-Based RANSAC

Add code
Mar 12, 2025
Viaarxiv icon

ROCM: RLHF on consistency models

Add code
Mar 08, 2025
Viaarxiv icon

SEOE: A Scalable and Reliable Semantic Evaluation Framework for Open Domain Event Detection

Add code
Mar 05, 2025
Viaarxiv icon

MA-LoT: Multi-Agent Lean-based Long Chain-of-Thought Reasoning enhances Formal Theorem Proving

Add code
Mar 05, 2025
Viaarxiv icon

FANS -- Formal Answer Selection for Natural Language Math Reasoning Using Lean4

Add code
Mar 05, 2025
Viaarxiv icon