Picture for Fuzheng Zhang

Fuzheng Zhang

Kuaishou Natural Language Processing Center and Audio Center

From Chaos to Order: The Atomic Reasoner Framework for Fine-grained Reasoning in Large Language Models

Add code
Mar 20, 2025
Viaarxiv icon

HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models

Add code
Feb 28, 2025
Viaarxiv icon

SPPD: Self-training with Process Preference Learning Using Dynamic Value Margin

Add code
Feb 19, 2025
Viaarxiv icon

VidCapBench: A Comprehensive Benchmark of Video Captioning for Controllable Text-to-Video Generation

Add code
Feb 18, 2025
Viaarxiv icon

Breaking the Stage Barrier: A Novel Single-Stage Approach to Long Context Extension for Large Language Models

Add code
Dec 10, 2024
Viaarxiv icon

Video-Text Dataset Construction from Multi-AI Feedback: Promoting Weak-to-Strong Preference Learning for Video Large Language Models

Add code
Nov 25, 2024
Figure 1 for Video-Text Dataset Construction from Multi-AI Feedback: Promoting Weak-to-Strong Preference Learning for Video Large Language Models
Figure 2 for Video-Text Dataset Construction from Multi-AI Feedback: Promoting Weak-to-Strong Preference Learning for Video Large Language Models
Figure 3 for Video-Text Dataset Construction from Multi-AI Feedback: Promoting Weak-to-Strong Preference Learning for Video Large Language Models
Figure 4 for Video-Text Dataset Construction from Multi-AI Feedback: Promoting Weak-to-Strong Preference Learning for Video Large Language Models
Viaarxiv icon

DMQR-RAG: Diverse Multi-Query Rewriting for RAG

Add code
Nov 20, 2024
Viaarxiv icon

Video DataFlywheel: Resolving the Impossible Data Trinity in Video-Language Understanding

Add code
Sep 29, 2024
Figure 1 for Video DataFlywheel: Resolving the Impossible Data Trinity in Video-Language Understanding
Figure 2 for Video DataFlywheel: Resolving the Impossible Data Trinity in Video-Language Understanding
Figure 3 for Video DataFlywheel: Resolving the Impossible Data Trinity in Video-Language Understanding
Figure 4 for Video DataFlywheel: Resolving the Impossible Data Trinity in Video-Language Understanding
Viaarxiv icon

TSO: Self-Training with Scaled Preference Optimization

Add code
Aug 31, 2024
Figure 1 for TSO: Self-Training with Scaled Preference Optimization
Figure 2 for TSO: Self-Training with Scaled Preference Optimization
Figure 3 for TSO: Self-Training with Scaled Preference Optimization
Figure 4 for TSO: Self-Training with Scaled Preference Optimization
Viaarxiv icon

Towards Comprehensive Preference Data Collection for Reward Modeling

Add code
Jun 24, 2024
Viaarxiv icon