Picture for Xiaochen Ma

Xiaochen Ma

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

Add code
Dec 18, 2025
Viaarxiv icon

Scone: Bridging Composition and Distinction in Subject-Driven Image Generation via Unified Understanding-Generation Modeling

Add code
Dec 14, 2025
Viaarxiv icon

Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions

Add code
Jun 09, 2025
Figure 1 for Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions
Figure 2 for Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions
Figure 3 for Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions
Figure 4 for Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions
Viaarxiv icon

ForensicHub: A Unified Benchmark & Codebase for All-Domain Fake Image Detection and Localization

Add code
May 16, 2025
Viaarxiv icon

Dataset Distillation via Committee Voting

Add code
Jan 13, 2025
Viaarxiv icon

Can We Get Rid of Handcrafted Feature Extractors? SparseViT: Nonsemantics-Centered, Parameter-Efficient Image Manipulation Localization Through Spare-Coding Transformer

Add code
Dec 19, 2024
Figure 1 for Can We Get Rid of Handcrafted Feature Extractors? SparseViT: Nonsemantics-Centered, Parameter-Efficient Image Manipulation Localization Through Spare-Coding Transformer
Figure 2 for Can We Get Rid of Handcrafted Feature Extractors? SparseViT: Nonsemantics-Centered, Parameter-Efficient Image Manipulation Localization Through Spare-Coding Transformer
Figure 3 for Can We Get Rid of Handcrafted Feature Extractors? SparseViT: Nonsemantics-Centered, Parameter-Efficient Image Manipulation Localization Through Spare-Coding Transformer
Figure 4 for Can We Get Rid of Handcrafted Feature Extractors? SparseViT: Nonsemantics-Centered, Parameter-Efficient Image Manipulation Localization Through Spare-Coding Transformer
Viaarxiv icon

Mesoscopic Insights: Orchestrating Multi-scale & Hybrid Architecture for Image Manipulation Localization

Add code
Dec 18, 2024
Figure 1 for Mesoscopic Insights: Orchestrating Multi-scale & Hybrid Architecture for Image Manipulation Localization
Figure 2 for Mesoscopic Insights: Orchestrating Multi-scale & Hybrid Architecture for Image Manipulation Localization
Figure 3 for Mesoscopic Insights: Orchestrating Multi-scale & Hybrid Architecture for Image Manipulation Localization
Figure 4 for Mesoscopic Insights: Orchestrating Multi-scale & Hybrid Architecture for Image Manipulation Localization
Viaarxiv icon

M^3:Manipulation Mask Manufacturer for Arbitrary-Scale Super-Resolution Mask

Add code
Jul 04, 2024
Viaarxiv icon

IMDL-BenCo: A Comprehensive Benchmark and Codebase for Image Manipulation Detection & Localization

Add code
Jun 15, 2024
Figure 1 for IMDL-BenCo: A Comprehensive Benchmark and Codebase for Image Manipulation Detection & Localization
Figure 2 for IMDL-BenCo: A Comprehensive Benchmark and Codebase for Image Manipulation Detection & Localization
Figure 3 for IMDL-BenCo: A Comprehensive Benchmark and Codebase for Image Manipulation Detection & Localization
Figure 4 for IMDL-BenCo: A Comprehensive Benchmark and Codebase for Image Manipulation Detection & Localization
Viaarxiv icon

Perceptual MAE for Image Manipulation Localization: A High-level Vision Learner Focusing on Low-level Features

Add code
Oct 10, 2023
Figure 1 for Perceptual MAE for Image Manipulation Localization: A High-level Vision Learner Focusing on Low-level Features
Figure 2 for Perceptual MAE for Image Manipulation Localization: A High-level Vision Learner Focusing on Low-level Features
Figure 3 for Perceptual MAE for Image Manipulation Localization: A High-level Vision Learner Focusing on Low-level Features
Figure 4 for Perceptual MAE for Image Manipulation Localization: A High-level Vision Learner Focusing on Low-level Features
Viaarxiv icon