Picture for Kongming Liang

Kongming Liang

Generative Visual Chain-of-Thought for Image Editing

Add code
Mar 02, 2026
Viaarxiv icon

Benchmarking Semantic Segmentation Models via Appearance and Geometry Attribute Editing

Add code
Mar 02, 2026
Viaarxiv icon

Hepato-LLaVA: An Expert MLLM with Sparse Topo-Pack Attention for Hepatocellular Pathology Analysis on Whole Slide Images

Add code
Feb 26, 2026
Viaarxiv icon

Geometric Image Editing via Effects-Sensitive In-Context Inpainting with Diffusion Transformers

Add code
Feb 09, 2026
Viaarxiv icon

Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting

Add code
Jan 05, 2026
Viaarxiv icon

MedReasoner: Reinforcement Learning Drives Reasoning Grounding from Clinical Thought to Pixel-Level Precision

Add code
Aug 11, 2025
Viaarxiv icon

FairHuman: Boosting Hand and Face Quality in Human Image Generation with Minimum Potential Delay Fairness in Diffusion Models

Add code
Jul 03, 2025
Viaarxiv icon

DriveRX: A Vision-Language Reasoning Model for Cross-Task Autonomous Driving

Add code
May 27, 2025
Viaarxiv icon

Harnessing Caption Detailness for Data-Efficient Text-to-Image Generation

Add code
May 21, 2025
Viaarxiv icon

CineTechBench: A Benchmark for Cinematographic Technique Understanding and Generation

Add code
May 21, 2025
Figure 1 for CineTechBench: A Benchmark for Cinematographic Technique Understanding and Generation
Figure 2 for CineTechBench: A Benchmark for Cinematographic Technique Understanding and Generation
Figure 3 for CineTechBench: A Benchmark for Cinematographic Technique Understanding and Generation
Figure 4 for CineTechBench: A Benchmark for Cinematographic Technique Understanding and Generation
Viaarxiv icon