Picture for Yu Zhou

Yu Zhou

National Laboratory of Pattern Recognition, Institute of Automation, CAS, Beijing, China, Fanyu AI Laboratory, Zhongke Fanyu Technology Co., Ltd, Beijing, China

Arbitrary Reading Order Scene Text Spotter with Local Semantics Guidance

Add code
Dec 13, 2024
Viaarxiv icon

Falcon-UI: Understanding GUI Before Following User Instructions

Add code
Dec 12, 2024
Viaarxiv icon

A4-Unet: Deformable Multi-Scale Attention Network for Brain Tumor Segmentation

Add code
Dec 08, 2024
Viaarxiv icon

M3D: Dual-Stream Selective State Spaces and Depth-Driven Framework for High-Fidelity Single-View 3D Reconstruction

Add code
Nov 20, 2024
Viaarxiv icon

SeaS: Few-shot Industrial Anomaly Image Generation with Separation and Sharing Fine-tuning

Add code
Oct 19, 2024
Viaarxiv icon

AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial Scenarios

Add code
Oct 18, 2024
Figure 1 for AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial Scenarios
Figure 2 for AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial Scenarios
Figure 3 for AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial Scenarios
Figure 4 for AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial Scenarios
Viaarxiv icon

TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control

Add code
Oct 14, 2024
Figure 1 for TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
Figure 2 for TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
Figure 3 for TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
Figure 4 for TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
Viaarxiv icon

First Creating Backgrounds Then Rendering Texts: A New Paradigm for Visual Text Blending

Add code
Oct 14, 2024
Figure 1 for First Creating Backgrounds Then Rendering Texts: A New Paradigm for Visual Text Blending
Figure 2 for First Creating Backgrounds Then Rendering Texts: A New Paradigm for Visual Text Blending
Figure 3 for First Creating Backgrounds Then Rendering Texts: A New Paradigm for Visual Text Blending
Figure 4 for First Creating Backgrounds Then Rendering Texts: A New Paradigm for Visual Text Blending
Viaarxiv icon

Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making

Add code
Oct 09, 2024
Figure 1 for Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making
Figure 2 for Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making
Figure 3 for Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making
Figure 4 for Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making
Viaarxiv icon

Leverage Knowledge Graph and Large Language Model for Law Article Recommendation: A Case Study of Chinese Criminal Law

Add code
Oct 07, 2024
Figure 1 for Leverage Knowledge Graph and Large Language Model for Law Article Recommendation: A Case Study of Chinese Criminal Law
Figure 2 for Leverage Knowledge Graph and Large Language Model for Law Article Recommendation: A Case Study of Chinese Criminal Law
Figure 3 for Leverage Knowledge Graph and Large Language Model for Law Article Recommendation: A Case Study of Chinese Criminal Law
Figure 4 for Leverage Knowledge Graph and Large Language Model for Law Article Recommendation: A Case Study of Chinese Criminal Law
Viaarxiv icon