Picture for Yu Zhou

Yu Zhou

National Laboratory of Pattern Recognition, Institute of Automation, CAS, Beijing, China, Fanyu AI Laboratory, Zhongke Fanyu Technology Co., Ltd, Beijing, China

SeaS: Few-shot Industrial Anomaly Image Generation with Separation and Sharing Fine-tuning

Add code
Oct 19, 2024
Viaarxiv icon

AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial Scenarios

Add code
Oct 18, 2024
Figure 1 for AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial Scenarios
Figure 2 for AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial Scenarios
Figure 3 for AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial Scenarios
Figure 4 for AnomalyNCD: Towards Novel Anomaly Class Discovery in Industrial Scenarios
Viaarxiv icon

TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control

Add code
Oct 14, 2024
Figure 1 for TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
Figure 2 for TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
Figure 3 for TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
Figure 4 for TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
Viaarxiv icon

First Creating Backgrounds Then Rendering Texts: A New Paradigm for Visual Text Blending

Add code
Oct 14, 2024
Viaarxiv icon

Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making

Add code
Oct 09, 2024
Viaarxiv icon

Leverage Knowledge Graph and Large Language Model for Law Article Recommendation: A Case Study of Chinese Criminal Law

Add code
Oct 07, 2024
Viaarxiv icon

HM3: Hierarchical Multi-Objective Model Merging for Pretrained Models

Add code
Sep 27, 2024
Figure 1 for HM3: Hierarchical Multi-Objective Model Merging for Pretrained Models
Figure 2 for HM3: Hierarchical Multi-Objective Model Merging for Pretrained Models
Figure 3 for HM3: Hierarchical Multi-Objective Model Merging for Pretrained Models
Figure 4 for HM3: Hierarchical Multi-Objective Model Merging for Pretrained Models
Viaarxiv icon

Towards Rehearsal-Free Multilingual ASR: A LoRA-based Case Study on Whisper

Add code
Aug 20, 2024
Viaarxiv icon

ARMADA: Attribute-Based Multimodal Data Augmentation

Add code
Aug 19, 2024
Viaarxiv icon

Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text Retrieval

Add code
Aug 01, 2024
Viaarxiv icon