Picture for Dong Chen

Dong Chen

Diffusion Models without Classifier-free Guidance

Add code
Feb 17, 2025
Viaarxiv icon

Improved YOLOv7 model for insulator defect detection

Add code
Feb 11, 2025
Viaarxiv icon

A Simple Aerial Detection Baseline of Multimodal Language Models

Add code
Jan 16, 2025
Figure 1 for A Simple Aerial Detection Baseline of Multimodal Language Models
Figure 2 for A Simple Aerial Detection Baseline of Multimodal Language Models
Figure 3 for A Simple Aerial Detection Baseline of Multimodal Language Models
Figure 4 for A Simple Aerial Detection Baseline of Multimodal Language Models
Viaarxiv icon

SmartEraser: Remove Anything from Images using Masked-Region Guidance

Add code
Jan 14, 2025
Figure 1 for SmartEraser: Remove Anything from Images using Masked-Region Guidance
Figure 2 for SmartEraser: Remove Anything from Images using Masked-Region Guidance
Figure 3 for SmartEraser: Remove Anything from Images using Masked-Region Guidance
Figure 4 for SmartEraser: Remove Anything from Images using Masked-Region Guidance
Viaarxiv icon

Experimental Study of RCS Diversity with Novel No-divergent OAM Beams

Add code
Dec 25, 2024
Figure 1 for Experimental Study of RCS Diversity with Novel No-divergent OAM Beams
Figure 2 for Experimental Study of RCS Diversity with Novel No-divergent OAM Beams
Figure 3 for Experimental Study of RCS Diversity with Novel No-divergent OAM Beams
Figure 4 for Experimental Study of RCS Diversity with Novel No-divergent OAM Beams
Viaarxiv icon

CodeV: Issue Resolving with Visual Data

Add code
Dec 23, 2024
Viaarxiv icon

Protecting Confidentiality, Privacy and Integrity in Collaborative Learning

Add code
Dec 11, 2024
Figure 1 for Protecting Confidentiality, Privacy and Integrity in Collaborative Learning
Figure 2 for Protecting Confidentiality, Privacy and Integrity in Collaborative Learning
Figure 3 for Protecting Confidentiality, Privacy and Integrity in Collaborative Learning
Figure 4 for Protecting Confidentiality, Privacy and Integrity in Collaborative Learning
Viaarxiv icon

Structured 3D Latents for Scalable and Versatile 3D Generation

Add code
Dec 02, 2024
Figure 1 for Structured 3D Latents for Scalable and Versatile 3D Generation
Figure 2 for Structured 3D Latents for Scalable and Versatile 3D Generation
Figure 3 for Structured 3D Latents for Scalable and Versatile 3D Generation
Figure 4 for Structured 3D Latents for Scalable and Versatile 3D Generation
Viaarxiv icon

CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation

Add code
Nov 29, 2024
Figure 1 for CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation
Figure 2 for CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation
Figure 3 for CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation
Figure 4 for CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation
Viaarxiv icon

Robotic transcatheter tricuspid valve replacement with hybrid enhanced intelligence: a new paradigm and first-in-vivo study

Add code
Nov 19, 2024
Viaarxiv icon