Picture for Xue Yang

Xue Yang

Wholly-WOOD: Wholly Leveraging Diversified-quality Labels for Weakly-supervised Oriented Object Detection

Add code
Feb 13, 2025
Viaarxiv icon

Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances

Add code
Feb 07, 2025
Viaarxiv icon

PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection

Add code
Jan 23, 2025
Figure 1 for PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection
Figure 2 for PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection
Figure 3 for PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection
Figure 4 for PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection
Viaarxiv icon

A Simple Aerial Detection Baseline of Multimodal Language Models

Add code
Jan 16, 2025
Figure 1 for A Simple Aerial Detection Baseline of Multimodal Language Models
Figure 2 for A Simple Aerial Detection Baseline of Multimodal Language Models
Figure 3 for A Simple Aerial Detection Baseline of Multimodal Language Models
Figure 4 for A Simple Aerial Detection Baseline of Multimodal Language Models
Viaarxiv icon

Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding

Add code
Jan 14, 2025
Viaarxiv icon

RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark

Add code
Jan 08, 2025
Figure 1 for RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark
Figure 2 for RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark
Figure 3 for RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark
Figure 4 for RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark
Viaarxiv icon

Efficiently Achieving Secure Model Training and Secure Aggregation to Ensure Bidirectional Privacy-Preservation in Federated Learning

Add code
Dec 16, 2024
Viaarxiv icon

DiffCLIP: Few-shot Language-driven Multimodal Classifier

Add code
Dec 10, 2024
Figure 1 for DiffCLIP: Few-shot Language-driven Multimodal Classifier
Figure 2 for DiffCLIP: Few-shot Language-driven Multimodal Classifier
Figure 3 for DiffCLIP: Few-shot Language-driven Multimodal Classifier
Figure 4 for DiffCLIP: Few-shot Language-driven Multimodal Classifier
Viaarxiv icon

Marco-LLM: Bridging Languages via Massive Multilingual Training for Cross-Lingual Enhancement

Add code
Dec 05, 2024
Viaarxiv icon

GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data

Add code
Nov 27, 2024
Viaarxiv icon