Picture for Qiang Zhou

Qiang Zhou

Active management of battery degradation in wireless sensor network using deep reinforcement learning for group battery replacement

Add code
Mar 20, 2025
Viaarxiv icon

EmpathyAgent: Can Embodied Agents Conduct Empathetic Actions?

Add code
Mar 19, 2025
Viaarxiv icon

Raccoon: Multi-stage Diffusion Training with Coarse-to-Fine Curating Videos

Add code
Feb 28, 2025
Viaarxiv icon

Neural Directed Speech Enhancement with Dual Microphone Array in High Noise Scenario

Add code
Dec 24, 2024
Figure 1 for Neural Directed Speech Enhancement with Dual Microphone Array in High Noise Scenario
Figure 2 for Neural Directed Speech Enhancement with Dual Microphone Array in High Noise Scenario
Figure 3 for Neural Directed Speech Enhancement with Dual Microphone Array in High Noise Scenario
Figure 4 for Neural Directed Speech Enhancement with Dual Microphone Array in High Noise Scenario
Viaarxiv icon

DLF: Disentangled-Language-Focused Multimodal Sentiment Analysis

Add code
Dec 16, 2024
Viaarxiv icon

KAAE: Numerical Reasoning for Knowledge Graphs via Knowledge-aware Attributes Learning

Add code
Nov 20, 2024
Viaarxiv icon

Motion Control for Enhanced Complex Action Video Generation

Add code
Nov 13, 2024
Viaarxiv icon

Multimodal LLM Enhanced Cross-lingual Cross-modal Retrieval

Add code
Sep 30, 2024
Figure 1 for Multimodal LLM Enhanced Cross-lingual Cross-modal Retrieval
Figure 2 for Multimodal LLM Enhanced Cross-lingual Cross-modal Retrieval
Figure 3 for Multimodal LLM Enhanced Cross-lingual Cross-modal Retrieval
Figure 4 for Multimodal LLM Enhanced Cross-lingual Cross-modal Retrieval
Viaarxiv icon

I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing

Add code
Aug 26, 2024
Figure 1 for I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing
Figure 2 for I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing
Figure 3 for I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing
Figure 4 for I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing
Viaarxiv icon

INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model

Add code
Jul 23, 2024
Viaarxiv icon