Picture for Feng Li

Feng Li

A Powered Prosthetic Hand with Vision System for Enhancing the Anthropopathic Grasp

Add code
Dec 10, 2024
Viaarxiv icon

WavFusion: Towards wav2vec 2.0 Multimodal Speech Emotion Recognition

Add code
Dec 07, 2024
Viaarxiv icon

DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding

Add code
Nov 21, 2024
Figure 1 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Figure 2 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Figure 3 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Figure 4 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Viaarxiv icon

ROSS:RObust decentralized Stochastic learning based on Shapley values

Add code
Nov 01, 2024
Viaarxiv icon

Long Term Memory: The Foundation of AI Self-Evolution

Add code
Oct 21, 2024
Viaarxiv icon

PSVMA+: Exploring Multi-granularity Semantic-visual Adaption for Generalized Zero-shot Learning

Add code
Oct 15, 2024
Viaarxiv icon

Optimal starting point for time series forecasting

Add code
Sep 25, 2024
Viaarxiv icon

Thinking in Granularity: Dynamic Quantization for Image Super-Resolution by Intriguing Multi-Granularity Clues

Add code
Sep 22, 2024
Viaarxiv icon

PDSR: A Privacy-Preserving Diversified Service Recommendation Method on Distributed Data

Add code
Aug 28, 2024
Figure 1 for PDSR: A Privacy-Preserving Diversified Service Recommendation Method on Distributed Data
Figure 2 for PDSR: A Privacy-Preserving Diversified Service Recommendation Method on Distributed Data
Figure 3 for PDSR: A Privacy-Preserving Diversified Service Recommendation Method on Distributed Data
Figure 4 for PDSR: A Privacy-Preserving Diversified Service Recommendation Method on Distributed Data
Viaarxiv icon

MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Add code
Aug 23, 2024
Figure 1 for MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?
Figure 2 for MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?
Figure 3 for MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?
Figure 4 for MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?
Viaarxiv icon