Picture for Ke Li

Ke Li

Jack

FD2-Net: Frequency-Driven Feature Decomposition Network for Infrared-Visible Object Detection

Add code
Dec 12, 2024
Viaarxiv icon

FlashSloth: Lightning Multimodal Large Language Models via Embedded Visual Compression

Add code
Dec 05, 2024
Figure 1 for FlashSloth: Lightning Multimodal Large Language Models via Embedded Visual Compression
Figure 2 for FlashSloth: Lightning Multimodal Large Language Models via Embedded Visual Compression
Figure 3 for FlashSloth: Lightning Multimodal Large Language Models via Embedded Visual Compression
Figure 4 for FlashSloth: Lightning Multimodal Large Language Models via Embedded Visual Compression
Viaarxiv icon

Scale Contrastive Learning with Selective Attentions for Blind Image Quality Assessment

Add code
Nov 13, 2024
Figure 1 for Scale Contrastive Learning with Selective Attentions for Blind Image Quality Assessment
Figure 2 for Scale Contrastive Learning with Selective Attentions for Blind Image Quality Assessment
Figure 3 for Scale Contrastive Learning with Selective Attentions for Blind Image Quality Assessment
Figure 4 for Scale Contrastive Learning with Selective Attentions for Blind Image Quality Assessment
Viaarxiv icon

MBL-CPDP: A Multi-objective Bilevel Method for Cross-Project Defect Prediction via Automated Machine Learning

Add code
Nov 10, 2024
Viaarxiv icon

Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

Add code
Nov 01, 2024
Viaarxiv icon

Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change Detection

Add code
Oct 31, 2024
Figure 1 for Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change Detection
Figure 2 for Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change Detection
Figure 3 for Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change Detection
Figure 4 for Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change Detection
Viaarxiv icon

OmniGenBench: Automating Large-scale in-silico Benchmarking for Genomic Foundation Models

Add code
Oct 02, 2024
Viaarxiv icon

Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech

Add code
Oct 02, 2024
Figure 1 for Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech
Figure 2 for Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech
Figure 3 for Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech
Figure 4 for Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech
Viaarxiv icon

Rejection Sampling IMLE: Designing Priors for Better Few-Shot Image Synthesis

Add code
Sep 26, 2024
Viaarxiv icon

Test-time Training for Hyperspectral Image Super-resolution

Add code
Sep 13, 2024
Viaarxiv icon