Picture for Quan Chen

Quan Chen

Text-Video Multi-Grained Integration for Video Moment Montage

Add code
Dec 12, 2024
Viaarxiv icon

Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads

Add code
Nov 28, 2024
Viaarxiv icon

Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy

Add code
Nov 23, 2024
Figure 1 for Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy
Figure 2 for Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy
Figure 3 for Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy
Figure 4 for Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy
Viaarxiv icon

LiTformer: Efficient Modeling and Analysis of High-Speed Link Transmitters Using Non-Autoregressive Transformer

Add code
Nov 18, 2024
Figure 1 for LiTformer: Efficient Modeling and Analysis of High-Speed Link Transmitters Using Non-Autoregressive Transformer
Figure 2 for LiTformer: Efficient Modeling and Analysis of High-Speed Link Transmitters Using Non-Autoregressive Transformer
Figure 3 for LiTformer: Efficient Modeling and Analysis of High-Speed Link Transmitters Using Non-Autoregressive Transformer
Figure 4 for LiTformer: Efficient Modeling and Analysis of High-Speed Link Transmitters Using Non-Autoregressive Transformer
Viaarxiv icon

A QoE-Aware Split Inference Accelerating Algorithm for NOMA-based Edge Intelligence

Add code
Sep 25, 2024
Figure 1 for A QoE-Aware Split Inference Accelerating Algorithm for NOMA-based Edge Intelligence
Viaarxiv icon

Whole Heart Perfusion with High-Multiband Simultaneous Multislice Imaging via Linear Phase Modulated Extended Field of View (SMILE)

Add code
Sep 06, 2024
Viaarxiv icon

D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matching

Add code
Aug 23, 2024
Viaarxiv icon

ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval

Add code
Aug 06, 2024
Figure 1 for ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval
Figure 2 for ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval
Figure 3 for ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval
Figure 4 for ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval
Viaarxiv icon

Spatiotemporal Graph Guided Multi-modal Network for Livestreaming Product Retrieval

Add code
Jul 24, 2024
Viaarxiv icon

Training-free Subject-Enhanced Attention Guidance for Compositional Text-to-image Generation

Add code
May 11, 2024
Viaarxiv icon