Picture for Jian Jia

Jian Jia

Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads

Add code
Nov 28, 2024
Figure 1 for Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads
Figure 2 for Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads
Figure 3 for Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads
Figure 4 for Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads
Viaarxiv icon

Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy

Add code
Nov 23, 2024
Figure 1 for Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy
Figure 2 for Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy
Figure 3 for Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy
Figure 4 for Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy
Viaarxiv icon

ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval

Add code
Aug 06, 2024
Figure 1 for ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval
Figure 2 for ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval
Figure 3 for ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval
Figure 4 for ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval
Viaarxiv icon

Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application

Add code
May 07, 2024
Figure 1 for Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application
Figure 2 for Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application
Figure 3 for Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application
Figure 4 for Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application
Viaarxiv icon

Knowledge Condensation and Reasoning for Knowledge-based VQA

Add code
Mar 15, 2024
Viaarxiv icon

Beyond Appearance: a Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual Tasks

Add code
Mar 30, 2023
Figure 1 for Beyond Appearance: a Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual Tasks
Figure 2 for Beyond Appearance: a Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual Tasks
Figure 3 for Beyond Appearance: a Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual Tasks
Figure 4 for Beyond Appearance: a Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual Tasks
Viaarxiv icon

InsPro: Propagating Instance Query and Proposal for Online Video Instance Segmentation

Add code
Jan 05, 2023
Viaarxiv icon

Learning Disentangled Label Representations for Multi-label Classification

Add code
Dec 02, 2022
Figure 1 for Learning Disentangled Label Representations for Multi-label Classification
Figure 2 for Learning Disentangled Label Representations for Multi-label Classification
Figure 3 for Learning Disentangled Label Representations for Multi-label Classification
Figure 4 for Learning Disentangled Label Representations for Multi-label Classification
Viaarxiv icon

QueryProp: Object Query Propagation for High-Performance Video Object Detection

Add code
Jul 22, 2022
Figure 1 for QueryProp: Object Query Propagation for High-Performance Video Object Detection
Figure 2 for QueryProp: Object Query Propagation for High-Performance Video Object Detection
Figure 3 for QueryProp: Object Query Propagation for High-Performance Video Object Detection
Figure 4 for QueryProp: Object Query Propagation for High-Performance Video Object Detection
Viaarxiv icon

PanopticDepth: A Unified Framework for Depth-aware Panoptic Segmentation

Add code
Jun 01, 2022
Figure 1 for PanopticDepth: A Unified Framework for Depth-aware Panoptic Segmentation
Figure 2 for PanopticDepth: A Unified Framework for Depth-aware Panoptic Segmentation
Figure 3 for PanopticDepth: A Unified Framework for Depth-aware Panoptic Segmentation
Figure 4 for PanopticDepth: A Unified Framework for Depth-aware Panoptic Segmentation
Viaarxiv icon