Picture for Yi Jiang

Yi Jiang

SA-Occ: Satellite-Assisted 3D Occupancy Prediction in Real World

Add code
Mar 20, 2025
Viaarxiv icon

UniTok: A Unified Tokenizer for Visual Generation and Understanding

Add code
Feb 27, 2025
Viaarxiv icon

Goku: Flow Based Video Generative Foundation Models

Add code
Feb 10, 2025
Viaarxiv icon

FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation

Add code
Feb 07, 2025
Viaarxiv icon

Liquid: Language Models are Scalable Multi-modal Generators

Add code
Dec 05, 2024
Figure 1 for Liquid: Language Models are Scalable Multi-modal Generators
Figure 2 for Liquid: Language Models are Scalable Multi-modal Generators
Figure 3 for Liquid: Language Models are Scalable Multi-modal Generators
Figure 4 for Liquid: Language Models are Scalable Multi-modal Generators
Viaarxiv icon

Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Add code
Dec 05, 2024
Viaarxiv icon

TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation

Add code
Dec 04, 2024
Figure 1 for TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation
Figure 2 for TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation
Figure 3 for TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation
Figure 4 for TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation
Viaarxiv icon

MapEval: Towards Unified, Robust and Efficient SLAM Map Evaluation Framework

Add code
Nov 26, 2024
Figure 1 for MapEval: Towards Unified, Robust and Efficient SLAM Map Evaluation Framework
Figure 2 for MapEval: Towards Unified, Robust and Efficient SLAM Map Evaluation Framework
Figure 3 for MapEval: Towards Unified, Robust and Efficient SLAM Map Evaluation Framework
Figure 4 for MapEval: Towards Unified, Robust and Efficient SLAM Map Evaluation Framework
Viaarxiv icon

NACNet: A Histology Context-aware Transformer Graph Convolution Network for Predicting Treatment Response to Neoadjuvant Chemotherapy in Triple Negative Breast Cancer

Add code
Nov 14, 2024
Viaarxiv icon

PEAR: A Robust and Flexible Automation Framework for Ptychography Enabled by Multiple Large Language Model Agents

Add code
Oct 11, 2024
Figure 1 for PEAR: A Robust and Flexible Automation Framework for Ptychography Enabled by Multiple Large Language Model Agents
Figure 2 for PEAR: A Robust and Flexible Automation Framework for Ptychography Enabled by Multiple Large Language Model Agents
Figure 3 for PEAR: A Robust and Flexible Automation Framework for Ptychography Enabled by Multiple Large Language Model Agents
Figure 4 for PEAR: A Robust and Flexible Automation Framework for Ptychography Enabled by Multiple Large Language Model Agents
Viaarxiv icon