Picture for Qi Yang

Qi Yang

MiniMax-01: Scaling Foundation Models with Lightning Attention

Add code
Jan 14, 2025
Viaarxiv icon

Socratic Questioning: Learn to Self-guide Multimodal Reasoning in the Wild

Add code
Jan 07, 2025
Viaarxiv icon

Benchmarking and Learning Multi-Dimensional Quality Evaluator for Text-to-3D Generation

Add code
Dec 15, 2024
Viaarxiv icon

Diff5T: Benchmarking Human Brain Diffusion MRI with an Extensive 5.0 Tesla K-Space and Spatial Dataset

Add code
Dec 09, 2024
Viaarxiv icon

Continuous Speculative Decoding for Autoregressive Image Generation

Add code
Nov 18, 2024
Viaarxiv icon

A Hierarchical Compression Technique for 3D Gaussian Splatting Compression

Add code
Nov 11, 2024
Viaarxiv icon

Brain age identification from diffusion MRI synergistically predicts neurodegenerative disease

Add code
Oct 29, 2024
Viaarxiv icon

Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis

Add code
Sep 10, 2024
Figure 1 for Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis
Figure 2 for Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis
Figure 3 for Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis
Figure 4 for Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis
Viaarxiv icon

AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual Segmentation

Add code
Aug 03, 2024
Figure 1 for AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual Segmentation
Figure 2 for AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual Segmentation
Figure 3 for AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual Segmentation
Figure 4 for AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual Segmentation
Viaarxiv icon

A Benchmark for Gaussian Splatting Compression and Quality Assessment Study

Add code
Jul 19, 2024
Viaarxiv icon