Picture for Yichi Zhang

Yichi Zhang

AI Lab, Netease

Scaling Laws for Black box Adversarial Attacks

Add code
Nov 25, 2024
Viaarxiv icon

Looking Beyond Text: Reducing Language bias in Large Vision-Language Models via Multimodal Dual-Attention and Soft-Image Guidance

Add code
Nov 21, 2024
Viaarxiv icon

MKGL: Mastery of a Three-Word Language

Add code
Oct 10, 2024
Figure 1 for MKGL: Mastery of a Three-Word Language
Figure 2 for MKGL: Mastery of a Three-Word Language
Figure 3 for MKGL: Mastery of a Three-Word Language
Figure 4 for MKGL: Mastery of a Three-Word Language
Viaarxiv icon

MetaOOD: Automatic Selection of OOD Detection Models

Add code
Oct 04, 2024
Viaarxiv icon

A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation

Add code
Oct 02, 2024
Figure 1 for A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation
Figure 2 for A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation
Figure 3 for A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation
Figure 4 for A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation
Viaarxiv icon

Robust Training of Neural Networks at Arbitrary Precision and Sparsity

Add code
Sep 14, 2024
Figure 1 for Robust Training of Neural Networks at Arbitrary Precision and Sparsity
Figure 2 for Robust Training of Neural Networks at Arbitrary Precision and Sparsity
Figure 3 for Robust Training of Neural Networks at Arbitrary Precision and Sparsity
Figure 4 for Robust Training of Neural Networks at Arbitrary Precision and Sparsity
Viaarxiv icon

Unleashing the Potential of SAM2 for Biomedical Images and Videos: A Survey

Add code
Aug 23, 2024
Viaarxiv icon

Prompt Your Brain: Scaffold Prompt Tuning for Efficient Adaptation of fMRI Pre-trained Model

Add code
Aug 20, 2024
Figure 1 for Prompt Your Brain: Scaffold Prompt Tuning for Efficient Adaptation of fMRI Pre-trained Model
Figure 2 for Prompt Your Brain: Scaffold Prompt Tuning for Efficient Adaptation of fMRI Pre-trained Model
Figure 3 for Prompt Your Brain: Scaffold Prompt Tuning for Efficient Adaptation of fMRI Pre-trained Model
Figure 4 for Prompt Your Brain: Scaffold Prompt Tuning for Efficient Adaptation of fMRI Pre-trained Model
Viaarxiv icon

Timeliness-Fidelity Tradeoff in 3D Scene Representations

Add code
Jul 23, 2024
Figure 1 for Timeliness-Fidelity Tradeoff in 3D Scene Representations
Figure 2 for Timeliness-Fidelity Tradeoff in 3D Scene Representations
Figure 3 for Timeliness-Fidelity Tradeoff in 3D Scene Representations
Figure 4 for Timeliness-Fidelity Tradeoff in 3D Scene Representations
Viaarxiv icon

MAVIS: Mathematical Visual Instruction Tuning

Add code
Jul 11, 2024
Figure 1 for MAVIS: Mathematical Visual Instruction Tuning
Figure 2 for MAVIS: Mathematical Visual Instruction Tuning
Figure 3 for MAVIS: Mathematical Visual Instruction Tuning
Figure 4 for MAVIS: Mathematical Visual Instruction Tuning
Viaarxiv icon