Picture for Yichi Zhang

Yichi Zhang

AI Lab, Netease

HonkaiChat: Companions from Anime that feel alive!

Add code
Jan 05, 2025
Viaarxiv icon

Have We Designed Generalizable Structural Knowledge Promptings? Systematic Evaluation and Rethinking

Add code
Dec 31, 2024
Viaarxiv icon

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Add code
Dec 30, 2024
Viaarxiv icon

6DMA-Aided Hybrid Beamforming with Joint Antenna Position and Orientation Optimization

Add code
Dec 22, 2024
Viaarxiv icon

PyOD 2: A Python Library for Outlier Detection with LLM-powered Model Selection

Add code
Dec 11, 2024
Viaarxiv icon

Scaling Laws for Black box Adversarial Attacks

Add code
Nov 25, 2024
Figure 1 for Scaling Laws for Black box Adversarial Attacks
Figure 2 for Scaling Laws for Black box Adversarial Attacks
Figure 3 for Scaling Laws for Black box Adversarial Attacks
Figure 4 for Scaling Laws for Black box Adversarial Attacks
Viaarxiv icon

Looking Beyond Text: Reducing Language bias in Large Vision-Language Models via Multimodal Dual-Attention and Soft-Image Guidance

Add code
Nov 21, 2024
Viaarxiv icon

MKGL: Mastery of a Three-Word Language

Add code
Oct 10, 2024
Figure 1 for MKGL: Mastery of a Three-Word Language
Figure 2 for MKGL: Mastery of a Three-Word Language
Figure 3 for MKGL: Mastery of a Three-Word Language
Figure 4 for MKGL: Mastery of a Three-Word Language
Viaarxiv icon

MetaOOD: Automatic Selection of OOD Detection Models

Add code
Oct 04, 2024
Viaarxiv icon

A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation

Add code
Oct 02, 2024
Figure 1 for A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation
Figure 2 for A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation
Figure 3 for A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation
Figure 4 for A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation
Viaarxiv icon