Picture for Pei Zhang

Pei Zhang

additional authors not shown

WeVibe: Weight Change Estimation Through Audio-Induced Shelf Vibrations In Autonomous Stores

Add code
Feb 17, 2025
Viaarxiv icon

MinMo: A Multimodal Large Language Model for Seamless Voice Interaction

Add code
Jan 10, 2025
Figure 1 for MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Figure 2 for MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Figure 3 for MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Figure 4 for MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Viaarxiv icon

A Separable Self-attention Inspired by the State Space Model for Computer Vision

Add code
Jan 03, 2025
Figure 1 for A Separable Self-attention Inspired by the State Space Model for Computer Vision
Figure 2 for A Separable Self-attention Inspired by the State Space Model for Computer Vision
Figure 3 for A Separable Self-attention Inspired by the State Space Model for Computer Vision
Figure 4 for A Separable Self-attention Inspired by the State Space Model for Computer Vision
Viaarxiv icon

MATEY: multiscale adaptive foundation models for spatiotemporal physical systems

Add code
Dec 29, 2024
Viaarxiv icon

Qwen2.5 Technical Report

Add code
Dec 19, 2024
Viaarxiv icon

ZhoBLiMP: a Systematic Assessment of Language Models with Linguistic Minimal Pairs in Chinese

Add code
Nov 09, 2024
Viaarxiv icon

Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation

Add code
Oct 17, 2024
Figure 1 for Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation
Figure 2 for Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation
Figure 3 for Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation
Figure 4 for Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation
Viaarxiv icon

Large Language Model for Multi-Domain Translation: Benchmarking and Domain CoT Fine-tuning

Add code
Oct 03, 2024
Figure 1 for Large Language Model for Multi-Domain Translation: Benchmarking and Domain CoT Fine-tuning
Figure 2 for Large Language Model for Multi-Domain Translation: Benchmarking and Domain CoT Fine-tuning
Figure 3 for Large Language Model for Multi-Domain Translation: Benchmarking and Domain CoT Fine-tuning
Figure 4 for Large Language Model for Multi-Domain Translation: Benchmarking and Domain CoT Fine-tuning
Viaarxiv icon

Hierarchical learning control for autonomous robots inspired by central nervous system

Add code
Aug 07, 2024
Viaarxiv icon

Fusion Flow-enhanced Graph Pooling Residual Networks for Unmanned Aerial Vehicles Surveillance in Day and Night Dual Visions

Add code
Jul 17, 2024
Viaarxiv icon