Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mengyang Wang

PoseX: AI Defeats Physics Approaches on Protein-Ligand Cross Docking

May 03, 2025

Yize Jiang, Xinze Li, Yuanyuan Zhang, Jin Han, Youjun Xu, Ayush Pandit, Zaixi Zhang, Mengdi Wang, Mengyang Wang, Chong Liu(+6 more)

Abstract:Recently, significant progress has been made in protein-ligand docking, especially in modern deep learning methods, and some benchmarks were proposed, e.g., PoseBench, Plinder. However, these benchmarks suffer from less practical evaluation setups (e.g., blind docking, self docking), or heavy framework that involves training, raising challenges to assess docking methods efficiently. To fill this gap, we proposed PoseX, an open-source benchmark focusing on self-docking and cross-docking, to evaluate the algorithmic advances practically and comprehensively. Specifically, first, we curate a new evaluation dataset with 718 entries for self docking and 1,312 for cross docking; second, we incorporate 22 docking methods across three methodological categories, including (1) traditional physics-based methods (e.g., Schr\"odinger Glide), (2) AI docking methods (e.g., DiffDock), (3) AI co-folding methods (e.g., AlphaFold3); third, we design a relaxation method as post-processing to minimize conformation energy and refine binding pose; fourth, we released a leaderboard to rank submitted models in real time. We draw some key insights via extensive experiments: (1) AI-based approaches have already surpassed traditional physics-based approaches in overall docking accuracy (RMSD). The longstanding generalization issues that have plagued AI molecular docking have been significantly alleviated in the latest models. (2) The stereochemical deficiencies of AI-based approaches can be greatly alleviated with post-processing relaxation. Combining AI docking methods with the enhanced relaxation method achieves the best performance to date. (3) AI co-folding methods commonly face ligand chirality issues, which cannot be resolved by relaxation. The code, curated dataset and leaderboard are released at https://github.com/CataAI/PoseX.

Via

Access Paper or Ask Questions

The Tenth NTIRE 2025 Efficient Super-Resolution Challenge Report

Apr 14, 2025

Bin Ren, Hang Guo, Lei Sun, Zongwei Wu, Radu Timofte, Yawei Li, Yao Zhang, Xinning Chai, Zhengxue Cheng, Yingsheng Qin(+136 more)

Abstract:This paper presents a comprehensive review of the NTIRE 2025 Challenge on Single-Image Efficient Super-Resolution (ESR). The challenge aimed to advance the development of deep models that optimize key computational metrics, i.e., runtime, parameters, and FLOPs, while achieving a PSNR of at least 26.90 dB on the $\operatorname{DIV2K\_LSDIR\_valid}$ dataset and 26.99 dB on the $\operatorname{DIV2K\_LSDIR\_test}$ dataset. A robust participation saw \textbf{244} registered entrants, with \textbf{43} teams submitting valid entries. This report meticulously analyzes these methods and results, emphasizing groundbreaking advancements in state-of-the-art single-image ESR techniques. The analysis highlights innovative approaches and establishes benchmarks for future research in the field.

* Accepted by CVPR2025 NTIRE Workshop, Efficient Super-Resolution Challenge Report. 50 pages

Via

Access Paper or Ask Questions

Intelligent Understanding of Large Language Models in Traditional Chinese Medicine Based on Prompt Engineering Framework

Oct 25, 2024

Yirui Chen, Qinyu Xiao, Jia Yi, Jing Chen, Mengyang Wang

Figure 1 for Intelligent Understanding of Large Language Models in Traditional Chinese Medicine Based on Prompt Engineering Framework

Figure 2 for Intelligent Understanding of Large Language Models in Traditional Chinese Medicine Based on Prompt Engineering Framework

Abstract:This paper explores the application of prompt engineering to enhance the performance of large language models (LLMs) in the domain of Traditional Chinese Medicine (TCM). We propose TCM-Prompt, a framework that integrates various pre-trained language models (PLMs), templates, tokenization, and verbalization methods, allowing researchers to easily construct and fine-tune models for specific TCM-related tasks. We conducted experiments on disease classification, syndrome identification, herbal medicine recommendation, and general NLP tasks, demonstrating the effectiveness and superiority of our approach compared to baseline methods. Our findings suggest that prompt engineering is a promising technique for improving the performance of LLMs in specialized domains like TCM, with potential applications in digitalization, modernization, and personalized medicine.

Via

Access Paper or Ask Questions

Spiking Semantic Communication for Feature Transmission with HARQ

Oct 13, 2023

Mengyang Wang, Jiahui Li, Mengyao Ma, Xiaopeng Fan

Abstract:In Collaborative Intelligence (CI), the Artificial Intelligence (AI) model is divided between the edge and the cloud, with intermediate features being sent from the edge to the cloud for inference. Several deep learning-based Semantic Communication (SC) models have been proposed to reduce feature transmission overhead and mitigate channel noise interference. Previous research has demonstrated that Spiking Neural Network (SNN)-based SC models exhibit greater robustness on digital channels compared to Deep Neural Network (DNN)-based SC models. However, the existing SNN-based SC models require fixed time steps, resulting in fixed transmission bandwidths that cannot be adaptively adjusted based on channel conditions. To address this issue, this paper introduces a novel SC model called SNN-SC-HARQ, which combines the SNN-based SC model with the Hybrid Automatic Repeat Request (HARQ) mechanism. SNN-SC-HARQ comprises an SNN-based SC model that supports the transmission of features at varying bandwidths, along with a policy model that determines the appropriate bandwidth. Experimental results show that SNN-SC-HARQ can dynamically adjust the bandwidth according to the channel conditions without performance loss.

Via

Access Paper or Ask Questions

S-JSCC: A Digital Joint Source-Channel Coding Framework based on Spiking Neural Network

Oct 13, 2022

Mengyang Wang, Jiahui Li, Mengyao Ma, Xiaopeng Fan

Figure 1 for S-JSCC: A Digital Joint Source-Channel Coding Framework based on Spiking Neural Network

Figure 2 for S-JSCC: A Digital Joint Source-Channel Coding Framework based on Spiking Neural Network

Figure 3 for S-JSCC: A Digital Joint Source-Channel Coding Framework based on Spiking Neural Network

Figure 4 for S-JSCC: A Digital Joint Source-Channel Coding Framework based on Spiking Neural Network

Abstract:Nowadays, deep learning-based joint source-channel coding (JSCC) is getting attention, and it shows excellent performance compared with separate source and channel coding (SSCC). However, most JSCC works are only designed, trained, and tested on additive white Gaussian noise (AWGN) channels to transmit analog signals. In current communication systems, digital signals are considered more. Hence, it is necessary to design an end-to-end JSCC framework for digital signal transmission. In this paper, we propose a digital JSCC framework (S-JSCC) based on spiking neural network (SNN) to tackle this problem. The SNN is used to compress the feature of the deep model, and the compressed results are transmitted over digital channels such as binary symmetric channel (BSC) and binary erasure channel (BEC). Since the outputs of SNN are binary spikes, the framework can be applied directly to digital channels without extra quantization. Moreover, we propose a new spiking neuron and regularization method to improve the performance and robustness of the system. The experimental results show that under digital channels, the proposed S-JSCC framework performs better than the state-of-the-art convolution neural network (CNN)-based JSCC framework, which needs extra quantization.

Via

Access Paper or Ask Questions

Constellation Design for Deep Joint Source-Channel Coding

Jun 08, 2022

Mengyang Wang, Jiahui Li, Mengyao Ma, Xiaopeng Fan

Figure 1 for Constellation Design for Deep Joint Source-Channel Coding

Figure 2 for Constellation Design for Deep Joint Source-Channel Coding

Figure 3 for Constellation Design for Deep Joint Source-Channel Coding

Figure 4 for Constellation Design for Deep Joint Source-Channel Coding

Abstract:Deep learning-based joint source-channel coding (JSCC) has shown excellent performance in image and feature transmission. However, the output values of the JSCC encoder are continuous, which makes the constellation of modulation complex and dense. It is hard and expensive to design radio frequency chains for transmitting such full-resolution constellation points. In this paper, two methods of mapping the full-resolution constellation to finite constellation are proposed for real system implementation. The constellation mapping results of the proposed methods correspond to regular constellation and irregular constellation, respectively. We apply the methods to existing deep JSCC models and evaluate them on AWGN channels with different signal-to-noise ratios (SNRs). Experimental results show that the proposed methods outperform the traditional uniform quadrature amplitude modulation (QAM) constellation mapping method by only adding a few additional parameters.

Via

Access Paper or Ask Questions

Deep Joint Source-Channel Coding for Multi-Task Network

Sep 27, 2021

Mengyang Wang, Zhicong Zhang, Jiahui Li, Mengyao Ma, Xiaopeng Fan

Figure 1 for Deep Joint Source-Channel Coding for Multi-Task Network

Figure 2 for Deep Joint Source-Channel Coding for Multi-Task Network

Figure 3 for Deep Joint Source-Channel Coding for Multi-Task Network

Figure 4 for Deep Joint Source-Channel Coding for Multi-Task Network

Abstract:Multi-task learning (MTL) is an efficient way to improve the performance of related tasks by sharing knowledge. However, most existing MTL networks run on a single end and are not suitable for collaborative intelligence (CI) scenarios. In this work, we propose an MTL network with a deep joint source-channel coding (JSCC) framework, which allows operating under CI scenarios. We first propose a feature fusion based MTL network (FFMNet) for joint object detection and semantic segmentation. Compared with other MTL networks, FFMNet gets higher performance with fewer parameters. Then FFMNet is split into two parts, which run on a mobile device and an edge server respectively. The feature generated by the mobile device is transmitted through the wireless channel to the edge server. To reduce the transmission overhead of the intermediate feature, a deep JSCC network is designed. By combining two networks together, the whole model achieves 512x compression for the intermediate feature and a performance loss within 2% on both tasks. At last, by training with noise, the FFMNet with JSCC is robust to various channel conditions and outperforms the separate source and channel coding scheme.

* Accpeted by IEEE Signal Processing Letters

Via

Access Paper or Ask Questions