ARTÍCULO
TITULO

TranScreen: Transfer Learning on Graph-Based Anti-Cancer Virtual Screening Model

Milad Salem    
Aminollah Khormali    
Arash Keshavarzi Arshadi    
Julia Webb and Jiann-Shiun Yuan    

Resumen

Deep learning?s automatic feature extraction has proven its superior performance over traditional fingerprint-based features in the implementation of virtual screening models. However, these models face multiple challenges in the field of early drug discovery, such as over-training and generalization to unseen data, due to the inherently unbalanced and small datasets. In this work, the TranScreen pipeline is proposed, which utilizes transfer learning and a collection of weight initializations to overcome these challenges. An amount of 182 graph convolutional neural networks are trained on molecular source datasets and the learned knowledge is transferred to the target task for fine-tuning. The target task of p53-based bioactivity prediction, an important factor for anti-cancer discovery, is chosen to showcase the capability of the pipeline. Having trained a collection of source models, three different approaches are implemented to compare and rank them for a given task before fine-tuning. The results show improvement in performance of the model in multiple cases, with the best model increasing the area under receiver operating curve ROC-AUC from 0.75 to 0.91 and the recall from 0.25 to 1. This improvement is vital for practical virtual screening via lowering the false negatives and demonstrates the potential of transfer learning. The code and pre-trained models are made accessible online.

 Artículos similares

       
 
Yong Liu, Xiaohui Yan, Wenying Du, Tianqi Zhang, Xiaopeng Bai and Ruichuan Nan    
The current work proposes a novel super-resolution convolutional transposed network (SRCTN) deep learning architecture for downscaling daily climatic variables. The algorithm was established based on a super-resolution convolutional neural network with t... ver más
Revista: Water

 
Ziyi Wang, Jinqing Jia, Lihua Zhang and Ziqi Li    
The direct-shear test is the primary method used to test the shear strength of transparent soil, but this experiment is complex and easily influenced by experimental conditions. In order to simplify the process of obtaining the shear strength of transpar... ver más
Revista: Buildings

 
Tahir Mehmood, Ivan Serina, Alberto Lavelli, Luca Putelli and Alfonso Gerevini    
Biomedical named entity recognition (BioNER) is a preliminary task for many other tasks, e.g., relation extraction and semantic search. Extracting the text of interest from biomedical documents becomes more demanding as the availability of online data is... ver más
Revista: Future Internet

 
Lorenzo Ridolfi, David Naseh, Swapnil Sadashiv Shinde and Daniele Tarchi    
With the advent of 6G technology, the proliferation of interconnected devices necessitates a robust, fully connected intelligence network. Federated Learning (FL) stands as a key distributed learning technique, showing promise in recent advancements. How... ver más
Revista: Future Internet

 
Nehad M. Ibrahim, Dalia G. Gabr, Atta Rahman, Dhiaa Musleh, Dania AlKhulaifi and Mariam AlKharraa    
Plant taxonomy is the scientific study of the classification and naming of various plant species. It is a branch of biology that aims to categorize and organize the diverse variety of plant life on earth. Traditionally, plant taxonomy has been performed ... ver más