Are you sure you want to delete this task? Once this task is deleted, it cannot be recovered.
Katherine1216 8335da8ba5 | 2 years ago | |
---|---|---|
SPVec | 2 years ago | |
examples | 2 years ago | |
original data | 2 years ago | |
source codes | 2 years ago | |
LICENSE | 2 years ago | |
README | 2 years ago | |
README.md | 2 years ago | |
requirements.txt | 2 years ago | |
setup.py | 2 years ago |
Codes and datasets for "SPVec: A Word2vec-inspired feature representation method for Drug-Target Interaction Prediction"
Drug discovery is an academical and commercial process of global importance. Accurate identification of drug-target interactions (DTIs) can significantly facilitate the drug discovery process. Compared to the costly, labour-intensive and time-consuming experimental methods, machine learning (ML) plays an ever-increasingly important role in effective, efficient and high-throughput identification of DTIs. However, upstream feature extraction methods require tremendous human resources and expert insights, which limited the application of ML approaches.Inspired by the unsupervised representation learning methods like Word2vec, we proposed SPVec, a novel way to automatically represent raw data such as SMILES strings and protein sequences into continuous, information-rich and lower-dimensional vectors, so as to avoid the sparseness and bit collisions from the cumbersomely manually extracted features. SPVec illustrated that the similar compounds or proteins occupy similar vector space, which indicated that SPVec not only encodes compound substructures or protein sequences efficiently, but also implicitly reveals some important biophysical and biochemical patterns. SPVec can provide an effective and efficient way to discover reliable DTIs, which would be beneficial for drug reprofiling.
datasets_construction.py
are codes for data preprocessing and datasets construction.
feature_reprensentation.py
are codes for SPVec implemention
models_training.py
are codes for parameters tuning and models training.
raw data can be downloaded from https://www.bindingdb.org/bind/chemsearch/marvin/SDFdownload.jsp?all_download=yes and https://www.drugbank.ca/release
SPVec: A Word2vec-inspired feature representation method for Drug-Target Interaction Prediction
No Description
Text CSV Python
Dear OpenI User
Thank you for your continuous support to the Openl Qizhi Community AI Collaboration Platform. In order to protect your usage rights and ensure network security, we updated the Openl Qizhi Community AI Collaboration Platform Usage Agreement in January 2024. The updated agreement specifies that users are prohibited from using intranet penetration tools. After you click "Agree and continue", you can continue to use our services. Thank you for your cooperation and understanding.
For more agreement content, please refer to the《Openl Qizhi Community AI Collaboration Platform Usage Agreement》