Protein-Protein Interactions (PPIs) which have a vital role in many cellular processes, can be inhibited by small synthetic molecules at hotspot of PPIs. Those small molecules are called Protein-Protein Interaction Modulators (PPIMs). Therefore, we built a web based prediction server named as Prediction of Protein-Protein Interaction Modulators (PPIMpred) which may complement High-throughput Docking studies using small chemicals. This webserver can perform faster with better accuracy to indentify the ligands on target PPIs using support vector machine (SVM) based method and similarity search algorithm (Tanimoto Co-efficient). Here three popular PPIs were studied : i) Mdm2/P53 ii) Bcl2/bak and and iii) c-Myc/Max.
Datasets:
Cross-validation dataset: The data of distinct small molecules (inhibitors) for three PPIs: Mdm2/P53, Bcl2/Bak, and c-Myc/Max were downloaded from TIMBAL and PubChem databases. The positive datasets of Mdm2/P53, Bcl2/Bak, and c-Myc/Max consisted of 250, 735 and 15 small molecules respectively. No two small molecules have 100% similarity, based on PubChem BioAssay structure clustering. To see structure clustering of Mdm2/P53, click here. In case of Mdm2/P53 and Bcl2/Bak the negative sets were prepared by choosing 1040 random chemicals from PubChem and adding the other two positive set of PPIMs. For example, Bcl2/Bak and c-Myc/Max positive sets were included in Mdm2/P53 negative set along with 1040 random chemicals. In a case of c-Myc/Max there were only 15 positive PPIMs, so we selected ten times of positive set from random small chemicals as a negative set. Therefore, the negative datasets of three PPIs (Mdm2/P53, Bcl2/Bak, and c-Myc/Max) became 1790, 1305 and 150 molecules respectively. The positive and negative sets were further divided into five equal parts for training/testing purpose in five-fold cross validation technique.
Training/Testing Dataset | ||
TARGET PPI | POSITIVE | NEGATIVE |
Mdm2/P53 | Mdm2/P53 Training Positive Data | Mdm2/P53 Training Negative Data | Bcl2/Bak | Bcl2/Bak Training Positive Data | Bcl2/BakTraining Negative Data |
c-Myc/Max | c-Myc/Max Training Positive Data | c-Myc/Max Training Negative Data |
Independent/blind Dataset | |||
TARGET PPI | POSITIVE | NEGATIVE I | NEGATIVE II |
Mdm2/P53 | Mdm2/P53 Training Positive Independent Data | Mdm2/P53 Training NEGATIVE I Independent Data | Mdm2/P53 Training NEGATIVE II Independent Data |
Bcl2/Bak | Bcl2/BakTraining Positive Independent Data | Bcl2/BakTraining NEGATIVE I Independent Data | Bcl2/BakTraining NEGATIVE II Independent Data |
c-Myc/Max | c-Myc/MaxTraining Positive Independent Data | c-Myc/Max Training NEGATIVE I Independent Data | c-Myc/Max Training NEGATIVE II Independent Data |
Methods:
fig.1 Density plot of CID 11433190 which can inhibit Mdm2/P53 PPI and its positiveness: 52.14 & negativeness: 23.27
Target PPI | Sensitivity | Specficity | Accuracy | PPV | F1 | AUC |
Mdm2/P53 | 0.83 | 0.82 | 0.83 | 0.45 | 0.57 | 0.88 RBF kernel |
Bcl2/Bak | 0.86 | 0.75 | 0.79 | 0.72 | 0.77 | 0.83 RBF kernel |
c-Myc/Max | 0.87 | 0.91 | 0.90 | 0.50 | 0.63 | 0.91 RBF kernel |
Mdm2/P53 |
Bcl2/Bak |
c-Myc/Max |
Fig.2 ROC plots in three different targets Mdm2/P53, Bcl2/BAK and c-Myc/Max
Result of known PPIMs:
There are some well known PPIMs which were clinically tested. Such as Nutlin 3a, ABT-263 and GX15-070 etc. Those were evaluted by our SVM based models. The ouput of those PPIMs are tabulated here.Users input chemical/ (CID) | Target protein-protein interaction complex | Result of prediction | SVM score | Positiveness | Negativeness |
Nutlin 3a (11433190) | Mdm2/P53 | positive PPIM | 1.17 | 52.14% | 23.27% |
ABT-263 | Bcl2/Bak | positive PPIM | 1.01 | 42.86 | 3.46 |
GX15-070 | Bcl2/Bak | positive PPIM | 0.56 | 17.02% | 13.27% |
Position of Nutlin 3a in frequency density plot of Mdm2/P53. | Position of ABT-263 & GX15-070 in frequency density plot of Bcl2/Bak |
Conclusion : In summary, PPIMpred can be useful for high throughput screening of small chemicals and besides categorical classification it also give hints of structural similarity with known drug like molecules for further insights.
Cite this article:
Jana T, Ghosh A, Das Mandal S, Banerjee R, Saha S. 2017 PPIMpred:a web server for high-throughput screening of small molecules targeting protein–protein interaction.R. Soc. open sci.4: 160501. http://dx.doi.org/10.1098/rsos.1605 PUBMED: 28484602