Multiclass Classification of Drug Resistance in MTB clinical isolates abbreviated as MCDR-MTB is a webserver that uses Support Vector Machine model with Linear kernel to predict class of drug resistance in Mycobacterium tuberculosis (MTB) isolates from variant calling format (VCF) files of whole genome sequencing (WGS) data.
It can classify the MTB isolates into three classes- 1)Extensively drug resistance (XDR), 2)Multidrug resistance (MDR), and 3)Drug susceptible (Susc).
The VCF files contain the sequence variation information such as SNPs, InDels and other type of mutations obtained from WGS data analysis. The variant calling was performed on 16 targeted regions associated with anti-tuberculosis drugs. The model achieved an overall accuracy of 0.957. The predicted
class is determined from the prediction probabilities and the prediction confidence is measured using reliability index (RI).
This webserver can perform multiclass classification of single MTB isolate from a VCF file or multiple isolates from a merged VCF file. In case users have large number of MTB WGS data (either in FASTQ or VCF), users can use the standalone version of MCDR-MTB that is available here to predict the drug resistance class.
Two different input options are given for uploading .vcf file(s). (Maximum file size = 18MB)
An example of VCF file An example of merged VCF file |
The VCF file input for the MCDR-MTB webserver can be generated using the scripts provided in GitHub repository.