VirClust is a bioinformatics tool which can be used for:
• virus clustering
• protein annotation
• core protein calculation
At its core is the grouping of viral proteins into clusters of three different levels:
• at the first level, proteins are grouped based on their reciprocal BLASTP similarities into protein clusters, or PCs.
• at the second level, PCs are grouped based on their Hidden Markov Model (HMM) similarities into protein superclusters, or PSCs.
• at the third, still experimental level, PSCs are grouped based on their HMM similarities into protein super-superclusters, or PSSC.
More about the how it works can be read here DOI: 10.1101/2021.06.14.448304.
If you are using VirClust, please cite the following pre-print publication: • Moraru, Cristina (2021): VirClust, a tool for hierarchical clustering, core gene detection and annotation of (prokaryotic) viruses. In BioRxiv. DOI: 10.1101/2021.06.14.448304.
Additionally, if you are performing viral protein annotations using VirClust, please also cite the respective databases used for the annotations, see VirClust manuscript for the complete citations