HierCC (Hierarchical Clustering of CgMLST) ========================================== Hierarchical Clustering of CgMLST (HierCC) defines clusters based on cgMLST. Distances between genomes is calculated using the number of shared cgMLST alleles and genomes are linked on a single-linkage clustering criteria. These clusters are assigned stable cluster group numbers at different, fixed cgMLST allele distances. *Salmonella* for instance, has cut-offs such as 1, 2, 5, 10, 20, 50, 100, etc. HierCC Methods -------------- Construction of Hierarchical Clustering scheme for cgMLST (HierCC) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Hierarchical clustering schemes in EnteroBase were initially developed as sets of sub-trees of a minimum spanning tree (MSTree) constructed of all the cgMLST STs. In particular, in order to minimize the effect of missing data, the genetic similarity of STs that were used to infer the MSTree was defined as the number of shared core genomic alleles. Then for a cgMLST scheme with D loci, given any cutoff value d