iEsGene-ZCPseKNC: identify eseential genes based on Z curve pseudo k-tuple nucleotide composition |
Supporting Information S1. The benchmark dataset for archeae. It contains 518 essential genes (positive samples), and 1072 non-essential genes (negative samples). None of the sequences included has ≥ 80% pairwise sequence identity with any other in a same subset.