sgRNA-PSM: predict sgRNAs on-target activity based on Position Specific Mismatch |
Supporting Information. The benchmark dataset for sgRNA. It consists of 5310 guide sequences targeting 11 human genes (CD13, CD15, CD33, CCDC101, MED12, TADA2B, TADA1, HPRT, CUL3, NF1, NF2) and 6 mouse genes (Cd45, Cd28,Cd43,Cd5, H2-K, Thy1). The positive subset contains 1059 high on-target activity sgRNAs, while the negative subset contains 4251 low on-target activity sgRNAs.