Download dataset and source code
Dataset for protein remote homology detection [1] | Download |
---|---|
Dataset for circRNA-disease association identification [2] | Download |
Dataset for protein function annotation [3] | Download |
Dataset for homogeneous biological sequence similarity analysis of webserver | Download |
Dataset for heterogeneous biological sequence similarity analysis of webserver | Download |
Source code for BioSeq-Diabolo | Link |
After the users download the dataset and source code, they can decompress the dataset file to the "/data" folder of stand alone package, and then run the experiment according to the given command line in https://github.com/Zimiao1025/Sesica or the manual of stand alone package. | |
Above datasets are obtain from or constructed by following works: [1] Murzin AG, Brenner SE, Hubbard T, Chothia C. SCOP - A STRUCTURAL CLASSIFICATION OF PROTEINS DATABASE FOR THE INVESTIGATION OF SEQUENCES AND STRUCTURES. J Mol Biol. 1995;247(4):536-40. doi: 10.1016/s0022-2836(05)80134-2. PubMed PMID: WOS:A1995QQ85600003. [2] Wei H, Xu Y, Liu B. iCircDA-LTR: identification of circRNA–disease associations based on Learning to Rank. Bioinformatics. 2021;37(19):3302-10. doi: 10.1093/bioinformatics/btab334. [3] Cao Y, Shen Y. TALE: Transformer-based protein function Annotation with joint sequence–Label Embedding. Bioinformatics. 2021;37(18):2825-33. doi: 10.1093/bioinformatics/btab198. |