Download dataset and source code

Dataset for protein remote homology detection [1] Download
Dataset for circRNA-disease association identification [2] Download
Dataset for protein function annotation [3] Download
Dataset for homogeneous biological sequence similarity analysis of webserver Download
Dataset for heterogeneous biological sequence similarity analysis of webserver Download
Source code for BioSeq-Diabolo Link
After the users download the dataset and source code, they can decompress the dataset file to the "/data" folder of stand alone package, and then run the experiment according to the given command line in or the manual of stand alone package.

Above datasets are obtain from or constructed by following works:

[1] Murzin AG, Brenner SE, Hubbard T, Chothia C. SCOP - A STRUCTURAL CLASSIFICATION OF PROTEINS DATABASE FOR THE INVESTIGATION OF SEQUENCES AND STRUCTURES. J Mol Biol. 1995;247(4):536-40. doi: 10.1016/s0022-2836(05)80134-2. PubMed PMID: WOS:A1995QQ85600003.

[2] Wei H, Xu Y, Liu B. iCircDA-LTR: identification of circRNA–disease associations based on Learning to Rank. Bioinformatics. 2021;37(19):3302-10. doi: 10.1093/bioinformatics/btab334.

[3] Cao Y, Shen Y. TALE: Transformer-based protein function Annotation with joint sequence–Label Embedding. Bioinformatics. 2021;37(18):2825-33. doi: 10.1093/bioinformatics/btab198.