Download dataset and source code

Dataset for protein remote homology detection [1] Download
Dataset for circRNA-disease association identification [2] Download
Dataset for protein function annotation [3] Download
Dataset for homogeneous biological sequence similarity analysis of webserver Download
Dataset for heterogeneous biological sequence similarity analysis of webserver Download
Source code for BioSeq-Diabolo Link
After the users download the dataset and source code, they can decompress the dataset file to the "/data" folder of stand alone package, and then run the experiment according to the given command line in https://github.com/Zimiao1025/Sesica or the manual of stand alone package.

Above datasets are obtain from or constructed by following works:

[1] Murzin AG, Brenner SE, Hubbard T, Chothia C. SCOP - A STRUCTURAL CLASSIFICATION OF PROTEINS DATABASE FOR THE INVESTIGATION OF SEQUENCES AND STRUCTURES. J Mol Biol. 1995;247(4):536-40. doi: 10.1016/s0022-2836(05)80134-2. PubMed PMID: WOS:A1995QQ85600003.

[2] Wei H, Xu Y, Liu B. iCircDA-LTR: identification of circRNA–disease associations based on Learning to Rank. Bioinformatics. 2021;37(19):3302-10. doi: 10.1093/bioinformatics/btab334.

[3] Cao Y, Shen Y. TALE: Transformer-based protein function Annotation with joint sequence–Label Embedding. Bioinformatics. 2021;37(18):2825-33. doi: 10.1093/bioinformatics/btab198.