This website presents the method of Using distances between Top-n-gram and residue pairs for protein remote homology detection.


  This website includes five parts:'Home','Server','Manual','Download','Contact'. The page of 'Home' is the home page, which shows the abstract of our method and the main process of our method. The page of 'Server' shows the process of constructing the feature vector of input sequences; when you input some protein sequences, it will output the feature vectors. The page of 'Download' includes the source code of this method. The page of 'Contact' includes the information of authors of this site; if you have any question, you can contact them any time through E-mail.


  Input information can be divided into two parts. The first is a input textarea where users can input protein sequences of fasta format, and then click the construct button for feature result. The second part is a textbox which used to select file, and users can select the file of fasta format from the local computer, and then click the construct button for feature result. The reset button will be clear out all the information just entered.


  After the analysis, the results are shown in a user-friendly format. The result feature vectors are constructed based on the distance threshold of 3.
An output example:

         Name          Feature
         >1HBG          30.0 3.0 6.0 1.0 7.0 3.0 5.0 8.0 20.0 3.0 11.0 12.0 5.0 4.0 3.0 10.0 1.0 2.0 3.0 10.0 8.0 0.0 2.0 ...

