Was this page helpful?

GENSCAN output-predicted exons


    GENSCANW output for sequence Smo8



    
    GENSCAN 1.0	Date run: 21-Oct-108	Time: 20:49:08
    
    Sequence Smo8 : 96116 bp : 47.80% C+G : Isochore 2 (43 - 51 C+G%)
    
    Parameter matrix: Arabidopsis.smat
    
    Predicted genes/exons:
    
    
    Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr..
    ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------
    
     1.34 Intr -    303     17  287  1  2   61   77   329 0.995  31.16
     1.33 Intr -    616    362  255  2  0   -1   61   160 0.887   6.82
     1.32 Intr -    789    668  122  2  2   39   87   123 0.922  12.44
     1.31 Intr -    993    839  155  0  2   24   64   102 0.999   5.27
     1.30 Intr -   1273   1044  230  2  2   64  101   114 0.739  12.89
     1.29 Intr -   1354   1325   30  2  0  114   74   -17 0.648   2.60
     1.28 Intr -   1541   1469   73  2  1   72  131    66 0.999  13.28
     1.27 Intr -   1743   1604  140  1  2   70   78   132 0.999  15.58
     1.26 Intr -   1998   1791  208  0  1   60  116   215 0.980  25.15
     1.25 Intr -   2213   2049  165  2  0   13   62   203 0.999  15.36
     1.24 Intr -   2363   2265   99  2  0   63   88   185 0.997  21.31
     1.23 Intr -   2594   2423  172  1  1   19   67   171 0.997  13.05
     1.22 Intr -   2791   2663  129  0  0  113   81   215 0.999  28.11
     1.21 Intr -   2964   2877   88  1  1  -11   97    48 0.285  -0.17
     1.20 Intr -   3359   3230  130  2  1   61   41    51 0.316   2.97
     1.19 Intr -   4085   3692  394  1  1   82   21   432 0.181  35.56
     1.18 Intr -   4276   4136  141  0  0   80   83   163 0.999  19.47
     1.17 Intr -   4503   4343  161  0  2   19   71   210 0.952  16.19
     1.16 Intr -   4654   4554  101  2  2  -30  110   104 0.383   5.63
     1.15 Intr -   5150   4896  255  0  0  -39   43   319 0.378  17.12
     1.14 Intr -   5473   5180  294  2  0   21   28   296 0.253  18.88
     1.13 Intr -   5802   5597  206  2  2  -64   91   110 0.203  -0.06
     1.12 Intr -   5931   5816  116  0  2   58   -2   147 0.124   6.95
     1.11 Intr -   6134   5995  140  0  2   70   77     2 0.479   2.48
     1.10 Intr -   6422   6194  229  2  1   68    8   333 0.805  25.74
     1.09 Intr -   6520   6470   51  1  0   43  108    49 0.981   6.50
     1.08 Intr -   6793   6568  226  0  1    5   68   231 0.970  15.79
     1.07 Intr -   6911   6841   71  2  2   91   52    51 0.995   4.78
     1.06 Intr -   7090   6944  147  1  0  104   85   118 0.992  18.53
     1.05 Intr -   7341   7147  195  0  0   66   86   148 0.913  17.01
     1.04 Intr -   7718   7657   62  0  2   82   91    71 0.977  10.25
     1.03 Intr -   7995   7883  113  2  2   29   35    65 0.008   0.32
     1.02 Intr -   8287   8217   71  1  2   59   89    46 0.573   4.78
     1.01 Init -   8463   8335  129  0  0   89   99   106 0.649  16.95
     1.00 Prom -   8517   8478   40                             -14.63
    
     2.00 Prom +   8528   8567   40                             -10.35
     2.01 Sngl +   8612   9613 1002  1  0   43   43  1051 0.994  98.05
     2.02 PlyA +   9688   9693    6                              -4.04
    
     3.00 Prom +   9720   9759   40                              -8.46
     3.01 Sngl +   9778  10713  936  0  0   47   41  1073 0.166  99.99
     3.02 PlyA +  10743  10748    6                               1.05
    
     4.02 PlyA -  10756  10751    6                              -4.33
     4.01 Sngl -  11751  10777  975  0  0   70   37   858 0.999  80.67
     4.00 Prom -  11797  11758   40                             -14.47
    
     5.00 Prom +  11821  11860   40                             -16.88
     5.01 Init +  11876  12125  250  1  1   92  -21   181 0.825   8.43
     5.02 Intr +  12191  12460  270  0  0   43  -59   218 0.774   5.11
     5.03 Intr +  12474  12640  167  1  2   82   19   289 0.894  26.08
     5.04 Intr +  12718  12988  271  0  1   51   47   147 0.863   9.01
     5.05 Intr +  13046  13162  117  0  0   54   52   116 0.588  10.04
     5.06 Intr +  13379  13644  266  0  2   22   60   244 0.986  17.23
     5.07 Intr +  13780  13990  211  0  1   23  117   184 0.423  18.29
     5.08 Intr +  14016  14200  185  1  2   18   26   255 0.641  16.81
     5.09 Intr +  14458  14662  205  0  1   22   80    85 0.453   4.87
     5.10 Intr +  14968  15162  195  2  0  -17   20   229 0.545   9.99
     5.11 Intr +  15215  15686  472  0  1   73   28   598 0.660  49.44
     5.12 Term +  15713  15806   94  2  1   12   47   171 0.845   7.70
     5.13 PlyA +  16215  16220    6                              -0.45
    
     6.05 PlyA -  16326  16321    6                              -0.45
     6.04 Term -  16892  16734  159  2  0   52   53   269 0.999  22.74
     6.03 Intr -  17356  17108  249  1  0   34   34   253 0.014  17.13
     6.02 Intr -  17657  17496  162  2  0   51  -11   155 0.014   7.07
     6.01 Init -  18529  18470   60  1  0   74   91     8 0.011   4.17
     6.00 Prom -  18667  18628   40                             -15.60
    
     7.02 PlyA -  19049  19044    6                               1.05
     7.01 Sngl -  20066  19080  987  2  0   65   46  1443 0.999 139.65
     7.00 Prom -  20420  20381   40                              -8.76
    
     8.00 Prom +  20518  20557   40                             -11.63
     8.01 Init +  20594  21454  861  1  0   56   63   972 0.000  91.11
     8.02 Intr +  22012  22317  306  0  0   35   13   267 0.000  15.55
     8.03 Intr +  22706  22977  272  1  2   -9   53   274 0.031  15.54
     8.04 Intr +  23030  23055   26  2  2   16   40    46 0.070  -4.73
     8.05 Intr +  23498  24018  521  0  2  -11  -43   725 0.024  47.17
     8.06 Term +  24099  24503  405  2  0   17   42   415 0.765  29.99
     8.07 PlyA +  25328  25333    6                               1.05
    
     9.08 PlyA -  25833  25828    6                               1.05
     9.07 Term -  26115  25891  225  0  0   98   37   113 0.826   8.98
     9.06 Intr -  26232  26140   93  0  0   12   68    77 0.768   3.26
     9.05 Intr -  26444  26375   70  1  1  -10   80    43 0.614  -2.12
     9.04 Intr -  27651  27505  147  2  0   71   48   156 0.078  14.25
     9.03 Intr -  29491  29306  186  0  0   40   46   315 0.509  26.30
     9.02 Intr -  29539  29521   19  2  1   24   53    32 0.849  -6.03
     9.01 Init -  29839  29779   61  1  1   85  131   113 0.852  21.71
     9.00 Prom -  30117  30078   40                              -8.16
    
    10.00 Prom +  30334  30373   40                             -12.59
    10.01 Init +  30535  30602   68  0  2   54   74    98 0.603  10.57
    10.02 Term +  30984  31218  235  0  1   41   38   149 0.878   5.99
    10.03 PlyA +  31228  31233    6                              -3.94
    
    11.02 PlyA -  31453  31448    6                               1.05
    11.01 Sngl -  32470  31484  987  1  0   65   46  1249 0.999 120.25
    11.00 Prom -  32525  32486   40                              -9.75
    
    12.00 Prom +  33050  33089   40                             -12.49
    12.01 Init +  33309  33615  307  2  1   76   47   461 0.023  43.26
    12.02 Term +  34647  35494  848  1  2   53   48   856 0.018  76.14
    12.03 PlyA +  35581  35586    6                              -6.47
    
    13.09 PlyA -  35614  35609    6                              -5.41
    13.08 Term -  35964  35644  321  0  0  111   42   344 0.992  31.92
    13.07 Intr -  36099  36024   76  2  1   77   74   200 0.970  22.02
    13.06 Intr -  36360  36164  197  0  2   72   62   190 0.990  18.01
    13.05 Intr -  36561  36418  144  0  0   86   95   231 0.991  29.08
    13.04 Intr -  36814  36694  121  0  1   65   86    76 0.968  10.60
    13.03 Intr -  36929  36835   95  2  2   45   20    21 0.116  -5.24
    13.02 Intr -  37040  36974   67  1  1  119   -9   126 0.120   9.91
    13.01 Init -  37266  37094  173  0  2   74   68   282 0.999  28.71
    13.00 Prom -  37312  37273   40                             -16.89
    
    14.04 PlyA -  37327  37322    6                              -4.73
    14.03 Term -  38304  37357  948  0  0   63   54  1212 0.092 112.38
    14.02 Intr -  38776  38505  272  2  2   46  -38   195 0.045   4.96
    14.01 Init -  40507  38917 1591  1  1  105   61  1813 0.904 177.92
    14.00 Prom -  40560  40521   40                             -16.88
    
    15.00 Prom +  40623  40662   40                             -14.78
    15.01 Init +  40666  40904  239  0  2   63   96   501 0.999  50.88
    15.02 Intr +  40959  41005   47  0  2   49   74    64 0.922   4.15
    15.03 Intr +  41076  41387  312  1  0   64  105   336 0.915  34.06
    15.04 Intr +  41453  41620  168  0  0   53   56   138 0.906  12.02
    15.05 Intr +  41670  41762   93  1  0   99   59   125 0.990  15.64
    15.06 Intr +  41811  41917  107  1  2   65   76    90 0.546  10.43
    15.07 Intr +  42062  42135   74  1  2   46  107    59 0.520   6.80
    15.08 Intr +  43242  43475  234  0  0   10  -34   353 0.010  17.10
    15.09 Intr +  43746  43948  203  0  2  -45   25   233 0.039   7.63
    15.10 Intr +  45035  45166  132  0  0  -20   80   113 0.926   5.32
    15.11 Intr +  45678  45835  158  1  2  108   39   159 0.999  17.73
    15.12 Intr +  46076  46156   81  1  0   43   82    65 0.939   6.23
    15.13 Intr +  46221  46334  114  2  0   95   70   204 0.963  24.94
    15.14 Intr +  46392  46487   96  2  0  130    3    64 0.953   7.31
    15.15 Intr +  46548  46625   78  2  0   85   52    74 0.984   8.25
    15.16 Intr +  46939  47046  108  0  0   70   88    56 0.970   9.28
    15.17 Intr +  47097  47272  176  2  2   57   60   169 0.975  14.74
    15.18 Term +  47330  47441  112  2  1   51   38   107 0.826   5.03
    15.19 PlyA +  47531  47536    6                               1.05
    
    16.08 PlyA -  47607  47602    6                              -8.62
    16.07 Term -  48079  47613  467  2  2   18   41   480 0.105  36.48
    16.06 Intr -  48457  48320  138  2  0    8  -21   201 0.037   6.64
    16.05 Intr -  50592  48950 1643  2  2  -25   92  1857 0.072 167.12
    16.04 Intr -  51054  50854  201  2  0    8   71   153 0.647   9.00
    16.03 Intr -  51214  51165   50  1  2   18   98    26 0.629  -0.92
    16.02 Intr -  51952  51575  378  1  0   21   53   475 0.775  37.46
    16.01 Init -  52984  52013  972  1  0   53   88  1140 0.971 108.81
    16.00 Prom -  53106  53067   40                             -11.92
    
    17.00 Prom +  53393  53432   40                             -11.82
    17.01 Init +  53559  53783  225  2  0   41   75   345 0.706  31.97
    17.02 Intr +  53848  53979  132  0  0   92   89   264 0.992  32.74
    17.03 Intr +  54030  54314  285  2  0   48  -80   260 0.713   7.74
    17.04 Intr +  54441  54544  104  2  2  109   34    55 0.708   6.17
    17.05 Intr +  54942  55139  198  0  0   73   68   176 0.760  17.57
    17.06 Intr +  55196  55268   73  2  1   96   51    17 0.999   3.21
    17.07 Intr +  55325  55466  142  1  1   43   89   137 0.972  14.23
    17.08 Intr +  55883  56039  157  0  1   56   92   118 0.878  13.07
    17.09 Intr +  56083  56219  137  1  2   34   53    20 0.969  -1.69
    17.10 Intr +  56276  56589  314  0  2   82  105   152 0.993  17.20
    17.11 Intr +  56636  56829  194  1  2   78   78    82 0.807   9.49
    17.12 Term +  56883  56988  106  0  1   21   42    71 0.723  -1.32
    17.13 PlyA +  58157  58162    6                               1.05
    
    18.08 PlyA -  58654  58649    6                              -3.44
    18.07 Term -  59562  59410  153  0  0   -7   39   111 0.542  -0.38
    18.06 Intr -  60640  59857  784  0  1   80   71   266 0.764  20.58
    18.05 Intr -  61026  60848  179  0  2   50   76    92 0.857   7.92
    18.04 Intr -  61486  61352  135  1  0   23   75    34 0.338   1.36
    18.03 Intr -  61950  61924   27  0  0   62   76    18 0.654   1.31
    18.02 Intr -  62110  62084   27  1  0   25  108    17 0.184   0.71
    18.01 Init -  62820  62701  120  0  0   52  -13   121 0.291   3.59
    18.00 Prom -  63721  63682   40                              -7.16
    
    19.13 PlyA -  64303  64298    6                               1.05
    19.12 Term -  65486  64624  863  0  2   70   50   520 0.987  44.29
    19.11 Intr -  66109  65534  576  2  0   22   53   367 0.536  24.29
    19.10 Intr -  66210  66157   54  1  0   44   54    43 0.543   0.45
    19.09 Intr -  66328  66314   15  2  0   72  115   -11 0.587   0.52
    19.08 Intr -  66952  66386  567  2  0   60   99   706 0.923  66.65
    19.07 Intr -  67143  67000  144  1  0   60   63   139 0.990  13.85
    19.06 Intr -  67282  67193   90  2  0   44   72    79 0.957   6.87
    19.05 Intr -  68754  67665 1090  0  1   76   61  1417 0.275 132.42
    19.04 Intr -  69987  68809 1179  0  0   68  115  1181 0.401 112.37
    19.03 Intr -  70319  70227   93  2  0   12  115    64 0.652   6.66
    19.02 Intr -  70517  70331  187  1  1   79   93   205 0.999  24.79
    19.01 Init -  70659  70655    5  0  2   85  -20     0 0.241  -6.63
    19.00 Prom -  70834  70795   40                              -4.36
    
    20.00 Prom +  70886  70925   40                             -13.24
    20.01 Init +  71436  71677  242  2  2   42   74    58 0.265   2.15
    20.02 Intr +  72578  72766  189  2  0   79  -26   188 0.235  10.20
    20.03 Intr +  73155  76210 3056  0  2   95  100  1632 0.385 157.47
    20.04 Intr +  77042  77175  134  0  2   75   22   112 0.797   8.66
    20.05 Term +  77641  77796  156  0  0   33   48    23 0.141  -4.27
    20.06 PlyA +  77824  77829    6                               1.05
    
    21.00 Prom +  77996  78035   40                              -4.96
    21.01 Init +  78170  78439  270  1  0   46   44   185 0.430  11.87
    21.02 Intr +  79273  79474  202  0  1   54   92   121 0.941  12.96
    21.03 Intr +  79530  79729  200  1  2   97   27   145 0.936  13.37
    21.04 Intr +  79954  80190  237  0  0   74   -6   118 0.572   3.81
    21.05 Intr +  80229  80310   82  2  1  -12   77    67 0.618  -0.19
    21.06 Intr +  80544  80645  102  1  0   53  115    12 0.901   5.55
    21.07 Intr +  81524  81744  221  0  2   59   80   104 0.370   9.72
    21.08 Term +  82290  82640  351  2  0  109   44   118 0.951   8.99
    21.09 PlyA +  82642  82647    6                               1.05
    
    22.05 PlyA -  82848  82843    6                              -0.45
    22.04 Term -  83436  83219  218  1  2   22   37    85 0.473  -0.89
    22.03 Intr -  83995  83713  283  1  1   -4   30   325 0.557  19.49
    22.02 Intr -  84375  84065  311  1  2  -30   25   280 0.910  10.93
    22.01 Init -  84489  84435   55  0  1   72   63   122 0.654  14.55
    22.00 Prom -  85679  85640   40                              -9.65
    
    23.02 PlyA -  85778  85773    6                               1.05
    23.01 Sngl -  86397  86026  372  0  0   51   48   237 0.428  17.13
    23.00 Prom -  88662  88623   40                              -7.66
    
    24.02 PlyA -  88839  88834    6                              -1.75
    24.01 Sngl -  89791  89345  447  1  0   57   42   509 0.650  42.43
    24.00 Prom -  89954  89915   40                             -11.82
    
    25.11 PlyA -  91579  91574    6                               1.05
    25.10 Term -  93029  92828  202  1  1   61   42   119 0.764   6.46
    25.09 Intr -  93439  93090  350  1  2  114   64   295 0.987  28.85
    25.08 Intr -  93925  93491  435  1  0   53   93   185 0.573  14.28
    25.07 Intr -  94095  93980  116  1  2  103   92    47 0.748  11.77
    25.06 Intr -  94357  94157  201  2  0   84   61   161 0.999  17.26
    25.05 Intr -  94616  94422  195  0  0  -21   25   136 0.632   0.79
    25.04 Intr -  95061  94825  237  1  0   57   69   238 0.740  21.39
    25.03 Intr -  95311  95113  199  1  1   57   88    92 0.990  10.02
    25.02 Intr -  95570  95360  211  1  1   80   94    88 0.944  12.52
    25.01 Init -  96081  95618  464  0  2   73   66   292 0.776  25.67

    Click here to view a PDF image of the predicted gene(s)

    Click here for a PostScript image of the predicted gene(s)


    Predicted peptide sequence(s):
    >Smo8|GENSCAN_predicted_peptide_1|1795_aa
    MESLIGLVNKIQRACTVLGDYGGDHSMPTLWESLPSVAVVGGQQSELGQIICAGKRRRPR
    LSSQRISSLAALVRKEIEDETDRMTGHTKQISPVPIHLSIYSPNEGQPDSIVADIENMVR
    SYVEKVLEGRAYRLQFQWVGVVNRSQADINKSVDMIAARKKEREFFASSPDYGHLANRMG
    SEYLAKMLSKHLETVIKTRLPSILALINKSIDELEQELNQLGRPISHDAGVWFHRVCSCA
    QLYTILELCRAFDHVFKAHLDGGRPGGERIYVVFDNQLPAALKKLPVDKHLSMQNVRKIV
    TEADGYQPHLIAPEQGYRRLIEGTLGLFRGPAEAVVDAVHSVLKELVRKAIAETQELKRF
    PTLQAELAAATTEALERFRDESRKFVLRLVDMEASYLTVEYFRKLPPELEKGGNPSAPTA
    DRYTEAHLRKIGSHVTSYIMIVCETLRHSIPKAVVHCQVREAKRTLLDTFYTQVGKKEEK
    QLLQMLDEDPALMERRVALAKRLELYKNARDEIDAVIKSRNPEARRSLTLRYPKHLEQQQ
    QQYNRSGHLGVRSQQHNGVLKETNLHLGTRAGGAVVFGTETPTPGAGLAVLVVVTGESRG
    LGVEDILCDAPPATDTLTGPAKLAGDTKNPPGPLARAPGGGGGGGGGPAMKFIPPPLANL
    QSSVAGAGARFDPTRRTGPAFGGARWRERMERHGERYRERHRGGGGGPWRIRGRLSVVVE
    LFDSFVDSSWNGRLLEILGGNSMELLLLLLLDLVLVEVIDRGSLRRRRQNDGIAWERRQG
    DSYRVSEEEIAAFHRDGYVHLQGVLAEDEIAQLEREFMAFLRREIRVEGRDFCDMSADYS
    KPVESFSIINIMLPSKYNPSLKGNIYQRRAASISQQLLGDGMVFDYDQLLAKPPNKPDAV
    FKWHQDLAYWPVTKDTRTASFWLAIDDSTIQNGCLQFVPGTHLEQQLRDHGPAHGDREKS
    HTLFATLAPADAPKAMEIRRGDVTVHHERLLHGSSGNVSSDSWRRAWVIAFRSKETVEEE
    RRIGFTHSHNDKLEVFQNGRDARISDERDEPSIARGAGARLMRSWSLELNPRRAAVTGIV
    TRKVSSMKRAREDSTSGPQLKRTAGESSEQPRLTTEDAMLYLKAVKEKFKDDNGKYAEFL
    EVMKDFKAQRIDTSGVIAKVKDLFKGHNKLILGFNTFLPKNYQIVLPEEKKQPVEFDQAI
    NFVNKIKNRFNDNEHVYKAFLEILNKYRKGTKSINEVYDEVASLFRDHPDLLDEFTRFLP
    DTGNAVQTSFRQGTSNQRKEEKGPSGRSSQYPVIKKETERSSLKAEKEHRRKLERERDRN
    DEHERDRDKEDLDRDDLDGQRLPHKRKSARRADELIRKQSQTAEGTSTQIDYAFFEKVKG
    RLRNRDSYKELIKIINLYTEQIINRGELHSFATDILGKHPDLLEGFNNFLQGENAEGCLG
    GVFARRLESEGVASKGRDKDHDREKDWEKDRDKDKERYASDKTGQKMSLLPSKDKFTNKP
    ISELDLSNCETCTPSYRLLPKNYPRLPSNHRNELANSVLNDSWVSVTSGSEDSSFKHMRR
    NQYEESLFRCEDDRFELDMLLESTALTAKRVGELVEKLDSGQLDPSIRIDDYLSAINLRC
    IERIYGDHGLDIIDLMRKNASSVLAVVHCRLRQKEEEWAKCRADMNKVWAEVYTKNYHKS
    LDHRSFYFKQQDKKSLSSKGLLAEIKDVHEKKRKEDESVLHLIMGNKRLPTPDMKFGYPD
    SSIHEDLFQIMKYSADEVCNTMEQSDKIMRVWTMSLELLFGVPPRPRGTDDTEEA
    >Smo8|GENSCAN_predicted_peptide_2|333_aa
    MASQEIAREFPDGVFFVDRSFARKSMPKFLRVPANPSASPIASRDAVIDEEHGIWARIFL
    PTDQAQGKGEGDSSKLPVVLFFHGGGFVTLSADFCVFHVLCSSIAEKLGALVIGVNYRLA
    PENRLPAAYEDGFAALKWLADEQGGRRDPWLASHADLSKILVMGDSAGGNLAHHVTVRAA
    VEDLGEMRIMGQVLIQPFFGGIARFPSETKPQPPNSTLTTDLSDQLWELALPIGASRDHP
    YCHVVAPDLKAQLREIEALPKALVVAGSEDVLCDRVVEFAEVMRECGKDLELLVVENAGH
    AFYIVPESEKTAQLLEKISAFVHGLIPKINSKV>Smo8|GENSCAN_predicted_peptide_3|311_aa
    MEFPDGVFFADRSFARRSMPKSLCVEANPGAHPIASRDVIIDEERGLWARIFLPADQVIH
    HSRQVPVAFYFHGGGFVCFTADTMEYHVLCELLAKKMGAIVISVNYRLAPENRLPAAYHD
    GFAALKWLAQEQGGRKDPWLAAHADLSKTLLVGDSSGANLVHHVLPMLAAAEDPAMSDIQ
    VVGTVLIQPFFGGVARVPSETKHRSPTPLISTDMCDRFWELALPIGADRDHPYCRVAAPD
    HPLPKTLIVAGGEDVLCDRAKEFMETMGGSSKDLELLVIENAAHAFYIALESQETAHFLDKVATFAQGIFA
    >Smo8|GENSCAN_predicted_peptide_4|324_aa
    MAENVARDLVLPGATLHADGSFSRQAIRHPVSARSDSSSSIVSRDVTIDDGLGLWARIFL
    PKRLKGECVDPNALKSPVLMYFHGGGFVAMSASFFGFHDFCEEISRWLGVLVVSVEYRLA
    PENRLPVAYEDGFAALKWLGQDQGGLSDPWLAAHADLSSVFLVGDSSGANLAQHLSVRAA
    APASWGDLGPVRIVGRVLIQPTFASVARKPSGMLRDDPSKVSPSTLMMDRFWELALPIGA
    SRDHPFCNIAVARGDLAGILLPRTLVVVGGLDVLRDHGVEYSGILRECGKNVKLVEFESC
    DHAFYLNGSTESTSKLMDSSFLHD>Smo8|GENSCAN_predicted_peptide_5|900_aa
    MEFHASFLVLLAAGSGIFGGFLGASSACVFPAIFNFGDSTSDTGGIQTAFPTFSQSEFPP
    YGMTFPGRPFLRYSDGRLGIDFITEALGIPYLSSFFQAVGSNFTTGVNFATAGATSQAVT
    YISPFSLNVQLNQFREFKQKVLVTGKDMNPRIYSIPFSPRLIATLPAHTFLINRNSLNAL
    PARDAFSRALYIVDIGGNDFSYGYNRNMNFDQLKAYIFRAVDGIIALVKGVYAEGGRTFL
    VSDVGPQGCIPYFLTNFPNLRVSYDQAGCAIEFNQVTQHYNGLLKQALSSLRSQLPGSTI
    IYTNTYDIKYSLALKAASNGFQFATKACCGIGGNYNYNFAVQCGESKVMAGKTVASTTSP
    IKHYRLEGNAQWNEKRLPDDTAKTKGRNAILVNGCQTLCTSNPVSSDSQSTRQSTKSTLF
    NNGQQRYKASDFVQDKDGFLAASVPHESPGPQSREISRQFQRRVAKWMVAVGRALGERRQ
    PAKLGGVEREHRFGGPEPELRVGPGRASIKGLEEDLAARNATRAARFPPVAHDEDLGEIR
    VSESVLIAAALGDLSDPVENVVAIVIGRREPVLLRLDAQLFRTENSHPNSPILVENHVAA
    RHAVLAGTSAKNPAGPVGGGARTQDPSSTNTKPPICLRSSSPILSVLVLDNVQWQYSALY
    GVVDVALCRLEHRIQRVWSNACNQANVSEYAATKAIVRERPSYLSDDNLCYYANKRRPTS
    IEAVQEELQQQQQQEMPLLSSPLIKYIRAKPSIPREIRGRGPGSRKFRQELVSWVERDLF
    GGVGSLDMQARLVFDERMLAHEWQALFGFRYCKYEQPVDKDLSTPDVVFDETCGSFDHEK
    YDELVEHLEAGKIIAVCGGFNKLSAWPGNAGGVEELIKWYGGLRFKVNPKRRRAYDRNTE
    >Smo8|GENSCAN_predicted_peptide_6|209_aa
    MARNTLTGYVLAWGEGVFYVALLSRTLISLDVQSGRESGRCTSGRAKAASLRSSAIVIRH
    RLNAYIARRSNAPSRCSNSRENARETAEEEAEDVAPKQRGRGAFTYGRERDGFHSDHTTA
    MDDETHEELTTSSDDIDGKSGKFVSLWMIKCGVVIRWTWSLRHPPKLLNAYDRRDRAQNK
    VRFTVQEKKEHKQRLLAREKLREETWGGD>Smo8|GENSCAN_predicted_peptide_7|328_aa
    MGDEERKSSGKQIEGFVFAEDGSYVRTPPPTGPAGFFEEVPANPSFIDGVASRDVILDKD
    RGLWVRVFRPEELENRSTLPIVIFYHGGGFIYLSAANAIVHRFCEALSRKLGAIVVSVNY
    RLAPEHRLPAAYDDGYDALKWVRGIAKSSSDQDAFAHADFSKIFVMGDSAGGNLAARVAL
    RAAQDGIPLAGQILLQPFYGGTSRTESELKLGSSNPMITLDTTDFCWLATLPEGAADRDH
    PFCNPTLEFPGDLARLGAGELPRALVVVGGKDLLYDRQVEFARILEDAGNAVKLIDYENA
    SHGFYAVGDASCQEYVLVLDEIASFLRE>Smo8|GENSCAN_predicted_peptide_8|796_aa
    MAAQLRRKLWDESVILGQVLDTLGHRQWQYSAFYSAVEFALCRLEHRIQRVWSNACKQAN
    VREYAATNAIVRERLSYLSDDELCYYGIWSDDLTYTNVRLCVDSYKRRPTSIEAVQEELQ
    QQQQQQEMPLLSPPLIKYIRAKPSIPRDIRGRGPGLQKFRQELVSWVERDLFGGVGSLEM
    QARLVFDARVLAHEWEELFGFRCCEYELLVDKNLSTPDVVFNENCGPFDQEKYDKLVVQL
    EAGKMIAVCCGFNKLSTWPGNACVDLFNFQVGGVEELIKRYGELRFKETMFFSECMLLDG
    SRPSDIVGGCLDKTSVVDHSVQHEKDHLYAMVDGRLSELMKTQWLAACQKRSYGHLYVNR
    AEQGHLTPEKLTQKALLRKKRSLRATPVVEELQQQQQQQQQRRQEMPLLSLPLIKYIRAK
    PSIHCEIRGRGPGSRKFRQELVFWVELDLFGGVGSLEMQAWLVFDARVLAHEWEELFSFS
    CRRKGKRWSPKNGRRGAQSSGNQIGGFVFAEDGSPPTGPAWFFAEVPANPASIDGVASRD
    VILDKDRGLWVRVFRLEELENRTLPIVIFYHGGGFVYMSAANAIFHRFCEALSRKLGAIV
    VSVNYRLAPEHRLPAAYDDGYDALNWVREIAKSSSDQDAFAHADFSKIFVMGDSAGGNLA
    ARDRVGAQAGSSDPMITLRITDFCWLAALPEGAVDRDHPSCNMTLELPGDLARLGARGLA
    RALVVVGGKDVLHDHQVEFAKILEDAGNAVKLIEYENASHGFYLVGTIAARNPSLSWTKSLASCESDPWLAAIDPR
    >Smo8|GENSCAN_predicted_peptide_9|266_aa
    MDDETHEELTTSSDDIDGKSAKFTAESPFSDRPLFVQTWSLRHPPKLLNAYDRRDRAQNK
    VRFTVQEKEERKQRLLAREKLREETCGGDIPEKNIVSCVNVDQALSLCPSVKLRNQDPGL
    LELDAWSLGYFRMLSNTSVSPENLQIFSTSKCLKCVNLAGKRSKACQKSRRPRDFPEDSS
    LATKVSSIDYFIVSSRSFSLASDRWLCSSFSWTVNLLVGGCQRLQVYKEGAVVALMQGCR
    ERLSPAKKTLKAPVSFFFVPAKTVPP>Smo8|GENSCAN_predicted_peptide_10|100_aa
    MVTRKATRLEFSGVGRIRARDRRIPVCQQGNYNYNFEVQCGQSKVMAGKMVASTTCKNPL
    NWDGVHYNEAASWIITRQILSGSFFEPSFPLGMLCTLKNI
    >Smo8|GENSCAN_predicted_peptide_11|328_aa
    MGDEERKSSGKQIGGFVFAEDGSYVRTPPPTGPAGFFEEVPANPSFIDGVASRDVILDKD
    RGLWVRVFRPEELENRSTLPIVIFYHGGGFIYMSAANAIFHRFCEALSRKLGAIVVSVNY
    RLAPEHRLPAAYDDGYDALKWVRGIAKSSSDQDAFAHADFSKIFVMGDSAGGNLAARVAL
    RAAQDGIPLAGQILLQPFYGGTSRTESELRLGSSNPMITLDSSDFCWLATLPEGAADRDH
    PFCNPTLELPGDLARLGARGLARALVVVGGKDLLHDRQVEFAKILEDAGNTVKLIEYENA
    SHGFYAVGDASCQESVLVLDEIASFLRE>Smo8|GENSCAN_predicted_peptide_12|384_aa
    MQVRLVFDARMLAHEWQALLGFGCLSYESLVDKDPSTPDVVFDEHYGPFVQEKYDELVKD
    LEAEKMIAACGGFNKLSAWLGNACVDLFNFQVADVEELIKRYRLQGWRRSCSSKSSWDGD
    LGSSSRQFWQYDGFYGVADLALRRLDHRIQQVCRSGVSEYAAINAIVRERPSYLSHDVLR
    RYLERRPHFVNVRLCVDSYKRRPTSIEAVQEEARTAGLFSPPLIKYIRAKPSIPRQIRGR
    GPGSRKFQQEVVSWVERDLFGGAGSLEMQARLVFDVRMLTHEWQALLGFGCFSYESLVDK
    DPSTPDVVFDKHYGPFVQEKYDELVEDLEAGKMITACGGFNKLSALLGNACVDLFNFQVG
    ELIKRYGELRFKVNLKRRRAHDCD>Smo8|GENSCAN_predicted_peptide_13|397_aa
    MEHQEDSKKRKIKEESDLHDAPMEGYEAPVPLRLSRDDLKRLLEPLSKEQLVALLTDASY
    RYSTFVLRFGSEFRKERAAMDIGGVFVFDLLLILLFFILDFSGWIDWECLSLGSQYSAVA
    DDIREVASKDPAHRKLFVRGLAWETTSQDLRDAFEQFGEIEEGAVIIDKATGKSRGFGFI
    TFKHMDSAQRALKEPSKTIDGRITVCNLASVGTSGSGGTNDQAQRKLYIGGLSYETSNET
    LLNIFSKYGEIEEGAVAYDKNTNKSRGFAFVTYKTVEAARNAIDDPNKTIEGRHVIVKLA
    AEGQKEKAPQVSAPSQGPQAQPGYNVVNPNIPAYARPPPPGTILGFSPHTAVPAYSAAYA
    GIATMASQYTPQYAGAQYGIPAPQPGQNGTGGYYAPN
    >Smo8|GENSCAN_predicted_peptide_14|936_aa
    MARDLERGRSIHSQVVDSGAAANPFVASSLVSMYAKCGSVREARRIFDAMPCHSVVSWTA
    LIQGYVENKEELLALDLFAAMARAGGDCQPNGRTFVAVLKACSRLAAKEAMQALGDELVK
    IKALESVVALHAVIDQTGCGVDIFVANALLDAYARCGSLVDASKVFSGMAYKDVVSWTSL
    ILGFAENSQGEMALQLFELMQRQGCSPNPRTFLAAVKAIAALAAEERENPLELDDGMSKV
    VALEKVMALHSLCVKSGCQDRYVTSSLVDLYAKCGSLDDSRVVFERARFFDDVVLWTALI
    LAHADSSGNEEQALEIYVRMRSHGCAPDARTYVALLKVCGKAVASRLAKVIHGEICRHGL
    EHGELVANALVDVYGKCGSAHNAEKVFLSLLSPDAVAHTSLMAGLCRQGKDGSTKIFQIF
    HDMLDNGLSIDGVTLLCLLTASSHAGLVTQGKQIFHGMQRRYGVSPEIEHYHAVIDLLGR
    TNHLDEAMNFARTMPFKPTAVTWTTILTWCSKWKNPSLGRLSFEKCLEIDTVAANVVALV
    MGTARERGLSESLAKNTREAERLIARGKLYSQVHRRRIQSLTTAPSHESPPDGRAPSCRF
    RRSSAEVPRFISAFVVPQIEPVDRSRAMPPVVLDNISSCRSSPIHWNPTPVPAPAFDGFV
    PDLAVPEDLSWEHCIRILNNSGRLLPRSSGYPVFGSQRYKRSGIARNLASSISRATSPMA
    SEFSTISPLFDGQPHLKRSATARQLNACSVSRATSPLPSEFSDFPTTTKDFEQHSKDFDQ
    RSLDIDLASSPGKNSGPGIMKKIFGQLVGAGKNRLSRSSSLSSSARRSAKKSGPKSPPPR
    LSCVGLTGDGDPRISSANLGADDEISIDCEQELGSRNMRRWLQYQQWMAMVARAENNLSE
    LSSMLQYAVKSMADSREMDFFVEEMQQRQSSIYYAS>Smo8|GENSCAN_predicted_peptide_15|843_aa
    MVKNWSERVEELLDEGNSDGAAQFLQGVVDEMQSLQGADANLSLAAAMEDLGALYERQGL
    SIKADSLRSSAIVIRHKLNARSDAPSIGKEDDEEEEWEKAVHDIPGSSSNPPPSRRAPSP
    PRFSQLSLSDPTPKRRSNSRENAREAEEEAEDVAPKQRGRGAFTYGRERDGFHSDHTTAM
    DDETHEELTTSSDDIDGKSLVYGASHVLVLGGFPPKTTTRDLEQVVMPYASRGVVIRWVN
    DTTALAVFRNPTIGLSREALATIRHPKYVLKTYDDSTEVHGLVTERDLEPPTTRPATSAR
    AAQRMIVGALTNQGIVRGAQTRPAESRRNRPQVFSTSNAANNNADLWLDYRIQHVFGHAC
    EQGKVSGISRITMEAIVHEKPTSYEEQYEGIWDNDLTYTNVRLCVNGYKRAPTTIEAVQE
    ELQQQLLRMDTDYSLDVEFNDGMSQEDFDQETYDELVEDLEAAKMVAQNGGFDKIWAWPR
    NTCIDLFNFQMGDEVNWQIRKEKFVAYKIIKWYNELKFKPNRNDEYNNNIEDNFKLDAKI
    QLHSSQVVMGHMASATKRLEDEKFCDEPVKYKRIARLKFQLSELQIQKDPRCSSQKSLAR
    SCFCAASDESTSTLKNDFRLRTSRQEDDSDVLIECRDVYKSFGEKHILKGASFKVRHGEA
    VGIIGPSGTGKSTILKIMAGLLVPDKGEVFIRGVRRDGLISDQDVSGLRIGLGVEERLPA
    ELSGGMKKRVALARAIISDEREETNEPEVVMYDEPTAGLDPIASTVVEDLIRSVHIKGRD
    ARGKPGQITSYVVVTHQHSTIRRAVDRLLFLHDGKIVWEGETSKFGNTSNPIVRQVRVDYFGR
    >Smo8|GENSCAN_predicted_peptide_16|1282_aa
    MACGEDDLPDDVLARILSLIVSSRHLYWCSLVSKRWARLAKLVTHFSLEDEQYGFLSWFD
    SSASNLRSLSVHSAPRSSLGWLPAIGKTLHSLTISVDVDGFWSSIVACQQLRCLHIINDV
    KLGGAITTTTAAAPMLPSLIYCVMLTGLDVGTMQQLLALCPSLVYIQLMLLLVESPGTYL
    LKSHSLDCLALRGKSVANFSLALDMPKLRWMFVSAVPRLELRLELLPSTCNVEWIKPHTG
    VRIIQGLRMSKLNKLTFTNTCQSAADVTQILKRFVCQPSAVSGITRLDLFVKSSIFDPGS
    LNLAQLLEPFPALEVFTLSHSSIKCLSLDKWSSREAPLCVRIEVACGIAGWARDEISFVR
    ELVESSSCVTRLDVSGVVYATKKPPKFRTQIPRAIVKLGEEFSGRVKVHPIRFVPRRCDV
    IWRCMKAKFDSEAPYTAIHHRCTYFDDTTEASINRLLKHKICEAAPSYGVKFNDQAAAVK
    LAICNEIGQESYQVTGGACREIKALANSVCSSAITSDKTMQETSRYRIDAEDCSAQALKE
    QDYFAHLLKNAKNTLRIRQLHAQILHTSHRGNTFLRNLVVASYAQCGSVGSARQAFDSIP
    CRNLFSWNILVAAFAQSGHLEEARSIHGLAPARDPVATNAILAAYAQCGRAIDAKETFDS
    MPHRNIVSWNTLIQANTQIGHLQFAKAVFQRMPQYNVVSWNGIITGYCQIAQSTNAKILF
    DIMPERDITSWSPMIHAYAQCGHFEEAKELYGKMPSHDVVSASAMITAYGLTSSWEEAKF
    IFENSREKNVVLWTATMVALGQSGDPHGAAEFFRKMPEHGLVPSTAIVTAFAQNGRSADA
    KEVFDLMPRRNIVSWNTMIVAAAAAAAATPSDAREVMSRMPQHSVVSWTAMIVALSQHKL
    LSEARAVFDSMPEKNIVSWTALLQAYALGMDVDRALSLFQEMPERNLVSWNSMLGACAQS
    GEFQLAIETFHCMLLDGSRPNEISFTIVLGACSHGGCLDKSRDYFTSMVIDHGVQHEKEH
    YYAMVDVLGRAGRLSKAEELIKTMPFVPASAQYEVLLGASTVQRNAQVGGRAAEGLIALD
    PEDPTPRKLLEEPMFTFENSRGKRRVVDGDHGGSRAGRSRQDARARPAYALGMDVDRAVS
    LFLEMPERPVLGPVLTREFQLAIETFHCMLLDGSRPNEISIAIVHGACSHGGCLDKSRDY
    FTSMVVDHGVQHKKKHSMRFGRAGRLSKAEEMIKTMPFSALSVRSRLPSNVMVSAVCWHS
    AVEAFKATPFFPFLRNHLPLLR>Smo8|GENSCAN_predicted_peptide_17|688_aa
    MTGSSVVFSAAAAAEGANAIATGELKCLFEEAFFKCSKNDLGHYDKAFEFIIENGGIDSE
    GFGLNFRNKTCFFLERDFTIDGYEHVLPNNEEALKKAVAHQPVSVMIDAGCPAFKFYKSG
    ILTSSCGTDLNHAVTIVGYGTTSDGKKYWIVKNSWGTEWGDDGYVYMQRDTGVSTGLCGI
    NMNPSYPTKQGFPKIQDEGLSTSNAANINADLWPLLGHFGLCSANDTFVYESSDYHWVDV
    AGLSVGGSKRMEAREVEEDAHIGAMLRAMASSRKIFSLSSGISRADFFRRVAALSKGLRG
    LVQYGERVAIAALTSEYYLEWMLAVPCAGGIVAPLNYRWSFDEANAAIQLVRPAILVLDE
    QCRHWGEPLSELYPWMMQVFLEEDCESGISSVLAMVMAGSQQLFLPKFETSAVRQALKDY
    NISTMIVVPTILRDILEPGIFRKSGSDQVFPSMLTILNGGGSVPSRLLPAVKKTFPNARL
    FSAYGMTEACSSMTFLRLDGHVPLGVGSCVGKPPPHVQVGIKDGRIFTRGPHVMHGYWGQ
    TIETAGVLQQDGWLDTGDIGKIDKAGNLWLLGRAKDVVKTGGENVYASEVEMVLSQHPSV
    SSVAVVGVPESRLSEIVAAVVRLHDGWTWSISEHPMTVSESDLRLHCSKQGLSRYKIPKL
    IVQRQDPFPVTSTGKVRKDMIKAGLSKL>Smo8|GENSCAN_predicted_peptide_18|474_aa
    MQGVKDKVVILFNDESEEDELPDIASHTNKCIPSGPVFEADYVVDTLLQQNKELDVRRLL
    NNEPGRNPTFMVSWQMLLVLTGWLYNKYSQCDTYKVIKVNHYKDQPLDDNTLTPTINLSK
    SKHWSWRASKAKKGKVSLYEDSFSDLTKSLEQSSKALAAQEASSHCPEFDYHSISRHQKR
    TKEHGVEEVPIVTQVNPMPEVEKIIAHKLVSRINWAIRRVNAKSNVHCSGMIGSRGQCVA
    FVRSSSSSSAVPYFWGCQDLGVKGCQSSWIYCCGTNVTHTNRIHNSIISKPDSVLEEWPV
    EQSYNLLKEEIESLERVGFKLKTSSNVAIDTAMPPMSTLDWHTRLGRNGKAVQYRCSISK
    GAQKKIKLTKVLKCTITTINYISAVAKGFVVVIESSKGMSYTIRICENPSCTCPDFTQRE
    NKGKAPIMREDLKVMGLPEWMIVQAIAPIGKGHNKNIKTLEDYFQCCKSNTKKA
    >Smo8|GENSCAN_predicted_peptide_19|1620_aa
    MTFLVCIGVHSRLHHTRARIALVCIGVHFKTHPVCNFAANSAAHSATHTQPTWRPGRFQH
    HSPTSSLAATIRCTHSAHSPASPASIVTASIRLQMARNCVLLPATVNVDAEKGEELTSWT
    FGMEGEDLMLYNTDGGRKECMGILIQVTDTPCSSMQEYGLALVVETASAFKIHLAHPSGF
    PEGGSGYEEFLEGARERLSMRPTISKTLRVYEDHMESTDLTYSDVQKLYSNIEPLRVPPV
    CEAKGKGVDLETTPKQKRVVKLKPTSDSAAPKKKLLGDAIASGSNTTPTKKKSRKEQKEQ
    EEAAKIVADNSTCYYSNKVYTIDVSRIEIQSKYNHRLVSDEWVEKLKQKMLASPVTPDIA
    DVIPYYRTMKGETRFPDLINRKVLDDQNIRFYAVSGQHSSLAMQQIVVDIMVGEEVKKVF
    KTRVVRLIKGNSPLRSIIYISHAQNVKQETCYKSGLHETLARTRKLWVQLKKPQKPGQGC
    ALATSAWQEFSSLLSKTLGVNDTLSIHNILLATKVVFDKWVDVLRRYHDGELLNSMGVED
    GRKPNDLKLNALRFSRSLDDDDLLIVANALLRGEVVFNKTKMTPGHIVTYEDLCTTLRAR
    ALLKDWIFEQVAVARVSIDTTQEMMEELDLSNEDIDEYVKGAPKFIDLLRDATKSKAEKL
    DQYPNFIKRDVDRRLQKQLDPAAEAYREPYRVYVNSDEDQRWRSIVSRGDEQVKNIFCVM
    DFTNEEITHADEVSSIIDLMVGLAEVESSVHYMIYDSESYHKFWKALNEVKARLKGKEMR
    EIHGIMAKKKIPTNVRIFTVALVIGPAAEWDWSDFPASFNKADRIDLREVNLPVPEELKT
    AMELSNPKKAPDRCEYSDTRTEKRTDGASKKDKQTSNRKKGGDTSSKAKQASTSQAQTDN
    GHKKRKHVGQSSGSEAFDSKRKKGNEKMIDQGLSERSQSPPSDSDSYGGLPFPNLNGEVQ
    WMRDAFQVRATFFSSRRLRRVQEHAETMLRQSGEDTVLGHALKQAMDVLYFEMETRPDDF
    PKMVLEKQAIQPEQPMEEMQEGDEEPEKDNEQEEQKDEEEDKEEEDEEEEDEEEEDEEDL
    DGPHNGDDHNNDDDDDDDDGQPEVAKEQRRKVIQDKPSGRRRNESTVRPSDIGPSSSIAV
    DVAREDTISIEGRYLDMHMDPPVDSTCLQESSELVEATIHINAKQADHAPLDAAHTTESI
    LSTYPTDGDQAPLDPHKAAPAAEPVVSSFPTCAQSSDYAPLEPPYTAPAACSVMSTLPTE
    APAEMACDHTPSEPPHATVEPVVPESTAEMACAHTPSDAIDPTLQRRVFPPFTFPRPKPS
    ERPSGAPFANIVDESRRKYCIVFDLNGVLFSHERGSSSNVTAPSKACGRKLVKRDRFTLI
    TREKLRDLIMTIRHLKQRAGFNLQLGVWTSMMRHNALTLLQMMHDQTRVILPFDFLLTQE
    DCLTMVHVRGSKPIFVKAEAVVMNRTARSSEILIIDDSLIKTAVNTHTYAVHPISCSMET
    REKKPYVLGDLAKRLEAMVIENRPVQEVAAEINQMADGTVYQKPFDRTDTSHHLWHVYCL
    AKHYPHLHNLLNRMGSSTTEHERGRIREEMALLLGLNPDSVESDEIPRLLYRRWRQKITV
    >Smo8|GENSCAN_predicted_peptide_20|1258_aa
    MVYNVFLDDPAKKHLDRTPRETTRALQCFRGICHRSDRYFAPRFYSLVCIDLSIETAKGH
    HNQMRDHKKLTSLTLVSPRSTSIIFVRCVLGSAERSERCLCKRRCGLWFMRLRAVAALRA
    AVYASGSLCERRYEQLRCCERRYEMAFRVGEKVYILNGGKRVAKGTLVSMDRNFVIHCQR
    LGEGNAAVSVKTVFDGNVVAIERGEMVVGVNVVVALKNLQKKADVGDSASATHVPSVDLK
    NRATWIQAKVFLNNPEGEVVAEGLATVVDPRDAVANQELGWDDIGVAILEVVDTEENSLT
    IGSIVRWKLRHLVFENVKDSQSNTTTTDDRTSLHGKRKYTFLKRSQRDLSNEPKRQKMAM
    EVRMVSTVACCKRRCCQHADWDAIEEERARYKTLSWKAARNFVYEALRSSVDASEYQVGG
    MVFQRGLVCMKAWRIIYGVPRTTFYRIKTEFEEQRVQQAVHGNDGTHKTRGHVVYEEAIF
    KRFVETYSEQQPHKTRLLEDGTRDTQRVLLSMYTKGAILEDVNRELQSNGYRKLSQSRFS
    RLWSNRFRDVSIGKYSSFSKCDECTAIKMAFSKNSCSAQERETLVRRRLDHNIEQRSCRT
    AYYHNRILSMERPDSYLCIIHDKMDQKKTCLPRLNPAPKAISSSMQLPVSLFGIIAHGHG
    AQNYGNFFVPLWGGGGANMMIASLAKHLRSLQSAQTELNGQPLHKALVDRTPYRVQLEEE
    QRLAVSHGAPCHPSEYVGRQIVGLSREASCHSDEAGRSVNAQPHPNKAESQGSSCGRCGL
    PPHLLLQTDNAGSENKNMYVFAYLAMLVAKGIFETVTLGFLMVGHTHEDVDAMFSHLAES
    LRRSNVLTLPQLYDEFQNCIKGTVEAALLTEVPDFKGFVRGYIRDGSETLIGHSKPLQFH
    FSMCDGAPVMRYKMREWDTVWLPVDGIELFKRDPITGELMIPQGSPWPMHPDGLKDQESI
    VARIRQHIAFWKRGLVFGGESYFQRCKPLLEYWEKIISAIQNGPVVYEELEESFWPEPRA
    PDSSGTMSSGNAVTDNDVEDHYCGPRNKKPDSAFNPVQVCKGMVVLVRPSNKKQMIWVGR
    CITNAEEDAETNSFNVTVEWFKPSSGYTNDCLEKRWVLNPRDPSMSISVDNVVHGWMRKQ
    SATITLPDHARDAARAFITRVQGDHFSEQTATLTTIENAPVTSETTIQSKTTDPEKDDST
    AENTKKELQIRLVHLDNEIYYTTRWTQSGHKVTALVTMPQICLAPGQKTYNFTSFSNT
    >Smo8|GENSCAN_predicted_peptide_21|554_aa
    MVDPDLGVSMQELDGSREHGNVEASYSTREVAGGSAIGGGVELGGRNEAPSAIGGGVDLG
    GGNGRHAIAPEGHRNEVVTMLEADTILNEQPIVKVEQNSYLKIMQVMDEVQKNLELQQIE
    QCILNLKKSKVKKLLAIADIRRNKDWLDCIAKICSHGRNGKALEELIYTIAKSKLSFNKQ
    TSCLELLNTWHHNNDKRLESKLKNVMKEKLYITQSIAAAKELLQEPEVQGLKPLNDRTVP
    NSQAWNQGVSEPNHGETPEHKAQGVMGQEPGATWNSPGATSIPMPGPARKTEPNGVWCYF
    GIPHGMELGLRPKPDAEQRQARKRELDYASDMPNEQIWTLSINSIWDRNETNTKGMIARV
    FGRHAPIAQNVRRQLHSDDVQESKNLISNDLAYCNLVVDVGETSQHAVCRCTNAFFPSLP
    TAIEDDLQELSSSASYKQVLVWNYHALCHYSCHHNITRADDRASLALLGYDSSTDIIPNV
    INEDHHIKFPGYRHRSPKAMQILLEFFYPSKGIVLDTITGFGLMVLAAKETRHAVFALEKESAFESVLGTFHNR
    >Smo8|GENSCAN_predicted_peptide_22|288_aa
    MMEAIVQEKPMSYEEHYEEGTNNDRSNLGRAPAAAEQQQLQEMPLFSGRLIKYMRLGMIP
    REIRGCGLGSRKFRQKMVSWVERDLFSSARSLEVQALLVFDVRMLAHEWELVLRTFDKSM
    LLDFDQETYDEYVEDLEAAKMVAQNGGFNKMWAWPGNACVDLFNFQVGGVEEIIKRYGEL
    KFKPDINDEYNNNVEDNFRLSEHPNRQRVKYKMTKRGHLQQMLEVMGGPEEKPVWLWNAM
    IQMLMMFGNFNEAMEMFRMIPKHRDEVSWNSLVTAFALKGDFEASSSL
    >Smo8|GENSCAN_predicted_peptide_23|123_aa
    MYAIIREDLKVMGVPEWTIVQAMAKMGKGHSKNIETFEDYFQCCKSNAKKAHHQQLQHFL
    ASRIHPDNPYHSLIFLDALFLGASRNVFGDDVKKLNQEVMKLQRLQNVFTHCTEALYERNKAL
    >Smo8|GENSCAN_predicted_peptide_24|148_aa
    MVSWVEQDLFSGVRSLEMQALLVFDVRMLAHEWELVLRTFDKSMLLVRLLRMDMDCSPDV
    EFDDGISQQDFDQETYDEHVEDLEAAKMVAQNGGLNKMWAWPGNACLDLFNSQVGGVEEI
    IKRYGELKFKPDINDEYNNNVEDNFRLG>Smo8|GENSCAN_predicted_peptide_25|869_aa
    MAWNGPHRFLVPLCRLVQSHWVRSAYDLNMAIYSSFAEVGFGSTAGTFIVSPTFPGTGRT
    ESVTTELEEKWDPIWLRKSREFDGFLGQHVETAGFIGKLFWVIDGNHRWLAFWKCIQEKF
    RDDEERHISVPVTLVEPMTREHTVKLNMACHKINMLNQLAFQAHTLVHDIEHLRFMGMLS
    LEEVQTLYNREVFEAIRVKLAARKGQELQTWYQLPLDFLVAVWAKPIIKGERASFFKNAR
    GDEQQKEMEWHAIEQRILNTKRTKVKKLLSIADIRRGQEWLDSIAEICSRVRTGKAPEEL
    IYNISKSKISFAEQSVCLELLNTWLRNNEKRCEIRMEKVMKEKAYISQSIATGEKMLQVW
    DEKFLSSTRTAPAHRYTQPVFCEKLLTRGKKLGMYEPEFYCPCTHERELCPWYLELVEQM
    PTSDFELDLYFECHPEALSDCDNDVVEVQQLNLGVSTQETIQEMPNSNTVEPEETVQEIP
    AQEIPVPAPRNPRRGRHEAFHENVAESSHQSFRRTKTFFPSLPTATDDDLEELSSAAMHK
    QVRPLSPTASVGEVAILIGDYTQLLDATSRELYMLDRWIHGLGQLTPYGKIDFIFFDLPD
    GMAPPQGDTPSWNVLQAQHISQALKVASSFMTYEGVVVMILPRSLMWEGLLAHLADNDDG
    FVEFSSGCLLSSMAVDIKGTSRQTGKIYSWNYHALCHIGNRRNATRADERAALALPGYDS
    STDIIPNVIDSDVLHPAFPGYRHRSVTAMQILLELFCPPKGIVLDATAGFGSMVQAAKET
    GRAVFALEKEGAFEDILRRFCDRSTLITHSVSVHTLDLDDMDDHMDRSQFHLPIYAERMA
    QHYIQHEPIVKEHSTIPISTFLDMEAEEDExplanation
    Gn.Ex : gene number, exon number (for reference)
    Type  : Init = Initial exon (ATG to 5' splice site)
            Intr = Internal exon (3' splice site to 5' splice site)
            Term = Terminal exon (3' splice site to stop codon)
            Sngl = Single-exon gene (ATG to stop)
            Prom = Promoter (TATA box / initation site)
            PlyA = poly-A signal (consensus: AATAAA)
    S     : DNA strand (+ = input strand; - = opposite strand)
    Begin : beginning of exon or signal (numbered on input strand)End   : end point of exon or signal (numbered on input strand)Len   : length of exon or signal (bp)
    Fr    : reading frame (a forward strand codon ending at x has frame x mod 3)Ph    : net phase of exon (exon length modulo 3)
    I/Ac  : initiation signal or 3' splice site score (tenth bit units)Do/T  : 5' splice site or termination signal score (tenth bit units)CodRg : coding region score (tenth bit units)
    P     : probability of exon (sum over all parses containing exon)Tscr  : exon score (depends on length, I/Ac, Do/T and CodRg scores)CommentsThe SCORE of a predicted feature (e.g., exon or splice site) is a
    log-odds measure of the quality of the feature based on local sequence
    properties. For example, a predicted 5' splice site with
    score > 100 is strong; 50-100 is moderate; 0-50 is weak; and
    below 0 is poor (more than likely not a real donor site).
    The PROBABILITY of a predicted exon is the estimated probability under
    GENSCAN's model of genomic sequence structure that the exon is correct.
    This probability depends in general on global as well as local sequence
    properties, e.g., it depends on how well the exon fits with neighboring
    exons.  It has been shown that predicted exons with higher probabilities
    are more likely to be correct than those with lower probabilities.
    Was this page helpful?
    Tag page (Edit tags)
    • No tags
    You must login to post a comment.