Was this page helpful?

Sequence 200 Supplemental Data

     

    Here are some of the data that was generated for this project but not presented in full on the main page in order to optimize readability.

    S1. Repeat Masker (Transposon and retroelement analysis)

    RepeatMasker accepts DNA sequences and returns a sequence with repetitive segments, caused by transposons and retroelements, masked with Ns. In an effort to simplify our search for genes, this repeated sequence was used, instead of the actual sequence, when running the gene prediction programs, FGENESH and GeneMark.  RepeatMasker also returns a table summarizing the types of retroelements and transposons found, along with their relative frequencies. 

    The search scheme, cross_match and the default sensitivity was used for this search.

    ==================================================
    file name: RM2sequpload_1284664640  
    sequences:             1
    total length:     400000 bp  (396200 bp excl N/X-runs)
    GC level:         46.67 %
    bases masked:     331379 bp ( 82.84 %)
    ==================================================
                   number of      length   percentage
                   elements*    occupied  of sequence
    --------------------------------------------------
    Retroelements          123       327504 bp   81.88 %
       SINEs:                0            0 bp    0.00 %
       Penelope              0            0 bp    0.00 %
       LINEs:                7         1765 bp    0.44 %
        CRE/SLACS            0            0 bp    0.00 %
         L2/CR1/Rex          0            0 bp    0.00 %
         R1/LOA/Jockey       0            0 bp    0.00 %
         R2/R4/NeSL          0            0 bp    0.00 %
         RTE/Bov-B           0            0 bp    0.00 %
         L1/CIN4             7         1765 bp    0.44 %
       LTR elements:       116       325739 bp   81.43 %
         BEL/Pao             0            0 bp    0.00 %
         Ty1/Copia          43        98801 bp   24.70 %
         Gypsy/DIRS1        73       226938 bp   56.73 %
           Retroviral        0            0 bp    0.00 %
    
    DNA transposons         12         3022 bp    0.76 %
       hobo-Activator        7         1643 bp    0.41 %
       Tc1-IS630-Pogo        0            0 bp    0.00 %
       En-Spm                0            0 bp    0.00 %
       MuDR-IS905            0            0 bp    0.00 %
       PiggyBac              0            0 bp    0.00 %
       Tourist/Harbinger     0            0 bp    0.00 %
       Other (Mirage,        0            0 bp    0.00 %
        P-element, Transib)
    
    Rolling-circles          0            0 bp    0.00 %
    
    Unclassified:            0            0 bp    0.00 %
    
    Total interspersed repeats:      330526 bp   82.63 %
    
    
    Small RNA:               0            0 bp    0.00 %
    
    Satellites:              0            0 bp    0.00 %
    Simple repeats:          8          383 bp    0.10 %
    Low complexity:          8          470 bp    0.12 %
    ==================================================
    
    * most repeats fragmented by insertions or deletions
      have been counted as one element
                                                          
    
    The query species was assumed to be zea           
    RepeatMasker version open-3.2.9 , default mode
                                       
    run with cross_match version 0.990329
    RepBase Update 20090604, RM database version 20090604

    S2. CPGPLOT

     

    The following data shows the portions of sequence 200 that have unusually high CG compositions (>0.6 Observed/Expected ratio).  Areas of high CG composition are also called CpG islands.  CpG islands often coincide with the promoter region of genes.  Thus, we use them to help find genes when annotating new genomes.

     

     

    CPGPLOT islands of unusual CG composition
    seq200 from 1 to 400000
    
         Observed/Expected ratio > 0.60
         Percent C + Percent G > 50.00
         Length > 200
    
     Length 1099 (1686..2784)
    
     Length 418 (4211..4628)
    
     Length 207 (13131..13337)
    
     Length 426 (14873..15298)
    
     Length 362 (15965..16326)
    
     Length 231 (17005..17235)
    
     Length 252 (20069..20320)
    
     Length 201 (20925..21125)
    
     Length 285 (21210..21494)
    
     Length 1110 (21554..22663)
    
     Length 1411 (22668..24078)
    
     Length 1014 (28394..29407)
    
     Length 240 (29444..29683)
    
     Length 1164 (29713..30876)
    
     Length 330 (31396..31725)
    
     Length 270 (31985..32254)
    
     Length 389 (32285..32673)
    
     Length 980 (32701..33680)
    
     Length 418 (33770..34187)
    
     Length 265 (34199..34463)
    
     Length 1256 (37644..38899)
    
     Length 1286 (41541..42826)
    
     Length 741 (44973..45713)
    
     Length 232 (46185..46416)
    
     Length 432 (53577..54008)
    
     Length 200 (54712..54911)
    
     Length 586 (55659..56244)
    
     Length 463 (56253..56715)
    
     Length 276 (57227..57502)
    
     Length 798 (62827..63624)
    
     Length 293 (64165..64457)
    
     Length 632 (64481..65112)
    
     Length 363 (65120..65482)
    
     Length 884 (65557..66440)
    
     Length 1097 (66465..67561)
    
     Length 249 (67568..67816)
    
     Length 1230 (67827..69056)
    
     Length 356 (69070..69425)
    
     Length 343 (69446..69788)
    
     Length 924 (69914..70837)
    
     Length 520 (70888..71407)
    
     Length 304 (71443..71746)
    
     Length 495 (72046..72540)
    
     Length 501 (72554..73054)
    
     Length 1417 (73129..74545)
    
     Length 1231 (74577..75807)
    
     Length 582 (75856..76437)
    
     Length 333 (76515..76847)
    
     Length 284 (78122..78405)
    
     Length 405 (80662..81066)
    
     Length 423 (81238..81660)
    
     Length 417 (81985..82401)
    
     Length 1099 (86587..87685)
    
     Length 1158 (91755..92912)
    
     Length 479 (103708..104186)
    
     Length 235 (104881..105115)
    
     Length 951 (105679..106629)
    
     Length 873 (114321..115193)
    
     Length 806 (119551..120356)
    
     Length 413 (125311..125723)
    
     Length 241 (127736..127976)
    
     Length 209 (137406..137614)
    
     Length 295 (140795..141089)
    
     Length 360 (145524..145883)
    
     Length 883 (145960..146842)
    
     Length 356 (146870..147225)
    
     Length 952 (147292..148243)
    
     Length 1413 (148306..149718)
    
     Length 453 (149722..150174)
    
     Length 226 (150193..150418)
    
     Length 1726 (150460..152185)
    
     Length 4248 (152197..156444)
    
     Length 471 (156560..157030)
    
     Length 356 (157089..157444)
    
     Length 342 (159661..160002)
    
     Length 286 (160118..160403)
    
     Length 255 (162698..162952)
    
     Length 210 (166320..166529)
    
     Length 212 (166556..166767)
    
     Length 746 (166964..167709)
    
     Length 322 (169204..169525)
    
     Length 901 (170077..170977)
    
     Length 906 (176089..176994)
    
     Length 732 (194562..195293)
    
     Length 287 (195344..195630)
    
     Length 310 (196821..197130)
    
     Length 231 (198333..198563)
    
     Length 376 (199068..199443)
    
     Length 729 (204489..205217)
    
     Length 390 (206465..206854)
    
     Length 363 (207894..208256)
    
     Length 478 (208396..208873)
    
     Length 375 (210245..210619)
    
     Length 267 (211651..211917)
    
     Length 610 (217925..218534)
    
     Length 297 (218541..218837)
    
     Length 424 (219901..220324)
    
     Length 263 (227206..227468)
    
     Length 737 (228301..229037)
    
     Length 443 (231273..231715)
    
     Length 274 (233510..233783)
    
     Length 200 (233834..234033)
    
     Length 424 (235431..235854)
    
     Length 228 (241396..241623)
    
     Length 352 (241827..242178)
    
     Length 576 (244747..245322)
    
     Length 700 (245340..246039)
    
     Length 787 (246719..247505)
    
     Length 724 (247999..248722)
    
     Length 253 (249633..249885)
    
     Length 267 (250010..250276)
    
     Length 540 (250355..250894)
    
     Length 252 (250936..251187)
    
     Length 317 (251886..252202)
    
     Length 564 (256793..257356)
    
     Length 285 (258701..258985)
    
     Length 320 (259088..259407)
    
     Length 797 (259463..260259)
    
     Length 956 (260286..261241)
    
     Length 299 (261265..261563)
    
     Length 1004 (261810..262813)
    
     Length 325 (270381..270705)
    
     Length 406 (270874..271279)
    
     Length 390 (274122..274511)
    
     Length 208 (275289..275496)
    
     Length 1192 (281076..282267)
    
     Length 371 (285621..285991)
    
     Length 237 (286065..286301)
    
     Length 260 (286412..286671)
    
     Length 361 (289292..289652)
    
     Length 684 (294824..295507)
    
     Length 204 (295517..295720)
    
     Length 1455 (301128..302582)
    
     Length 758 (302596..303353)
    
     Length 323 (303356..303678)
    
     Length 774 (303702..304475)
    
     Length 304 (307085..307388)
    
     Length 601 (307541..308141)
    
     Length 1064 (311312..312375)
    
     Length 484 (312644..313127)
    
     Length 235 (313213..313447)
    
     Length 263 (314611..314873)
    
     Length 388 (317150..317537)
    
     Length 695 (318164..318858)
    
     Length 364 (318993..319356)
    
     Length 370 (321768..322137)
    
     Length 324 (329471..329794)
    
     Length 450 (332025..332474)
    
     Length 476 (332504..332979)
    
     Length 307 (333497..333803)
    
     Length 212 (333899..334110)
    
     Length 588 (334393..334980)
    
     Length 453 (335137..335589)
    
     Length 699 (336085..336783)
    
     Length 451 (336791..337241)
    
     Length 270 (337417..337686)
    
     Length 299 (355818..356116)
    
     Length 257 (358802..359058)
    
     Length 262 (359102..359363)
    
     Length 214 (359937..360150)
    
     Length 320 (362337..362656)
    
     Length 740 (363020..363759)
    
     Length 245 (363800..364044)
    
     Length 460 (364095..364554)
    
     Length 762 (364567..365328)
    
     Length 363 (365487..365849)
    
     Length 437 (365965..366401)
    
     Length 268 (366798..367065)
    
     Length 382 (367353..367734)
    
     Length 486 (367769..368254)
    
     Length 478 (368538..369015)
    
     Length 486 (369159..369644)
    
     Length 1876 (369717..371592)
    
     Length 1044 (371682..372725)
    
     Length 1234 (372755..373988)
    
     Length 301 (374413..374713)
    
     Length 219 (374924..375142)
    
     Length 203 (379797..379999)
    
     Length 269 (387594..387862)
    
     Length 348 (389558..389905)
    
     Length 275 (393622..393896)
    
     Length 502 (394455..394956)
    
     Length 229 (395036..395264)
    
     Length 702 (395411..396112)
    
     Length 233 (398151..398383)
    
     Length 435 (398939..399373)
    
     Length 261 (399425..399685)

     

     

     

    S3. COMPSEQ

    # Output from 'compseq'
    #
    # The Expected frequencies are calculated on the (false) assumption that every
    # word has equal frequency.
    #
    # The input sequences are:
    #	seq200
    
    
    Word size	2
    Total count	399999
    
    #
    # Word	Obs Count	Obs Frequency	Exp Frequency	Obs/Exp Frequency
    #
    AA	32908		0.0822702	0.0625000	1.3163233
    AC	21489		0.0537226	0.0625000	0.8595621
    AG	25875		0.0646877	0.0625000	1.0350026
    AT	26494		0.0662352	0.0625000	1.0597626
    CA	26654		0.0666352	0.0625000	1.0661627
    CC	23814		0.0595351	0.0625000	0.9525624
    CG	16791		0.0419776	0.0625000	0.6716417
    CT	25504		0.0637602	0.0625000	1.0201626
    GA	25670		0.0641752	0.0625000	1.0268026
    GC	21912		0.0547801	0.0625000	0.8764822
    GG	23646		0.0591151	0.0625000	0.9458424
    GT	20884		0.0522101	0.0625000	0.8353621
    TA	21531		0.0538276	0.0625000	0.8612422
    TC	25548		0.0638702	0.0625000	1.0219226
    TG	25805		0.0645127	0.0625000	1.0322026
    TT	31636		0.0790902	0.0625000	1.2654432
    
    Other	3838		0.0095950	0.0000000	10000000000.0000000

     

    S4. GeneMark

    GeneMark.hmm (Version 2.2a)
    Sequence name: Mon Oct 18 20:45:54 EDT 2010
    Sequence length: 400000 bp
    G+C content: 91.08%
    Matrix: corn
    Mon Oct 18 20:45:57 2010
    
    Predicted genes/exons
    
    Gene Exon Strand Exon           Exon Range     Exon      Start/End
      #    #         Type                         Length       Frame
    
      1     1   +  Initial      28886     29119     234          1 3
      1     2   +  Terminal     29959     30066     108          1 3
    
      2     1   +  Initial      37729     37772      44          1 2
      2     2   +  Internal     38009     38078      70          3 3
      2     3   +  Terminal     38718     38837     120          1 3
    
      3     1   +  Initial      41528     41899     372          1 3
      3     2   +  Internal     42005     42058      54          1 3
      3     3   +  Terminal     42146     42229      84          1 3
    
      4     1   +  Initial      42374     42413      40          1 1
      4     2   +  Internal     42499     42555      57          2 1
      4     3   +  Internal     42666     42805     140          2 3
      4     4   +  Terminal     43352     43426      75          1 3
    
      5     1   +  Initial      46226     46348     123          1 3
      5     2   +  Internal     48621     48707      87          1 3
      5     3   +  Internal     48795     48881      87          1 3
      5     4   +  Internal     48989     49041      53          1 2
      5     5   +  Internal     49314     49464     151          3 3
      5     6   +  Internal     49590     49737     148          1 1
      5     7   +  Internal     50473     50855     383          2 3
      5     8   +  Internal     50949     51329     381          1 3
      5     9   +  Terminal     52918     52938      21          1 3
    
      6     1   +  Initial      86608     86721     114          1 3
      6     2   +  Internal     87267     87367     101          1 2
      6     3   +  Terminal     87446     87575     130          3 3
    
      7     1   +  Initial     119785    119926     142          1 1
      7     2   +  Terminal    120028    120107      80          2 3
    
      8     1   +  Initial     160164    160212      49          1 1
      8     2   +  Terminal    160350    160408      59          2 3
    
      9     2   -  Terminal    182620    182636      17          3 2
      9     1   -  Initial     182739    182790      52          1 1
    
     10     3   -  Terminal    231071    231292     222          3 1
     10     2   -  Internal    231384    231452      69          3 1
     10     1   -  Initial     231546    231599      54          3 1
    
     11     1   +  Initial     281323    281380      58          1 1
     11     2   +  Internal    281472    281528      57          2 1
     11     3   +  Terminal    281654    282255     602          2 3
    
     12     1   +  Initial     283469    283483      15          1 3
     12     2   +  Terminal    288051    288179     129          1 3
    
     13    18   -  Terminal    337907    337954      48          3 1
     13    17   -  Internal    343511    343596      86          3 2
     13    16   -  Internal    343737    343962     226          1 1
     13    15   -  Internal    344040    344120      81          3 1
     13    14   -  Internal    344221    344340     120          3 1
     13    13   -  Internal    344450    345531    1082          3 2
     13    12   -  Internal    345937    346135     199          1 1
     13    11   -  Internal    346875    346965      91          3 3
     13    10   -  Internal    347101    347168      68          2 1
     13     9   -  Internal    347246    348094     849          3 1
     13     8   -  Internal    348495    348577      83          3 2
     13     7   -  Internal    348684    349529     846          1 2
     13     6   -  Internal    349773    350862    1090          1 1
     13     5   -  Internal    350935    352980    2046          3 1
     13     4   -  Internal    353089    353960     872          3 2
     13     3   -  Internal    354046    354090      45          1 2
     13     2   -  Internal    354186    354331     146          1 3
     13     1   -  Initial     354509    354585      77          2 1
    
     14     1   +  Initial     357184    357222      39          1 3
     14     2   +  Internal    357334    357429      96          1 3
     14     3   +  Internal    358852    359049     198          1 3
     14     4   +  Terminal    359162    359230      69          1 3
    
    
    >Mon Oct 18 20:45:54 EDT 2010_1|GeneMark.hmm|gene 1|113_aa
    MGARVAASQIGADALTALQARAHAAAAQTGVDAPAVLPAAATESTEIAASTGGAVLPPRI
    GTMVAAWPRPLSGTSVPAANVAQAEAEEEALLLAHASIEPSPAAPGHSGTPPP
    
    >Mon Oct 18 20:45:54 EDT 2010_2|GeneMark.hmm|gene 2|77_aa
    MTNGDDVMASRIMLRAPASCEDGWRRMAPGAGGIEIGQGGVEVAAQSQKGTTADVPTAAA
    LSLRAGEEPEDDQCWEG
    
    >Mon Oct 18 20:45:54 EDT 2010_3|GeneMark.hmm|gene 3|169_aa
    MEHRVHDPIKDHKGASEKLAVVKGDAHTIGRHVDSGMAKDGRQQSRCESGHVAARKVPYA
    CAGVGLGKPKFNSAGETVGHNGRHCGREEQWRPTIVGERPTTKPGVVKKLGSIAAIEGGH
    RVLRGRKRTPNKPDEGVATDAERGCWWTECGRKVGGSVVSEEMDVGLLS
    
    >Mon Oct 18 20:45:54 EDT 2010_4|GeneMark.hmm|gene 4|103_aa
    MRHEFDEEGDEEAWDEVARRRSRAGEERAQLCGVRDGWGRVHDGWESGGASCSGISGGGS
    RNYVWQKRLADGRGSLAGRVLGKALGIPENSVRTYTEAEIRAR
    
    >Mon Oct 18 20:45:54 EDT 2010_5|GeneMark.hmm|gene 5|477_aa
    MPHLTAPPPLVTSPLAPILVSLMPALVHGPVSPAIIPDASSVERIAPGSLPSSMKEPVVL
    VVNKADGDEEVKAAGDNIVGVVLLQELPHLSHLGVRARQEKVVFVTCEDDDTIKNMRLGA
    SSNNVDLSVVSNKDDCAAMSSEPSAGGDLFAQQFSLLTTDKKLELSEGPSLQQKSYTSVA
    NGMSGVLELSEASIESSGAKAAACGTLSVLSSMSNKVYNDQGTPAAFRVPAGAVIPFGSM
    EDALKKSGSLKSYTNLLERIETAQIENGELDSLSSKLQATVSLLSPSEEIIESLKKTFDQ
    NVRLIVRSTANVEDLAGMSAAGLYESIPNVSLSDPRSFGAAVGQEMLQPDLSFVLHTISP
    VDHDPKLVEAEVAPGLGETLASGTRGTPWRLSCHKLDGKVTTLAFANFSEELMVLNSGPT
    DGEMSRRTVDYSKKPLSVDATFREQFGQRLAAIGQYLEQKFGSAQDVEGCLGPGYWI
    
    >Mon Oct 18 20:45:54 EDT 2010_6|GeneMark.hmm|gene 6|114_aa
    MGPSCQQLGGRSSHTRPPSPRFDVGPFGCGVVFTESHPAVGQATGNSSPDFDLGCESTVR
    RGHSSVGHLPRCESRSTSHGAAVLLFYAVGLTPVREKGKGRGSLTVDSQLEGGD
    
    >Mon Oct 18 20:45:54 EDT 2010_7|GeneMark.hmm|gene 7|73_aa
    MARNSGHLDEPLGLILHTSDPHTLMNAPDSSPAFDRWYKHELSAADDADPRFVPSAKSAR
    LLALLSSRHGQLG
    
    >Mon Oct 18 20:45:54 EDT 2010_8|GeneMark.hmm|gene 8|35_aa
    MGVNRAEDAETAERPGFFAFCLPPDGIDPRWYALS
    
    >Mon Oct 18 20:45:54 EDT 2010_9|GeneMark.hmm|gene 9|22_aa
    MVPDDIVVDVLVAFVDTKNTFH
    
    >Mon Oct 18 20:45:54 EDT 2010_10|GeneMark.hmm|gene 10|114_aa
    MRELAAPVVFVANGHAEQTLVVTRRVPISKMTLRVVVASELALSGLVNLEGMTMSAIDHA
    LEEIIPEGLLLELPEAGGAGIHVEVPDDVHSASLLVGPEITRAVPMHLRPLRVV
    
    >Mon Oct 18 20:45:54 EDT 2010_11|GeneMark.hmm|gene 11|238_aa
    MVSQLGLYRRDEETNAREAGALLSCSYSPRQTKQGESPDDADELEPGAGAGHRAETGRSA
    GELRERARELRELETDARSKEKRAQRREEQGHGKETLRRGRSRGRGRAQRKTATTGKGAR
    RPGEMRAWERRTGARARELSRAPELEAEREYAAACHGRESTEQRELGAPWEESWSAGRGQ
    KEGDEAEGAAQRPGELGETSAGRTMADRNNQAARRAQLPWEIRPAASRQGRKYPKRRR
    
    >Mon Oct 18 20:45:54 EDT 2010_12|GeneMark.hmm|gene 12|47_aa
    MGTHKVPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPTGGRNDSA
    
    >Mon Oct 18 20:45:54 EDT 2010_13|GeneMark.hmm|gene 13|2684_aa
    MHLQKEDDLSILKQCVDKFNVPPEHRLVRSEDSVPGSIRALAMLALGAQLAAYASSHERA
    RILSGSSIISAGGNLFSAPCTVFIKFRDYVKTLQKLMEYSSPAVSLFKDLGGVELLSQRL
    HVEVQRVIGTADGHNSMVTDAVKSDDNHMYSQKRLIKALLKALGSATYSPGNPARSQSSQ
    DNSLPVSLSLIFQNVDKFGGDIYFSAVTVMSEIIHKDPTCFITLKELGVPDAFISSVTAG
    VIPSCKALICVPNGLGAICLNNQGLEAVRETSALRFLVDTFTSRKYLIPMNEGVVLLANA
    VEELLRHVQSLRSIGVDIIIEIINKLSSSQEYKNNETATLQEKTDMETDVEGRDLVSAMD
    SSVDGSNDEQFSHLSIFHVMSSGGMPIALHSTMVFKGFTQHHSTPLARAFCSSLKEHLKS
    ALKELDKVSNSFDMTKIEKGAIPSLFVVEFLLFLAASKDNRWMNALLSEFGDASREVLED
    VGQVHREVLWKISLFEKNKIVAETSSSSSTSEAQQPDMSASDIGDSRYTSFRQYLDPILR
    RRGSGWNIESQVSDLINMYRDIGRAASDSQRVGSDRYSSLGLPSSSQDQFSSSSDANAST
    RSEEDKKKSEHSSCFDMMRSLSYHINHLFLELGKAMLFASRRENSPVNLSPAVISVANNI
    ASIVLEHLNFEGHSVSFERDMTVTTKCRYLGKVVEFVDGMLLDRPESCNSIMVNSFYCRG
    VIQAILTTFQATSELLFTMSRPPSSPMETDSKTGKDGKEMDSSWIYGPLTSYGAIMDHLV
    TSSFILSSSTRQLLEQPIFNGSVRFPQDAETFMKLLQSKVLKTVLPIWAHPQFPECNIEL
    ISSVMSIMRHVCSGVEVKDTVGNGGARLAGPPPDESAISLIVEMGFSRARAEEALRQVGT
    NSVEIATDWLFAHPEEPQEEDDELARALAMSLGNSVTPAQEGDSRSNDLELEEATVQPPP
    IDEMLRSCLQLLQRKEALAFSVRDMLVTISSQNDGQNRVKVLTYLIDNLKQCVVASEPSN
    DTALSALLHVLALILHGDTAAREVASKAGLVKVALDLLCSWEMLQLEPKLPDVTELHVLK
    RDNSNIKTSLVIDDSKRKDSESLPNVGLLDMEDQFQLLKICCKCIGKQLPSASMHAILQL
    SATLTKVHAAAICFLESGGLNALLSLPTSSLFSGFNNMASTIIRHILEDPHTLQQAMELE
    IRHSLVTAANRHANPRVTPRNFIQNLAFVVYRDPVIFMKAAQSVCQIEMVGDRPYVVLLK
    DREKERIKEKDKDKSVDKDKATVAVTKVVSGDTAAGSPANSHGKQSDLNSRNVKSHRKPP
    QSFVTVIEHLLDLLMSFVPPPRPEDQVDVSGTALSSDMDIDCSSAKGKGKAVSVPPEESK
    HAIQESTASLAKTAFFLKLLTDVLLTYASSIHVVLRHDAELSNMHGYNAPVPRMNVYVDL
    LNDILSARSPTGSSLSAESAVIFVEAGLVHSLSTMLQVLDLDHPDSAKIVTAVVKALELV
    SKEHIHSADNAKGVNSSKIASDSNNVNSSSNRFQALDMTSQPTEMVTDHRETFNAVRTSQ
    ISDSVADEMDHDRDMDGGFARDGEDDFMHEMAEDGTGDGSTMEIRIEIPRNREDDMAPAA
    DDTDEDISAEDGEDDEDEDEENNNLEEDDAHRMSHPDTDQEDREMDEEEFDEDLLEEDDE
    DEDEEGVILRLEEGINGINVLDHVEVFGLVIMVFLIIHFSKSLLQQRIFQTKGIQRTPVG
    GAQENDQPNHGSDAEAREVAPAQQNENSESTLNPLDLSECAGPAPPDSDALQRDVSNASE
    LATEMQYERSDAITRDVEAVSQASSGSGATLGESLRSLEVEIGSVEGHDDGDRHGTSGTS
    ERLPLGDIQAAARSRRPSGNAVPVSSRDMSLESVSEVPQNPDQEPDQNASEGNQEPTRAA
    GADSIDPTFLEALPEDLRAEVLSSRQNQVTQTSNDQPQDDGDIDPEFLAALPPDIREEVL
    AQQRTQRMQQQSQELEGQPVEMDAVSIIATFPSEIREEVLLTSPDTLLATLTPALVAEAN
    ISTSKPIETEGAPLVDEDGLKALIRLLRVVQLLNLLDVVMHNAENEIKQAKLEASSEKPS
    APDNAVQDGKNNSDISVSYGSELNPEDGSKAPAVDNRSEVLKKIVALAPFFCCHFINELA
    RSMQNLTLRAMKELHLYENSEKALLSSSSANGTAVLRVVQALSSLVNTLQERKDPEQPAE
    KDHSDAVSQISEINTALDSLWLELSNCISKIESSSEYASNLSPASASAAMLTTGVAPPLP
    AGTQNLLPYIESFFVTCEKLRPGQPDAVQDASTSDMEDASTSSGGQRSSACQASLDEKQN
    AFVKFSEKHRRLLNAFIRQNSGLLEKSFSLMLKIPRLIDFDNKRAYFRSKIKHQYDHHHH
    SPVRISVRRPYILEDSYNQLRMRSPQDLKGRLTVQFQGEEGIDAGGLTREWYQSISRVIV
    DKSALLFTTVGNDLTFQPNPNSVYQTEHLSYFKFVGRVLLDAHFTRSFYKHILGVKVTYH
    DIEAIDPSYYKNLKWMLENDISDVLDLTFSMDADEEKLILYEKAEVTDCELIPGGRNIRV
    TEENKHEYVDRVAEHRLTTAIRPQINAFLEGFNELIPRELISIFNDKELELLISGLPDID
    FVQWFWEIVQGFSKEDKARFLQFVTGTSKASNSSKLSKGSEDVF
    
    >Mon Oct 18 20:45:54 EDT 2010_14|GeneMark.hmm|gene 14|133_aa
    MPSSVATTYRSTQGQEQHRGGRDEGSGQHGSRDWQRRVKEMRTGADGKQSSKDAGDGEEN
    RGLTGSDSILIAGETARLPARICCRRSGKLALCAAIAAAAAAISDRNPYLLGRGALAAWR
    LVTGVENWARTYL
    

     

    S5. FGENESH

     

     FGENESH 2.6 Prediction of potential genes in Monocot genomic DNA
     Time    :   Mon Oct 18 22:49:05 2010
     Seq name:  test sequence 
     Length of sequence: 400000 
     Number of predicted genes 6: in +chain 4, in -chain 2.
     Number of predicted exons 41: in +chain 22, in -chain 19.
     Positions of predicted genes and exons: Variant   1 from   1, Score:747.084717 
       G Str   Feature   Start        End    Score           ORF           Len
    
       1 -    1 CDSi        27 -       230   11.88        27 -       230    204
       1 -    2 CDSf       914 -      1081   10.52       914 -      1081    168
       1 -      TSS       1119               -8.78
    
       2 +      TSS       1434               -4.78
       2 +    1 CDSf     24269 -     24276   -8.62     24269 -     24274      6
       2 +    2 CDSi     25004 -     25088   14.85     25005 -     25088     84
       2 +    3 CDSi     25535 -     25702   19.00     25535 -     25702    168
       2 +    4 CDSi     25780 -     25841   -2.31     25780 -     25839     60
       2 +    5 CDSi     25957 -     26162   15.48     25958 -     26161    204
       2 +    6 CDSl     26266 -     26408    3.39     26268 -     26408    141
       2 +      PolA     26762               -1.06
    
       3 +      TSS      39868               -1.48
       3 +    1 CDSo     41528 -     41923   21.38     41528 -     41923    396
       3 +      PolA     42023               -4.26
    
       4 +      TSS      42059               -2.78
       4 +    1 CDSf     42567 -     42617    3.37     42567 -     42617     51
       4 +    2 CDSi     43352 -     43422   11.75     43352 -     43420     69
       4 +    3 CDSi     43546 -     43663   -0.46     43547 -     43663    117
       4 +    4 CDSi     44227 -     44232    1.40     44227 -     44232      6
       4 +    5 CDSi     46062 -     46139    1.31     46062 -     46139     78
       4 +    6 CDSi     46541 -     46724    4.00     46541 -     46723    183
       4 +    7 CDSi     47311 -     47363   10.38     47313 -     47363     51
       4 +    8 CDSi     48621 -     48707   15.30     48621 -     48707     87
       4 +    9 CDSi     48795 -     48881    0.30     48795 -     48881     87
       4 +   10 CDSi     48989 -     49065   -1.52     48989 -     49063     75
       4 +   11 CDSi     49314 -     49464   15.19     49315 -     49464    150
       4 +   12 CDSi     49605 -     49737   -1.93     49605 -     49736    132
       4 +   13 CDSl     50473 -     51377   48.66     50475 -     51377    903
       4 +      PolA     51411                0.44
    
       5 +      TSS     281065               -8.18
       5 +    1 CDSf    281323 -    281380   -0.77    281323 -    281379     57
       5 +    2 CDSl    281654 -    282255   14.67    281656 -    282255    600
       5 +      PolA    282577                0.44
    
       6 -      PolA    342034                0.44
       6 -    1 CDSl    343050 -    343152    4.38    343050 -    343151    102
       6 -    2 CDSi    343242 -    343348   -1.55    343244 -    343348    105
       6 -    3 CDSi    343511 -    343650   16.92    343511 -    343648    138
       6 -    4 CDSi    343737 -    343962   25.18    343738 -    343962    225
       6 -    5 CDSi    344040 -    344120   17.26    344040 -    344120     81
       6 -    6 CDSi    344221 -    344367    9.65    344221 -    344367    147
       6 -    7 CDSi    344450 -    345560   37.67    344450 -    345559   1110
       6 -    8 CDSi    345861 -    346135   22.59    345863 -    346135    273
       6 -    9 CDSi    346213 -    346469   -1.08    346213 -    346467    255
       6 -   10 CDSi    346578 -    346674    8.72    346579 -    346674     96
       6 -   11 CDSi    346875 -    347168   13.74    346875 -    347168    294
       6 -   12 CDSi    347246 -    348321   94.70    347246 -    348319   1074
       6 -   13 CDSi    348495 -    354981  369.38    348496 -    354981   6486
       6 -   14 CDSi    355093 -    355296    4.84    355093 -    355296    204
       6 -   15 CDSi    355382 -    355477   10.50    355382 -    355477     96
       6 -   16 CDSi    356803 -    356886    1.13    356803 -    356886     84
       6 -   17 CDSf    358908 -    359024   23.50    358908 -    359024    117
       6 -      TSS     394549                0.02
    
    Predicted protein(s):
    >FGENESH:[mRNA]   1   2 exon (s)     27  -   1081   372 bp, chain -
    ATGTACAGTGGGCGAATGACTTCCACCCAGAGATCTGAGAGCCAAAACAGGGTTTTGAAA
    GATGGTTATGTTAACAATGTGACAAGCCTGCATATATTTGCAAAGAGGGTGCTTGACTCG
    ATTCAGCACACAGACCACATGGATGCTGGGGAGTCACACTACTCACAGACTGAAGTTGTG
    AGGGCCTGCAAATCACGATTTGATGAGCAACTCAGCAGGGTGTACACCAGGGCTGTGTAC
    AATGAATACAAGAGGGAATATATTAACAACACAGCTTTTGTGATAGAGCCTGATCCAGGA
    ATGGAATGCGGTTACTTGGTGAAACATGAGAAAAGCAATGGGACATTTTGCTGGGCACAA
    CACGCATTCAAG
    >FGENESH:   1   2 exon (s)     27  -   1081   124 aa, chain -
    MYSGRMTSTQRSESQNRVLKDGYVNNVTSLHIFAKRVLDSIQHTDHMDAGESHYSQTEVV
    RACKSRFDEQLSRVYTRAVYNEYKREYINNTAFVIEPDPGMECGYLVKHEKSNGTFCWAQ
    HAFK
    >FGENESH:[mRNA]   2   6 exon (s)  24269  -  26408   672 bp, chain +
    ATGGATTGTCTACTGGAGCAACTGGAATCTATTGAGCAATCTTTGAATGAGTCTGGCTTA
    GAAGCTCTCTCATCCTTTCTGAAAACCAAAAAGAATTTAGACCAACTGGAAGATGCAAAA
    GATTTGGATGAAAATGGCGGCGTTCACGTTTTGCTGAAGACCTTGCTGTCATTATCTTAT
    CTAAGATCAATTCTAATGAAGGGTCTGGAAAGTGGCCTTAGAAATGATGCTCCAGATAGT
    GCTATTGCAATGCGACAAAAGGTTTTGCAGTGGCGTCTTTGTGAAATTGGGCTTGAGGAT
    TATTCGTTTGTATTGTTAAGTAGATACATCAATGCTCTTGAAGCTTTAGGTGGATCAGCT
    TCACTTGCAGAGGGTCTTCCTACAAATACAAGTCTATGGGATGATGCCCTTGATGCCCTT
    ATCATTGGCATAAATCAAGTTTGCTTTTCAGGTTGGAAACCAAATGAGTGTTCTGCAATA
    GTGAATGAGCTTCTTTCTTGGAAGCAGAAAGGTCTATCTGAATTTGAAGGCAATGAGGAT
    GGAAAGTATATTTGGGCACTGAGACTCAAAGCCACTCTTGATAGAACAGGAAGACTAACA
    GAAGAATACTCTGAAGCACTTCTTTCAATATTCCCTGAAAAAGTCAAGGTGAGTTTACCA
    AATGAAGAGTAG
    >FGENESH:   2   6 exon (s)  24269  -  26408   223 aa, chain +
    MDCLLEQLESIEQSLNESGLEALSSFLKTKKNLDQLEDAKDLDENGGVHVLLKTLLSLSY
    LRSILMKGLESGLRNDAPDSAIAMRQKVLQWRLCEIGLEDYSFVLLSRYINALEALGGSA
    SLAEGLPTNTSLWDDALDALIIGINQVCFSGWKPNECSAIVNELLSWKQKGLSEFEGNED
    GKYIWALRLKATLDRTGRLTEEYSEALLSIFPEKVKVSLPNEE
    >FGENESH:[mRNA]   3   1 exon (s)  41528  -  41923   396 bp, chain +
    ATGGAACACAGGGTTCATGACCCGATCAAAGACCACAAAGGTGCCTCGGAGAAGCTGGCT
    GTCGTGAAGGGAGACGCGCACACAATAGGGAGGCATGTGGATTCTGGCATGGCGAAGGAC
    GGGCGTCAACAGAGCAGGTGCGAGAGCGGACATGTCGCTGCCAGGAAAGTGCCCTACGCG
    TGTGCCGGGGTGGGGTTAGGGAAACCGAAGTTTAACAGTGCTGGGGAAACCGTTGGGCAC
    AACGGGCGTCACTGCGGTCGCGAGGAGCAGTGGAGGCCGACAATTGTTGGGGAGAGGCCA
    ACCACCAAGCCGGGCGTCGTGAAGAAGCTGGGGAGCATCGCCGCAATCGAAGGAGGACAT
    AGGGTGCTGCGGGTGCCACGACTGCAGCTAGGATAA
    >FGENESH:   3   1 exon (s)  41528  -  41923   131 aa, chain +
    MEHRVHDPIKDHKGASEKLAVVKGDAHTIGRHVDSGMAKDGRQQSRCESGHVAARKVPYA
    CAGVGLGKPKFNSAGETVGHNGRHCGREEQWRPTIVGERPTTKPGVVKKLGSIAAIEGGH
    RVLRVPRLQLG
    >FGENESH:[mRNA]   4  13 exon (s)  42567  -  51377  2001 bp, chain +
    ATGGCCCCCCTGCGTAGCAGGGGGTCGTCGGTCGCCCGCGCAGCAAGGGAGGTTCTTGGG
    AAAGCCCTTGGAATACCCGAGAACAGTGTCAGAACATACACTGAGGCTGAAATTCGTGCT
    AGTGTTATTTTTCAGGTCTCGAAACTTTGCACTGTACTTTTAAAAGCAACTCGAGCAGTT
    CTTGGGTCGTCTGTGTGGGATGTTCTTGTTCCTGGAGTGGCCCATGGAGCCTTGATACAG
    GTTCAGAACGACATCAACCCTCGTGTGCTCCACTTCCTCCTCCTAGTGTACTATCGTCCG
    CCCCTGCCTCCGCCACCTGCGTTGGCAACCCTGGCACATGGGTCCCCTTCCATCTGCCCT
    CCCGATTATGGCTCCACTACTGCTAATCTTTCCGATCACGAGACGACTGGTCCTCCTCCC
    CACGACATTGTTGCTGCTCATTCTACGCTCGCCGCTGGCCTCGCGGTCGCTAATGAGCGA
    GTCGTGAACCTCACTTGGGAGCAAGAAGGCTTTCTGGGTGCCTTGTTCGGTGTGGTATCT
    GGTGCCACACCGGACACAACAGTTGAAAGAATAGCTCCTGGATCATTGCCGTCATCCATG
    AAAGAACCTGTCGTGCTAGTTGTAAACAAGGCAGATGGAGATGAAGAGGTCAAAGCTGCT
    GGGGATAACATAGTGGGCGTTGTTCTTCTACAAGAATTACCTCACCTATCACATCTTGGT
    GTTAGAGCTCGTCAAGAGAAAGTTGTATTTGTAACTTGTGAAGACGATGACACGATTAAA
    AACATGAGGTTACTTGAGGGAAAACATGTCAGGTTGGGTGCATCATCCAATAATGTTGAT
    CTTTCAGTTGTATCAAACAAAGATGACTGTGCTGCCATGTCTTCAGAGCCATCCGCTGGA
    GGAGATCTATTTGCACAGCAATTCTCACTATTGACTACAGACAAGAAGTTAGAATTGTCT
    GAGCAAAAGAGCTATACTTCCGTAGCAAATGGTATGTCTGGTGTGCTCGAGCTTTCAGAG
    GCATCGATTGAATCATCTGGTGCAAAAGCTGCTGCATGTGGAACTCTTTCGGTTCTTTCC
    TCAATGTCAAATAAAGTTTACAATGACCAAGGAACTCCAGCAGCATTCAGAGTTCCTGCA
    GGTGCTGTGATTCCTTTTGGATCCATGGAGGACGCCTTGAAGAAAAGTGGATCACTCAAA
    TCCTACACTAATCTTCTAGAGAGGATCGAAACAGCTCAAATTGAAAATGGTGAACTTGAT
    AGCCTATCCTCGAAGCTTCAAGCAACTGTTTCACTTCTTTCTCCATCGGAAGAAATCATT
    GAATCTTTGAAGAAAACATTTGATCAGAATGTTCGACTAATTGTGAGATCTACTGCTAAT
    GTGGAGGATTTGGCTGGAATGTCTGCTGCTGGACTCTATGAATCAATTCCTAATGTCAGT
    CTCTCTGACCCTAGATCCTTTGGAGCTGCGGTTGGACAGGTTTGGGCATCATTATACACA
    AGGAGAGCAATCCTTAGCCGTCGAGCAGCCGGTGTACCTCAGAGAGACGCAAAGATGGCG
    GTTCTCGTGCAGGAAATGCTGCAGCCAGATCTCTCTTTTGTGCTTCATACAATTAGCCCA
    GTTGACCATGATCCCAAATTAGTAGAGGCTGAGGTTGCCCCTGGTCTTGGAGAAACCCTC
    GCTTCTGGTACCCGTGGCACTCCATGGAGACTCTCATGTCACAAGCTTGATGGCAAAGTC
    ACAACTCTAGCTTTTGCGAATTTCAGTGAAGAGCTTATGGTGCTCAACTCTGGCCCTACC
    GATGGTGAAATGAGTCGTAGGACCGTAGACTACAGCAAGAAGCCGTTATCTGTTGATGCA
    ACCTTTAGGGAGCAGTTTGGTCAGCGCCTTGCCGCCATTGGACAGTATCTGGAACAGAAG
    TTTGGAAGTGCCCAAGATGTGGAAGGTTGCCTGGTTGGCCCCGATATATTTATAGTTCAG
    AGCAGGCCACAACCACAATAA
    >FGENESH:   4  13 exon (s)  42567  -  51377   666 aa, chain +
    MAPLRSRGSSVARAAREVLGKALGIPENSVRTYTEAEIRASVIFQVSKLCTVLLKATRAV
    LGSSVWDVLVPGVAHGALIQVQNDINPRVLHFLLLVYYRPPLPPPPALATLAHGSPSICP
    PDYGSTTANLSDHETTGPPPHDIVAAHSTLAAGLAVANERVVNLTWEQEGFLGALFGVVS
    GATPDTTVERIAPGSLPSSMKEPVVLVVNKADGDEEVKAAGDNIVGVVLLQELPHLSHLG
    VRARQEKVVFVTCEDDDTIKNMRLLEGKHVRLGASSNNVDLSVVSNKDDCAAMSSEPSAG
    GDLFAQQFSLLTTDKKLELSEQKSYTSVANGMSGVLELSEASIESSGAKAAACGTLSVLS
    SMSNKVYNDQGTPAAFRVPAGAVIPFGSMEDALKKSGSLKSYTNLLERIETAQIENGELD
    SLSSKLQATVSLLSPSEEIIESLKKTFDQNVRLIVRSTANVEDLAGMSAAGLYESIPNVS
    LSDPRSFGAAVGQVWASLYTRRAILSRRAAGVPQRDAKMAVLVQEMLQPDLSFVLHTISP
    VDHDPKLVEAEVAPGLGETLASGTRGTPWRLSCHKLDGKVTTLAFANFSEELMVLNSGPT
    DGEMSRRTVDYSKKPLSVDATFREQFGQRLAAIGQYLEQKFGSAQDVEGCLVGPDIFIVQ
    SRPQPQ
    >FGENESH:[mRNA]   5   2 exon (s) 281323  - 282255   660 bp, chain +
    ATGGTGTCGCAACTGGGGCTATACAGGCGCGACGAAGAGACGAACGCCAGGGAGGCTGAC
    GACGCGGACGAGCTGGAGCCGGGAGCTGGGGCTGGGCACCGAGCTGAAACAGGGAGGAGC
    GCCGGCGAGCTGAGGGAGCGAGCTAGGGAGCTCCGCGAGCTGGAGACCGATGCGCGCAGC
    AAGGAGAAGCGCGCGCAGAGAAGAGAGGAGCAGGGGCACGGCAAGGAAACTCTACGGCGA
    GGAAGGAGCAGAGGTCGGGGGCGAGCTCAGAGGAAGACAGCCACCACAGGAAAGGGAGCT
    CGGCGGCCAGGAGAGATGCGCGCGTGGGAACGAAGAACAGGGGCGCGCGCCAGGGAACTC
    AGCCGGGCGCCGGAGCTGGAGGCCGAGCGAGAATACGCGGCTGCGTGCCATGGGCGAGAG
    AGCACCGAGCAGAGGGAGCTGGGCGCGCCATGGGAGGAAAGTTGGAGCGCCGGCAGAGGG
    CAGAAGGAGGGAGATGAAGCCGAGGGGGCTGCGCAGCGGCCAGGCGAGCTGGGCGAGACC
    TCGGCTGGGCGGACCATGGCGGACAGAAACAACCAGGCAGCTCGACGTGCTCAGCTGCCG
    TGGGAGATAAGACCGGCGGCAAGCCGGCAAGGCAGGAAATATCCCAAGCGGAGGAGATAA
    >FGENESH:   5   2 exon (s) 281323  - 282255   219 aa, chain +
    MVSQLGLYRRDEETNAREADDADELEPGAGAGHRAETGRSAGELRERARELRELETDARS
    KEKRAQRREEQGHGKETLRRGRSRGRGRAQRKTATTGKGARRPGEMRAWERRTGARAREL
    SRAPELEAEREYAAACHGRESTEQRELGAPWEESWSAGRGQKEGDEAEGAAQRPGELGET
    SAGRTMADRNNQAARRAQLPWEIRPAASRQGRKYPKRRR
    >FGENESH:[mRNA]   6  17 exon (s) 343050  - 359024 10902 bp, chain -
    ATGGCGGCTGCGGCTGCGGCGATGGCCGCACATAGAGCCAGCTTCCCGCTGCGCCTGCAG
    CAGATCCTTGCTGGCAGCCGCGCCGTCTCACCTGCGATCAAGATCGAGTCTGAGCCGGTT
    CAAGATAGGCAGCCACTAGTCACCAGTGAGGGGTCTGTTGCAGATGGAAATGGTGCTGGC
    ATTTCCTTACCGGTGGAAATGCCTGCAAACATCAAAGCATTTATTGATCGAGTAGTCAAT
    ATTCCACTACATGACATTGCCATACCGCTATCAGGCTTCTGTTGGGAGTTCAATAAGGGA
    AATTTCCACCATTGGAGGCCTTTATTTATTCATTTTGACACCTATTTCAAGACATACATT
    TCTTCTCGGAAGGATCTTCTTTTATCTGATGATATGACCGAGGCTGACCCTATGCCAAAG
    AATGCCATCTTGAAAATCTTAAGAGTTATGCAGATTATTTTGGAGAACTGCCAGAATAGA
    AGTTCTTTTACTGGTCTTGCGCATCTTAAGCTTCTCCTGGCATCGTCAGACCCTGAGATA
    GTTGTAGCAGCTTTGGAAACTCTTGTTGCCTTGGTCAAAATAAATCCTTCAAAGTTGCAT
    ATGAATGGAAAGCTAATTAGCTGCGGACCTATAAACACCCATCTTCTATCATTGGCCCAA
    GGATGGGGCAGCAAGGAGGAAGGTCTTGGCATATATTCTTGTGTTGTGGCAAATGAAGGA
    AACCATCAGGGGGGTTTGTCCTTGTTCCCTGTAGACTTGGAGAACAAATATGGTGGCACA
    CAACATCGTCTTGGTTCAACTCTTCATTTTGAGTATAATTTGGGTCCTGCACAGTATCCT
    GGCCAAACTAGTGACAAGGGCAAGTCATCTAATTTGTGTGTGATACATATTCCAGACATG
    CATCTTCAGAAAGAGGATGACTTGAGCATACTGAAGCAATGTGTAGACAAGTTCAATGTG
    CCCCCAGAGCACAGGTTTGCATTGCTTACAAGGATAAGATATGCCCGTGCTTTCAACTCT
    GCACGGACTTGTAGGATCTATAGCCGCATAAGTCTTCTTTCGTTCATTGTTCTTGTGCAA
    TCGAGTGATGCTCATGATGAACTCACATATTTCTTTACAAATGAACCAGAGTACATAAAC
    GAGTTAATCAGGCTTGTTCGGTCAGAGGATTCTGTTCCTGGGTCCATTCGTGCTCTGGCT
    ATGCTAGCACTGGGAGCACAGCTGGCAGCATATGCATCATCTCATGAGCGTGCTAGGATA
    TTGAGTGGCTCAAGTATTATCTCTGCTGGTGGCAATCGTATGGTCTTGCTCAGTGTCCTG
    CAGAAAGCAATATCCTCACTCAATAGCCTGAATGATACCTCATCTCCTCTAATTGTTGAT
    GCCCTTCTTCAGTTTTTTCTGCTCCATGTACTGTCTTCATCAAGTTCAGGGACTACGGTT
    AGAGGCTCAGGGATGGTTCCTCCTCTCTTGCCCTTGCTGCGAGATAATGATCCTTCTCAT
    ATGCATCTTGTCTGCTTGGCAGTGAAGACTCTCCAAAAGCTTATGGAGTACAGCAGCCCT
    GCTGTTTCTCTGTTTAAAGACTTGGGTGGTGTAGAGCTTTTGTCTCAGAGGTTACATGTA
    GAAGTTCAGCGTGTTATTGGTACTGCTGATGGACATAATTCAATGGTTACTGATGCAGTG
    AAATCAGATGATAACCACATGTACTCTCAGAAGCGATTGATTAAAGCTTTGCTCAAGGCT
    TTGGGGTCTGCTACTTATTCCCCTGGGAATCCTGCTCGTTCTCAAAGCTCCCAAGATAAC
    TCATTGCCTGTCTCACTTTCCCTTATATTTCAGAATGTAGATAAGTTTGGGGGTGACATT
    TATTTTTCAGCAGTTACTGTTATGAGCGAAATAATCCATAAAGATCCTACATGCTTTATT
    ACTTTGAAGGAACTCGGTGTTCCAGATGCTTTTATCTCATCAGTGACTGCTGGGGTAATA
    CCATCTTGTAAAGCCCTTATCTGTGTTCCTAATGGCCTGGGTGCAATATGCCTCAATAAT
    CAAGGACTTGAGGCTGTCAGGGAAACTTCAGCACTGCGATTTCTTGTAGATACCTTCACT
    AGCAGGAAATACTTGATACCAATGAATGAGGGTGTGGTCCTTCTAGCTAATGCAGTGGAA
    GAGCTTCTTCGTCATGTGCAATCCCTGAGAAGTATTGGTGTTGACATCATTATTGAAATA
    ATCAATAAACTAAGTTCGTCTCAGGAATATAAAAACAATGAAACAGCCACCTTGCAAGAA
    AAAACAGATATGGAAACTGATGTTGAAGGGCGTGATTTGGTAAGTGCTATGGACTCAAGC
    GTGGATGGAAGTAATGATGAACAGTTTTCTCATCTGAGCATTTTCCATGTGATGGTATTG
    GTGCACCGGACAATGGAAAACTCGGAAACCTGCCGGTTATTTGTAGAAAAGGGAGGGCTA
    CATGCTCTTTTGACACTCTTGTTGAGGCCTAGCATCACACAGTCATCTGGAGGGATGCCT
    ATTGCTTTGCATAGCACGATGGTGTTCAAGGGTTTTACTCAGCATCACTCTACTCCACTT
    GCACGTGCATTTTGCTCATCTTTAAAGGAGCATTTGAAGAGTGCATTGAAGGAACTTGAT
    AAAGTTTCCAACTCATTTGATATGACCAAGATTGAAAAAGGAGCGATCCCATCTTTGTTT
    GTTGTTGAATTTTTGCTATTCCTAGCTGCCTCAAAAGATAATCGTTGGATGAATGCTTTG
    CTCTCAGAATTTGGCGATGCCAGCAGGGAGGTCCTGGAAGATGTTGGGCAAGTTCACCGG
    GAAGTGCTTTGGAAAATTTCGCTTTTTGAAAAAAACAAGATTGTCGCTGAAACCAGTTCA
    TCTTCTTCCACATCTGAAGCTCAGCAGCCTGACATGTCTGCCAGCGACATTGGTGATAGC
    AGATACACTTCTTTCAGGCAATATCTTGATCCAATTTTAAGGCGAAGGGGTTCTGGGTGG
    AACATAGAGTCTCAAGTGTCTGACCTCATTAATATGTATCGAGACATTGGCCGTGCAGCT
    AGTGACTCGCAGAGGGTTGGTTCTGATAGATATTCTAGTCTAGGTTTGCCCTCAAGTTCC
    CAGGACCAGTTTTCCAGTTCATCTGATGCAAATGCCAGCACAAGGTCAGAAGAGGACAAG
    AAAAAATCTGAGCATTCCTCCTGCTTTGACATGATGAGATCTCTCTCCTACCATATCAAC
    CATCTTTTTTTGGAGCTTGGTAAAGCAATGCTTTTTGCATCTCGTCGGGAGAATAGTCCT
    GTAAACCTATCCCCAGCCGTTATATCTGTTGCTAACAATATTGCTTCTATTGTGTTGGAG
    CACCTCAACTTTGAAGGTCATTCGGTAAGTTTTGAGAGGGACATGACTGTTACCACAAAA
    TGCAGATATCTTGGAAAGGTGGTTGAATTTGTTGATGGGATGTTATTAGACCGGCCAGAA
    TCATGTAATTCGATTATGGTGAATTCATTTTATTGCCGTGGTGTCATTCAGGCCATCCTG
    ACCACATTTCAGGCCACCAGTGAGTTGCTCTTCACAATGAGCAGGCCACCGTCTTCACCT
    ATGGAGACGGATAGTAAAACCGGAAAGGATGGAAAGGAGATGGATTCTTCATGGATCTAT
    GGTCCCCTCACCAGTTATGGGGCAATTATGGATCATCTTGTGACCTCATCTTTTATTCTC
    TCTTCATCAACTAGACAATTACTTGAGCAACCTATTTTTAATGGATCTGTAAGGTTCCCT
    CAAGATGCTGAGACGTTCATGAAGTTACTTCAGTCAAAGGTGTTGAAGACAGTTCTTCCC
    ATCTGGGCCCATCCCCAGTTCCCAGAATGCAACATCGAGCTGATTAGTTCAGTCATGTCC
    ATCATGAGGCATGTTTGCTCTGGGGTGGAAGTGAAAGACACTGTTGGCAATGGTGGTGCT
    CGTTTGGCTGGTCCACCTCCTGATGAGAGTGCAATTTCACTTATTGTAGAGATGGGTTTT
    TCTCGTGCCAGGGCTGAAGAAGCATTAAGACAAGTTGGAACAAACAGTGTTGAAATAGCA
    ACTGATTGGTTATTCGCACACCCAGAGGAACCACAAGAGGAGGATGATGAACTTGCTCGG
    GCGCTTGCAATGTCTCTTGGGAATTCTGTTACCCCTGCACAAGAGGGAGATAGTAGGTCC
    AATGATCTTGAGCTGGAAGAAGCAACAGTTCAGCCGCCTCCTATAGATGAAATGTTACGT
    TCATGTCTTCAGCTACTTCAGAGAAAGGAAGCATTAGCTTTCTCTGTCAGGGATATGCTT
    GTTACTATCAGCTCACAGAATGATGGCCAAAACCGAGTAAAGGTGCTTACATATTTGATT
    GATAATCTGAAACAATGTGTGGTGGCATCTGAGCCTTCAAATGATACTGCATTGTCTGCT
    CTTTTACATGTTCTTGCTTTGATTCTTCATGGAGATACTGCTGCCCGTGAAGTTGCCTCA
    AAGGCAGGACTTGTGAAGGTTGCTTTGGATCTGCTGTGCAGCTGGGAGGTGCAAATTAGG
    GAAAGTAGCATGATTGAGGTTCCTAATTGGGTAATTTCCTGTTTCCTTTCTGTTGATCAG
    ATGCTCCAGTTGGAACCAAAGTTGCCAGATGTTACAGAGCTACATGTCCTGAAAAGGGAT
    AACTCAAATATAAAAACATCACTTGTGATTGATGACAGCAAGAGAAAAGATTCAGAATCC
    TTGCCAAATGTTGGGTTACTAGACATGGAGGACCAGTTTCAACTTTTGAAGATCTGTTGT
    AAGTGCATAGGAAAACAGTTACCTTCGGCTTCCATGCATGCTATTCTCCAGCTCTCTGCA
    ACACTGACAAAAGTCCATGCAGCTGCTATTTGTTTCCTTGAATCTGGTGGTTTGAATGCA
    TTACTATCACTGCCGACAAGTAGTTTGTTTTCTGGGTTCAACAATATGGCTTCTACAATA
    ATTCGTCACATTTTGGAGGACCCTCACACCCTTCAGCAAGCAATGGAATTAGAGATACGT
    CACAGTCTTGTTACAGCTGCTAATCGACATGCAAATCCCAGGGTTACACCACGCAATTTT
    ATCCAGAATTTGGCTTTTGTCGTGTATAGGGACCCAGTGATATTTATGAAAGCTGCCCAG
    TCTGTGTGCCAGATTGAGATGGTTGGAGATAGGCCATATGTTGTTCTGTTAAAGGATCGT
    GAGAAGGAAAGGATCAAGGAGAAAGATAAGGACAAGTCAGTTGATAAGGATAAAGCAACA
    GTCGCAGTCACTAAGGTAGTTTCTGGGGACACAGCTGCAGGATCTCCTGCAAATTCTCAT
    GGAAAACAGTCTGATTTGAATTCAAGAAACGTGAAATCTCACCGCAAGCCACCACAAAGT
    TTTGTCACTGTTATAGAGCATCTCTTAGATCTGCTGATGTCCTTTGTTCCGCCACCCCGT
    CCAGAGGATCAAGTTGATGTCTCTGGTACTGCCTTATCTTCAGACATGGATATTGACTGC
    AGTTCTGCAAAAGGGAAAGGGAAAGCTGTGTCTGTCCCACCTGAAGAGAGCAAGCATGCT
    ATCCAAGAGTCTACTGCATCCCTTGCCAAAACTGCATTTTTCCTTAAGTTGCTGACAGAT
    GTATTGTTAACTTACGCATCATCCATTCATGTTGTTCTTCGTCATGATGCTGAGCTGAGT
    AACATGCACGGTCCAAATCGAACAAGTGCCAGACTAACTAGTGGTGGAATCTTTAACCAT
    ATCCTGCAGCATTTCCTTCCTCATGCTACAAGGCAAAAGAAAGAGAGAAAAAATGATGGA
    GATTGGATGTATAAATTGGCAACAAGGGCTAATCAGTTCTTGGTGGCTTCATCTATTCGT
    TCTGCAGAGGCCCGAAAAAGGATTTTCTCTGAAATCTGCAGTATATTTCTTGATTTCACT
    GACTCGTCTGCAGGGTACAATGCCCCAGTACCAAGGATGAATGTGTATGTCGATTTACTG
    AATGATATTCTGTCAGCCCGATCACCAACTGGCTCCTCGCTGTCAGCAGAATCTGCAGTT
    ATATTTGTTGAAGCTGGTCTTGTTCATTCATTATCGACAATGCTTCAAGTTCTTGATTTG
    GATCACCCTGATTCTGCAAAGATTGTTACTGCTGTTGTTAAGGCCTTGGAACTCGTCAGT
    AAGGAACATATTCATTCAGCAGATAATGCGAAAGGAGTGAACTCGTCAAAGATTGCTTCT
    GACAGCAACAACGTGAATTCATCATCTAATAGATTCCAGGCTCTTGACATGACTTCTCAG
    CCCACAGAGATGGTTACTGATCACAGGGAAACGTTTAATGCTGTCCGAACTTCACAAATA
    TCAGATTCAGTGGCAGATGAGATGGATCATGATCGTGATATGGATGGCGGCTTTGCTCGT
    GATGGTGAAGATGATTTTATGCATGAGATGGCTGAAGATGGAACTGGTGATGGATCTACA
    ATGGAAATTAGAATTGAAATTCCACGCAATAGAGAGGATGATATGGCCCCTGCTGCTGAT
    GACACTGATGAAGATATATCTGCCGAGGATGGTGAAGATGATGAAGACGAAGATGAGGAG
    AACAACAATCTGGAGGAAGATGATGCCCATCGGATGTCTCATCCTGACACAGATCAGGAA
    GACCGTGAAATGGATGAAGAGGAATTTGATGAGGATCTGCTAGAAGAAGACGATGAAGAT
    GAGGATGAGGAAGGAGTCATCCTTCGCCTGGAGGAGGGGATTAATGGCATTAATGTGTTG
    GACCATGTTGAAGTATTTGGTGGGAGCAACAATTTGTCTGGGGATACGCTGCGTGTGATG
    CCTCTAGACATTTTTGGTACAAGAAGGCAAGGCCGAAGTACCTCCATATACAATCTTCTT
    GGTAGGGCTAGTGATCATGGTGTTCTTGATCATCCACTTCTCGAAGAGCCTTCTTCAACA
    ACGAATTTTTCAGACCAAGGGCATCCAGAGAATCTTGTTGAGATGGCTTTCTCTGATCGA
    AACCATGAAAGTAGCTCCTCTCGTTTGGATGCTATATTCCGCAGCTTGCGAAGTGGACGT
    AATGGACACCGCTTTAATATGTGGCTAGATGACGGTCCACAACGTAATGGATCTGCTGCC
    CCTGCAGTACCTGAAGGCATTGAAGAATTGCTGATCTCTCATCTGAGACGGCCCACTCCT
    CAACCTGATGGCCAGAGAACACCTGTTGGTGGTGCTCAAGAAAATGACCAGCCCAACCAT
    GGATCAGATGCTGAGGCAAGGGAAGTAGCACCAGCTCAGCAAAATGAGAACAGTGAAAGT
    ACACTGAATCCTCTAGATCTGTCAGAATGTGCTGGCCCTGCACCTCCTGATAGTGATGCA
    CTTCAAAGAGATGTGTCCAATGCAAGTGAGCTTGCTACAGAGATGCAGTATGAACGTAGT
    GATGCTATAACACGTGACGTTGAAGCAGTGAGCCAAGCAAGCAGTGGCAGTGGTGCTACT
    TTAGGGGAGAGCCTCAGAAGTTTAGAGGTTGAAATAGGAAGTGTTGAAGGGCATGATGAT
    GGTGACCGGCATGGAACTTCAGGCACCTCAGAACGTTTACCTTTGGGTGATATTCAAGCA
    GCTGCTCGATCACGGAGGCCATCTGGAAATGCTGTGCCAGTAAGCAGTAGAGATATGTCT
    CTGGAGAGTGTTAGTGAGGTTCCTCAAAATCCAGACCAAGAACCTGATCAGAATGCTAGT
    GAGGGGAATCAGGAGCCTACCAGAGCGGCAGGTGCTGACTCAATTGATCCTACATTTTTG
    GAGGCTCTTCCTGAGGACTTGCGAGCTGAAGTTCTTTCTTCACGTCAAAATCAAGTAACT
    CAGACTTCTAATGACCAACCTCAGGATGATGGTGATATTGATCCTGAATTCCTTGCTGCA
    CTTCCCCCTGATATAAGAGAGGAGGTGCTAGCTCAGCAACGTACTCAAAGGATGCAACAA
    CAATCACAAGAATTAGAAGGACAACCTGTTGAAATGGATGCTGTTTCAATTATTGCAACA
    TTCCCTTCAGAAATCCGGGAAGAGGTGCTTTTGACATCTCCTGACACACTACTAGCCACA
    CTCACGCCTGCACTAGTTGCTGAGGCCAACATGTTAAGGGAGAGATTTGCTCATCGGTAT
    CACAGTAGCTCCCTTTTTGGTATGAACTCCAGGAACAGGAGAGGTGAGTCCTCCCGACGT
    GACATAATGGCAGCAGGACTTGATAGAAACACAGGTGATCCCTCCAGATCAACTAGCAAG
    CCAATTGAGACTGAAGGTGCTCCTCTTGTCGATGAAGATGGCCTTAAAGCTCTTATTCGG
    TTACTTCGAGTTGTTCAGGGGTTTTCAAAGAAATCAATAGATGCACCTGAGCCACCATTT
    AGGTTGTATGGTTGCCATGCAAATATCGCATATTCACGCCCTCAATCTTCGGATGGTCTT
    CCTCCATTAGTCTCACGTCGTGTGCTCGAAACGCTCACGAACTTGGCAAGAAGCCATCCA
    AATGTTGCTAAACTCTTGCTCTTTCTTGAATTCCCTTGCCCATCAAGATGTTTCCCCGAA
    GCACATGATCATAGGCATGGCAAGGCTGTTCTCCTAGATGATGGTGAAGAACAGAAAACC
    TTTGCACTTGTTCTTCTGTTAAATCTGTTGGATCAGCCACTTTATATGAGAAGTGTAGCT
    CATCTTGAACAGTTACTAAACCTTCTTGACGTTGTCATGCACAATGCTGAGAATGAAATT
    AAGCAAGCTAAGCTAGAAGCATCATCTGAGAAGCCATCTGCACCTGATAATGCAGTGCAA
    GATGGTAAAAATAACTCCGACATTTCTGTATCATATGGATCAGAGTTGAATCCTGAGGAT
    GGCAGCAAAGCCCCTGCTGTTGACAACAGAAGTAACTTGCAAGCTGTTTTGCGAAGTCTT
    CCTCAACCAGAGCTTAGATTGCTATGTTCATTGCTTGCGCATGATGGATTGTCGGACAGC
    GCTTATCTCCTTGTAGGTGAAGTGTTGAAAAAGATTGTGGCTCTTGCCCCTTTCTTCTGT
    TGCCATTTCATAAATGAGCTTGCACGTTCAATGCAAAACTTGACACTTCGTGCAATGAAG
    GAGCTTCATTTGTACGAGAATTCTGAAAAGGCTCTTCTTAGCTCTTCATCAGCCAATGGT
    ACTGCAGTTCTTAGAGTAGTGCAGGCTTTGAGCTCTCTTGTCAATACTCTTCAAGAGAGA
    AAGGATCCAGAACAGCCTGCTGAAAAGGACCATTCGGACGCAGTTTCCCAGATTTCTGAA
    ATTAACACAGCATTGGATTCTCTGTGGTTGGAATTGAGCAACTGCATAAGCAAGATAGAA
    AGCTCTTCAGAGTATGCATCAAATCTAAGTCCTGCTTCTGCAAGTGCAGCTATGTTAACA
    ACAGGAGTAGCACCTCCATTGCCTGCAGGAACTCAAAACTTATTACCATATATAGAATCA
    TTTTTTGTGACATGTGAGAAGTTGCGTCCTGGGCAACCTGATGCTGTACAAGATGCTTCA
    ACTTCTGATATGGAAGATGCTTCGACATCTAGTGGTGGGCAGAGATCATCTGCTTGTCAA
    GCTAGTCTTGATGAAAAGCAAAATGCTTTTGTGAAATTCTCAGAGAAGCACAGAAGATTG
    CTGAATGCATTCATCCGTCAAAACTCTGGATTGCTGGAAAAGTCATTCTCGCTGATGTTG
    AAAATTCCTCGCCTGATTGATTTTGACAATAAACGTGCATATTTTCGGTCTAAAATCAAG
    CATCAGTATGATCATCATCATCATAGCCCTGTTAGAATTTCTGTGCGTCGTCCATATATT
    CTTGAGGACTCCTACAATCAGCTTAGGATGCGATCTCCTCAGGATTTGAAGGGTAGGCTC
    ACTGTCCAATTTCAAGGAGAAGAAGGAATTGATGCTGGTGGGCTGACAAGGGAATGGTAC
    CAGTCGATATCACGAGTCATTGTTGACAAGAGTGCCCTTCTGTTTACGACAGTCGGGAAT
    GATTTGACGTTTCAACCGAATCCTAATTCTGTGTATCAAACTGAGCACCTCTCATACTTC
    AAATTTGTTGGACGAGTGGTTGGCAAAGCTCTCTTCGATGGTCAGCTTTTAGATGCCCAC
    TTCACTAGATCTTTCTACAAACACATACTTGGTGTCAAGGTTACATACCATGACATTGAA
    GCCATCGATCCTTCGTACTACAAAAACTTGAAATGGATGCTTGAGAATGACATAAGTGAT
    GTTTTGGATCTCACCTTTAGCATGGATGCAGACGAAGAGAAACTGATATTGTACGAGAAG
    GCAGAGGTCACTGATTGTGAGTTGATTCCTGGAGGGCGTAACATCAGGGTTACTGAGGAG
    AATAAGCATGAATATGTCGACCGGGTAGCAGAGCATAGATTGACCACTGCCATCAGGCCT
    CAGATTAATGCCTTCCTGGAAGGGTTTAATGAACTCATTCCACGGGAGCTGATATCTATC
    TTCAACGACAAAGAGCTTGAGCTTCTGATTAGTGGTCTTCCAGATATTGACTTGGACGAT
    TTGAAAACAAATACAGAATATTCTGGCTACAGCATAGCCTCTCCAGTCGTCCAGTGGTTC
    TGGGAAATCGTTCAGGGGTTCAGCAAGGAAGACAAAGCTCGGTTCCTCCAGTTCGTCACT
    GGCACTTCAAAGGTACCGTTGGAAGGTTTTAGTGAACTTCAAGGGATCTCTGGGCCACAA
    AGGTTTCAGATACACAAGGCATACGGCAGCACGAACCATCTGCCTTCAGCTCATACTTGC
    TTTAACCAACTAGACTTGCCTGAGTACACGTCCAAAGAGCAGCTCCAGGAGAGGTTACTG
    TTGGCAATCCACGAAGCAAATGAAGGTTTCGGATTTGGCTAA
    >FGENESH:   6  17 exon (s) 343050  - 359024  3633 aa, chain -
    MAAAAAAMAAHRASFPLRLQQILAGSRAVSPAIKIESEPVQDRQPLVTSEGSVADGNGAG
    ISLPVEMPANIKAFIDRVVNIPLHDIAIPLSGFCWEFNKGNFHHWRPLFIHFDTYFKTYI
    SSRKDLLLSDDMTEADPMPKNAILKILRVMQIILENCQNRSSFTGLAHLKLLLASSDPEI
    VVAALETLVALVKINPSKLHMNGKLISCGPINTHLLSLAQGWGSKEEGLGIYSCVVANEG
    NHQGGLSLFPVDLENKYGGTQHRLGSTLHFEYNLGPAQYPGQTSDKGKSSNLCVIHIPDM
    HLQKEDDLSILKQCVDKFNVPPEHRFALLTRIRYARAFNSARTCRIYSRISLLSFIVLVQ
    SSDAHDELTYFFTNEPEYINELIRLVRSEDSVPGSIRALAMLALGAQLAAYASSHERARI
    LSGSSIISAGGNRMVLLSVLQKAISSLNSLNDTSSPLIVDALLQFFLLHVLSSSSSGTTV
    RGSGMVPPLLPLLRDNDPSHMHLVCLAVKTLQKLMEYSSPAVSLFKDLGGVELLSQRLHV
    EVQRVIGTADGHNSMVTDAVKSDDNHMYSQKRLIKALLKALGSATYSPGNPARSQSSQDN
    SLPVSLSLIFQNVDKFGGDIYFSAVTVMSEIIHKDPTCFITLKELGVPDAFISSVTAGVI
    PSCKALICVPNGLGAICLNNQGLEAVRETSALRFLVDTFTSRKYLIPMNEGVVLLANAVE
    ELLRHVQSLRSIGVDIIIEIINKLSSSQEYKNNETATLQEKTDMETDVEGRDLVSAMDSS
    VDGSNDEQFSHLSIFHVMVLVHRTMENSETCRLFVEKGGLHALLTLLLRPSITQSSGGMP
    IALHSTMVFKGFTQHHSTPLARAFCSSLKEHLKSALKELDKVSNSFDMTKIEKGAIPSLF
    VVEFLLFLAASKDNRWMNALLSEFGDASREVLEDVGQVHREVLWKISLFEKNKIVAETSS
    SSSTSEAQQPDMSASDIGDSRYTSFRQYLDPILRRRGSGWNIESQVSDLINMYRDIGRAA
    SDSQRVGSDRYSSLGLPSSSQDQFSSSSDANASTRSEEDKKKSEHSSCFDMMRSLSYHIN
    HLFLELGKAMLFASRRENSPVNLSPAVISVANNIASIVLEHLNFEGHSVSFERDMTVTTK
    CRYLGKVVEFVDGMLLDRPESCNSIMVNSFYCRGVIQAILTTFQATSELLFTMSRPPSSP
    METDSKTGKDGKEMDSSWIYGPLTSYGAIMDHLVTSSFILSSSTRQLLEQPIFNGSVRFP
    QDAETFMKLLQSKVLKTVLPIWAHPQFPECNIELISSVMSIMRHVCSGVEVKDTVGNGGA
    RLAGPPPDESAISLIVEMGFSRARAEEALRQVGTNSVEIATDWLFAHPEEPQEEDDELAR
    ALAMSLGNSVTPAQEGDSRSNDLELEEATVQPPPIDEMLRSCLQLLQRKEALAFSVRDML
    VTISSQNDGQNRVKVLTYLIDNLKQCVVASEPSNDTALSALLHVLALILHGDTAAREVAS
    KAGLVKVALDLLCSWEVQIRESSMIEVPNWVISCFLSVDQMLQLEPKLPDVTELHVLKRD
    NSNIKTSLVIDDSKRKDSESLPNVGLLDMEDQFQLLKICCKCIGKQLPSASMHAILQLSA
    TLTKVHAAAICFLESGGLNALLSLPTSSLFSGFNNMASTIIRHILEDPHTLQQAMELEIR
    HSLVTAANRHANPRVTPRNFIQNLAFVVYRDPVIFMKAAQSVCQIEMVGDRPYVVLLKDR
    EKERIKEKDKDKSVDKDKATVAVTKVVSGDTAAGSPANSHGKQSDLNSRNVKSHRKPPQS
    FVTVIEHLLDLLMSFVPPPRPEDQVDVSGTALSSDMDIDCSSAKGKGKAVSVPPEESKHA
    IQESTASLAKTAFFLKLLTDVLLTYASSIHVVLRHDAELSNMHGPNRTSARLTSGGIFNH
    ILQHFLPHATRQKKERKNDGDWMYKLATRANQFLVASSIRSAEARKRIFSEICSIFLDFT
    DSSAGYNAPVPRMNVYVDLLNDILSARSPTGSSLSAESAVIFVEAGLVHSLSTMLQVLDL
    DHPDSAKIVTAVVKALELVSKEHIHSADNAKGVNSSKIASDSNNVNSSSNRFQALDMTSQ
    PTEMVTDHRETFNAVRTSQISDSVADEMDHDRDMDGGFARDGEDDFMHEMAEDGTGDGST
    MEIRIEIPRNREDDMAPAADDTDEDISAEDGEDDEDEDEENNNLEEDDAHRMSHPDTDQE
    DREMDEEEFDEDLLEEDDEDEDEEGVILRLEEGINGINVLDHVEVFGGSNNLSGDTLRVM
    PLDIFGTRRQGRSTSIYNLLGRASDHGVLDHPLLEEPSSTTNFSDQGHPENLVEMAFSDR
    NHESSSSRLDAIFRSLRSGRNGHRFNMWLDDGPQRNGSAAPAVPEGIEELLISHLRRPTP
    QPDGQRTPVGGAQENDQPNHGSDAEAREVAPAQQNENSESTLNPLDLSECAGPAPPDSDA
    LQRDVSNASELATEMQYERSDAITRDVEAVSQASSGSGATLGESLRSLEVEIGSVEGHDD
    GDRHGTSGTSERLPLGDIQAAARSRRPSGNAVPVSSRDMSLESVSEVPQNPDQEPDQNAS
    EGNQEPTRAAGADSIDPTFLEALPEDLRAEVLSSRQNQVTQTSNDQPQDDGDIDPEFLAA
    LPPDIREEVLAQQRTQRMQQQSQELEGQPVEMDAVSIIATFPSEIREEVLLTSPDTLLAT
    LTPALVAEANMLRERFAHRYHSSSLFGMNSRNRRGESSRRDIMAAGLDRNTGDPSRSTSK
    PIETEGAPLVDEDGLKALIRLLRVVQGFSKKSIDAPEPPFRLYGCHANIAYSRPQSSDGL
    PPLVSRRVLETLTNLARSHPNVAKLLLFLEFPCPSRCFPEAHDHRHGKAVLLDDGEEQKT
    FALVLLLNLLDQPLYMRSVAHLEQLLNLLDVVMHNAENEIKQAKLEASSEKPSAPDNAVQ
    DGKNNSDISVSYGSELNPEDGSKAPAVDNRSNLQAVLRSLPQPELRLLCSLLAHDGLSDS
    AYLLVGEVLKKIVALAPFFCCHFINELARSMQNLTLRAMKELHLYENSEKALLSSSSANG
    TAVLRVVQALSSLVNTLQERKDPEQPAEKDHSDAVSQISEINTALDSLWLELSNCISKIE
    SSSEYASNLSPASASAAMLTTGVAPPLPAGTQNLLPYIESFFVTCEKLRPGQPDAVQDAS
    TSDMEDASTSSGGQRSSACQASLDEKQNAFVKFSEKHRRLLNAFIRQNSGLLEKSFSLML
    KIPRLIDFDNKRAYFRSKIKHQYDHHHHSPVRISVRRPYILEDSYNQLRMRSPQDLKGRL
    TVQFQGEEGIDAGGLTREWYQSISRVIVDKSALLFTTVGNDLTFQPNPNSVYQTEHLSYF
    KFVGRVVGKALFDGQLLDAHFTRSFYKHILGVKVTYHDIEAIDPSYYKNLKWMLENDISD
    VLDLTFSMDADEEKLILYEKAEVTDCELIPGGRNIRVTEENKHEYVDRVAEHRLTTAIRP
    QINAFLEGFNELIPRELISIFNDKELELLISGLPDIDLDDLKTNTEYSGYSIASPVVQWF
    WEIVQGFSKEDKARFLQFVTGTSKVPLEGFSELQGISGPQRFQIHKAYGSTNHLPSAHTC
    FNQLDLPEYTSKEQLQERLLLAIHEANEGFGFG
    

    END

    Was this page helpful?
    Tag page (Edit tags)
    • No tags
    You must login to post a comment.