Was this page helpful?

GENEMARK rice

    Table of contents
    No headers

    gene#                 Exon                Position            Amino acids
        1                        1                   200-592                  130
        2                        2                 1701-3128                245
        3                        3               23948-4690                227
        4                        3                 5014-6603                264
        5                        3                 7506-8569                265
        6                        4                 9409-11579              501
        7                        2               12476-14094              482
        8                        6               14409-15254              606
        9                        5               17202-18382              325
      10                        2               19414-19492                80
      11                        3               21592-21698              155
      12                        3               23480-23489                55
      13                      19               27260-32821             1291
      14                        2               33140-33501              494
      15                        2               35757-36196              125
      16                        1               36525-37001              158
      17                        2               37784-39157              432
      18                        1               39285-39719              144
      19                        7               39791-42449              515
      20                        3               42617-43119              126
      21                        1               43595-45184              529
      22                        2               45423-45782                98
      23                        7               46663-48636              510
      24                        4               48850-49940              302
      25                        3               51330-57622              116
      26                      13               58137-60796              539
      27                        8               61158-63495              433
      28                      10               63789-68416              591
      29                       2               69058-69300                97
      30                       1               69545-71152               535
      31                     10               71531-74212               606
      32                       3               74341-74752                95
      33                       1               74886-76625              579
      34                     15               76983-80618              638
      35                       2               82214-83338              496
      36                       1               83970-85256              428
      37                       6               85335-88965              259
      38                       3               89425-91614              593
      39                     12               92473-99893              668

     

    Predicted peptide sequences

    >gene_1|GeneMark.hmm|130_aa
    MACYQWKKILEQGTLHGSRNFGYEELTSLPLIVSGPAPSNAIKLGNDRSPVSGRLSISGR
    GHIAPDDGTAPNTDFFSDSFSAWFLSTSSRLIFFLVWSRTLLMPSPPAGPLSEEALTGSK
    NLKDRLDRDL
    
    >gene_2|GeneMark.hmm|245_aa
    MFSIQGSSISSVGRESKSSPREVVEHRRSCHHHHQFGRDVELVQALTERKPAGPVVQHPW
    NSAHGTSRFSNHQFLGIIIVRHRPPVVEEVCEWHPSSRTILQFTQLANVLDPSVHVLDNH
    LRAVSHLTNLSDVDELGVGLKPSRHSETELPCSWLWEGHIVHANCDTPETSSGNVSTRQM
    RWYSSRMAVPDQLGLLRSVGNDWRLVYRLVALHPYLYASSVDVANSGKGLEGGECWRGDC
    RKSWW
    
    >gene_3|GeneMark.hmm|227_aa
    MALATSSSLAALPSSLSKTPLGGASAHKSRSTRSLGFLLPVRASSDKGPAGGDGISSVLD
    QTKKNISREGVLKNQAENESEKKSVFGAVPSSGAMWPRPEIERRPETGDRSFPSLMAFDG
    AGPETINGRLAMVGIVWAFAVERITGQTVFEQLYAPGNFGLFNFLMVAQLFAYASLVPMF
    KGESPDSRALGPFNAMAERWNGRTAMLGFLALVLTESFIKTPVFNMM
    
    >gene_4|GeneMark.hmm|264_aa
    MAVATSSSIAAIPSSLSKTPLGGAKSRSNRSLGFLLPVRASSENGRGDGISSVLDETKKN
    ITREDVLKNQAENESEKRSVFGAVPSSGALWPRPEIERRPETGDRSFASLMAFDGAGPET
    INGRLAMVGILWAFAVERMTGQTVAEQLYTPGNFGLFNFLAVAQLFAYASLVPMFKGESP
    DSRSLGPFRAMAERWNGRTAMLGFLALVITEFFTKTPVFNMITRRRNVPCSMCALAKFWA
    RGAGIRFAKNAIKLYVVLNTRMET
    
    >gene_5|GeneMark.hmm|265_aa
    MALATSSSIAVIPSSLSKTPLGGASAYKSRSGRSLGFLPPVRASSDNGRGDGISSVLDQT
    KKEITREDVLKNQAENESEKRSVFGAVPSSGALWPRPEIERRPETGDRSFASLMAFDGAG
    PETINGRLAMVGILWAFAVERMTGQTVAEQLYTPGNFGLFNFLAVAQLFAYASLVPMFKG
    ESPDSRSLGPFRAMAERWNGRTAMLGFLALVLTEFFTKTPVFNMITISPARPMLSPKRLA
    PIHTACCAYTYKMEASKSMDAEQWT
    
    >gene_6|GeneMark.hmm|501_aa
    MDRPGAEAEAVEIRAIFVEMDAASRDSGIQVWMRDGDLESIRRNDRRIAYIKRVMEYLVS
    HEKQGSVNLRKLHAVEGLDDNTKHGLRRNAWNYPGFFTVTVNNRQKEDLVALTDEAKQLM
    EDERRARWDRDESSCVQILKKILMMSRDRRIRRSKLHCLQEYYAFPYDYNTGFLERHPDH
    IRLVETPKDSYVELVSWDEELAVTEREKAVAQGRTPKELGQWAFVISYPEGYTPNRVRLE
    QLDNFQRLPFPSPYEFSQRAFDTPAGQKEALGIFHELLSFTVEKRALVSDFTTLAGSLNI
    PRYFSDSLLSCHPGIFYVSKWKNQHYVFLREAYRGKNLVAEEVDPLVTIRWRYLELMKPK
    PEKLHLSSRDEDYLYVPISKRGITSNQDKANAMASSAPNRLPKSSGRSLSSRRTKEIEPT
    GKPTIYSFKHGKYVCVYYFITTRITDELAITQGLWARDMDAIEEGNASCWMSCAVSCMEV
    HRKLECPLDLHGKTKQATGQA
    
    >gene_7|GeneMark.hmm|482_aa
    MIFWRKWRRRWNSCSVEVSNGAIGQRGGGGGTLLCIPSDTLADDKRLKKRSRRLLVSDST
    GESHNNAKRSAAMYSTAFNFKLPGTTKTYPGTSTTNSSLDPPVDSSRMPEPESCDQSRHE
    TWEFETLVPFSCETPSLLSSDVISTIDYDETGQLIATGGLARKIRICSYQELVNGMGREC
    FQGRNVKNLFTTICMPAKLSSLKWRPGGSEVIACGDYDGSVTEWDVEHGVTVSERYEHTG
    RTVWSIDYSRDFRGLLASASSDSTVRFWSRNVERSVGIIKSLKRNSMCCVEFGRSSGPCC
    YVAVACADASVYLYDMRSLGSPVATLRGHERSVSYVRWLGENSLVSSSPDGTIRLWDIAS
    TVTGTGESWHARNDELPIARTFGCHSNTRNFVGLSVASSGGGSGGLIASGSENNEVFVYS
    SSVSERPVFRHKFNDAVVLDDKAFVGSVCWTKQQDHLSLISANSEGIVQVIRATTTATAR
    HT
    
    >gene_8|GeneMark.hmm|606_aa
    MAIVRVVGVVGAGQMGAGIAQLAAAAQMAVVMADSDGAALTRGLQSISSSLARFVKKGVI
    SEDEANATLARVSTTTSLADMSSADVVIEAVSERENVKKGIFSELDRLLKPSAILASNTS
    SISITRLAASTQRPQQVIGMHFMNPPPLMKLVEIVRGLATADEVFEQTKELAERFGKTVV
    SSRDFPGFVVNRILMPMINEAFYALLEGVSSAEEIDTAMKLGTNQPMGPLALADFIGLDT
    CLSIMGVLHAGLGDAKYRPCPLLAQYVDAGWLGRKSGRGVYHYLSKIGKQRLAARALVPV
    LPPWGERSLGEWRASGGSALDRDQQIISRELRPMVDAPGVDPPKTGTPRPPEEGGAASPV
    KKRRGRPRKCDTIAAAAAAAAASPQLGTNGGVGPTPARKVRKRKIDPLSPPQVDASLVGQ
    SVHGVLDGSFDAGYILTVRVGETDTILRGVVFGPGLCVPITTANDIAPNVRFAGGSRGGA
    AATARLSSPVPASSTTPSSSQAVVMPPALAENPLDVSSSRHHTPVRDHGQQLGFAPVVPP
    LFGVQTPTAGFIEAGKSCLVEQQQQQQQPLPVELGCSLQLANGSSPQREQQQRTLAEASI
    EDMDRA
    
    >gene_9|GeneMark.hmm|325_aa
    MARLRLAAGHGWEEAEFHSPRRKNRVLKCVAAKAETELDGGSSSGTTVEEAPPAPSTRPD
    EKISSTISQDSSSPISSWLYPDKEDLPDDKEMTIFEHLEELRERLLLSVGAVGVAMLGCF
    AFAKDLIMYLESPAHVQGVRFLQLSPGEYFFTTLKVSGYCGLLIASPVILYEIIAFVVPG
    LTLSERKFLGPIVLGSSILFYAGLAFSYSVLTPAALNFFVSYAEGVVESIWSIDQYFEFI
    LVLMFSTGLAFQVPVIQLLLGQTKLVTGDQMLSVWRYVVVGAVVAAAVLTPSTDPLTQIL
    LAGPLIGLYLGGASLVKLLQAGETS
    
    >gene_10|GeneMark.hmm|80_aa
    MSVVLWGFGHGIGHVKHLDQVQRLSGRDAILEVSKPKSKIRLMKVVSETFPSGRRFEMQV
    PLGAMMTEIYSIKNGKLQVY
    
    >gene_11|GeneMark.hmm|155_aa
    MESEVETETKTETETKTGTKTETETKIGTETETKAETEAEFESESKSESKFESERVKAQS
    IRLLDPGGFVQVWVALPTWIPSVAQGNLTLSPQLQHPKAEGGESTHEACIRPWYSEMLPQ
    ERDLVGTIKGILGKAIPHIYRHPKDAKEWIWDDHS
    
    >gene_12|GeneMark.hmm|55_aa
    MARDAGSFKIALTSFCASVPSPLGVDSCNNNVNIAIHVIYAAKEGSELTFIDPCH
    
    >gene_13|GeneMark.hmm|1291_aa
    MGAVSLLLLACPTNPSARSRIPAPHSSRRGKILVRAKAGSNAGHTLAVERLRSAPRRRAA
    DFPADLPSIERRDSSWLGSLAKILGVGIAVVALAALGAHRPAFAAPAFVGEEEEQKITAA
    DLHSRFVEWKDGFHREMEQMGPMRKRLLEIYGLDKRFEKFRQGIEERELMYREVLGVKDM
    LKEIEELAGKSEARSLEVEFRVREALVQARRKLTMGARRLQGEVLLKTSSHLAKLRKEEA
    SLMKELNEAMDGFPGLRKKFDAAETQPAEDDLGGSTKAFSPKSESDDQVDAQESLFFSKT
    EELRAKIVNVKAELVGSEYEGWLRAWHEIPNFTAIFAAELEEAAKFRLPERLESRIANDL
    RASGKEIWEESILPLALDKQEKGVVPVDSTVKSEVDELLRRHIEFAGSDERKAVSAVDGK
    LGSIELYSALGGSVSDVPGPEAVKTILGWRKWCSIKKEELKQHLLDHPEAGKKYIQEQQE
    KLLLARDRVLQNTWYNEKSRRWEMSDAAAMYAVEKNLVHHVRIRHDLRVIFVGLKGDDQE
    YVVNVEDLNERLEDVGGFDAFYAMLKENKIPTVMEKMWIPLREWSAFQLIRLPFVMLIDA
    VLYVWNLGVVAAARDLYFESLDALLGELMVRFGYPAILKLPRLVREMIGFELPKGSEFLE
    PTFLMKWQTAAQAKFDARQIALPFWMDLPLRIYVVGVPLLFVVTRVAKAIYRFIGPPPLP
    KSQYDKAVELHMKVYKEMTPLLKQVTTNPIKRVFDKMKRIRKPPVSLNDFVGIETFREEV
    DEVVSFLRDPGAFKKLGARAPRGVIIVGETGTGKTTLARAIASEARVPVVELQSFDLQGG
    GEWVGQKAANAPIILFMDDFDDFVGKRGETMHLDTQELETLINQMLVELDGFETQEGVVV
    LATTSHPERIDEALRRPGRMDRTIKLLPPNRTQREQMLRLIARNTMFPHVVDWVNWAEVA
    EKTEGLTYVDLEPIPVNLQQCATHWKTKDEEELFSYLCILDKYNKIVPEWIRKTPLLKKW
    DQSVIDWLGLRVTKEDFEMTVRYMDVRGRSKPGIEFHDPLYPWTRETKYPHAVWAAARGL
    LARLLPNFDIVEYIWLDETSWEGIAWTRLTRRVEEGYLETHTQTRNYLEKKLVLCYASHV
    ACCMLMPVLDRNNLAEPQLEEAKQIAADMVMEYGWALDDSPMVYRSKGEGPMDMGLQEMT
    VIERKVTKLLTVACDRASQILARNKQVLEALVEQLVVHENITQEFMEKTLKEKGATFEGE
    PFTLQDLDSSLQSTDGKLLDMGRIGFIENHS
    
    >gene_14|GeneMark.hmm|494_aa
    MGSIALPSRRWILIALVAAAVALAAMTGVDSVGVNWGTVTSHRLPDKMIVKLLQESRISK
    VKLFDADPSVIRAFAGTDLELMVAVPNDLLEDMAFSEKAARRFVRRNITKFLSHQDGVNI
    RYIAVGNEPFLKAYNGSYEDVTIPAIRNMQQAIEQAGIQHKVTLVVPLNADILTNSGNSG
    KPSQGAIRPDIRRLMRTILEFLDKHKAPFVINMYPFLSLQQDSHFPSDFAFFDGSAHVLS
    DGRNFYSNVFDASYDLLVSALAREGFPDMEIVVGEVGWPTDGDIYANIPNAQRFNQQLIR
    HVTSNRGTPLRPGIPIEIYIFGLVDEDRKSVLPGNFERHWGLYRYDGKPKYSLDVSGRGG
    NGLSSPINLMSVSGVTYLPSRWCVLNPEVDDLSKLPATISYACSYADCSTLAYGGSCNHI
    GQTGNASYAFNSFYQMNNQRTESCHFGGLGMITETDPSSGNCQFRVEITPWSAASTVWRG
    GAAVLVVSLLVALS
    
    >gene_15|GeneMark.hmm|125_aa
    MENKRDHPAASMIAQMAKAGSTGSLASHPDPNLAEDLQDKMEHRFDIDEIQVKNEQLAIW
    AIENGFSLFSTRLKAPVFRRLLVAVRCFLLLQSFPTAESLLLAKSGECLGKSIVTTVAMD
    KNHDN
    
    >gene_16|GeneMark.hmm|158_aa
    MKAEDFLGYEYPRKFLRKMNFKVMEYLSGSGSSSRKITSSSEELQTKKTWALALFLETFE
    TSSSMVKAKDFLRSDNALAGGEKEEEEAKRRSSQLILGAPRNQENQKKIFRSSHITGRID
    QSDSIDAERSRKTVKTSSRQEKFHKHARRSERLHRRSA
    
    >gene_17|GeneMark.hmm|432_aa
    MGDDNEEDRQSVVSRSSSCRSSSSKPHKANDAGWEAIRSVEARDGNINLSHFKLLQRLGS
    GDIGSVYLSELRGFRCLFAMKVMDKTALAARNKLLRAATERSILEKLDHPFLPTLYAHFD
    TANFSCLIMEYCPGGDLHTLRQRQLTKRFDNEAVRFYAAEILLALEYLHMMGVVYRDLKP
    ENVLVRHDGHIMLSDFDLSLICDVSPTVIQSPPPGTAARRRAPSFSSSSSSSSTSKLGRL
    GGGASPSCILPACVAPCTVDRPMPPAGQLRSTRVNPLPELVAEPTGARSMSFVGTHEYLA
    PEIISGYGHGSAVDWWTLGIFLFEMFHGRTPFKGGDNESTLVNVLTKPLEFGGAAEGVEL
    GEDARSLIRGLLAKDPAKRIASARGAVEIKQHPFFAGTNWALVRCAAPPEVPKALLWRKK
    NTGSKSDDVEYF
    
    >gene_18|GeneMark.hmm|144_aa
    MATLGVAGMCLVRARGERFGGNSALGRARMRGARLQCAGPRVFEVEMEHEGKIHTLRVPE
    DETILSKALEEGVEVPHDCKLGVCMTCPAKLERGRVNQSEGMLSDDVVDKGYALLCVAYP
    LEDCRIRTIPEDELVSLQLVTSSD
    
    >gene_19|GeneMark.hmm|515_aa
    MAPATGERSRIAFSAPAQQILSAGVLRAPVKRSLFGNDSGACISLEFFVVFLEETFDFWN
    SFSYALFMRFLARAEFFVLAVLSCEDYVQRFLVCYYFTDSWEQFRIHGDIEVISHTEDDP
    LKKEIREKSWFSSSLQTRKQFTWPHPGQPKESHPEEVRLESTQPPVDTFCVVTLHPVESQ
    VAGYASRGSFARNFAMPSLQVAKEFPDGLFFVDRSFARKSMPKSLCVEADPSGNPIASRD
    VTIDEKLRIWARVFLPKGKNEKLPVVLYFHGGGFVSFTANTLEFHVLCESISKKLGALVV
    SVNYRLAPENRLPAAYDDGFAALKWLAQEQGGRKDPWIAAHADLSKILVMGDSAGGNLAH
    HVAMRAAAEDLGELQIKGRVLIQPFFGGIARLPSETNLQSPTSLLSTDMCDRFWELALPV
    GASRNHPYCRVFAPDLKAQLRELDLPSTLVVAGGLDVLRDRALEFVEVMRECGMDPELLL
    LEAADHAFYVAPGSREVAQFLDKLCSFARGIFVSS
    
    >gene_20|GeneMark.hmm|126_aa
    MASSTQIVLLLLALVFAESLHDAHAYTAGFEMSEAQVTAASKSSTGCATDFSKVDYAEVT
    SVCKGPQYHQEACCGAFKKMACKYTTQVNDFSTTCPVEFMAYLNYAGNYPNGVFVGRCNS
    GSSLCS
    
    >gene_21|GeneMark.hmm|529_aa
    MNNRRARTVLALLSCCFCLSLAQQPGRFNIVLQNAGISSMHTAVTHYGNVIFLDRTNIGP
    SAINLVGNCRDNPADMMTTHDCTAHSVIYDPSSNTVRPVFIYSDTWCSSGQFLPNGTLMQ
    TGGSSDGGSIIRYFTPCSSGSWCNWMESSTNLQSSRWYASNQILPDGRIIVVGGRGVYNY
    EFQPTGGQFYLQFLKDTADFQDDNLYPYLHLLPSNLLYIFANRDSILLNYFTNTVVRKFP
    TIPGEPRNYPCSGSSVMLALDTANSYSKAEVLVCGGANQASFKNSDAQYGASQTCGRMEV
    TSNSPYWDMSYMPFRRNMGDMVLLPTAKVLIINGAQNGSQGYLLASNPILNPLLYDPDKK
    TFEIQAPSTIPRVYHSTANLLPDGRVLVAGSNTRYTYQYTGPFPTELRVETFSPAYLDAT
    NDWLRPRIAKNPFTITYGMPFSVDVAIPGKLVGNIQLTLLSSPFTTHSFSQGQRQLKLPV
    AASVLSYANTYYVASTAPPSSVVAPPSYYMLFALHNGIPSQAVWVLVTS
    
    >gene_22|GeneMark.hmm|98_aa
    MLQGSGHVYLTMELLLRAFQSRPHQSCRLGRAVLFTETREAHTSTSRPLLRHCYTLLQAS
    AFLWRCSRKKRRAKWIQKHDGGTERQASETTGKQQLEL
    
    >gene_23|GeneMark.hmm|510_aa
    MANLSIFLCVFLAIAIACCCGQSALASHDHTGKKKPRDGDRDWSPKHSPPSHKAPKAPKV
    RSPPLFEDESPPPPHHAPKAPKAPKVRSPPPFEDESPLPPHHAPKPPKAPKAPKAPKVRS
    PPPAKAPPPPSVTLPRSPPPPSSSPLTGKEFNVKNYGAVGDGKHDDAQAFLDAYNAACQA
    GDNAVILVPSTSAGYYLSPISFGECTSVTMKVDGTLHAIPRSAWLSKFSSEKSWLLFTKV
    TDFTLTGGTFNGNGQDWWAHSCKKDKSQAVRFEDSKNIKVEGITITNSPQIHITFSDSQA
    IQATDVVINSPESSPNTDGIHVSGSTNVVVRDADISAGDDCVSIVSGSSNIQVLGGRCGP
    GHGISIGSLGKGGSYATVSNVQVSGVKIDAATNGVRIKTWQGGKGYVSNVIFENISMDNV
    KNPIIIDQNYCDGGCGKKRGSSLTIQGVTYSNIVGTSASPDGINLHCSSSGACTNIHFSN
    VKLTLGSSGKAAGAVCENVQGYTSDCRSSS
    
    >gene_24|GeneMark.hmm|302_aa
    MQQQCGKEMPAISRNPGFSRRFPPAQVCLSRRFSGAASRAATRALARSEPVSTVEVEALL
    DLIKWDSSGLAVAIAQDVDTGAILMQGFVNRDAVSATIASKRATYFSRSRRSLWTKGETS
    SNFIDVVDVYLDCDRDSIIYLGKPDGPTCHTGADTCYFTRAADILQNKPLIGENGLATST
    LYDLERVIQQRKSEPDGKKPSWTKRLLQDQKLLCSKIRCALLSCLSRLTRNFGREEAGEL
    CKTVEENEGRKRTVSEMADVLYHSMVLLAAQDVKMAEVMEVLRARFSQSGIEEKSSRGRS
    SS
    
    >gene_25|GeneMark.hmm|116_aa
    MEAQQSHHHCSTLVPDLIPVAPLKIWILFSMGVITMQHVNVPRLHCWKVPSSFPGSCRCI
    QSSGSFLAPFCRKLKGVSFSLDKVVARFKLSVDMFLSQKALSPVVSLDMNGWNEVL
    
    >gene_26|GeneMark.hmm|539_aa
    MATVRGVPQQGEWRLILRSKTEDNEDLRQTSTWGKSSRKLAPDGSNHPKFVTYFAIIRDT
    VSISTPFLLAGLPALKYFQKDGHNWRKKKDGKAVREAHEQKSGSIDVLHCYCARGEEDPN
    FQRSYWMLEGAYEHIVLVQYLQVHQGRKSAYRAPEYLKPHAESPSILSSEHGDSSDDVEQ
    LSSKYSILPETQGSFFQSDIDLNDLLDRPDMFLGQKALTPAVSLDMSGWKEVLRSYLENP
    TNGPVKPEDETLEQRTTRDAFLEQASPMKFEDGMFKLSPRASLSPKAIMEVLSPRGLGRQ
    PQTHFEAQLRAATAENTKRVSSPPPAQNVLVDMGRASQQEGRIKQLASFGRWALEPAPAS
    SSSVWLLPTKKPRFRLTYIFISPLGSKWQVLVSGTTGRGAPASRSKPVSTVEVGALPDLI
    SNGLAVAIVDTGAILMQGYVMRTLSRPRLHPMSISVVSDTCYIDILQNKPLVCVNGQYLR
    SRVEGSLNSEWNAGLTGHMPWVQKTKQVLSKSHCLIEMAMPQQGEWRNFKIEDCEITRI
    
    >gene_27|GeneMark.hmm|433_aa
    MDDYKDREETSNLPMPMELEMSAQFPRLSITDFKKPLALIKIAMEWYAYALGIRINKPNK
    WVVEYLMVEYQFHIGPKMATVRGMPQQDFDMRQIIQEACVRWLKPPEVCKILRNYQSYGF
    DLSHVPPSKPASECSFLLASIVTWTDLPKLLGGSLLLFDRKAVKYFRKDGHNWRKKKGGK
    AVREAHKRLKFGSIDVLHCYYTHGEEDPNFQRSYWILEGSKLFLFQDLESAYKALEHPEA
    FSHAMDSPLLSSVGTQQGPSYQQGSLFQPGQDGGILEDCIDLNDLLDSPDKFLGKKALTP
    AVSLDMSGWNELLRSYRENPTTGPVKQEDETSEQCTTNALLGQVLPMKFEDGMFKESPGA
    SLSPKAIMEVLSPRGLGRQPQTHVEAQLRAAATNNAMKSLSPEFDTGSALVKKSKLVEQD
    LRELAGQNEATGR
    
    >gene_28|GeneMark.hmm|591_aa
    MVLSPLAFKHYSVIQPLDFFFLKAFVHHHVHNGRSSRASILHLAVKLDQSTSCITLCFVR
    GKHKWHKSFSAISKKSLAEYVLIESTFVDVLQARAGTERADKMSREERKEKRVDCAPRER
    REREKERAELCKTVGQNEGRKRTVSEMVLYHSTVLLAAQDCEDGGSYGSLESEILSIRHR
    GREQPRQEFKLRDLSTQVENLDMRQIIQEACVRWLKPHEVCDILRNYQSYGFDLNSVPPN
    RPASECSLLLALCHTDWLAMLLGGSLFLFDRKAVRCFRKDGHNWKKEGQAHERLKSGSID
    VLHCYYARGEEDPNFQRSYWVLEGAYEHIVLVHYLQVHQGRESAYGASPEHPEPFSHSEH
    GDSSDHVEQMEQLFSKDSLLSETQSGQGDMFLGHQPLSPAVSLDMSGWKEVLRSYRENPT
    NGPVKQEDSDALEQRTTVDASPGQVKFDDGIMFKLSPEAIPSPKAIMEVLSQPGLGRQPH
    TLLEAQLRAATAENAMKTAQSLSLNVLVDMGRSSRQEESDIKSLASFGRWALAKFGNDDD
    AGAPLEAAPSVSSSVWAAMDVDKDREETSNLPTPMELEMSAQFQRFSITDL
    
    >gene_29|GeneMark.hmm|97_aa
    MDCQLLFGKSYSTASLMGKSRRGYCRPAAAFKVLFELLFVSCVRIRRNFGREEAGDLCKT
    DRRQGKDCISNGKKSDLLTQLGRRCCKSMEWKCCLQQ
    
    >gene_30|GeneMark.hmm|535_aa
    MACPGRRNFLWFLSVVVLISSSTRAQPGRFDVIAQNAGVASMHTVVTHFSNAIFLDRTNI
    GPSQINLAAGGCRDNPDDRTLKHDCTAHSVMFDYFSGASRALSIYSDTWCSSGQFLPNGT
    LLQTGGDFDGFFKVRYMTPCPNGGTCDWQESKTEFLHSGRWYASNQLLPDGRVIVVGGRS
    AFSYEFIPDRGAGQFELPFLKETNDPTFNNLYPFLHLLPDNNLFVFANRDSILLNYFTNT
    VLRRYPTLPGEPRNYPSAGSSVMLPLDSANSFSNAEILVCGGSNKDAYAYPAGQLPASQT
    CGRMVATSGDPNWNILNMPTRRNMGDMVLLPTGQVLIINGAQSGSQGWGYASSPCLNPVI
    FDPVSSKFETQAASTIPRMYHSTANLLPDGRVLVAGSNTHEYYTFTGAFPTELRVEAFSP
    AYLDPANDWQRPKLVNYPGVINYGMPFSVDVSLPGNLTGDIELTLLSAPFTTHSFSQGQR
    QLKLAVSTPLRANGNTFTVKSSAPPSAVIAPPSFYMLFPLHNGIPGTATWVMVTY
    
    >gene_31|GeneMark.hmm|606_aa
    MVRKILRSRLLRRLAKQTVGLKRVPEPLQESIKGYLAGMVLEISRILGFSHDFSPPDYSN
    HQIRQHVFNYTLMLSSKSEEPPTAVTPAFEPSAKVLAAVEDYFSNKGPFAPVPKKYRTAR
    LKPRYDEKQVAAYVAAKMPAVYSVIHTVLSEVARRLPDFKPENVLDYGSGPGTSIWAMSQ
    VWPKTVKLVNMVETSPSMLAASKKILEELPTVEEQITTARQLWALTRDILKLRRRRKFLT
    GSTSTDTPMIGTSEEDGNKRNLDEQLVVHSEEEVEIPGGGAHVIAPCPHDGVCPMDGTTV
    FCHFVQRLERTFTQRMAKKHSRTMLRGYEDEKYSYVVLRRGHRPRVDWPLDHVELQLDKD
    EPVENDLLVDYEEDEDEEEEEYLEDENDEDRETRDDDEGGENSDIETKMEEEPGEDENED
    QIEEQEGDDDECKETAANMSSGWGRVIFKPFRRGKHVTLDVCRSTSPDGSSGSFDRLTVT
    RAKHRVLHKEAKKTPETTAYRYREKGSIRMNEKWKSREASAPTGRIQKERKKAAERKSEV
    SEKSVRSGWSFVCTAVKAGVSKALCARDFKTKQWISHFKPDQASKTTLRKRLWKNSTKWQ
    QAYLLV
    
    >gene_32|GeneMark.hmm|95_aa
    MAWRSEISRRARELRILFCQTSPGSETTRDYILKNYKQLKTLNPTLPILLRECSGIQPRL
    WIRYPYGVEQSATLDGLSVEQIDAKLEELVKDAPE
    
    >gene_33|GeneMark.hmm|579_aa
    MARSWTRHKLLDLALDCDGFARALRASRDAIEVAALHRQILHSPHCCDDRFLANLVVQMY
    GKCGDVESARLAFDSMECPNLYSWAILLGVYARNAHLRDAKETFDRMPQRNEVAWNSLLT
    MFEEQRMIDPVREIFDRMPSTTVVSWSSIVRANAQTGHLGKAKAVFDRMPERNVVAWTAM
    VSEFAYADSIDLASETFDRMPGWDLIAWTAMVTAVAANGHLGRAFDLYDRMPERGIPSHN
    AMIIACAQNGLARESQKIFDEMGDRNIVSWNSILSVCATEEELGSVEAVFRSMPEWSVIS
    WIVLLGAYASAGRIPQVEELFQSMPERDLVAWNAMISSYGRHGYVERSKNTFSRMPEHDL
    ISWNSLLTAFSANSHPREAQAVFDSMPERTTVSWAAMVAMRSQQGHLDSARELFDSMPDR
    SLASWNAMLAGYSQNGHSKPAMELFALMNLDGSQPSAATFVEILGACRDTGKAELSYGYF
    ASMVGDFSLDPVPRHYCCVVDALAKAGHLREAEEIIKGVPGLESAGIAWRSLLDGCRTHQ
    DLQRGSDAARMAIQFDPGAGASYALVTDVLRSSSSSQEP
    
    >gene_34|GeneMark.hmm|638_aa
    MAEHKIIFVEPPAGCEERRRSKRMMACPSGHRYCCSEIPETPRKAVRKSPKKIGAGSPPS
    PTKIRNSPRSRTAGPVTRKIKRRNLAEIYEITPACELPKEEEEDEDEDDEEDDYEEVDDV
    KCGNCDRANDPQRFLLCDGCDRGYHMYCLSPILVAVPKGDWFCPHCSKDRQQVKVFPMVQ
    RKLIDFFGIEKVGPIVLALDRSEATTTQWLSGDLQEKQEAAAIHAIQGSIAEAGANGITG
    VSLDDFRSSSKACEQSRARERRNAGVGTLPRFSVSMIFTSSLTLQVIGKEDKATYELCKA
    MCLRGEHPPLMVTRDPRQGFVVEANNHIKDMTLIAEYTGDVDFMCNREDDEGDSIMGLLF
    PEDASQELVICPDKRGNIARFISGINNHTPDGRKKQNLRCIRFDIDGEVHALLVSIRDIA
    KGERLYYDYNAYQKEYPTEHFDGPEDPRSFWSSFSARDRGSRLILISGAMGASLIQGFLK
    STAMTIVSEIGDKTFFVAALMAMRHPRGVVLTGALLALVISRKLTHNGATFLFFVFGLRS
    LWDAISNEEGESELAEVEAKLGRTDDIKKKKKQQASVFLSPVLIEAFSLTFLGEWGDRSQ
    GPCLVHECCCLGREASSFKHLRKIEERLITSALKNKCP
    
    >gene_35|GeneMark.hmm|496_aa
    MALSEVLHPEATKRTISRRPMSDRLDFTGQIPRVCKFWQQGRCSKGAVCGFVHGEADSGE
    SAPLALERPSRKRSVAAADPSSSASGSRPVLKKNTWIKEGLADKARVIGFDQTKGPEIIK
    NKKIIVDPAALPPPPKLKRSSSTEALILGSKKPARKPGTSPSPHTQRACAYWLEGSCRYG
    ERCKFLHAATTVTGLALLTTLKEHKESITGIAMVPDSAVLFTGATDGTLRAWDCNSGTVS
    DTLRLEGPVEALASGFGWIFAGAGHEVLAWNVKFSQQTLQARAPGNVNALAVGKGLLVAG
    LGSGEVCAWEFGSGELKSTGTLSKHPSAVTALTVAGGRVYSGSRDGSIRVCEAETGKSCA
    TIVKAHAGELTGLLCWESFLLSCSLDSFIKVWAATPAGTLENYFTFPEGEEEIVGRSGVT
    GMCGSVDSDGKPVLVCSYRDSTVKVYGLPLFEERGALFSSAEVISLSSAAAGNNLVFSGD
    KQGAVKVWKWSKDLIK
    
    >gene_36|GeneMark.hmm|428_aa
    MLVIWKKSQNADALASSLWRRECGNPWVIQRRFMARERKRKRIRQVSFDVMISREKHVRQ
    ALWLKDLLVTRPGHTISMIDFREEVKNLGMRVRRLYYLLEYYDTLFQTRVDRAKVEWIEL
    GEDGRRIVELERRLMAEYEPCLVENLRKLLMMSEGEKICLKRIALLREPLGLPHDFEQNL
    VHKYPQYFDVVIAKDKKYRDLQPFLKLTSWDPLLAISRREADAEESERDPHSFRMRFPGV
    KFVRGRDAQFLKSFQMLEFPSPYDPNHGYPKLSREAVKRAVAVIHEFLCLTQESKALVDS
    IAEIRRETGIPKKIGELICRHPGIFYLSWKGALARHPHMEVVYLKEAYSKPYEGERLKAA
    RLLRKGPLVQVKEAMALTMWHADLAYDKRHGTDCFLEGRQFPMITYDAFESTFQSELYRE
    QKIKYENL
    
    >gene_37|GeneMark.hmm|259_aa
    MDGKPRIGRGIQEEVDYQLIGCELSRLCFLKSIDALDAIKTGLYLYEQEVGIAGPVISFP
    LELVEKLVNDRVLRSDKHKDCSKSLRAPVPSLLGLYARELLRDISKFEWAQLLISGCEKL
    VTFFTKKPKLLTTFQKIAPWDIVKLTTTQFTYSYLVISQFVDEKMLWKNNEFWIQCSELL
    MIMALILKLLKIADKDGVGMGIIFEAIDKIQESLQKLADDDVVDKRWLYEFSANAFQFLM
    LKEIGALGELFSLREKLGC
    
    >gene_38|GeneMark.hmm|593_aa
    MLCATIALLLALALYGTSGQGCKPGNCGQLEIPPPFSCEDSPFFLSCEDDKVRISNESYT
    IVAFLGPFVVVDRLRPPPCAARTCRIDTSCGIRVIGEWRDGLGCYITSPPPGDESRYHAV
    IQFDGPDDSCNPNLLQLHLATVVNCSRSASLSSPSLGSGRTSKTGAIVGGSVGGGLGVLV
    LLGCVCCVARRRRLAAKQRGLILGDDPKQPPPQLLSKSGRIVYSNNSGSYGTSNSYGSSV
    SVTVENGDGGSNSSRFSYRELQEATNNFSEDGRLGDGGFGTVYKGKLRDGRLVAVKKLNP
    WNAQGKYQFDNEVTILSRVTHPHLVRLYGCCIEQELLLVYEFVAHGTLADHLYDNPRDYL
    GWDARLTVAVQCAEALAFLHTNVCYHRDVKSTNILLDERYHCKVGDFGLSRLVPSLELTH
    ITTAPQGTPGYLDPDYHQSYQLTDKSDVYSLGVVLMELVSSQRAVDMARERKEINLAALA
    VSRIQCGELDKLVDPRLGAGEDSVRQRMVECVAELGFECLATEKEDRPCMKDVAARLRAI
    EEEGKQRYLEQMVAIRKVEVVDDDKKHTRSSPTSVQMQWPSNSTSPNDSSSSM
    
    >gene_39|GeneMark.hmm|668_aa
    MDTKSQGPKWLPMLISLARITRPENQLGLKCVCSRTFKNFCTRCEDVVCDACHDDNHHEI
    VKILKSSRQMSIRVDEIKHLLDVSDVQWYYCNFKFGVYIDRPTSNQQNPQGKAQICISCG
    RRHSTGKEKHRTPEEIADLENGKFKFCSIGCKLRYIVQHPGEGYTLTLDSRSASSSGFRG
    QALPVSEMRSDDAGMEISEEIVGLSLSLSLATGGKRKGGRGEVVKSPAGKLAKLRDSANV
    EGIELGIAPPPLRNHYARQEVVTSFTSEDLSFVQLQPSGETYEEAGRLCEWAEESVVDVL
    GTFHAQRDDREIREVGTTRGRRGWIAASRTPAMKKRPSIFHRWKWRISVLALLFFLALCG
    LRFPSSPGTMPMPAASIRATVPEKIGAPRIALLFLARNRLAVEEVWDLFFKGAQEHLYSI
    YIHARPGFVYDATNTESSFFWNRQINNSVMVEWGEASMIDAERILLHRALQDASLSHFVL
    LSDSFIESKNTRYNFRMFPTVTHEKWRKGSQWFMLLRKHAEIVVGDSRILLKFYEHCKRF
    SQLKQKAVPNDQHKTTSENDCVPDEHYIQTLLAIKTVENEIERRTLTYTLWKASDRREND
    RWHPVTFNTADVSAQTIKDIKGIHSVKYETEGRTEWCSCNGIPRACFLFARKFSRGAVSK
    LLHNMEAK
    
    Was this page helpful?
    Tag page (Edit tags)
    • No tags
    You must login to post a comment.